Última atualização: 4 de Setembro de 2025
Senior Site Reliability Engineer
Via Duckduckgo
Remuneração
$178,500.00
USD / Anual
Sobre
Your Team and Role
Working on the Site Reliability Team, you'll help build and maintain world-class infrastructure to meet the needs of millions of users protecting their privacy online. You'll utilize high-level languages like Perl, Go, or Python and work on related projects. Recent projects include:
- Preparing Duck.ai image uploads for production
- Reduce user impact of instances serving errors to users
As a Site Reliability Engineer, you'll dive deep into complex operational challenges, including software, systems, automation, and process analysis. We are looking for candidates who can read, write, troubleshoot, and deploy all types of software to help us tackle the reliability challenges of large-scale deployments.
About You
- 7+ years relevant professional experience in reliability, platform, infrastructure, or software engineering.
- Experience participating in a 24x7 on-call rotation for a large-scale deployment.
- Ability to lead and collaborate on high-impact and complex projects from proposal through post-mortem.
- Skills to wrangle vague problems, propose innovative solutions, and execute them with a strong focus on metrics.
- Experience developing effective tools, services, alerts, and responses to identify and address reliability risks.
- Investigative ability to root-cause sources of instability in high-traffic, distributed systems.
- Deep experience administering and troubleshooting Linux and web technologies.
- Ability to implement automation around infrastructure provisioning and configuration management to prioritize efficiency, scalability and reliability.
- Foresight to help identify the future technical direction of our deployment with an effort to improve reliability and performance.
- Advanced programming skills enabling close partnership with software engineers to triage production issues and identify appropriate remediation, including code changes and performance considerations.
- Ability to leverage cloud-native services and architectures to enhance reliability and scalability, with hands-on experience packaging and deploying applications using Docker and Docker Compose.
Outras Informações
Selecionamos as principais informações da posição. Para conferir o descritivo completo, clique em "acessar".
Hey!
Cadastre-se na Remotar para ter acesso a todos os recursos da plataforma, inclusive inscrever-se em vagas exclusivas e selecionadas!