Senior Site Reliability Engineer
Há 2 dias
Are you in Brazil or Argentina? Join us as we actively recruit in these locations, offering a comfortable remote environment. Submit your CV in English, and we'll get back to you
We invite a Senior Site Reliability Engineer to join our dynamic team. In this hands-on role, you'll focus on improving the stability, observability, and efficiency of our services. You'll lead initiatives to enhance monitoring, automation, and reliability practices while collaborating with engineering teams to ensure our systems run smoothly and remain resilient.
What's in it for you:
- Join a top S&P 500 company shaping the future of global payments and financial technology
- Lead initiatives to improve stability, observability, and efficiency of critical services
- Collaborate with engineering teams to solve complex problems and drive operational excellence
Is that you?
- 5+ years in site reliability, observability, or platform engineering
- Experience building SRE or observability practices from scratch
- Hands-on OpenTelemetry experience (SDKs and Collector)
- Strong experience with PromQL/SPL and at least one APM platform (Datadog, Splunk APM, Google Cloud APM)
- Experience designing SLOs and alerting strategies (burn rate, multi-window)
- Familiarity with MuleSoft or API gateway observability
- Awareness of security best practices (PII redaction, access control)
- Experience building automation scripts for CI/CD tasks
- Experience with logging frameworks (Logback, Serilog) and structured JSON logging
- Collaboration, communication, and independent problem-solving skills
- Upper-Intermediate+ English level
Key responsibilities and your contribution
In this role, you'll own and lead efforts to ensure the reliability, observability, and operational efficiency of our services.
- Define and enforce logging, tracing, and metrics standards across services
- Implement and maintain centralized telemetry pipelines and APM integrations
- Build reusable instrumentation libraries for core languages (Java, .NET, , Python)
- Establish dashboards and SLO/error budget alerts
- Ensure log/trace correlation and schema consistency
- Implement PII/secret redaction, retention, and cost optimization
- Collaborate with development teams to onboard services and ensure observability readiness
- Develop runbook templates, documentation, and training materials for engineering teams
- Audit alerts, reduce noise, and maintain alert quality standards
- Support incident response through tooling improvement and post-incident telemetry analysis
What's working at Dev.Pro like?
Dev.Pro is a global company that's been building great software since 2011. Our team values fairness, high standards, openness, and inclusivity for everyone — no matter your background
We are 99.9% remote — you can work from anywhere in the world
Get 30 paid days off per year to use however you like — vacations, holidays, or personal time
5 paid sick days, up to 60 days of medical leave, and up to 6 paid days off per year for major family events like weddings, funerals, or the birth of a child
Partially covered health insurance after the probation, plus a wellness bonus for gym memberships, sports nutrition, and similar needs after 6 months
We pay in U.S. dollars and cover all approved overtime
Join English lessons and Dev.Pro University programs, and take part in fun online activities and team-building events
Our next steps:
Submit a CV in English — Intro call with a Recruiter — Internal interview — Client interview — Offer
Interested? Find out more:
How we work
LinkedIn Page
Our website
IG Page
-
SRE - Senior Site Reliability Engineer
Há 2 dias
São Paulo, São Paulo, Brasil K2 Solutions Tempo inteiroTrabalho híbrido na região de Pinheiros/ SP - 3x por semana no escritório Estamos selecionando um Senior Site Reliability Engineer - SRE para se juntar ao nosso time e desempenhar um papel essencial na manutenção, automação e melhoria da confiabilidade dos sistemas que impulsionam a rede logística da empresa em múltiplas regiões. Essa pessoa...
-
Site Reliability Engineer
Há 4 dias
São Paulo, São Paulo, Brasil Truelogic Tempo inteiroAbout TruelogicAt Truelogic we are a leading provider of nearshore staff augmentation services headquartered in New York. For over two decades, we've been delivering top-tier technology solutions to companies of all sizes, from innovative startups to industry leaders, helping them achieve their digital transformation goals.Our team of 600+ highly skilled...
-
Senior Site Reliability Engineer
1 semana atrás
São Paulo, São Paulo, Brasil Enumerate Tempo inteiroRole OverviewWe're looking for a Senior Site Reliability Engineer who can own the architecture, governance, and cost efficiency of our cloud and platform infrastructure. In this role you'll design and evolve our production environments, define standards and best practices, and partner with engineering and IT teams to build scalable, reliable systems that are...
-
Site Reliability Engineer
2 semanas atrás
São Paulo, Estado de São Paulo, Brasil Conquest One Tempo inteiroVaga: SRE SêniorHíbrido – presencial 2x na semana no Jardim Paulista (Av. Nove de Julho – São Paulo/SP) + 3x na semana de home office Contratação: CLT Horário de trabalho: 09:00 às 18:00Estamos em busca de um(a) Site Reliability Engineer Sênior para atuar de forma estratégica na transformação e evolução de nossas plataformas! Se você tem...
-
Senior Site Reliability Engineer
Há 2 dias
São Paulo, São Paulo, Brasil Dev Tempo inteiroAre you in Brazil or Argentina? Join us as we actively recruit in these locations, offering a comfortable remote environment. Submit your CV in English, and we'll get back to youWe invite a Senior Site Reliability Engineer to join our dynamic team. In this hands-on role, you'll focus on improving the stability, observability, and efficiency of our services....
-
Senior Site Reliability Engineer
1 semana atrás
São Paulo, São Paulo, Brasil Dev Tempo inteiroWe are a US-based outsource software development company that has been delivering exceptional software experience to our clients since 2011, helping technology companies to become industry leaders.Over the past few years, we've been hiring specialists all over the world while our main development centers were in Ukraine. Now, we keep expanding and start...
-
Sr Site Reliability Engineer
Há 7 dias
São Paulo, São Paulo, Brasil Workana Tempo inteiroNa Workana, estamos em busca de um(a)Senior Site Reliability Engineer (SRE)para integrar o time de um dos nossos clientes e desempenhar um papel essencial na manutenção, automação e melhoria da confiabilidade dos sistemas que impulsionam sua rede logística em múltiplas regiões.Sobre o cliente:Trata-se de uma plataforma que gerencia fluxos logísticos...
-
Site Reliability Engineer
Há 2 dias
São Paulo, São Paulo, Brasil WSO2 Tempo inteiroAbout WSO2Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) products. WSO2's products and platforms—including our next-gen internal developer platform, Choreo—empower organizations to leverage the full potential of APIs for secure delivery of...
-
São Paulo, São Paulo, Brasil Airbnb Tempo inteiro R$26.666 - R$33.333Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible for guests to connect with communities in a more authentic...
-
Senior Frontend Engineer, Reliability Experience
52 minutos atrás
São Paulo, São Paulo, Brasil Airbnb Tempo inteiroAirbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible for guests to connect with communities in a more authentic...