Site Reliability Engineer
Há 4 horas
About the CompanyThis company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide.They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric single-tenant cloud infrastructure on the market. If you share this passion, this role offers the opportunity to help shape the future of internet-scale infrastructure.This position is being managed in partnership with an external recruitment consultancy supporting the company throughout the hiring process.SummaryThe Reliability team is responsible for the health and resilience of the infrastructure powering a global bare metal cloud platform. As a Senior Site Reliability Engineer (SRE), you'll focus on building reliable, observable, and self-healing systems at scale.SREs here operate at the intersection of software engineering and infrastructure — designing tools that automate operations, improve incident response, and enhance observability, ensuring the platform delivers high performance and reliability to customers worldwide.This role is ideal for engineers passionate about reliability, automation, distributed systems, and bringing cloud-like experiences to bare metal environments.Key ResponsibilitiesContinuously improve platform reliability and performance.Design, build, and maintain tools to automate operational workflows and incident response.Implement and enhance observability systems (monitoring, alerting, tracing).Collaborate with engineering and platform teams to design scalable and resilient systems.Participate in on-call rotations and lead post-incident reviews with a learning-focused approach.Develop and document operational playbooks and processes.Contribute to defining SLOs/SLIs and driving reliability metrics across teams.Skills & QualificationsRequired:Fluent verbal and written English communication skillsAdvanced experience with Linux/Unix in production environmentsHands-on experience with Kubernetes and container orchestrationProficiency with IaC tools (e.g., Terraform, Ansible)Experience with observability stacks (Prometheus, Grafana, Loki, ELK, etc.)Proficiency with scripting/programming languages such as Bash, Python, Go, or RubyWorking knowledge of Git and CI/CD pipelinesExperience with incident response and root cause analysisKnowledge of cloud-native reliability and security best practicesWhat's OfferedContractor engagement (PJ)Paid Time OffCompetitive compensation packageWellness benefit (Wellhub / Gympass equivalent)Annual performance-based bonusFlexible working hoursOpportunities for technical and career growth
-
Site Reliability Engineer
1 semana atrás
São Paulo, Brasil Mouts TI Tempo inteiroNa Mouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a) SRE (Site Reliability Engineer) para atuar presencialmente, com foco em infraestrutura, automação e observabilidade em ambientes de missão crítica.Responsabilidades:Implementar e gerenciar soluções de observabilidade
-
Site Reliability Engineer
Há 6 dias
São Paulo, Brasil PayRetailers Tempo inteiroSite Reliability Engineer Join PayRetailers in São Paulo. We are expanding across Latin America and Africa, building cutting‑edge payment solutions. We value creativity, growth, and collaboration. About the role Site Reliability Engineers are guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform...
-
Site Reliability Engineer
Há 6 dias
São Paulo, Brasil PayRetailers Tempo inteiroSite Reliability Engineer Join PayRetailers in São Paulo. We are expanding across Latin America and Africa, building cutting‑edge payment solutions. We value creativity, growth, and collaboration. About the role Site Reliability Engineers are guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform...
-
Site Reliability Engineer
2 semanas atrás
São Paulo, Brasil PayRetailers Tempo inteiroJob Overview We’re PayRetailers, and we offer cutting‑edge payment solutions that empower businesses to succeed in Latin America & Africa. Our collaborative and inclusive work environment encourages creativity and growth, where every employee’s contribution is valued. We’ve got big plans to expand into new markets and make a meaningful impact on the...
-
Senior Site Reliability Engineer
2 semanas atrás
São Paulo, Brasil K2 Solutions Tempo inteiroTrabalho híbrido na região de Pinheiros/ SP - 3x por semana no escritórioEstamos selecionando um Senior Site Reliability Engineer - SRE para se juntar ao nosso time e desempenhar um papel essencial na manutenção, automação e melhoria da confiabilidade dos sistemas que impulsionam a rede logística da empresa em múltiplas regiões. Essa pessoa...
-
Site Reliability Engineer
Há 6 horas
São Paulo, Brasil Review ALL Tempo inteiroAbout the Company This company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide. They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric...
-
Site reliability engineer
1 semana atrás
São Paulo, Brasil Mouts TI Tempo inteiroNaMouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a)SRE (Site Reliability Engineer)para atuarpresencialmente, com foco eminfraestrutura, automação e observabilidadeem ambientes de missão crítica.Responsabilidades:Implementar e gerenciar soluções deobservabilidade(Datadog,...
-
Site Reliability Engineer
1 semana atrás
São Paulo, Brasil Mouts TI Tempo inteiroNaMouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a)SRE (Site Reliability Engineer)para atuarpresencialmente, com foco eminfraestrutura, automação e observabilidadeem ambientes de missão crítica.Responsabilidades: Implementar e gerenciar soluções deobservabilidade(Datadog,...
-
Site Reliability Engineer
1 semana atrás
São Paulo, Brasil Mouts Ti Tempo inteiroNaMouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a)SRE (Site Reliability Engineer)para atuarpresencialmente, com foco eminfraestrutura, automação e observabilidadeem ambientes de missão crítica.Responsabilidades:Implementar e gerenciar soluções deobservabilidade(Datadog,...
-
Site Reliability Engineer
Há 6 dias
são paulo, Brasil Mouts TI Tempo inteiroNaMouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a)SRE (Site Reliability Engineer)para atuarpresencialmente, com foco eminfraestrutura, automação e observabilidadeem ambientes de missão crítica.Responsabilidades: Implementar e gerenciar soluções deobservabilidade(Datadog,...