Site Reliability Engineer

Há 4 dias


Greater São Paulo Area, Brasil PayRetailers Tempo inteiro R$80.000 - R$120.000 por ano

Job Description
We're PayRetailers, and we offer cutting-edge payment solutions that empower businesses to succeed in Latin America & Africa. Our collaborative and inclusive work environment encourages creativity and growth, where every employee's contribution is valued.

We've got big plans to expand into new markets and make a meaningful impact on the world of payments. To help us get there, our Technology team is on the lookout for a new Site Reliability Engineer with a strong focus on Data.

About The Role
Site Reliability Engineers are the guardians of our reliability promise. They deliver a highly reliable, resilient, and cost-efficient platform that consistently meets business and customer expectations for availability and performance.

Job Requirements
The ideal candidate should have all the following requirements.

However, we believe in self-learning and adaptation, so we can be flexible on certain requirements.

What Is a MUST

  • Proactive attitude, always on the lookout for improvement opportunities.
  • Strong scripting skills (Python, Bash).
  • Experience in Cloud.
  • Knowledge of Grafana, Application Insights, OpenTelemetry, Prometheus.
  • DBA experience in creating and maintaining DDBB in SQL Server (Mongo or postgreSQL).
  • Fluent level of English, able to conduct technical meetings in English.

What Is Nice To Have

  • Experience with non-functional and production testing.
  • Analytical mindset, being able to connect the dots and establish cause and effect.
  • Experience with containers and container orchestration platforms (EKS/AKS).
  • Understanding of APIs and asynchronous distributed software architectures.
  • Working knowledge of AI-enabled tools like VS Code, Claude Code, etc.
  • Demonstrable experience with applying AI to Site Reliability Engineering.
  • Knowledge with process automation tools like N8N.
  • Working experience with chaos engineering.

Job Responsibilities

  • Increase automation of operational activities to reduce downtime risk, in collaboration with Platform Engineering and Domain Squads.
  • Drive systemic improvements across engineering teams based on incident RCAs and telemetry insights.
  • Implement non-functional improvements (resilience, performance, reliability) directly in code, with Domain Squads reviewing and approving changes.
  • Promote adoption of SRE best practices across development teams (integration patterns, monitoring, alerting, real-time tracing.
  • Provide cross-platform observability capabilities above and beyond what the Domain Squads provide.

Investigate issues and incidents and propose/implement changes as deemed necessary.

  • Continuously review logs, metrics, and alerts to identify and/or implement continuous improvements.
  • Design non-functional test and continuously run them to ensure that we build quality up-to and including production.

Job Benefits

  • Individual development plans
  • Excellent working environment and collaboration
  • Private medical insurance covered by the company
  • Meal and Food Allowance
  • Life Insurance
  • Wellhub (Gym)

If you're passionate about tech, innovation, and want to thrive in an environment that values collaboration and diversity, this role might be the perfect fit for you

Apply today and help us shape the future of the PayTech industry

To get an idea of what life at PayRetailers is like, check out our Instagram and our About Us page.

Our commitment to diversity, equity & inclusion

At PayRetailers, diversity, equity, and inclusion aren't just values – they're fundamental to who we are. We're dedicated to fostering an environment where every individual feels valued, respected, and empowered to bring their authentic selves to work. We welcome applicants from all backgrounds and identities, recognizing that diversity drives innovation and strengthens our team.

So, if you're passionate about making a difference and excited about the role, we encourage you to apply. Join us in building a global company where everyone can thrive and feel proud to belong.

Please feel free to include your pronouns in your application (e.g. she/her, he/him, they/them, etc.).


  • Site Reliability Engineer

    1 semana atrás


    Greater São Paulo Area, Brasil WEX Tempo inteiro R$60.000 - R$120.000 por ano

    About The Team/RoleWe are seeking a Software Development Engineer Level 3 to join our SRE team dedicated to the Mobility line of business. This role is for a professional with a software development background who will apply SRE principles to ensure the reliability, scalability, and performance of our complex software systems.The ideal candidate will have...

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, Brasil Mouts TI Tempo inteiro

    Na Mouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a) SRE (Site Reliability Engineer) para atuar presencialmente, com foco em infraestrutura, automação e observabilidade em ambientes de missão crítica.Responsabilidades:Implementar e gerenciar soluções de observabilidade

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, Brasil PayRetailers Tempo inteiro

    Site Reliability Engineer Join PayRetailers in São Paulo. We are expanding across Latin America and Africa, building cutting‑edge payment solutions. We value creativity, growth, and collaboration. About the role Site Reliability Engineers are guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform...

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, Brasil PayRetailers Tempo inteiro

    Site Reliability Engineer Join PayRetailers in São Paulo. We are expanding across Latin America and Africa, building cutting‑edge payment solutions. We value creativity, growth, and collaboration. About the role Site Reliability Engineers are guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform...

  • Site Reliability Engineer

    3 semanas atrás


    São Paulo, Brasil PayRetailers Tempo inteiro

    Job Overview We’re PayRetailers, and we offer cutting‑edge payment solutions that empower businesses to succeed in Latin America & Africa. Our collaborative and inclusive work environment encourages creativity and growth, where every employee’s contribution is valued. We’ve got big plans to expand into new markets and make a meaningful impact on the...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, Brasil Review ALL Tempo inteiro

    About the CompanyThis company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide.They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric...


  • São Paulo, Brasil Scubyt Tempo inteiro

    Software Engineer Site Reliability Engineer Location: Brazil REMOTE Duration: Fulltime CLT / REMOTE About the role The Application SRE Team supports several critical components of our foundational technologies for real-time protection, as well as our RBI and SSPM services. We are a team of software engineers focused on improving availability, latency,...


  • São Paulo, Brasil K2 Solutions Tempo inteiro

    Trabalho híbrido na região de Pinheiros/ SP - 3x por semana no escritórioEstamos selecionando um Senior Site Reliability Engineer - SRE para se juntar ao nosso time e desempenhar um papel essencial na manutenção, automação e melhoria da confiabilidade dos sistemas que impulsionam a rede logística da empresa em múltiplas regiões. Essa pessoa...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, Brasil Review ALL Tempo inteiro

    About the Company This company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide. They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric...


  • são paulo, Brasil MetaCTO Tempo inteiro

    About Us At MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services. As a Site Reliability Engineer (SRE) , you will play a critical role in ensuring the reliability, scalability, and security of the backend infrastructure that powers...