Site Reliability Engineer

1 semana atrás


São Paulo, São Paulo, Brasil WEX Inc. Tempo inteiro R$70.000 - R$120.000 por ano

About the Team/Role

We are seeking a Software Development Engineer Level 3 to join our SRE team dedicated to the Mobility line of business. This role is for a professional with a software development background who will apply SRE principles to ensure the reliability, scalability, and performance of our complex software systems.

The ideal candidate will have related experience and will be a key player in fostering a culture of continuous improvement and collaboration across engineering teams.

SRE is an ongoing journey of continuous improvement, and the core principles apply regardless of the technology's complexity, the customer's needs, or the business context. If you're passionate about building resilient and highly available systems, we encourage you to apply.

How you'll make an impact

As a Site Reliability Engineer, your responsibilities will include:

  • Embrace Observability: You'll build and maintain comprehensive monitoring and observability systems by meticulously instrumenting applications, infrastructure, and dependencies. You'll create clear dashboards that provide a direct view of system health, standardizing metrics, logs, and tracing to enable effective correlation and analysis.
  • Design for Performance and Resilience: You will design systems with a focus on scalability, redundancy, and fault tolerance. This includes setting clear performance targets (SLIs/SLOs) aligned with business goals and regularly conducting load testing and chaos engineering to find issues proactively.
  • Proactive Reliability: You'll help shift our team from a reactive to a proactive mindset by defining explicit Service Level Objectives (SLOs) that reflect user expectations. You'll use error budgets to guide the balance between development and operations, slowing down releases when necessary to maintain reliability.
  • Incident Management and Learning: You will treat outages and performance degradations as opportunities to improve resilience. This involves streamlining incident response with clear procedures and conducting blameless postmortems to learn from mistakes.
  • Automate Everything (with Caution): You'll automate repetitive and error-prone tasks to minimize toil and free up the team for high-value work. You'll build in robust testing and rollback capabilities into automation pipelines, always maintaining careful oversight and human judgment.
  • Impact Engineering and Corporate Culture: You'll collaborate with development and product teams to improve system quality and performance. This includes highlighting impacts on quality, bringing focus to customer journey bottlenecks, and helping to prioritize product stories related to defects.

Experience you'll bring

  • Expertise in software design, development, and testing for software enhancements and new products.
  • Knowledge of automated testing tools and traditional quality assurance approaches.
  • Experience with cloud development, including designing, developing, and maintaining applications on platforms like Amazon Web Services/EC2.
  • Understanding of cloud storage services, including EBS, Amazon S3, and EFS.
  • Ability to create documentation for future maintenance and issue resolution.
  • Experience with APIs, pre-scripting, post-scripting, and integration testing.


  • São Paulo, São Paulo, Brasil Truelogic Tempo inteiro US$120.000 - US$180.000 por ano

    About TruelogicAt Truelogic we are a leading provider of nearshore staff augmentation services headquartered in New York. For over two decades, we've been delivering top-tier technology solutions to companies of all sizes, from innovative startups to industry leaders, helping them achieve their digital transformation goals.Our team of 600+ highly skilled...


  • São Paulo, São Paulo, Brasil Enter Tempo inteiro R$80.000 - R$120.000 por ano

    A Enter (anteriormente Talisman AI) foi fundada em 2023 com a missão de tornar o Brasil um protagonista em Inteligência Artificial. Unimos a expertise humana à eficiência da IA para ajudar grandes empresas da América Latina a otimizar processos críticos de alto volume e que exigem intenso trabalho manual. Iniciamos nossa jornada aplicando IA para...


  • São Paulo, São Paulo, Brasil WEX Inc. Tempo inteiro R$80.000 - R$160.000 por ano

    About the Team/RoleThe WEX Site Reliability Engineering (SRE) team seeks individuals passionate about developing software and solutions for observability, incident response, reliability, performance, operational excellence, and compliance. As part of the Site Reliability Engineering organization, you will support internal stakeholders and Payment Platform...

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, São Paulo, Brasil Loadsmart Tempo inteiro R$80.000 - R$120.000 por ano

    ARE YOU INTERESTED IN JOINING AN INNOVATIVE LOGISTICS TECHNOLOGY COMPANY? Loadsmart is a growth-stage technology company valued at over $1 billion (a true Tech Unicorn We are a collection of industry veterans and user-centered engineers using innovative technology to fearlessly reinvent the future of freight by helping shippers, brokers, warehouses and...

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, São Paulo, Brasil Loadsmart Tempo inteiro R$120.000 - R$240.000 por ano

    ARE YOU INTERESTED IN JOINING AN INNOVATIVE LOGISTICS TECHNOLOGY COMPANY?Loadsmart is a growth-stage technology company valued at over $1 billion (a true Tech Unicorn)We are a collection of industry veterans and user-centered engineers using innovative technology to fearlessly reinvent the future of freight by helping shippers, brokers, warehouses and...

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, São Paulo, Brasil Loadsmart Tempo inteiro R$80.000 - R$120.000 por ano

    ARE YOU INTERESTED IN JOINING AN INNOVATIVE LOGISTICS TECHNOLOGY COMPANY?Loadsmart is a growth-stage technology company valued at over $1 billion (a true Tech Unicorn)We are a collection of industry veterans and user-centered engineers using innovative technology to fearlessly reinvent the future of freight by helping shippers, brokers, warehouses and...


  • São Paulo, São Paulo, Brasil K2 Solutions Tempo inteiro R$90.000 - R$120.000 por ano

    Trabalho híbrido na região de Pinheiros/ SP - 3x por semana no escritório Estamos selecionando um Senior Site Reliability Engineer - SRE para se juntar ao nosso time e desempenhar um papel essencial na manutenção, automação e melhoria da confiabilidade dos sistemas que impulsionam a rede logística da empresa em múltiplas regiões. Essa pessoa...


  • São Paulo, São Paulo, Brasil Thales Tempo inteiro R$80.000 - R$120.000 por ano

    Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000...

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, São Paulo, Brasil DELIVER IT Tempo inteiro R$80.000 - R$120.000 por ano

    Você se considera uma pessoa que tem sede por aprendizado, gosta de trabalhar em equipe e almeja desenvolvimento na carreira? Então essa oportunidade é para vocêEstamos em busca de um(a) SRE Júnior (Site Reliability Engineer) para integrar uma equipe altamente técnica e comprometida com a excelência operacional. O profissional atuará com foco na...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, São Paulo, Brasil DELIVER IT Tempo inteiro R$60.000 - R$120.000 por ano

    Você é uma pessoa com sólida experiência em engenharia de confiabilidade, tem pensamento estratégico, perfil colaborativo e busca constantemente elevar o nível técnico dos times e sistemas com os quais trabalha? Então essa oportunidade é para vocêEstamos em busca de um(a) SRE Sênior (Site Reliability Engineer) para compor uma equipe técnica de...