Site Reliability Engineer

2 semanas atrás


Salvador, Bahia, Brasil WEX Tempo inteiro R$104.000 - R$130.878 por ano

About the Team/Role

We are seeking a Software Development Engineer Level 3 to join our SRE team dedicated to the Mobility line of business. This role is for a professional with a software development background who will apply SRE principles to ensure the reliability, scalability, and performance of our complex software systems.

The ideal candidate will have related experience and will be a key player in fostering a culture of continuous improvement and collaboration across engineering teams.

SRE is an ongoing journey of continuous improvement, and the core principles apply regardless of the technology's complexity, the customer's needs, or the business context. If you're passionate about building resilient and highly available systems, we encourage you to apply.

How you'll make an impact

As a Site Reliability Engineer, your responsibilities will include:

  • Embrace Observability: You'll build and maintain comprehensive monitoring and observability systems by meticulously instrumenting applications, infrastructure, and dependencies. You'll create clear dashboards that provide a direct view of system health, standardizing metrics, logs, and tracing to enable effective correlation and analysis.

  • Design for Performance and Resilience: You will design systems with a focus on scalability, redundancy, and fault tolerance. This includes setting clear performance targets (SLIs/SLOs) aligned with business goals and regularly conducting load testing and chaos engineering to find issues proactively.

  • Proactive Reliability: You'll help shift our team from a reactive to a proactive mindset by defining explicit Service Level Objectives (SLOs) that reflect user expectations. You'll use error budgets to guide the balance between development and operations, slowing down releases when necessary to maintain reliability.

  • Incident Management and Learning: You will treat outages and performance degradations as opportunities to improve resilience. This involves streamlining incident response with clear procedures and conducting blameless postmortems to learn from mistakes.

  • Automate Everything (with Caution): You'll automate repetitive and error-prone tasks to minimize toil and free up the team for high-value work. You'll build in robust testing and rollback capabilities into automation pipelines, always maintaining careful oversight and human judgment.

  • Impact Engineering and Corporate Culture: You'll collaborate with development and product teams to improve system quality and performance. This includes highlighting impacts on quality, bringing focus to customer journey bottlenecks, and helping to prioritize product stories related to defects.

Experience you'll bring

  • Expertise in software design, development, and testing for software enhancements and new products.

  • Knowledge of automated testing tools and traditional quality assurance approaches.

  • Experience with cloud development, including designing, developing, and maintaining applications on platforms like Amazon Web Services/EC2.

  • Understanding of cloud storage services, including EBS, Amazon S3, and EFS.

  • Ability to create documentation for future maintenance and issue resolution.

  • Experience with APIs, pre-scripting, post-scripting, and integration testing.


  • Sr Software Engineer

    Há 2 horas


    Salvador, Bahia, Brasil WEX Tempo inteiro R$90.000 - R$120.000 por ano

    About the Team/Role As a Sr SWE of Data Lake Engineering, this technologist will help with the design and implementation of the Data Lake platform (supporting both GenAI and traditional AI/ML technology and applications), AI model productionalization (E2E AI/ML model production lifecycle: AI/ML model development, deployment, monitoring, refresh, etc), Data...


  • Salvador, Bahia, Brasil Acronis Tempo inteiro R$80.000 - R$120.000 por ano

    Acronis is a world leader in cyber protection—empowering people by providing them with cutting-edge technology that enables them to monitor, control, and protect the data that their businesses and lives depend on. We are looking for а Senior Linux Systems Administrator who is ready to join our mission in creating a #CyberFit futureThe Senior Network...

  • ArgoCD Specialist

    1 semana atrás


    Salvador, Bahia, Brasil WEX Tempo inteiro R$80.000 - R$120.000 por ano

    About the Team/RoleWe're looking for an experienced DevOps Specialist to lead the design, implementation, and operation of our GitOps platform using ArgoCD. You will be responsible for running ArgoCD at enterprise scale—supporting hundreds of Kubernetes clusters across multiple environments—with a focus on reliability, security, and developer...

  • Site Reliability Engineer

    2 semanas atrás


    Salvador, Brasil Bebeeengenheiro Tempo inteiro

    Afirma sua carreira como Engenheiro de Confiabilidade de SitesA Stone Co. está em busca de um profissional experiente para ocupar o cargo de Site Reliability Engineer, contribuindo para a evolução da cultura de excelência operacional em engenharia e promovendo a implementação de soluções inovadoras.Responsabilidades:- Criar, manter e melhorar...

  • Site Reliability Engineer

    1 semana atrás


    Salvador, Brasil Canonical Tempo inteiro

    OverviewJoin to apply for the Site Reliability Engineer role at CanonicalCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets.Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT.Our customers...

  • Site Reliability Engineer

    2 semanas atrás


    Salvador, Brasil AgileEngine Tempo inteiro

    OverviewSite Reliability Engineer (Middle) ID38916 AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. If...

  • Site Reliability Engineer

    2 semanas atrás


    Salvador, Brasil AgileEngine Tempo inteiro

    Overview Site Reliability Engineer (Middle) ID38916 AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. If you're...

  • Site Reliability Engineer

    4 semanas atrás


    Salvador, Brasil buscojobs Brasil Tempo inteiro

    Sobre a Empresa Com mais de 20 anos de mercado, a ITeam se destaca pelo comprometimento com o cliente. Baseamos nosso relacionamento em valores sólidos e objetivos claros, oferecendo soluções e serviços de TI que auxiliam na realização das metas dos nossos clientes. Nossa missão é fornecer serviços de TI que se alinhem com a estratégia e processos...

  • Site Reliability Engineer

    4 semanas atrás


    Salvador, Brasil HCLTech Tempo inteiro

    Your role and responsabilities: - Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution. - Performing deep-dive application troubleshooting and identifying preventive actions. - Managing CIRS-related requests including deployments, feature toggles, and data fixes. - Following up on major...

  • Senior SRE

    Há 3 dias


    Salvador, Brasil Remessa Online Tempo inteiro

    Sua carreira com liberdade e propósito 🌏Na Remessa Online, não se trata apenas de transferências internacionais, criamos conexões que rompem fronteiras e transformam realidades. Somos movidos pela ousadia, respeito, colaboração, encantamento e responsabilidade.Nosso segredo? Trabalhar juntos com transparência, comprometimento e autonomia, sempre...

  • Lead SRE Engineer

    Há 3 dias


    Salvador, Brasil Avenue Code Tempo inteiro

    About the Company:Avenue Code is the leading software consultancy focused on delivering end-to-end development solutions for digital transformation across every vertical. We’re privately held, profitable, and have been on a solid growth trajectory since day one. We care deeply about our clients, our partners, and our people. We prefer the word...

  • .NET Engineer

    3 semanas atrás


    Salvador, Brasil AgileEngine Tempo inteiro

    Join to apply for the .NET Engineer (Senior/Lead) ID41557 role at AgileEngine AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and startups across 17+ industries. We rank among the leaders in application development and AI/ML, and our people-first culture has earned us Best Place to Work awards. ABOUT THE ROLE ...

  • Software Engineer

    2 semanas atrás


    Salvador, Brasil Agileengine Tempo inteiro

    OverviewSoftware Engineer (Mid-Senior) – AgileEngineAgileEngine is an Inc. 5000 company that creates software for Fortune 500 brands and startups across 17+ industries.We focus on application development, AI/ML, and a people-first culture with multiple Best Place to Work awards.As a Python/React Engineer, you'll develop and maintain applications that...