
Site Reliability Engineer
1 semana atrás
About the Team/Role
We are seeking a Software Development Engineer Level 3 to join our SRE team dedicated to the Mobility line of business. This role is for a professional with a software development background who will apply SRE principles to ensure the reliability, scalability, and performance of our complex software systems.
The ideal candidate will have related experience and will be a key player in fostering a culture of continuous improvement and collaboration across engineering teams.
SRE is an ongoing journey of continuous improvement, and the core principles apply regardless of the technology's complexity, the customer's needs, or the business context. If you're passionate about building resilient and highly available systems, we encourage you to apply.
How you'll make an impact
As a Site Reliability Engineer, your responsibilities will include:
- Embrace Observability: You'll build and maintain comprehensive monitoring and observability systems by meticulously instrumenting applications, infrastructure, and dependencies. You'll create clear dashboards that provide a direct view of system health, standardizing metrics, logs, and tracing to enable effective correlation and analysis.
- Design for Performance and Resilience: You will design systems with a focus on scalability, redundancy, and fault tolerance. This includes setting clear performance targets (SLIs/SLOs) aligned with business goals and regularly conducting load testing and chaos engineering to find issues proactively.
- Proactive Reliability: You'll help shift our team from a reactive to a proactive mindset by defining explicit Service Level Objectives (SLOs) that reflect user expectations. You'll use error budgets to guide the balance between development and operations, slowing down releases when necessary to maintain reliability.
- Incident Management and Learning: You will treat outages and performance degradations as opportunities to improve resilience. This involves streamlining incident response with clear procedures and conducting blameless postmortems to learn from mistakes.
- Automate Everything (with Caution): You'll automate repetitive and error-prone tasks to minimize toil and free up the team for high-value work. You'll build in robust testing and rollback capabilities into automation pipelines, always maintaining careful oversight and human judgment.
- Impact Engineering and Corporate Culture: You'll collaborate with development and product teams to improve system quality and performance. This includes highlighting impacts on quality, bringing focus to customer journey bottlenecks, and helping to prioritize product stories related to defects.
Experience you'll bring
- Expertise in software design, development, and testing for software enhancements and new products.
- Knowledge of automated testing tools and traditional quality assurance approaches.
- Experience with cloud development, including designing, developing, and maintaining applications on platforms like Amazon Web Services/EC2.
- Understanding of cloud storage services, including EBS, Amazon S3, and EFS.
- Ability to create documentation for future maintenance and issue resolution.
- Experience with APIs, pre-scripting, post-scripting, and integration testing.
-
Site Reliability Engineer
4 semanas atrás
São Paulo, Estado de São Paulo, Brasil Appoena Tempo inteiroEstamos contratando: Site Reliability Engineer [Especialista] Local: São Paulo, SP (modelo híbrido – possibilidade de home office parcial) Empresa: Appoena – Consultoria especializada em Observabilidade e Parceira Premier da DatadogDescrição da Vaga: Buscamos um(a) Site Reliability Engineer (SRE) [Especialista] para atuar garantindo a...
-
Site Reliability Engineer
Há 2 dias
São Paulo, São Paulo, Brasil Enter Tempo inteiro R$80.000 - R$160.000 por anoA Enter (anteriormente Talisman AI) foi fundada em 2023 com a missão de tornar o Brasil um protagonista em Inteligência Artificial. Unimos a expertise humana à eficiência da IA para ajudar grandes empresas da América Latina a otimizar processos críticos de alto volume e que exigem intenso trabalho manual. Iniciamos nossa jornada aplicando IA para...
-
Remote Site Reliability Engineer
3 semanas atrás
São Paulo, Estado de São Paulo, Brasil INDI Staffing Services Tempo inteiroAt INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work.Overview of the role:We are looking for a Site Reliability Engineer to build and maintain highly reliable,...
-
Site Reliability Engineer
Há 2 dias
São Paulo, São Paulo, Brasil Thales Tempo inteiro R$80.000 - R$120.000 por anoThales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000...
-
Site Reliability Engineer
1 semana atrás
São Paulo, São Paulo, Brasil DELIVER IT Tempo inteiro R$60.000 - R$120.000 por anoVocê é uma pessoa com sólida experiência em engenharia de confiabilidade, tem pensamento estratégico, perfil colaborativo e busca constantemente elevar o nível técnico dos times e sistemas com os quais trabalha? Então essa oportunidade é para vocêEstamos em busca de um(a) SRE Sênior (Site Reliability Engineer) para compor uma equipe técnica de...
-
Reliability Solutions, BDM Latin America
1 semana atrás
São Paulo, São Paulo, Brasil Emerson Career Site Tempo inteiro R$90.000 - R$120.000 por anoIn this Role, Your Responsibilities Will Be:Enable the delivery of the RS strategic plans through engagement, ensuring the right mix of technical support and application knowledge is available. Ability to manage priorities and focus on what will drive the Latin American business goals, balancing these with the needs of RS Country Leaders, Multi-BU Sales Team...
-
Service Engineer
Há 2 dias
São Paulo, São Paulo, Brasil Estun Automation Tempo inteiro R$60.000 - R$80.000 por anoJob Summary:The Service Engineer is responsible for conducting service interventions, repairing industrial robots, and performing preventive maintenance. This includes diagnosing and resolving technical issues, ensuring the reliable operation of robots, and supporting customers in maintaining optimal performance.Responsibilities:Perform on-site service...
-
Intermediate Site Reliability Engineer
2 semanas atrás
São Paulo, São Paulo, Brasil Dev Tempo inteiro R$96.000 - R$180.000 por anoAre you in Brazil, Argentina or Colombia? Join us as we actively recruit in these locations, offering a comfortable remote environment. Submit your CV in English, and we'll get back to youWe invite a proactive and motivated specialist to join our SRE team and provide reliable administrative support to one of the world's most recognized corporations with...
-
Intermediate Site Reliability Engineer
2 semanas atrás
São Paulo, São Paulo, Brasil Dev Tempo inteiro R$40.000 - R$80.000 por anoAre you in Brazil, Argentina or Colombia? Join us as we actively recruit in these locations, offering a comfortable remote environment. Submit your CV in English, and we'll get back to youWe invite a proactive and motivated specialist to join our SRE team and provide reliable administrative support to one of the world's most recognized corporations with...
-
Site Reliability Engineer PL
1 semana atrás
São Paulo, São Paulo, Brasil BRQ Digital Solutions Tempo inteiro R$80.000 - R$120.000 por anoCódigo da oportunidade: 54328Há mais de 32 anos no mercado, a BRQ Digital Solutions se consolidou como uma das maiores empresas de transformação digital do país. Com uma plataforma de serviços end to end, oferecemos as mais eficientes e inovadoras soluções, tecnologias e metodologias, promovendo uma jornada de transformação para grandes marcas, de...