Site Reliability Engineer

Há 2 dias

Brazil HCLTech Tempo inteiro

Your role and responsabilities:

Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution.
Performing deep-dive application troubleshooting and identifying preventive actions.
Managing CIRS-related requests including deployments, feature toggles, and data fixes.
Following up on major production incidents and coordinating with cross-functional teams.
Enhancing monitoring capabilities using tools like Dynatrace, Kibana, and Splunk.
Writing and improving monitoring scripts and alerts based on incident learnings.
Handling customer escalations and coordinating with Support & Engineering teams.
Supporting planned activities and responding to ad-hoc requests from CES teams.

Requirements and Qualifications:

Deep experience in DevOps and Production Support.
Experience in automation and CI/CD practices.
Familiarity with cloud platforms (GCP, AWS, or Azure preferred).
Hands-on experience with monitoring tools such as Dynatrace, Kibana, Splunk.
Strong troubleshooting skills and ability to deep dive into application issues.
Excellent communication and coordination skills across teams.

Please submit resumé in English.

Senior Site Reliability Engineer

2 semanas atrás

Brazil/Remote Articul8 Tempo inteiro US$90.000 - US$120.000 por ano

About Us Articul8 AI is at the forefront of Generative AI innovation, delivering cutting-edge SaaS products that transform how businesses operate. Our platform empowers organizations to leverage the power of artificial intelligence in a reliable, scalable, and secure environment. Position Overview We are seeking an experienced Site Reliability Engineer...
Site Reliability Engineer

Há 2 dias

Brazil Gauge Tempo inteiro

Somos uma empresa do Grupo Stefanini. Especializados em marketing digital, utilizamos uma abordagem integrada que combina tecnologia, inteligência de dados, design e profundo conhecimento do comportamento do consumidor. Nosso foco está em potencializar os resultados de nossos parceiros, oferecendo soluções que vão desde consultoria estratégica até a...
Site Reliability Engineer

Há 2 dias

Brazil, BR HCLTech Tempo inteiro

Your role and responsabilities:Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution.Performing deep-dive application troubleshooting and identifying preventive actions.Managing CIRS-related requests including deployments, feature toggles, and data fixes.Following up on major production incidents...
Chief AWS Site Reliability Engineer

2 semanas atrás

Buenos Aires, Espírito Santo, Brazil EPAM Systems Tempo inteiro

OverviewEPAM Systems is looking for a Chief AWS SRE Engineer who fully understands and practices SRE activities and philosophy to join the global engineering team that ensures fleet services reliability and availability under the SRE model.If you're passionate about innovation, we invite you to apply and become part of our teamResponsibilitiesCollaborate...
BSAtech | Recife

2 semanas atrás

Manaus, Pernambuco, Brazil BSATech Tempo inteiro

A BSAtech é uma empresa especializada no desenvolvimento de jogos de entretenimento com alcance global. Nosso compromisso é entregar experiências digitais de alta qualidade, combinando inovação, criatividade e tecnologia.Estamos em um momento de expansão e buscamos profissionais excepcionais para nos ajudar a ampliar nossas áreas de negócio e...
Senior DevOps Engineer Latam

2 semanas atrás

Remote, São Paulo, Brazil Wizdaa Tempo inteiro US$90.000 - US$120.000 por ano

Level: Senior (5+ years) | Department: Foundation/Platform Engineering Role Overview Lead development of internal Kubernetes platform enabling scalable application deployment through GitOps. Engineer solutions for deployment complexity, database migrations, multi-environment management, and developer productivity. Drive DevOps practices including CI/CD...
Site Reliability Engineer

Há 2 dias

Federative Republic Of Brazil HCLTech Tempo inteiro

Your role and responsabilities:Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution. Performing deep-dive application troubleshooting and identifying preventive actions. Managing CIRS-related requests including deployments, feature toggles, and data fixes. Following up on major production...
DevOps Engineer

Há 2 dias

Brazil Flowmentum, Inc. Tempo inteiro

We’re Flowmentum and our clients are fast-moving teams building reliable, scalable, and secure infrastructure for companies shaping the future of AI, fintech, cloud services, and beyond.Our engineers work on high-traffic, mission-critical systems that power millions of users across the globe.We believe in autonomy, ownership, and solving hard problems —...
DevOps Engineer

Há 2 dias

Brazil, BR Flowmentum, Inc. Tempo inteiro

We’re Flowmentum and our clients are fast-moving teams building reliable, scalable, and secure infrastructure for companies shaping the future of AI, fintech, cloud services, and beyond.Our engineers work on high-traffic, mission-critical systems that power millions of users across the globe.We believe in autonomy, ownership, and solving hard problems —...
DevOps Engineer

Há 2 dias

Brazil Flowmentum, Inc. Tempo inteiro

We’re Flowmentum and our clients are fast-moving teams building reliable, scalable, and secure infrastructure for companies shaping the future of AI, fintech, cloud services, and beyond. Our engineers work on high-traffic, mission-critical systems that power millions of users across the globe. We believe in autonomy, ownership, and solving hard problems...

Américas

Europa

Ásia / Oceania

África

Site Reliability Engineer