
Site Reliability Engineer
Há 2 dias
Your role and responsabilities:
- Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution.
- Performing deep-dive application troubleshooting and identifying preventive actions.
- Managing CIRS-related requests including deployments, feature toggles, and data fixes.
- Following up on major production incidents and coordinating with cross-functional teams.
- Enhancing monitoring capabilities using tools like Dynatrace, Kibana, and Splunk.
- Writing and improving monitoring scripts and alerts based on incident learnings.
- Handling customer escalations and coordinating with Support & Engineering teams.
- Supporting planned activities and responding to ad-hoc requests from CES teams.
Requirements and Qualifications:
- Deep experience in DevOps and Production Support.
- Experience in automation and CI/CD practices.
- Familiarity with cloud platforms (GCP, AWS, or Azure preferred).
- Hands-on experience with monitoring tools such as Dynatrace, Kibana, Splunk.
- Strong troubleshooting skills and ability to deep dive into application issues.
- Excellent communication and coordination skills across teams.
Please submit resumé in English.
-
Senior Site Reliability Engineer
2 semanas atrás
Brazil/Remote Articul8 Tempo inteiro US$90.000 - US$120.000 por anoAbout Us Articul8 AI is at the forefront of Generative AI innovation, delivering cutting-edge SaaS products that transform how businesses operate. Our platform empowers organizations to leverage the power of artificial intelligence in a reliable, scalable, and secure environment. Position Overview We are seeking an experienced Site Reliability Engineer...
-
Site Reliability Engineer
Há 2 dias
Brazil Gauge Tempo inteiroSomos uma empresa do Grupo Stefanini. Especializados em marketing digital, utilizamos uma abordagem integrada que combina tecnologia, inteligência de dados, design e profundo conhecimento do comportamento do consumidor. Nosso foco está em potencializar os resultados de nossos parceiros, oferecendo soluções que vão desde consultoria estratégica até a...
-
Site Reliability Engineer
Há 2 dias
Brazil, BR HCLTech Tempo inteiroYour role and responsabilities:Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution.Performing deep-dive application troubleshooting and identifying preventive actions.Managing CIRS-related requests including deployments, feature toggles, and data fixes.Following up on major production incidents...
-
Chief AWS Site Reliability Engineer
2 semanas atrás
Buenos Aires, Espírito Santo, Brazil EPAM Systems Tempo inteiroOverviewEPAM Systems is looking for a Chief AWS SRE Engineer who fully understands and practices SRE activities and philosophy to join the global engineering team that ensures fleet services reliability and availability under the SRE model.If you're passionate about innovation, we invite you to apply and become part of our teamResponsibilitiesCollaborate...
-
BSAtech | Recife
2 semanas atrás
Manaus, Pernambuco, Brazil BSATech Tempo inteiroA BSAtech é uma empresa especializada no desenvolvimento de jogos de entretenimento com alcance global. Nosso compromisso é entregar experiências digitais de alta qualidade, combinando inovação, criatividade e tecnologia.Estamos em um momento de expansão e buscamos profissionais excepcionais para nos ajudar a ampliar nossas áreas de negócio e...
-
Senior DevOps Engineer Latam
2 semanas atrás
Remote, São Paulo, Brazil Wizdaa Tempo inteiro US$90.000 - US$120.000 por anoLevel: Senior (5+ years) | Department: Foundation/Platform Engineering Role Overview Lead development of internal Kubernetes platform enabling scalable application deployment through GitOps. Engineer solutions for deployment complexity, database migrations, multi-environment management, and developer productivity. Drive DevOps practices including CI/CD...
-
Site Reliability Engineer
Há 2 dias
Federative Republic Of Brazil HCLTech Tempo inteiroYour role and responsabilities:Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution. Performing deep-dive application troubleshooting and identifying preventive actions. Managing CIRS-related requests including deployments, feature toggles, and data fixes. Following up on major production...
-
DevOps Engineer
Há 2 dias
Brazil Flowmentum, Inc. Tempo inteiroWe’re Flowmentum and our clients are fast-moving teams building reliable, scalable, and secure infrastructure for companies shaping the future of AI, fintech, cloud services, and beyond.Our engineers work on high-traffic, mission-critical systems that power millions of users across the globe.We believe in autonomy, ownership, and solving hard problems —...
-
DevOps Engineer
Há 2 dias
Brazil, BR Flowmentum, Inc. Tempo inteiroWe’re Flowmentum and our clients are fast-moving teams building reliable, scalable, and secure infrastructure for companies shaping the future of AI, fintech, cloud services, and beyond.Our engineers work on high-traffic, mission-critical systems that power millions of users across the globe.We believe in autonomy, ownership, and solving hard problems —...
-
DevOps Engineer
Há 2 dias
Brazil Flowmentum, Inc. Tempo inteiroWe’re Flowmentum and our clients are fast-moving teams building reliable, scalable, and secure infrastructure for companies shaping the future of AI, fintech, cloud services, and beyond. Our engineers work on high-traffic, mission-critical systems that power millions of users across the globe. We believe in autonomy, ownership, and solving hard problems...