Site reliability engineer
1 semana atrás
Role Summary The SRE Technical Member will: Deliver engineering, operational, and administrative support for the application and its technology landscape. Address reliability and operational challenges such as application failures, production issues, infrastructure performance (disk, memory), monitoring, and security. Serve as a mid-level subject matter expert, integrating with multiple teams to develop and evolve SRE practices for Azure-based environments. Participate in production support activities, including deployments, upgrades, and critical issue resolution. This role is central to designing, implementing, and maintaining monitoring, alerting, and reporting solutions across servers, containers, databases, and cloud infrastructure components. Key Responsibilities Collaborate with Central SRE, Dev Ops, and Info Sec teams on new projects, platform builds, and deployments. Contribute to the design, implementation, and operation of large-scale, Azure-based platforms. Apply industry best practices in monitoring, alerting, reporting, and cloud architecture. Participate in infrastructure, application, and security planning , focusing on scalability, redundancy, and data preservation. Support high-availability topologies with development teams. Produce documentation and weekly operational status reports , detailing project progress and key metrics. Provide engineering and support for technical infrastructure, cloud, databases, and application performance. Manage incident response, change management, and user permissions following SRE best practices (Google SRE model). Maintain close collaboration between Application, Central SRE, Dev Ops, Info Sec, and business units. Assist in configuring and onboarding new applications into the Azure Dev Ops (ADO) platform. Core Technical Skills Strong understanding of SRE fundamentals : monitoring, alerting, reporting, performance, availability, and incident response. Hands-on experience with CI/CD tools (Git, Azure Pipelines, Ansible, etc.). Infrastructure as Code (Ia C) design, scripting, and setup. Deep knowledge of Azure Web Services — installation, configuration, and management. Experience administering Microsoft applications (. NET, C#, Angular) with focus on automation, optimization, and security. Proficiency in Cosmos DB and MS SQL operational tasks. Excellent troubleshooting, root-cause analysis , and problem-solving skills. Experience with disaster recovery, scalability testing, and capacity planning . Qualifications Bachelor's degree in a technical discipline (Computer Science, Engineering, or related field). 5+ years of industry experience in SRE, Dev Ops, or related technical operations roles. Proven experience in cloud infrastructure , automation , and application reliability engineering within large-scale, enterprise environments.
-
Site Reliability Engineer
Há 7 dias
São Paulo, Brasil Mouts TI Tempo inteiroNa Mouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a) SRE (Site Reliability Engineer) para atuar presencialmente, com foco em infraestrutura, automação e observabilidade em ambientes de missão crítica.Responsabilidades:Implementar e gerenciar soluções de observabilidade
-
Site Reliability Engineer
Há 5 dias
São Paulo, Brasil PayRetailers Tempo inteiroSite Reliability Engineer Join PayRetailers in São Paulo. We are expanding across Latin America and Africa, building cutting‑edge payment solutions. We value creativity, growth, and collaboration. About the role Site Reliability Engineers are guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform...
-
Site Reliability Engineer
Há 5 dias
São Paulo, Brasil PayRetailers Tempo inteiroSite Reliability Engineer Join PayRetailers in São Paulo. We are expanding across Latin America and Africa, building cutting‑edge payment solutions. We value creativity, growth, and collaboration. About the role Site Reliability Engineers are guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform...
-
Site Reliability Engineer
1 semana atrás
São Paulo, Brasil PayRetailers Tempo inteiroJob Overview We’re PayRetailers, and we offer cutting‑edge payment solutions that empower businesses to succeed in Latin America & Africa. Our collaborative and inclusive work environment encourages creativity and growth, where every employee’s contribution is valued. We’ve got big plans to expand into new markets and make a meaningful impact on the...
-
Senior Site Reliability Engineer
2 semanas atrás
São Paulo, Brasil K2 Solutions Tempo inteiroTrabalho híbrido na região de Pinheiros/ SP - 3x por semana no escritórioEstamos selecionando um Senior Site Reliability Engineer - SRE para se juntar ao nosso time e desempenhar um papel essencial na manutenção, automação e melhoria da confiabilidade dos sistemas que impulsionam a rede logística da empresa em múltiplas regiões. Essa pessoa...
-
Site Reliability Engineer
Há 4 dias
são paulo, Brasil Mouts TI Tempo inteiroNaMouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a)SRE (Site Reliability Engineer)para atuarpresencialmente, com foco eminfraestrutura, automação e observabilidadeem ambientes de missão crítica.Responsabilidades: Implementar e gerenciar soluções deobservabilidade(Datadog,...
-
Site Reliability Engineer
1 semana atrás
São Paulo, Brasil Mouts TI Tempo inteiroNaMouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a)SRE (Site Reliability Engineer)para atuarpresencialmente, com foco eminfraestrutura, automação e observabilidadeem ambientes de missão crítica.Responsabilidades: Implementar e gerenciar soluções deobservabilidade(Datadog,...
-
Site Reliability Engineer
1 semana atrás
São Paulo, Brasil Mouts Ti Tempo inteiroNaMouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a)SRE (Site Reliability Engineer)para atuarpresencialmente, com foco eminfraestrutura, automação e observabilidadeem ambientes de missão crítica.Responsabilidades:Implementar e gerenciar soluções deobservabilidade(Datadog,...
-
Site Reliability Engineer
1 semana atrás
São Paulo, Brasil Mouts Ti Tempo inteiroNaMouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a)SRE (Site Reliability Engineer)para atuarpresencialmente, com foco eminfraestrutura, automação e observabilidadeem ambientes de missão crítica.Responsabilidades:Implementar e gerenciar soluções deobservabilidade(Datadog,...
-
Site reliability engineer
Há 7 dias
São Paulo, Brasil Mouts TI Tempo inteiroNaMouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a)SRE (Site Reliability Engineer)para atuarpresencialmente, com foco eminfraestrutura, automação e observabilidadeem ambientes de missão crítica.Responsabilidades:Implementar e gerenciar soluções deobservabilidade(Datadog,...