Site Reliability Engineer

4 semanas atrás


São Paulo, São Paulo, Brasil Willis Towers Watson Tempo inteiro
Description

Summary :

We're looking for an experienced Platform/Infrastructure Engineer with a strong Microsoft Azure background and deep knowledge of Kubernetes. You'll play a key role in designing, deploying, and maintaining infrastructure and services that power our products. This role requires hands-on experience with automation, modern IaC practices, CI/CD, and maintaining production-grade environments.

The Role:

  • Operate, monitor, and improve cloud infrastructure for high-availability services in Azure
  • Deploy, configure and manage Kubernetes workloads at scale, including the use of Helm, ArgoCD, Flux, or similar GitOps tools
  • Build and maintain CI/CD pipelines using Azure DevOps or similar tooling
  • Write and maintain Infrastructure as Code using Terraform or OpenTofu
  • Develop scripts and automation to support infrastructure and deployment workflows - PowerShell is preferred
  • Collaborate with engineering teams to support platform reliability and enable delivery
  • Maintain visibility and awareness through monitoring and logging tools such as Datadog, Azure Monitor, App Insights etc.
  • Support incident resolution and participate in an on-call rota to help maintain service uptime
Qualifications

The Requirements:

Essential Experience:

  • Proven experience in a Platform, Infrastructure, or DevOps engineering role
  • Hands-on experience operating 24x7 services in a public cloud, ideally Azure
  • Strong experience managing infrastructure using Terraform or OpenTofu
  • Experience managing and scaling Kubernetes clusters in production environments
  • Proficient with CI/CD tooling, preferably Azure DevOps (YAML pipelines)
  • Strong scripting skills using PowerShell
  • Experience with monitoring and logging solutions such as Azure Monitor, App Insights, or similar
  • Clear communicator with the ability to collaborate across cross-functional teams

Nice to Have:

  • Azure certifications (e.g. Azure Administrator, Azure DevOps Engineer)
  • Experience with GitOps and tools such as ArgoCD or Flux
  • Familiarity with Configuration as Code tools like Ansible or Puppet
  • Exposure to large-scale distributed systems or high-volume web APIs
  • Awareness of incident response processes and platform reliability best practices

Equal Opportunity Employer

At WTW, we believe difference makes us stronger. We want our workforce to reflect the different and varied markets we operate in and to build a culture of inclusivity that makes colleagues feel welcome, valued and empowered to bring their whole selves to work every day. We are an equal opportunity employer committed to fostering an inclusive work environment throughout our organisation. We embrace all types of diversity.

At WTW, we trust you to know your work and the people, tools and environment you need to be successful. The majority of our colleagues work in a "hybrid" style, with a mix of remote, in-person and in-office interactions dependent on the needs of the team, role and clients. Our flexibility is rooted in trust and "hybrid" is not a one-size-fits-all solution.

#J-18808-Ljbffr
  • Site Reliability Engineer

    4 semanas atrás


    São Paulo, São Paulo, Brasil buscojobs Brasil Tempo inteiro

    Overview About the Role We are looking for a Senior Site Reliability Engineer (SRE) to join a mission-critical project for one of our U.S.-based clients. This role focuses on maintaining platform reliability and implementing proactive solutions to minimize system downtimes and performance bottlenecks. Responsibilities Design and maintain scalable,...

  • Site Reliability Engineer

    4 semanas atrás


    São Paulo, São Paulo, Brasil buscojobs Brasil Tempo inteiro

    OverviewAbout the RoleWe are looking for a Senior Site Reliability Engineer (SRE) to join a mission-critical project for one of our U.S.-based clients. This role focuses on maintaining platform reliability and implementing proactive solutions to minimize system downtimes and performance bottlenecks.ResponsibilitiesDesign and maintain scalable,...

  • Senior Site Reliability

    4 semanas atrás


    São Paulo, São Paulo, Brasil Canonical Tempo inteiro

    Senior Site Reliability / Gitops Engineer Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Senior Site Reliability / Gitops Engineer 1 day ago Be among the first 25 applicants Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Get AI-powered advice on this job and more exclusive features....


  • São Paulo, São Paulo, Brasil INDI Staffing Services Tempo inteiro

    At INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work.Overview of the role:We are looking for a Site Reliability Engineer to build and maintain highly reliable,...


  • São Paulo, São Paulo, Brasil INDI Staffing Services Tempo inteiro

    At INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work. Overview of the role:We are looking for a Site Reliability Engineer to build and maintain highly reliable,...

  • Site Reliability Engineer

    4 semanas atrás


    São Paulo, São Paulo, Brasil AgileEngine Tempo inteiro

    Site Reliability Engineer (Middle) ID38916 Join to apply for the Site Reliability Engineer (Middle) ID38916 role at AgileEngine. AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and startups across 17+ industries. We rank among the leaders in application development and AI/ML, and our people-first culture has...


  • São Paulo, São Paulo, Brasil INDI Staffing Services Tempo inteiro

    Overview We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure OpenShift/Kubernetes clusters. We will need you to approach the problem of building and maintaining production systems from a software engineering perspective with a focus on automation, and reliability. Responsibilities Build, automate, and...


  • São Paulo, São Paulo, Brasil INDI Staffing Services Tempo inteiro

    Overview We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure OpenShift/Kubernetes clusters. We will need you to approach the problem of building and maintaining production systems from a software engineering perspective with a focus on automation, and reliability. Responsibilities Build, automate, and...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, Estado de São Paulo, Brasil Appoena Tempo inteiro

    Estamos contratando: Site Reliability Engineer [Especialista] Local: São Paulo, SP (modelo híbrido – possibilidade de home office parcial) Empresa: Appoena – Consultoria especializada em Observabilidade e Parceira Premier da DatadogDescrição da Vaga: Buscamos um(a) Site Reliability Engineer (SRE) [Especialista] para atuar garantindo a...


  • São Paulo, São Paulo, Brasil Nearsure Tempo inteiro

    Staff Site Reliability Engineer - Work from home Staff Site Reliability Engineer - Work from home 1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management. Say...