Senior Reliability Engineer

Há 20 horas


Belo Horizonte, Minas Gerais, Brasil beBeeReliability Tempo inteiro US$160.000 - US$190.000
Site Reliability Engineer Lead

We are seeking an experienced SRE / DevOps / Platform Engineer to lead our site reliability engineering efforts. The ideal candidate will have a strong track record of designing, automating, and operating hybrid/on-prem environments for low-latency deployments.

The Opportunity

This role involves owning end-to-end reliability, architecting, deploying, and monitoring production clusters running micro-services, LLM workloads, and GPU back-ends. The candidate will automate the stack by building infrastructure as code pipelines (Terraform), GitOps workflows, and zero-downtime rollout strategies. They will also observe and respond to issues by instrumenting apps with Prometheus/Grafana, setting service level objectives/service level indicators, leading incident response, performing root-cause analysis, and hardening runbooks.

Key Responsibilities
  • Design, automate, and operate hybrid/on-prem environments for low-latency deployments
  • Own end-to-end reliability - architect, deploy, and monitor production clusters
  • Automate the stack - build IaC pipelines (Terraform), GitOps workflows, and zero-downtime rollout strategies
  • Observe & respond - instrument apps with Prometheus/Grafana, set SLOs/SLIs, lead incident response, perform root-cause analysis, and harden runbooks
Requirements
  • 5+ years building and operating production systems as an SRE / DevOps / Platform Engineer
  • Hands-on expertise with Kubernetes and Docker in hybrid or bare-metal setups
  • Strong Python skills for automation tooling; proficiency reading TypeScript services
  • Deep Linux administration knowledge (kernel tuning, networking, storage, security hardening)
  • Proven track record delivering 99.9 %+ uptime for latency-sensitive services
  • Observability stack experience (Prometheus, Grafana, Loki / ELK, Alertmanager)
  • Proficiency with Terraform (or equivalent IaC) and Git-based workflows
Culture Fit
  • Challenge status quo
  • Strong opinions, loosely held
  • Ship fast, ship quality
  • Proud of our craft

  • Senior Site Reliability

    3 semanas atrás


    Belo Horizonte, Minas Gerais, Brasil Canonical Tempo inteiro

    Senior Site Reliability / Gitops EngineerJoin or sign in to find your next jobJoin to apply for the Senior Site Reliability / Gitops Engineer role at CanonicalSenior Site Reliability / Gitops Engineer3 days ago Be among the first 25 applicantsJoin to apply for the Senior Site Reliability / Gitops Engineer role at CanonicalCanonical is a leading provider of...


  • Belo Horizonte, Minas Gerais, Brasil AgileEngine Tempo inteiro

    Site Reliability Engineer (Middle) ID38916 Join to apply for the Site Reliability Engineer (Middle) ID38916 role at AgileEngine Site Reliability Engineer (Middle) ID38916 3 weeks ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Middle) ID38916 role at AgileEngine Get AI-powered advice on this job and more exclusive...


  • Belo Horizonte, Minas Gerais, Brasil beBeeSiteReliabilityEngineer Tempo inteiro US$200.000 - US$250.000

    Job Title: Site Reliability EngineerWe are seeking a skilled and experienced Site Reliability Engineer to join our team.This is a challenging and rewarding role that requires strong technical skills, excellent communication skills, and a passion for delivering high-quality results.The successful candidate will be responsible for designing, building, and...

  • Site Reliability

    3 semanas atrás


    Belo Horizonte, Minas Gerais, Brasil Canonical Tempo inteiro

    Join or sign in to find your next jobJoin to apply for the Site Reliability / Gitops Engineer role at Canonical3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability / Gitops Engineer role at CanonicalGet AI-powered advice on this job and more exclusive features.Canonical is a leading provider of open source software and operating...

  • Site Reliability

    2 semanas atrás


    Belo Horizonte, Minas Gerais, Brasil Canonical Tempo inteiro

    Join or sign in to find your next job Join to apply for the Site Reliability / Gitops Engineer role at Canonical 3 days ago Be among the first 25 applicants Join to apply for the Site Reliability / Gitops Engineer role at Canonical Get AI-powered advice on this job and more exclusive features. Canonical is a leading provider of open source software and...

  • DevOps Engineer

    2 semanas atrás


    Belo Horizonte, Minas Gerais, Brasil AgileEngine Tempo inteiro

    Join to apply for the DevOps Engineer (Middle/Senior) ID39763 role at AgileEngine 3 days ago Be among the first 25 applicants Join to apply for the DevOps Engineer (Middle/Senior) ID39763 role at AgileEngine AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We...

  • DevOps Engineer

    3 semanas atrás


    Belo Horizonte, Minas Gerais, Brasil AgileEngine Tempo inteiro

    Join to apply for the DevOps Engineer (Middle/Senior) ID39763 role at AgileEngine3 days ago Be among the first 25 applicantsJoin to apply for the DevOps Engineer (Middle/Senior) ID39763 role at AgileEngineAgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank...

  • DevOps Engineer

    Há 5 dias


    Belo Horizonte, Minas Gerais, Brasil AgileEngine Tempo inteiro

    Join to apply for the DevOps Engineer (Middle/Senior) ID39763 role at AgileEngine 3 days ago Be among the first 25 applicants Join to apply for the DevOps Engineer (Middle/Senior) ID39763 role at AgileEngine AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We...

  • React Native Engineer

    1 dia atrás


    Belo Horizonte, Minas Gerais, Brasil AgileEngine Tempo inteiro

    Join to apply for the React Native Engineer (Lead) ID36430 role at AgileEngine 3 days ago Be among the first 25 applicants Join to apply for the React Native Engineer (Lead) ID36430 role at AgileEngine AgileEngine is one of the Inc. 5000 fastest-growing companies in the US and a top-3 ranked dev shop according to Clutch. We create award-winning custom...

  • React Native Engineer

    2 semanas atrás


    Belo Horizonte, Minas Gerais, Brasil AgileEngine Tempo inteiro

    Join to apply for the React Native Engineer (Lead) ID36430 role at AgileEngine3 days ago Be among the first 25 applicantsJoin to apply for the React Native Engineer (Lead) ID36430 role at AgileEngineAgileEngine is one of the Inc. 5000 fastest-growing companies in the US and a top-3 ranked dev shop according to Clutch. We create award-winning custom software...