Site Reliability Engineer
4 semanas atrás
About the CompanyThis company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide.They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric single-tenant cloud infrastructure on the market. If you share this passion, this role offers the opportunity to help shape the future of internet-scale infrastructure.This position is being managed in partnership with an external recruitment consultancy supporting the company throughout the hiring process.SummaryThe Reliability team is responsible for the health and resilience of the infrastructure powering a global bare metal cloud platform. As a Senior Site Reliability Engineer (SRE), you'll focus on building reliable, observable, and self-healing systems at scale.SREs here operate at the intersection of software engineering and infrastructure — designing tools that automate operations, improve incident response, and enhance observability, ensuring the platform delivers high performance and reliability to customers worldwide.This role is ideal for engineers passionate about reliability, automation, distributed systems, and bringing cloud-like experiences to bare metal environments.Key ResponsibilitiesContinuously improve platform reliability and performance.Design, build, and maintain tools to automate operational workflows and incident response.Implement and enhance observability systems (monitoring, alerting, tracing).Collaborate with engineering and platform teams to design scalable and resilient systems.Participate in on-call rotations and lead post-incident reviews with a learning-focused approach.Develop and document operational playbooks and processes.Contribute to defining SLOs/SLIs and driving reliability metrics across teams.Skills & QualificationsRequired:Fluent verbal and written English communication skillsAdvanced experience with Linux/Unix in production environmentsHands-on experience with Kubernetes and container orchestrationProficiency with IaC tools (e.g., Terraform, Ansible)Experience with observability stacks (Prometheus, Grafana, Loki, ELK, etc.)Proficiency with scripting/programming languages such as Bash, Python, Go, or RubyWorking knowledge of Git and CI/CD pipelinesExperience with incident response and root cause analysisKnowledge of cloud-native reliability and security best practicesWhat’s OfferedContractor engagement (PJ)Paid Time OffCompetitive compensation packageWellness benefit (Wellhub / Gympass equivalent)Annual performance-based bonusFlexible working hoursOpportunities for technical and career growth
-
Site Reliability Engineer
3 semanas atrás
Rio de Janeiro, Brasil BairesDev Tempo inteiroOverview Site Reliability Engineer at BairesDev. We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure OpenShift/Kubernetes clusters. You will approach the problem of building and maintaining production systems from a software engineering perspective with a focus on automation and reliability. What You...
-
Site Reliability Engineer
3 semanas atrás
Rio de Janeiro, Brasil BairesDev Tempo inteiroOverviewSite Reliability Engineer at BairesDev. We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure OpenShift/Kubernetes clusters. You will approach the problem of building and maintaining production systems from a software engineering perspective with a focus on automation and reliability. What You Will...
-
Site Reliability Engineer
3 semanas atrás
Rio de Janeiro, Brasil BairesDev Tempo inteiroJoin to apply for the Site Reliability Engineer - Remote Work role at BairesDev At BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting‑edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works...
-
Site Reliability Engineer
3 semanas atrás
Rio de Janeiro, Brasil BairesDev Tempo inteiroJoin to apply for the Site Reliability Engineer - Remote Work role at BairesDev At BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting‑edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works...
-
Senior Site Reliability
3 semanas atrás
Rio de Janeiro, Brasil Canonical Tempo inteiroSenior Site Reliability / Gitops Engineer Join or sign in to find your next job Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Senior Site Reliability / Gitops Engineer 1 day ago Be among the first 25 applicants Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Canonical is a leading provider...
-
Site Reliability Engineer
4 semanas atrás
Rio de Janeiro, Brasil BairesDev Tempo inteiroJoin to apply for the Site Reliability Engineer - Remote Work role at BairesDev At BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely...
-
Site Reliability Engineer
3 semanas atrás
Rio de Janeiro, Brasil Canonical Tempo inteiroOverview Join to apply for the Site Reliability Engineer role at Canonical Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our...
-
Site Reliability Engineer ID45689
3 semanas atrás
Rio de Janeiro, Brasil AgileEngine Tempo inteiroJoin to apply for the Site Reliability Engineer ID45689 role at AgileEngine WHY JOIN US If you're looking for a place to grow, make an impact, and work with people who care, we'd love to meet you! ABOUT THE ROLE As a Site Reliability Engineer (SRE), you’ll shape the foundation of secure, reliable, and scalable cloud-native systems that power critical...
-
Staff Site Reliability Engineer
3 semanas atrás
Rio de Janeiro, Brasil Nearsure Tempo inteiroStaff Site Reliability Engineer - Work from home Staff Site Reliability Engineer - Work from home 1 week ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management. Say...
-
Site Reliability Engineer Id45689
Há 2 dias
Rio de Janeiro, Brasil Agileengine Tempo inteiroJob Description AgileEngine is an Inc. **** company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards.WHY JOIN US If you're looking for a place to...