Site Reliability Engineer

Há 14 horas


Três Lagoas, Brasil Review ALL Tempo inteiro

About the Company This company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide.
They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric single-tenant cloud infrastructure on the market. If you share this passion, this role offers the opportunity to help shape the future of internet-scale infrastructure.
This position is being managed in partnership with an external recruitment consultancy supporting the company throughout the hiring process.

Summary
The Reliability team is responsible for the health and resilience of the infrastructure powering a global bare metal cloud platform. As a Senior Site Reliability Engineer (SRE) , you'll focus on building reliable, observable, and self-healing systems at scale.
SREs here operate at the intersection of software engineering and infrastructure — designing tools that automate operations, improve incident response, and enhance observability, ensuring the platform delivers high performance and reliability to customers worldwide.
This role is ideal for engineers passionate about reliability, automation, distributed systems, and bringing cloud-like experiences to bare metal environments.

Key Responsibilities
Continuously improve platform reliability and performance.
Design, build, and maintain tools to automate operational workflows and incident response.
Implement and enhance observability systems (monitoring, alerting, tracing).
Collaborate with engineering and platform teams to design scalable and resilient systems.
Participate in on-call rotations and lead post-incident reviews with a learning-focused approach.
Develop and document operational playbooks and processes.
Contribute to defining SLOs/SLIs and driving reliability metrics across teams.

Skills & Qualifications

Required:
Fluent verbal and written English communication skills
Advanced experience with Linux/Unix in production environments
Hands-on experience with Kubernetes and container orchestration
Proficiency with IaC tools (e.g., Terraform, Ansible)
Experience with observability stacks (Prometheus, Grafana, Loki, ELK, etc.)
Proficiency with scripting/programming languages such as Bash, Python, Go, or Ruby
Working knowledge of Git and CI/CD pipelines
Experience with incident response and root cause analysis
Knowledge of cloud-native reliability and security best practices

What’s Offered
Contractor engagement (PJ)
Paid Time Off
Competitive compensation package
Wellness benefit (Wellhub / Gympass equivalent)
Annual performance-based bonus
Flexible working hours
Opportunities for technical and career growth


  • Sr. Data Engineer

    Há 2 dias


    Sete Lagoas, Brasil Xml International Tempo inteiro

    Sr Data Engineer (São Paulo, Brazil)Become part of a community of experts in Information Digital Engineering (IDE), turning ideas into reality by embedding AI across the organization. From enhancing existing operations to accelerating the development of next-generation clean energy solutions, your impact will be transformative.The IDE team creates business...


  • Três Lagoas, Brasil Genpact Tempo inteiro

    Ready to build the future with AI? At Genpact, we don’t just keep up with technology—we set the pace. AI and digital innovation are redefining industries, and we’re leading the charge. Genpact’s AI Gigafactory, our industry-first accelerator, is an example of how we’re scaling advanced technology solutions to help global enterprises work smarter,...


  • Sete Lagoas, Brasil Applaudo Tempo inteiro

    About YouYou are a back-end engineer passionate about building scalable, reliable, and maintainable applications. You thrive on writing clean code, applying modern architecture patterns, and leveraging cloud-based solutions. You enjoy solving complex challenges, automating infrastructure, and ensuring that systems are secure, performant, and ready to scale....

  • Safety Risk

    Há 7 dias


    Três Rios, Rio de Janeiro, Brasil GSK Tempo inteiro R$60.000 - R$120.000 por ano

    Site Name: Warsaw Rzymowskiego 53, Bangalore, Terra CampusPosted Date: Oct **Location: Costa Rica, India, Poland or As the role is truly global in nature, we reserve the option to modify the site location to align to the location of the successful candidate**Business Introduction:  GSK is focused on ambitious growth, aiming for £40 billion in annual sales...

  • Safety Risk

    Há 7 dias


    Três Rios, Rio de Janeiro, Brasil GSK Tempo inteiro R$80.000 - R$120.000 por ano

    Nazwa biura: Warsaw Rzymowskiego 53, Bangalore, Terra CampusPosted Date: Oct **Location: Costa Rica, India, Poland or As the role is truly global in nature, we reserve the option to modify the site location to align to the location of the successful candidate**Business Introduction:  GSK is focused on ambitious growth, aiming for £40 billion in annual...