
Site Reliability Engineer
1 semana atrás
Come join us at Odisea and work with some of the most exciting start-ups in the US
Role
:
Are you a seasoned Site Reliability Engineer looking to make a real impact? We're seeking a high-caliber technical expert to join our team and help shape the future of our infrastructure and product delivery.
This role is ideal for someone who thrives on ownership, values deep technical challenges, and is eager to lead initiatives that drive meaningful improvements across the stack. You'll contribute to architecture decisions, mentor peers, and take the lead on projects that are critical to system reliability and scalability.
As a Site Reliability Engineer, you'll be part of a collaborative team responsible for building, automating, and maintaining our multi-region infrastructure. You'll work hands-on with the Observability stack, Kubernetes, Infrastructure-as-Code, CI/CD systems and GitOps workflows. Your work will directly support infrastructure availability, performance, security and efficiency across the organization.
Responsibilities:
Manage and maintain the observability stack across all environments using tools such as
OpenTelemetry, DataDog, Prometheus, Grafana, and others
to ensure system visibility and performance.
Develop and manage Infrastructure as Code (IaC)
using Terraform, OpenTofu, Terragrunt, Atlantis, Spacelift, and related tools to provision and maintain cloud infrastructure.
Contribute to implementation and improvement of
SRE practices
, such as SLOs, Error Budgets, PRRs, Problem Management
Administer and support
CI/CD pipelines
, including TeamCity and GitHub Actions, ensuring reliable and efficient software delivery.
Own and resolve Jira tickets related to infrastructure projects, support requests, and ongoing operational tasks.
Create and maintain operational documentation, including runbooks, playbooks, architecture diagrams, and SOPs to support knowledge sharing and incident response.
Respond to production incidents, diagnose and triage issues, and follow established escalation protocols and standard operating procedures.
Identify and drive down toil with creative innovation and automation
Participate in a shared on-call rotation, helping ensure the reliability and uptime of critical production systems.
What We Are Looking For:
Proficient with infrastructure monitoring and observability tools
like Datadog and Prometheus
3+ years of hands-on experience in
AWS administration, Site Reliability Engineering, DevOps, or build and release roles
Deep knowledge of the
AWS ecosystem
, including but not limited to: EC2, EKS/ECS, RDS, IAM, KMS, SQS, CloudWatch, Lambda, Config, and Glue
3+ years experience with Infrastructure as
Code (IaC) tools
including Terraform, Atlantis, Spacelift, OpenTofu, and/or Terragrunt
Deep understanding of SRE practices, such as SLOs, Error Budgets, PRRs, Problem Management
A process improvement mindset, especially around DevOps/SRE areas such as deployment workflows, automation, security, and developer productivity
Scripting experience Python, Bash, or Shell
Bonus points:
Kubernetes and GCP experience
Production-level expertise with container orchestration and tooling such as EKS, ECS/Fargate, ArgoCD, Helm, and Istio.
Hands-on experience deploying, configuring, and automating CI/CD pipelines (TeamCity and GitHub Actions).
Multi-region (>3) production support
Prior work in FedRamp and/or SOC 2 environments
Familiarity with GitOps workflows (Argo CD, Akuity, Github Actions)
Familiarity with Security workflows and tooling (Wiz, Orca, AWS SecurityHub.
-
Senior AI Infrastructure Engineer
Há 2 dias
Colômbia, Brasil Solvd Tempo inteiro US$90.000 - US$120.000 por anoSolvd is an AI-first advisory and digital engineering firm delivering measurable business impact through strategic digital transformation. Taking an AI-first approach, we bridge the critical gap between experimentation and real ROI, weaving artificial intelligence into everything we do and helping clients at all stages accelerate AI integration into each...
-
Technical Support Engineer
2 semanas atrás
Colômbia, Brasil Exadel Inc Tempo inteiroTechnical Support Engineer (with DevOps Experience) We're seeking a talented Technical Support Engineer (with DevOps Experience) to join our team and work on a long-term project. If you have a background in a support role and the ability to identify and resolve unique problems effectively, take your chance to apply. Why Join Exadel We're an AI-first global...
-
Senior/Lead Full Stack Engineer
Há 3 dias
Colômbia, Brasil Exadel Inc Tempo inteiroSenior/Lead Full Stack Engineer (ReactJS, MongoDB) Join Exadel as a Senior/Lead Full-Stack Software Engineer and build scalable apps that make an impact. You'll explore new tech, work with the latest tools, and thrive in a fast-moving Agile team. Why Join Exadel We're an AI-first global tech company with 25+ years of engineering leadership, 2,000+ team...
-
Senior/Lead Full Stack Engineer
Há 3 dias
Colômbia, Brasil Exadel Inc Tempo inteiroSenior/Lead Full Stack Engineer (ReactJS, MongoDB)Join Exadel as a Senior/Lead Full-Stack Software Engineer and build scalable apps that make an impact. You'll explore new tech, work with the latest tools, and thrive in a fast-moving Agile team.Why Join ExadelWe're an AI-first global tech company with 25+ years of engineering leadership, 2,000+ team members,...
-
Technical Support Engineer
2 semanas atrás
Colômbia, Brasil Exadel open positions Tempo inteiroTechnical Support Engineer (with DevOps Experience) We're seeking a talented Technical Support Engineer (with DevOps Experience) to join our team and work on a long-term project. If you have a background in a support role and the ability to identify and resolve unique problems effectively, take your chance to apply. Why Join Exadel We're an AI-first global...
-
Dbt Developer
Há 4 dias
Colômbia, Brasil Jobgether Tempo inteiroThis position is posted by Jobgether on behalf of Allshore Talent. We are currently looking for a DBT Developer (Snowflake & Sigma) in Latin America.As a DBT Developer, you will play a key role in transforming raw data into actionable insights that drive business decisions. You will design, build, and maintain scalable data transformation pipelines while...
-
Senior Backend Software Developer
Há 3 dias
Colômbia, Brasil beBeeEngineer Tempo inteiro US$90.000 - US$120.000About UsWe are a cutting-edge tech firm dedicated to harnessing the power of AI and innovation.Our mission is to empower pathologists with intelligent, scalable systems that accelerate cancer diagnostics.Job OverviewWe seek an accomplished Full Stack Engineer to spearhead the development of our clinical data systems.This role will involve designing and...
-
Colômbia, Brasil Odisea-Cultura Tempo inteiroCome join us at Odisea and work with some of the most exciting start-ups in the US.Our client is a construction SaaS startup that provides data analytics to construction teams to help them better manage their job sites. Working with top land developers, home builders, contractors, and engineers across the US and Canada. This is a great chance to work with a...
-
Data Quality Assurance Analyst
2 semanas atrás
Colômbia, Brasil First Line Software Tempo inteiroEngineering & Development - | - Senior - Senior - Colombia - Brazil - | - Colombia - Brazil - June 6, 2025 **About the company**: First Line Software works with some of the worlds top businesses and organisations in industries like healthcare, real estate, _data engineering_, warehouse automation, retail digitalisation, mobile app development, and...