Site Reliability Engineer
Há 13 horas
About the CompanyThis company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide.They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric single-tenant cloud infrastructure on the market. If you share this passion, this role offers the opportunity to help shape the future of internet-scale infrastructure.This position is being managed in partnership with an external recruitment consultancy supporting the company throughout the hiring process.SummaryThe Reliability team is responsible for the health and resilience of the infrastructure powering a global bare metal cloud platform. As a Senior Site Reliability Engineer (SRE), you'll focus on building reliable, observable, and self-healing systems at scale.SREs here operate at the intersection of software engineering and infrastructure — designing tools that automate operations, improve incident response, and enhance observability, ensuring the platform delivers high performance and reliability to customers worldwide.This role is ideal for engineers passionate about reliability, automation, distributed systems, and bringing cloud-like experiences to bare metal environments.Key ResponsibilitiesContinuously improve platform reliability and performance.Design, build, and maintain tools to automate operational workflows and incident response.Implement and enhance observability systems (monitoring, alerting, tracing).Collaborate with engineering and platform teams to design scalable and resilient systems.Participate in on-call rotations and lead post-incident reviews with a learning-focused approach.Develop and document operational playbooks and processes.Contribute to defining SLOs/SLIs and driving reliability metrics across teams.Skills & QualificationsRequired:Fluent verbal and written English communication skillsAdvanced experience with Linux/Unix in production environmentsHands-on experience with Kubernetes and container orchestrationProficiency with IaC tools (e.g., Terraform, Ansible)Experience with observability stacks (Prometheus, Grafana, Loki, ELK, etc.)Proficiency with scripting/programming languages such as Bash, Python, Go, or RubyWorking knowledge of Git and CI/CD pipelinesExperience with incident response and root cause analysisKnowledge of cloud-native reliability and security best practicesWhat’s OfferedContractor engagement (PJ)Paid Time OffCompetitive compensation packageWellness benefit (Wellhub / Gympass equivalent)Annual performance-based bonusFlexible working hoursOpportunities for technical and career growth
-
Senior Data Platform Engineer
Há 13 horas
Brasil, BR Elios Talent Tempo inteiroData Platform Engineer – SeniorKey Highlights️ Lead the design and expansion of large-scale data platform components supporting analytics, experimentation, and machine learning workloads⚡ Architect and optimize ETL, streaming, metadata, and federated querying systems at scale Drive performance tuning, observability, and reliability efforts across...
-
Intermediate Data Platform Engineer
Há 13 horas
Brasil, BR Elios Talent Tempo inteiroData Platform Engineer – IntermediateKey Highlights️ Build and expand core data platform components powering analytics, experimentation, and algorithm development⚡ Develop scalable ETL, streaming, and metadata systems using Spark, Kafka, and modern lakehouse technologies Support high-volume data transformation, federated querying, and performance...
-
Front End AI Engineer
Há 13 horas
Brasil, BR Dry Ground AI Tempo inteiroPosition Overview:We are seeking a Front End AI Engineer with deep expertise in building high-quality, production-ready interfaces for AI-powered applications. This role focuses on leading the front-end architecture for advanced AI systems, including conversational interfaces, agent control panels, automation dashboards, and embedded AI workflows for client...
-
Data Engineer GCP
Há 13 horas
Brasil, BR Tata Consultancy Services Tempo inteiroCome to one of the biggest IT Services companies in the world!! Here you can transform your career!Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a culture of unlimited learning full of opportunities for improvement and mutual development. The ideal scenario to expand ideas through the right tools, contributing to...
-
Quality Assurance Automation Engineer
Há 13 horas
Brasil, BR Tata Consultancy Services Tempo inteiroCome to one of the biggest IT Services companies in the world!! Here you can transform your career!Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a culture of unlimited learning full of opportunities for improvement and mutual development. The ideal scenario to expand ideas through the right tools, contributing to...
-
Site Reliability Engineer
Há 4 horas
Brasil Review ALL Tempo inteiroAbout the Company This company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide. They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric...
-
Site Reliability Engineer
Há 3 horas
Brasil Review ALL Tempo inteiroAbout the Company This company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide. They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric...
-
Site Reliability Engineer
Há 12 horas
Brasil MetaCTO Tempo inteiroAbout Us At MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services. As a Site Reliability Engineer (SRE) , you will play a critical role in ensuring the reliability, scalability, and security of the backend infrastructure that powers...
-
Site reliability engineer sr
2 semanas atrás
Brasil Mercado Eletrônico Tempo inteiroO Mercado Eletrônico é líder na América Latina em soluções de gestão de compras B2 B. Suas tecnologias e serviços para as áreas de compras ajudam empresas a conquistarem mais economia, agilidade, governança e colaboração. Com escritórios no Brasil, Estados Unidos, México e Portugal, contabiliza mais de 1 milhão de fornecedores, 10 mil...
-
Software Engineer Site Reliability Engineer
Há 8 horas
Brasil Scubyt Tempo inteiroSoftware Engineer Site Reliability Engineer Location: Brazil REMOTE Duration: Fulltime CLT / REMOTE About the role The Application SRE Team supports several critical components of our foundational technologies for real-time protection, as well as our and services. We are a team of software engineers focused on improving availability, latency, performance,...
-
Software Engineer Site Reliability Engineer
Há 4 horas
Brasil Scubyt Tempo inteiroSoftware Engineer Site Reliability Engineer Location: Brazil REMOTE Duration: Fulltime CLT / REMOTE About the role The Application SRE Team supports several critical components of our foundational technologies for real-time protection, as well as our and services. We are a team of software engineers focused on improving availability, latency, performance,...
-
Software Engineer Site Reliability Engineer
Há 3 horas
Brasil Scubyt Tempo inteiroSoftware Engineer Site Reliability Engineer Location: Brazil REMOTE Duration: Fulltime CLT / REMOTE About the role The Application SRE Team supports several critical components of our foundational technologies for real-time protection, as well as our and services. We are a team of software engineers focused on improving availability, latency,...
-
Staff Devops Site Reliability Engineer
2 semanas atrás
Brasil Housecall Pro Tempo inteiro US$7.500 - US$15.000 por anoTO BE CONSIDERED FOR THIS ROLE, PLEASE SUBMIT AN UPDATED RESUME TRANSLATED TO ENGLISHWhy Housecall Pro?Help us build solutions that build better lives. At Housecall Pro, we show up to work every day to make a difference for real people: the home service professionals that support America's 100 million homes. We're all about the Pro, and dedicate our days to...
-
Senior Site Reliability Engineer
2 semanas atrás
Vitória Brasil Mercado Eletrônico Tempo inteiroO Mercado Eletrônico é líder na América Latina em soluções de gestão de compras B2 B. Suas tecnologias e serviços para as áreas de compras ajudam empresas a conquistarem mais economia, agilidade, governança e colaboração.Com escritórios no Brasil, Estados Unidos, México e Portugal, contabiliza mais de 1 milhão de fornecedores, 10 mil...
-
Site reliability engineer
Há 7 dias
Brasil Softensity Inc Tempo inteiroRole Summary The SRE Technical Member will: Deliver engineering, operational, and administrative support for the application and its technology landscape. Address reliability and operational challenges such as application failures, production issues, infrastructure performance (disk, memory), monitoring, and security. Serve as a mid-level subject matter...