Site Reliability Engineer
Há 5 dias
About the Company This company operates a global computing platform that enables businesses to programmatically deploy single‑tenant Bare Metal instances across multiple regions worldwide. They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer‑centric single‑tenant cloud infrastructure on the market. If you share this passion, this role offers the opportunity to help shape the future of internet‑scale infrastructure. This position is being managed in partnership with an external recruitment consultancy supporting the company throughout the hiring process. Summary The Reliability team is responsible for the health and resilience of the infrastructure powering a global bare metal cloud platform. As a Senior Site Reliability Engineer (SRE), you’ll focus on building reliable, observable, and self‑healing systems at scale. SREs here operate at the intersection of software engineering and infrastructure — designing tools that automate operations, improve incident response, and enhance observability, ensuring the platform delivers high performance and reliability to customers worldwide. This role is ideal for engineers passionate about reliability, automation, distributed systems, and bringing cloud‑like experiences to bare metal environments. Key Responsibilities Continuously improve platform reliability and performance. Design, build, and maintain tools to automate operational workflows and incident response. Implement and enhance observability systems (monitoring, alerting, tracing). Collaborate with engineering and platform teams to design scalable and resilient systems. Participate in on‑call rotations and lead post‑incident reviews with a learning‑focused approach. Develop and document operational playbooks and processes. Contribute to defining SLOs / SLIs and driving reliability metrics across teams. Skills & Qualifications Fluent verbal and written English communication skills Advanced experience with Linux / Unix in production environments Hands‑on experience with Kubernetes and container orchestration Proficiency with IaC tools (e.g., Terraform, Ansible) Experience with observability stacks (Prometheus, Grafana, Loki, ELK, etc.) Proficiency with scripting / programming languages such as Bash, Python, Go, or Ruby Working knowledge of Git and CI / CD pipelines Experience with incident response and root cause analysis Knowledge of cloud‑native reliability and security best practices What’s Offered Contractor engagement (PJ) Paid Time Off Competitive compensation package Wellness benefit (Wellhub / Gympass equivalent) Annual performance‑based bonus Flexible working hours Opportunities for technical and career growth #J-18808-Ljbffr
-
Aracaju, Brasil Scubyt Tempo inteiroSoftware Engineer Site Reliability Engineer Location: Brazil REMOTE Duration: Fulltime CLT / REMOTE About the role The Application SRE Team supports several critical components of our foundational technologies for real‑time protection, as well as our RBI and SSPM services. We are a team of software engineers focused on improving availability, latency,...
-
Site Reliability Engineer
Há 7 dias
Aracaju, Brasil Nearsure Tempo inteiroExplore the Nearsure experience!Join our close-knit LATAM remote team:Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management.Say goodbye to micromanagement!We champion autonomy, open communication, and respect for diversity as our core values.⚖️Your well-being matters:Our People Care team is here from...
-
Site Reliability Engineer
Há 7 dias
Aracaju, Brasil Nearsure Tempo inteiroExplore the Nearsure experience! Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management. Say goodbye to micromanagement! We champion autonomy, open communication, and respect for diversity as our core values. ⚖️Your well-being matters: Our People Care team...
-
Site Reliability Engineer
Há 7 dias
Aracaju, Brasil HCLTech Tempo inteiroYour role and responsibilities : Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution. Performing deep-dive application troubleshooting and identifying preventive actions. Managing CIRS-related requests including deployments, feature toggles, and data fixes. Following up on major production...
-
Full Cycle Engineer
Há 7 dias
Aracaju, Brasil Titan Clarity Tempo inteiroJob Summary We are seeking a highly skilled Full Cycle Engineer with a strong command of React Native, Node.js with NestJS, and DevOps practices, especially AWS and Terraform. This role is perfect for someone who thrives in a startup environment and enjoys owning the full development lifecycle, from planning and coding to deployment and monitoring....
-
Cloud Infrastructure Engineer
Há 7 dias
Aracaju, Brasil Avenue Code Tempo inteiroAbout the company : Avenue Code is the leading software consultancy focused on delivering end-to-end development solutions for digital transformation across every vertical. We’re privately held, profitable, and have been on a solid growth trajectory since day one. We care deeply about our clients, our partners, and our people. We prefer the word...
-
Data Engineer | Azure Data Platform
3 semanas atrás
Aracaju, Brasil Neo BI Solution Tempo inteiroWe’re expanding our Data Platform Operations team and looking for an experiencedData Engineerwith strongAzure administrationandsoftware engineeringskills. This role combinesoperational excellenceanddevelopment capability , supporting and enhancing enterprise data solutions within the Azure ecosystem.Key ResponsibilitiesManage and optimize Azure...
-
Sr. Full-Stack Software Engineer
Há 5 dias
Aracaju, Brasil Tecla Tempo inteiroNative / Bilingual English is required for this role (read / written / spoken) Please upload your CV Resume in English. Monthly salary : $5,000 - $6,000 USD Along with our partner, we are seeking a highly capable and experienced Full-Stack Software Engineer to join their team. This role is ideal for a senior-level engineer who thrives on owning features...
-
Senior Network Support Engineer
2 semanas atrás
Aracaju, Brasil NETSCOUT Tempo inteiroSenior Network Support Engineer / Resident Engineer - BrasiliaThe Senior Resident Engineer is responsible for providing onsite technical support services to a specific NETSCOUT | Arbor customer and performing the role of a Subject Matter Expert on NETSCOUT | Arbor security products.Specific Job Duties and Responsibilities The Senior Resident Support Engineer...
-
Senior Network Support Engineer
Há 5 dias
Aracaju, Brasil Netscout Tempo inteiroSenior Network Support Engineer / Resident Engineer - BrasiliaThe Senior Resident Engineer is responsible for providing onsite technical support services to a specific NETSCOUT | Arbor customer and performing the role of a Subject Matter Expert on NETSCOUT | Arbor security products.Specific Job Duties and ResponsibilitiesThe Senior Resident Support Engineer...