Site Reliability Engineer

4 semanas atrás


Porto Alegre, Brasil Sur LATAM Tempo inteiro

2 weeks ago Be among the first 25 applicants

Get AI-powered advice on this job and more exclusive features.

Our US based client is looking for a mission-driven Site Reliability Engineer to support and scale the infrastructure powering their secure, mission-critical SaaS platform.
You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to respond to incidents quickly, support ongoing automation, and scale systems reliably.
Responsibilities

  • Be part of the team that owns the uptime and performance of our core backend infrastructure (Windows + Linux)
  • Maintain and enhance observability across systems using Kibana, CloudWatch, and custom telemetry
  • Manage CI/CD pipelines, infrastructure as code (Terraform, Ansible), and deployment automation
  • Support and maintain production Windows environments:
  • .NET Framework/Core apps running in IIS
  • SQL Server with AlwaysOn replication and Service Broker-based messaging
  • Support and operate cloud-native services:
  • AWS Lambdas, DynamoDB, Postgres/Aurora, Redshift, Redis, and containerized workloads in Docker
  • Participate in on-call rotation and incident response
  • Collaborate closely with engineering teams to improve system reliability and deployment workflows
Requirements
  • 5+ years of SRE, DevOps, or WebOps experience supporting production SaaS systems
  • Strong experience with Windows Server, IIS, and .NET applications in production
  • Hands-on experience with SQL Server administration, including AlwaysOn and Service Broker
  • Proficiency in AWS operations, including Lambda, DynamoDB, CloudWatch, and IAM
  • Familiarity with Postgres, Redis, Kibana/ElasticSearch, and centralized logging
  • Experience with Docker, Terraform, and Ansible for infrastructure management
  • Strong scripting skills (PowerShell, Python)
  • Experience running and debugging containerized and distributed systems in production
  • Excellent incident response and debugging skills
Benefits
Salary: $6,000 USD/month + Holidays
Unlimited PTO Seniority level
  • Seniority level Mid-Senior level
Employment type
  • Employment type Full-time
Job function
  • Job function Other
  • Industries IT Services and IT Consulting

Referrals increase your chances of interviewing at Sur LATAM by 2x

Sign in to set job alerts for “Site Reliability Engineer” roles. Site Reliability Engineer Pleno – SRE (Remoto) DevOps Engineer Career Opportunities at Dev.Pro - 01 Site Reliability Engineer (SRE) - Technical Referent Software Engineer (Node.js) Career Opportunities at Dev.Pro - 01 Site Reliability Engineer (Middle) ID38916 Software Engineer (C++) Career Opportunities at Dev.Pro - 01 Site Reliability Engineer - Remote Work | REF# Software Development Engineer in Test (Windows) Intermediate Software Engineer (React.js, Node.js) - OP01587-OS Software Development Engineer in Test (MacOS) Senior Software Engineer (Python) - OP01837 Junior Software Development Engineer in Test / R+D - Remote Work | REF#

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr

  • Porto Alegre, Brasil Canonical Tempo inteiro

    1 month ago Be among the first 25 applicants Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's...


  • Porto Alegre, Brasil Azion Tempo inteiro

    Join to apply for the Site Reliability Engineer (SRE) role at Azion 3 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (SRE) role at Azion About Azion We are a global leader in the application and security industry. Our platform allows companies to operate with agility, reducing latency and increasing the reliability...

  • Site Reliability Engineer

    3 semanas atrás


    Porto Alegre, Brasil BairesDev Tempo inteiro

    Overview At BairesDev, we've been leading technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and to startups in Silicon Valley. Our 4,000+ remote team includes top tech talent, and we offer roles that drive significant impact worldwide. This position is for a Site Reliability Engineer to build and maintain highly...

  • Site Reliability Engineer

    4 semanas atrás


    Porto Alegre, Brasil Azion Tempo inteiro

    Join to apply for the Site Reliability Engineer (SRE) role at Azion 3 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (SRE) role at Azion About AzionWe are a global leader in the application and security industry. Our platform allows companies to operate with agility, reducing latency and increasing the...


  • Porto Alegre, Brasil INDI Staffing Services Tempo inteiro

    Overview We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure OpenShift/Kubernetes clusters. Approach the problem of building and maintaining production systems from a software engineering perspective with a focus on automation and reliability. Responsibilities Build, automate, and maintain...

  • Site Reliability Engineer

    4 semanas atrás


    Porto Alegre, Brasil Azion Tempo inteiro

    Join to apply for the Site Reliability Engineer (SRE) role at Azion 3 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (SRE) role at Azion About Azion We are a global leader in the application and security industry. Our platform allows companies to operate with agility, reducing latency and increasing the...

  • Site Reliability Engineer

    1 semana atrás


    Porto Alegre, Rio Grande do Sul, Brasil WEX Tempo inteiro R$80.000 - R$120.000 por ano

    About The Team/RoleWe are seeking a Software Development Engineer Level 3 to join our SRE team dedicated to the Mobility line of business. This role is for a professional with a software development background who will apply SRE principles to ensure the reliability, scalability, and performance of our complex software systems.The ideal candidate will have...


  • Porto Alegre, Brasil Wex Tempo inteiro

    About The Team/RoleThe WEX Site Reliability Engineering (SRE) team seeks individuals passionate about developing software and solutions for observability, incident response, reliability, performance, operational excellence, and compliance.As part of the Site Reliability Engineering organization, you will support internal stakeholders and Payment Platform...


  • Porto Alegre, Brasil WEX Tempo inteiro

    The WEX Site Reliability Engineering (SRE) team seeks individuals passionate about developing software and solutions for observability, incident response, reliability, performance, operational excellence, and compliance. As part of the Site Reliability Engineering organization, you will support internal stakeholders and Payment Platform teams, tackling...


  • Porto Alegre, Brasil WEX Tempo inteiro

    The WEX Site Reliability Engineering (SRE) team seeks individuals passionate about developing software and solutions for observability, incident response, reliability, performance, operational excellence, and compliance. As part of the Site Reliability Engineering organization, you will support internal stakeholders and Payment Platform teams, tackling...