Site Reliability Engineer

Há 19 horas


São Paulo, São Paulo, Brasil Truelogic Tempo inteiro US$120.000 - US$180.000 por ano
About Truelogic

At Truelogic we are a leading provider of nearshore staff augmentation services headquartered in New York. For over two decades, we've been delivering top-tier technology solutions to companies of all sizes, from innovative startups to industry leaders, helping them achieve their digital transformation goals.

Our team of 600+ highly skilled tech professionals, based in Latin America, drives digital disruption by partnering with U.S. companies on their most impactful projects. Whether collaborating with Fortune 500 giants or scaling startups, we deliver results that make a difference.

By applying for this position, you're taking the first step in joining a dynamic team that values your expertise and aspirations. We aim to align your skills with opportunities that foster exceptional career growth and success while contributing to transformative projects that shape the future.

Our Client

A data-driven technology company that partners with high-growth brands to optimize customer acquisition and retention. It specializes in delivering high-LTV audiences and enrichment data to increase repeat purchase rates. The company collaborates with major platforms and agencies such as Shopify, Experian, TransUnion, and top media partners, all focused on driving profitable revenue growth.


Job Summary

The Site Reliability Engineer plays a key role in platform enablement by building and maintaining core infrastructure tooling that enables teams to deploy and operate services reliably using AWS and Kubernetes. This position focuses on managing and evolving internal Infrastructure as Code (IaC) constructs, primarily Python-based abstractions built with AWS CDK and CDK8s. These constructs encompass networking, EKS configuration, data stores, observability, autoscaling patterns, and deployment primitives. The engineer collaborates closely with backend teams to ensure infrastructure is secure, consistent, and easy to integrate, driving platform reliability and developer productivity.

Responsibilities
  • Designs, implements, and evolves shared AWS CDK and CDK8s constructs used across multiple services and teams.

  • Maintains core infrastructure components including VPC, EKS clusters and node groups, RDS, OpenSearch, and MSK.

  • Operates and extends Kubernetes cluster addons such as ingress controllers, cert-manager, autoscalers, and monitoring/logging stacks.

  • Ensures high reliability through structured alerting systems (Prometheus, CloudWatch), autoscaling strategies, and recovery mechanisms.

  • Manages and publishes baseline templates, configuration schemas, and comprehensive documentation for infrastructure usage.

  • Owns the CI/CD pipelines for Infrastructure as Code (IaC) codebases and platform component releases.

  • Collaborates with engineering teams to troubleshoot infrastructure-related issues and deliver scalable, reliable solutions.

  • Applies Site Reliability Engineering (SRE) principles—including SLIs, SLOs, observability, and fault tolerance—to all shared platform services.

  • Supports IAM roles, secrets management, and tenant isolation best practices.

Qualifications and Job Requirements
  • Has 5+ years of experience in infrastructure or Site Reliability Engineering (SRE), including hands-on work with AWS services such as VPC, IAM, RDS, MSK, and S3, as well as Kubernetes components like Helm, RBAC, and ServiceAccounts.

  • Demonstrates fluency in Python and has practical experience with Infrastructure-as-Code using AWS CDK, CDK8s, or equivalent frameworks such as Pulumi.

  • Possesses a strong understanding of Prometheus, Grafana, and effective alert routing practices.

  • Has experience designing reusable infrastructure patterns or building internal developer platforms.

  • Shows a proven track record of improving system reliability through automation, monitoring, and operational best practices.

  • Has experience supporting Spark on Kubernetes, Argo, or Kafka-based batch pipelines.

What We Offer
  • 100% Remote Work: Enjoy the freedom to work from the location that helps you thrive. All it takes is a laptop and a reliable internet connection.

  • Highly Competitive USD Pay: Earn an excellent, market-leading compensation in USD, that goes beyond typical market offerings.

  • Paid Time Off: We value your well-being. Our paid time off policies ensure you have the chance to unwind and recharge when needed.

  • Work with Autonomy: Enjoy the freedom to manage your time as long as the work gets done. Focus on results, not the clock.

  • Work with Top American Companies: Grow your expertise working on innovative, high-impact projects with Industry-Leading U.S. Companies.

Why You'll Like Working Here
  • A Culture That Values You: We prioritize well-being and work-life balance, offering engagement activities and fostering dynamic teams to ensure you thrive both personally and professionally.

  • Diverse, Global Network: Connect with over 600 professionals in 25+ countries, expand your network, and collaborate with a multicultural team from Latin America.

  • Team Up with Skilled Professionals: Join forces with senior talent. All of our team members are seasoned experts, ensuring you're working with the best in your field.

Apply now



  • São Paulo, São Paulo, Brasil WEX Inc. Tempo inteiro R$70.000 - R$120.000 por ano

    About the Team/RoleWe are seeking a Software Development Engineer Level 3 to join our SRE team dedicated to the Mobility line of business. This role is for a professional with a software development background who will apply SRE principles to ensure the reliability, scalability, and performance of our complex software systems.The ideal candidate will have...


  • São Paulo, São Paulo, Brasil WEX Inc. Tempo inteiro R$80.000 - R$160.000 por ano

    About the Team/RoleThe WEX Site Reliability Engineering (SRE) team seeks individuals passionate about developing software and solutions for observability, incident response, reliability, performance, operational excellence, and compliance. As part of the Site Reliability Engineering organization, you will support internal stakeholders and Payment Platform...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, São Paulo, Brasil Loadsmart Tempo inteiro R$80.000 - R$120.000 por ano

    ARE YOU INTERESTED IN JOINING AN INNOVATIVE LOGISTICS TECHNOLOGY COMPANY? Loadsmart is a growth-stage technology company valued at over $1 billion (a true Tech Unicorn We are a collection of industry veterans and user-centered engineers using innovative technology to fearlessly reinvent the future of freight by helping shippers, brokers, warehouses and...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, São Paulo, Brasil Loadsmart Tempo inteiro R$120.000 - R$240.000 por ano

    ARE YOU INTERESTED IN JOINING AN INNOVATIVE LOGISTICS TECHNOLOGY COMPANY?Loadsmart is a growth-stage technology company valued at over $1 billion (a true Tech Unicorn)We are a collection of industry veterans and user-centered engineers using innovative technology to fearlessly reinvent the future of freight by helping shippers, brokers, warehouses and...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, São Paulo, Brasil Loadsmart Tempo inteiro R$80.000 - R$120.000 por ano

    ARE YOU INTERESTED IN JOINING AN INNOVATIVE LOGISTICS TECHNOLOGY COMPANY?Loadsmart is a growth-stage technology company valued at over $1 billion (a true Tech Unicorn)We are a collection of industry veterans and user-centered engineers using innovative technology to fearlessly reinvent the future of freight by helping shippers, brokers, warehouses and...

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, São Paulo, Brasil act digital Tempo inteiro

    SAP Site Reliability Engineer (SRE) SeniorLocal:São Paulo – Híbrido (Morumbi Shopping), presencial 3x por semanaIdioma:Inglês conversacional (a partir de B2)Modelo:Tempo integralSobre a oportunidadeBuscamos um(a) SRE Senior com forte vivência emoperações e confiabilidade de SAP S/4HANA, responsável por garantir estabilidade, performance e...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, São Paulo, Brasil DELIVER IT Tempo inteiro R$80.000 - R$120.000 por ano

    Você se considera uma pessoa que tem sede por aprendizado, gosta de trabalhar em equipe e almeja desenvolvimento na carreira? Então essa oportunidade é para vocêEstamos em busca de um(a) SRE Júnior (Site Reliability Engineer) para integrar uma equipe altamente técnica e comprometida com a excelência operacional. O profissional atuará com foco na...


  • São Paulo, São Paulo, Brasil DELIVER IT Tempo inteiro R$60.000 - R$120.000 por ano

    Você é uma pessoa com sólida experiência em engenharia de confiabilidade, tem pensamento estratégico, perfil colaborativo e busca constantemente elevar o nível técnico dos times e sistemas com os quais trabalha? Então essa oportunidade é para vocêEstamos em busca de um(a) SRE Sênior (Site Reliability Engineer) para compor uma equipe técnica de...


  • São Paulo, São Paulo, Brasil Workana Tempo inteiro R$150.000 - R$200.000 por ano

    Na Workana, estamos em busca de um(a) Senior Site Reliability Engineer (SRE) para integrar o time de um dos nossos clientes e desempenhar um papel essencial na manutenção, automação e melhoria da confiabilidade dos sistemas que impulsionam sua rede logística em múltiplas regiões.Sobre o cliente:Trata-se de uma plataforma que gerencia fluxos...


  • São Paulo, São Paulo, Brasil Workana Tempo inteiro R$150.000 - R$200.000 por ano

    Na Workana, estamos em busca de um(a)Senior Site Reliability Engineer (SRE)para integrar o time de um dos nossos clientes e desempenhar um papel essencial na manutenção, automação e melhoria da confiabilidade dos sistemas que impulsionam sua rede logística em múltiplas regiões.Sobre o cliente:Trata-se de uma plataforma que gerencia fluxos logísticos...