Site Reliability Engineer

Há 13 horas


Brasília, Brasil MetaCTO Tempo inteiro

About UsAt MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services. As aSite Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and security of the backend infrastructure that powers innovative applications for our clients. This role will involve managing cloud environments, optimizing databases, automating deployments, and improving system observability. Job Description As aSite Reliability Engineer (SRE) at MetaCTO, you will be responsible for designing, implementing, and maintaining highly available, scalable, and secure infrastructure solutions. You will collaborate with software engineers to improve system performance, automate operations, and ensure the smooth functioning of critical backend services. You’ll work extensively with cloud platforms like AWS, leveraging technologies such as Terraform, Docker, Kubernetes, and CI/CD pipelines to enhance system reliability. ResponsibilitiesArchitect, build, and maintain cloud infrastructure onAWS(Lambda, EC2, RDS, S3, EKS, SQS, CloudWatch). Manage and optimize databases (MySQL, PostgreSQL) for performance, reliability, and security. Implementmonitoring, alerting, and loggingsolutions to ensure system health and performance, with specific experience usingZabbixandElastic Logging. Design and maintainCI/CD pipelinesfor automated deployment and scaling of applications. Work withcontainerization and orchestration toolssuch asDockerandKubernetes. Develop and enforcesecurity best practicesfor cloud environments and infrastructure. Automate operational processes usingInfrastructure-as-Code (Terraform, CloudFormation)and scripting languages like Python or Bash. Troubleshoot and resolve infrastructure-related incidents and optimize system performance. Collaborate with backend engineers to ensure high availability, fault tolerance, and scalable system design, with a strong focus onDjango-based applications. Qualifications5-10 yearsof experience inSite Reliability Engineering (SRE), DevOps, or Cloud Engineeringroles. Strong expertise inAWScloud services (EC2, RDS, S3, Lambda, CloudFront, EKS, SQS, IAM). Hands-on experience withcontainerization (Docker) and orchestration (Kubernetes, ECS, or EKS). Deep knowledge ofrelational databases (MySQL, PostgreSQL), including performance tuning, query optimization, monitoring, and migration management. Proficiency inInfrastructure-as-Code toolssuch asTerraform, CloudFormation, or Pulumi. Strong experience withCI/CD pipelinesand automation tools (GitHub Actions, Jenkins, CircleCI, or GitLab CI/CD). Proficiency inmonitoring tools, specificallyZabbix, and logging solutions likeElastic Logging. Scripting experience withPython, Bash, or Gofor automating operational tasks. Experience working withDjango-based applicationsin a cloud environment. Experience implementing security best practices for cloud-based applications. Knowledge of distributed systems andmicroservices architecture. Preferred SkillsAWS certifications (Solutions Architect, DevOps Engineer) are a plus. Experience withserverless computingand event-driven architectures. Familiarity withmessage queue services(SQS, RabbitMQ, Kafka). Understanding ofzero-downtime deploymentsand disaster recovery strategies. Position DetailsType:Full-Time Location:100% Remote Hours:US Pacific Time hours How to ApplyIf you are passionate aboutscalability, automation, and reliability, and thrive in a collaborative, fast-paced environment, we’d love to hear from you. Please submit yourresumeand an optionalbrief cover letteroutlining your relevant experience. MetaCTOis an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.



  • Brasília, Brasil Review ALL Tempo inteiro

    About the CompanyThis company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide. They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric...


  • Brasília, DF, Brasil MetaCTO Tempo inteiro

    About Us At MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services. As a Site Reliability Engineer (SRE) , you will play a critical role in ensuring the reliability, scalability, and security of the backend infrastructure that powers...


  • Brasília, Brasil MetaCTO Tempo inteiro

    About UsAt MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services. As aSite Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and security of the backend infrastructure that powers...


  • Brasília, Brasil INDI Staffing Services Tempo inteiro

    At INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work. Overview of the role:We are looking for a Site Reliability Engineer to build and maintain highly reliable,...

  • Site Reliability Engineer Sr

    2 semanas atrás


    Brasília, Brasil Mercado Eletrônico Tempo inteiro

    O Mercado Eletrônico é líder na América Latina em soluções de gestão de compras B2B. Suas tecnologias e serviços para as áreas de compras ajudam empresas a conquistarem mais economia, agilidade, governança e colaboração. Com escritórios no Brasil, Estados Unidos, México e Portugal, contabiliza mais de 1 milhão de fornecedores, 10 mil...


  • Brasília, Brasil Agileengine Tempo inteiro

    AgileEngine is an Inc. **** company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards.WHY JOIN US If you're looking for a place to grow, make an...


  • Brasília, Brasil Scubyt Tempo inteiro

    Software Engineer Site Reliability Engineer Location: Brazil REMOTE Duration: Fulltime CLT / REMOTEAbout the role The Application SRE Team supports several critical components of our foundational technologies for real-time protection, as well as ourRBIandSSPMservices. We are a team of software engineers focused on improving availability, latency,...

  • Site reliability engineer sr

    2 semanas atrás


    Brasília, Brasil Mercado Eletrônico Tempo inteiro

    O Mercado Eletrônico é líder na América Latina em soluções de gestão de compras B2 B. Suas tecnologias e serviços para as áreas de compras ajudam empresas a conquistarem mais economia, agilidade, governança e colaboração.Com escritórios no Brasil, Estados Unidos, México e Portugal, contabiliza mais de 1 milhão de fornecedores, 10 mil...


  • Brasília, Brasil Scubyt Tempo inteiro

    Software Engineer Site Reliability EngineerLocation: Brazil REMOTE Duration: Fulltime CLT / REMOTEAbout the role The Application SRE Team supports several critical components of our foundational technologies for real-time protection, as well as ourandservices. We are a team of software engineers focused on improving availability, latency, performance,...

  • Site Reliability Engineer

    3 semanas atrás


    Brasília, Brasil AgileEngine Tempo inteiro

    Overview Site Reliability Engineer (Middle/Senior) ID38916 AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. Why...