Site Reliability Engineer

Há 8 horas


Itajaí, Brasil MetaCTO Tempo inteiro

About Us At MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services. As a Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and security of the backend infrastructure that powers innovative applications for our clients. This role will involve managing cloud environments, optimizing databases, automating deployments, and improving system observability. Job Description As a Site Reliability Engineer (SRE) at MetaCTO, you will be responsible for designing, implementing, and maintaining highly available, scalable, and secure infrastructure solutions. You will collaborate with software engineers to improve system performance, automate operations, and ensure the smooth functioning of critical backend services. You'll work extensively with cloud platforms like AWS, leveraging technologies such as Terraform, Docker, Kubernetes, and CI/CD pipelines to enhance system reliability. Responsibilities Architect, build, and maintain cloud infrastructure on AWS (Lambda, EC2, RDS, S3, EKS, SQS, CloudWatch). Manage and optimize databases (MySQL, PostgreSQL) for performance, reliability, and security. Implement monitoring, alerting, and logging solutions to ensure system health and performance, with specific experience using Zabbix and Elastic Logging. Design and maintain CI/CD pipelines for automated deployment and scaling of applications. Work with containerization and orchestration tools such as Docker and Kubernetes. Develop and enforce security best practices for cloud environments and infrastructure. Automate operational processes using Infrastructure-as-Code (Terraform, CloudFormation) and scripting languages like Python or Bash. Troubleshoot and resolve infrastructure-related incidents and optimize system performance. Collaborate with backend engineers to ensure high availability, fault tolerance, and scalable system design, with a strong focus on Django-based applications. Qualifications 5-10 years of experience in Site Reliability Engineering (SRE), DevOps, or Cloud Engineering roles. Strong expertise in AWS cloud services (EC2, RDS, S3, Lambda, CloudFront, EKS, SQS, IAM). Hands-on experience with containerization (Docker) and orchestration (Kubernetes, ECS, or EKS). Deep knowledge of relational databases (MySQL, PostgreSQL), including performance tuning, query optimization, monitoring, and migration management. Proficiency in Infrastructure-as-Code tools such as Terraform, CloudFormation, or Pulumi. Strong experience with CI/CD pipelines and automation tools (GitHub Actions, Jenkins, CircleCI, or GitLab CI/CD). Proficiency in monitoring tools, specifically Zabbix, and logging solutions like Elastic Logging. Scripting experience with Python, Bash, or Go for automating operational tasks. Experience working with Django-based applications in a cloud environment. Experience implementing security best practices for cloud-based applications. Knowledge of distributed systems and microservices architecture. Preferred Skills AWS certifications (Solutions Architect, DevOps Engineer) are a plus. Experience with serverless computing and event-driven architectures. Familiarity with message queue services (SQS, RabbitMQ, Kafka). Understanding of zero-downtime deployments and disaster recovery strategies. Position Details Type: Full-Time Location: 100% Remote Hours: US Pacific Time hours How to Apply If you are passionate about scalability, automation, and reliability, and thrive in a collaborative, fast-paced environment, we'd love to hear from you. Please submit your resume and an optional brief cover letter outlining your relevant experience. MetaCTO is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.



  • Itajaí, SC, Brasil Review ALL Tempo inteiro

    About the Company This company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide. They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric...

  • Site Reliability Engineer

    1 semana atrás


    Itajaí, Brasil Insight Global Tempo inteiro

    Remote Automation Cloud EngineerRequired Skills & ExperienceRequired Skills & Qualifications: • Minimum 8 years of experience in infrastructure automation and DevOps. • Strong hands-on experience with Terraform for IaC across Azure, GCP, and OCI. • Proficiency in Jenkins pipeline development using Groovy. • Solid experience with Ansible for...


  • Itajaí, Brasil Turbi Tempo inteiro

    E aí, tudo azul por aí?A Turbi é a locadora do futuro: 100% digital, movida a tecnologia, gente boa e paixão por transformar a forma como as pessoas se locomovem.A gente abre o carro pelo app (sim, sem chave!) e acreditamos que a inovação de verdade começa com um time engajado e com liberdade para criar.Estamos procurando uma pessoa para a posição...

  • Reliability Supervisor

    3 semanas atrás


    Itajaí, Brasil BRF S.A. Tempo inteiro

    Have you ever imagined to be part of one of the biggest food companies in the world? Nourish life is our commitment. This is not limited to food production – it extends to projects, initiatives and causes we embrace. In order to deliver quality products, we have a team dedicated to innovating every day. We have more than 100,000 employees worldwide. A...

  • Data Engineer

    2 semanas atrás


    Itajaí, Brasil Artefact Tempo inteiro

    The current vacancy is for the Brazilian office and we work in a Free Office model. Who we are At Artefact LatAm, we believe in and live a culture based on empathy! A healthy work environment is a place where all voices are heard, respected, and valued. Our commitment is to build a more diverse and inclusive environment, because empathy is for everyone,...


  • Itajaí, Brasil Damco Spain SL Tempo inteiro

    APM Terminals Global DBA Engineer Business Unit: APM Terminals Locations: Itajai, Pecem, Suape Brazil (remote flexibility) Job classification: 3 PURPOSE APM Terminals, a global leader in port and terminal operations and part of the A.P. Moller-Maersk Group, is committed to delivering world‑class container handling, logistics solutions, and maritime...

  • Senior Devops Engineer

    4 semanas atrás


    Itajaí, Brasil Avenue Code Tempo inteiro

    About the jobAt Avenue Code, we are passionate about transforming businesses through technology. We are a leading end-to-end development consultancy for digital transformation across various markets, growing sustainably since day one. We believe that great results are born from strong relationships. Our team combines technical expertise, collaboration, and a...


  • Itajaí, Brasil Nearsure Tempo inteiro

    Explore the Nearsure experience! Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management. Say goodbye to micromanagement! We champion autonomy, open communication, and respect for diversity as our core values. ⚖️Your well-being matters: Our People Care...