Site Reliability Engineer
Há 13 horas
About UsAt MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services. As aSite Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and security of the backend infrastructure that powers innovative applications for our clients. This role will involve managing cloud environments, optimizing databases, automating deployments, and improving system observability. Job Description As aSite Reliability Engineer (SRE) at MetaCTO, you will be responsible for designing, implementing, and maintaining highly available, scalable, and secure infrastructure solutions. You will collaborate with software engineers to improve system performance, automate operations, and ensure the smooth functioning of critical backend services. You’ll work extensively with cloud platforms like AWS, leveraging technologies such as Terraform, Docker, Kubernetes, and CI/CD pipelines to enhance system reliability. ResponsibilitiesArchitect, build, and maintain cloud infrastructure onAWS(Lambda, EC2, RDS, S3, EKS, SQS, CloudWatch). Manage and optimize databases (MySQL, PostgreSQL) for performance, reliability, and security. Implementmonitoring, alerting, and loggingsolutions to ensure system health and performance, with specific experience usingZabbixandElastic Logging. Design and maintainCI/CD pipelinesfor automated deployment and scaling of applications. Work withcontainerization and orchestration toolssuch asDockerandKubernetes. Develop and enforcesecurity best practicesfor cloud environments and infrastructure. Automate operational processes usingInfrastructure-as-Code (Terraform, CloudFormation)and scripting languages like Python or Bash. Troubleshoot and resolve infrastructure-related incidents and optimize system performance. Collaborate with backend engineers to ensure high availability, fault tolerance, and scalable system design, with a strong focus onDjango-based applications. Qualifications5-10 yearsof experience inSite Reliability Engineering (SRE), DevOps, or Cloud Engineeringroles. Strong expertise inAWScloud services (EC2, RDS, S3, Lambda, CloudFront, EKS, SQS, IAM). Hands-on experience withcontainerization (Docker) and orchestration (Kubernetes, ECS, or EKS). Deep knowledge ofrelational databases (MySQL, PostgreSQL), including performance tuning, query optimization, monitoring, and migration management. Proficiency inInfrastructure-as-Code toolssuch asTerraform, CloudFormation, or Pulumi. Strong experience withCI/CD pipelinesand automation tools (GitHub Actions, Jenkins, CircleCI, or GitLab CI/CD). Proficiency inmonitoring tools, specificallyZabbix, and logging solutions likeElastic Logging. Scripting experience withPython, Bash, or Gofor automating operational tasks. Experience working withDjango-based applicationsin a cloud environment. Experience implementing security best practices for cloud-based applications. Knowledge of distributed systems andmicroservices architecture. Preferred SkillsAWS certifications (Solutions Architect, DevOps Engineer) are a plus. Experience withserverless computingand event-driven architectures. Familiarity withmessage queue services(SQS, RabbitMQ, Kafka). Understanding ofzero-downtime deploymentsand disaster recovery strategies. Position DetailsType:Full-Time Location:100% Remote Hours:US Pacific Time hours How to ApplyIf you are passionate aboutscalability, automation, and reliability, and thrive in a collaborative, fast-paced environment, we’d love to hear from you. Please submit yourresumeand an optionalbrief cover letteroutlining your relevant experience. MetaCTOis an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
-
Site Reliability Engineer
Há 11 horas
Brasília, Brasil Review ALL Tempo inteiroAbout the CompanyThis company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide. They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric...
-
Site Reliability Engineer
1 dia atrás
Brasília, DF, Brasil MetaCTO Tempo inteiroAbout Us At MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services. As a Site Reliability Engineer (SRE) , you will play a critical role in ensuring the reliability, scalability, and security of the backend infrastructure that powers...
-
Site Reliability Engineer
Há 11 horas
Brasília, Brasil MetaCTO Tempo inteiroAbout UsAt MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services. As aSite Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and security of the backend infrastructure that powers...
-
Site Reliability Engineer
Há 2 dias
Brasília, Brasil INDI Staffing Services Tempo inteiroAt INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work. Overview of the role:We are looking for a Site Reliability Engineer to build and maintain highly reliable,...
-
Site Reliability Engineer Sr
2 semanas atrás
Brasília, Brasil Mercado Eletrônico Tempo inteiroO Mercado Eletrônico é líder na América Latina em soluções de gestão de compras B2B. Suas tecnologias e serviços para as áreas de compras ajudam empresas a conquistarem mais economia, agilidade, governança e colaboração. Com escritórios no Brasil, Estados Unidos, México e Portugal, contabiliza mais de 1 milhão de fornecedores, 10 mil...
-
Site Reliability Engineer Id45689
Há 9 horas
Brasília, Brasil Agileengine Tempo inteiroAgileEngine is an Inc. **** company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards.WHY JOIN US If you're looking for a place to grow, make an...
-
Software Engineer Site Reliability Engineer
Há 11 horas
Brasília, Brasil Scubyt Tempo inteiroSoftware Engineer Site Reliability Engineer Location: Brazil REMOTE Duration: Fulltime CLT / REMOTEAbout the role The Application SRE Team supports several critical components of our foundational technologies for real-time protection, as well as ourRBIandSSPMservices. We are a team of software engineers focused on improving availability, latency,...
-
Site reliability engineer sr
2 semanas atrás
Brasília, Brasil Mercado Eletrônico Tempo inteiroO Mercado Eletrônico é líder na América Latina em soluções de gestão de compras B2 B. Suas tecnologias e serviços para as áreas de compras ajudam empresas a conquistarem mais economia, agilidade, governança e colaboração.Com escritórios no Brasil, Estados Unidos, México e Portugal, contabiliza mais de 1 milhão de fornecedores, 10 mil...
-
Software Engineer Site Reliability Engineer
Há 11 horas
Brasília, Brasil Scubyt Tempo inteiroSoftware Engineer Site Reliability EngineerLocation: Brazil REMOTE Duration: Fulltime CLT / REMOTEAbout the role The Application SRE Team supports several critical components of our foundational technologies for real-time protection, as well as ourandservices. We are a team of software engineers focused on improving availability, latency, performance,...
-
Site Reliability Engineer
3 semanas atrás
Brasília, Brasil AgileEngine Tempo inteiroOverview Site Reliability Engineer (Middle/Senior) ID38916 AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. Why...