Site Reliability Engineer
Há 6 dias
About Us About UsAt MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services. Job Overview As a Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and security of the backend infrastructure that powers innovative applications for our clients. This role will involve managing cloud environments, optimizing databases, automating deployments, and improving system observability. Job DescriptionAs a Site Reliability Engineer (SRE) at MetaCTO, you will be responsible for designing, implementing, and maintaining highly available, scalable, and secure infrastructure solutions. You will collaborate with software engineers to improve system performance, automate operations, and ensure the smooth functioning of critical backend services. You'll work extensively with cloud platforms like AWS, leveraging technologies such as Terraform, Docker, Kubernetes, and CI / CD pipelines to enhance system reliability. Responsibilities Architect, build, and maintain cloud infrastructure on AWS (Lambda, EC2, RDS, S3, EKS, SQS, CloudWatch). Manage and optimize databases (MySQL, PostgreSQL) for performance, reliability, and security. Implement monitoring, alerting, and logging solutions to ensure system health and performance, with specific experience using Zabbix and Elastic Logging. Design and maintain CI / CD pipelines for automated deployment and scaling of applications. Work with containerization and orchestration tools such as Docker and Kubernetes. Develop and enforce security best practices for cloud environments and infrastructure. Automate operational processes using Infrastructure-as-Code (Terraform, CloudFormation) and scripting languages like Python or Bash. Troubleshoot and resolve infrastructure-related incidents and optimize system performance. Collaborate with backend engineers to ensure high availability, fault tolerance, and scalable system design, with a strong focus on Django-based applications. Qualifications 5-10 years of experience in Site Reliability Engineering (SRE), DevOps, or Cloud Engineering roles. Strong expertise in AWS cloud services (EC2, RDS, S3, Lambda, CloudFront, EKS, SQS, IAM). Hands-on experience with containerization (Docker) and orchestration (Kubernetes, ECS, or EKS). Deep knowledge of relational databases (MySQL, PostgreSQL), including performance tuning, query optimization, monitoring, and migration management. Proficiency in Infrastructure-as-Code tools such as Terraform, CloudFormation, or Pulumi. Strong experience with CI / CD pipelines and automation tools (GitHub Actions, Jenkins, CircleCI, or GitLab CI / CD). Proficiency in monitoring tools, specifically Zabbix, and logging solutions like Elastic Logging. Scripting experience with Python, Bash, or Go for automating operational tasks. Experience working with Django-based applications in a cloud environment. Experience implementing security best practices for cloud-based applications. Knowledge of distributed systems and microservices architecture. Preferred Skills: AWS certifications (Solutions Architect, DevOps Engineer) are a plus. Preferred Skills: Experience with serverless computing and event-driven architectures. Preferred Skills: Familiarity with message queue services (SQS, RabbitMQ, Kafka). Understanding of zero-downtime deployments and disaster recovery strategies. Position Details Type : Full-Time Location : 100% Remote Hours : US Pacific Time hours How to Apply If you are passionate about scalability, automation, and reliability, and thrive in a collaborative, fast-paced environment, we'd love to hear from you. Please submit your resume and an optional brief cover letter outlining your relevant experience. EEO Statement MetaCTO is an equal opportunity employer.We celebrate diversity and are committed to creating an inclusive environment for all employees. #J-18808-Ljbffr
-
Site Reliability Engineer
2 semanas atrás
Estância Velha, RS, Brasil HCLTech Tempo inteiroYour role and responsabilities: Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution. Performing deep-dive application troubleshooting and identifying preventive actions. Managing CIRS-related requests including deployments, feature toggles, and data fixes. Following up on major production...
-
DevOps Engineer
3 semanas atrás
Vila Velha, Brasil DEUNA Tempo inteiroDEUNA is a rapidly growing startup revolutionizing global commerce with ATHIA, our AI-powered orchestration and payments platform that helps large enterprises boost approval rates, reduce costs, and unlock new revenue. Built by the team behind DEUNA—the fastest-growing Commerce OS in Latin America—ATHIA combines payment intelligence, checkout...
-
DevOps Engineer
3 semanas atrás
Vila Velha, Brasil DEUNA Tempo inteiroDEUNA is a rapidly growing startup revolutionizing global commerce with ATHIA, our AI-powered orchestration and payments platform that helps large enterprises boost approval rates, reduce costs, and unlock new revenue. Built by the team behind DEUNA—the fastest-growing Commerce OS in Latin America—ATHIA combines payment intelligence, checkout...
-
Senior Software Engineer
4 semanas atrás
Vila Velha, Brasil Alloy Automation Tempo inteiroOverview Alloy Automation (YC W20) is the connectivity layer for companies building AI Agents. With our platform, companies can power their agents and products with 400+ ready-to-use connectors in minutes, not months. Our engineering team’s goal is to deliver an incredible experience for our customers and users, trusted by global leaders including Amazon,...
-
Software Engineer
4 semanas atrás
Vila Velha, Brasil Alloy Automation Tempo inteiroOverview Alloy Automation (YC W20) is the connectivity layer for companies building AI Agents. With our platform, companies can power their agents and products with 400+ ready-to-use connectors in minutes, not months. And our engineering team’s goal is to deliver an incredible experience for our customers and users, trusted by global leaders including...
-
Quality Automation Engineer
1 semana atrás
Vila Velha, Brasil Oowlish Tempo inteiroJoin Our Team Oowlish, one of Latin America's rapidly expanding software development companies, is seeking experienced technology professionals to enhance our diverse and vibrant team. As a valued member of Oowlish, you will collaborate with premier clients from the United States and Europe, contributing to pioneering digital solutions. Our commitment to...
-
Backend Engineer
3 semanas atrás
Vila Velha, Brasil Rocket.Chat Tempo inteiroJoin to apply for the Backend Engineer role at Rocket.Chat You will report to our Senior Engineering Manager and join the Engineering team. On TheOrg you can view the complete structure of our organization, including information about every team member, hiring managers and the size of each department. We’re seeking a Mid-Level Backend Engineer who thrives...
-
Senior Software Engineer
3 semanas atrás
Vila Velha, Brasil Connectly.ai Tempo inteiroJob Summary Senior Software Engineer • Connectly • Latam We’re looking for an exceptional Senior Software Engineer who thrives in building large‑scale, enterprise‑grade systems and is passionate about shaping how AI transforms retail commerce. You’ll work across backend and frontend domains, collaborating closely with product, sales, and AI...
-
DevOps Engineer
1 dia atrás
Vila Velha, Brasil Nearsure Tempo inteiroGet AI-powered advice on this job and more exclusive features. Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management. Say goodbye to micromanagement! We champion autonomy, open communication, and respect for diversity as our core values. ⚖️Your well-being...
-
React Engineer
1 dia atrás
Vila Velha, Brasil Nearsure Tempo inteiroAs a Senior Frontend React Engineer , you will be responsible for all aspects of the 3D decoration experience, including rendering quality, performance, and e-commerce outcomes like customer checkout. Responsibilities Design, build, test, deploy, and monitor web features for a new way to shop for a home. Design a seamless customer journey integration. Work...