Sre/Production Support Engineer

Há 14 horas


Cachoeiro de Itapemirim, Brasil Tecla Tempo inteiro

*Native/Bilingual English is required for this role (read/written/spoken) Please upload your CV Resume in English.Monthly salary: $4,000 - $5,500 USDAlong with our partner, we are seeking a Senior SRE/Production Support Engineer to lead the operational reliability, stability, and performance of their production systems.The selected professional will serve as a technical leader for incident response, root cause analysis, and long-term operational improvements.This role requires deep expertise in AWS serverless architectures, Python backends, PostgreSQL, and frontend technologies like React/Amplify.The Senior Production Support Engineer not only resolves incidents but also drives system improvements, mentors junior engineers, and shapes processes for reliability and monitoring.Responsibilities:Lead incident management for production issues across: AWS Lambda-based microservices, PostgreSQL (RDS), and React/Amplify frontend applicationsInvestigate, diagnose, and resolve complex production issues, including performance, data, and configuration problems.Conduct and lead post-incident reviews and root cause analyses (RCA), driving preventive solutions.Mentor and guide junior/mid-level production support engineers in troubleshooting and operational best practices.Maintain and enhance monitoring, alerting, logging, and observability tools (CloudWatch, X-Ray, DataDog, etc.).Collaborate with engineering teams to improve system reliability, scalability, and maintainability.Own and improve runbooks, playbooks, and operational documentation.Participate in on-call rotations, providing technical leadership during high-impact incidents.Analyze recurring issues and propose architectural or procedural improvements to prevent recurrence.Support deployment validation, emergency rollbacks, and operational changes.Partner with DevOps and Engineering teams to optimize performance, cost, and availability of cloud resources.Required Qualifications:5+ years of experience in production support, SRE, DevOps, or backend engineering roles.Strong expertise with AWS services, particularly Lambda, API Gateway, RDS (PostgreSQL), S3, Cognito, and CloudWatch.Proficient in Python, with the ability to read, debug, and modify code to resolve issues.Deep understanding of PostgreSQL, including query optimization, data integrity, and troubleshooting.Experience managing and improving observability, monitoring, and alerting in production systems.Proven experience handling high-severity incidents and leading incident response.Strong problem-solving skills and ability to navigate distributed systems.Excellent communication skills for incident reporting, collaboration, and mentoring.Preferred Qualifications:Experience with frontend technologies (React, Amplify) for debugging full-stack issues.Familiarity with serverless architecture best practices and cost/performance optimization.Experience with infrastructure-as-code (CloudFormation, CDK, Terraform).Knowledge of automation and scripting for operational tasks (Python preferred).Prior experience in defining or improving SLOs, SLAs, and operational KPIs.Familiarity with modern CI/CD pipelines and automated deployment strategies.Hands-on experience with observability and monitoring platforms (DataDog, New Relic, Sentry).Success Indicators:Production incidents are resolved quickly and effectively, minimizing business impact.Post-incident RCAs lead to measurable improvements in system reliability.Operational playbooks and runbooks are well-maintained and widely used.Junior/mid-level engineers are mentored effectively and develop troubleshooting skills.Systems are proactively monitored, optimized, and improved for stability, scalability, and cost efficiency.Tools You May Use:AWS Services: Lambda, RDS (PostgreSQL), S3, API Gateway, Cognito, CloudWatch, X-Ray, SNS/SQS, EventBridgeLanguages & Scripting: PythonMonitoring & Observability: CloudWatch, DataDog, Sentry, X-RayVersion Control & CI/CD: GitHub/GitLab, CI/CD pipelinesFrontend Collaboration: React, AmplifyTicketing & Collaboration: Jira, ConfluenceAI Prompting: Cursor, ChatGPTBenefits:A fully remote position with a structured schedule that supports work-life balance.The opportunity to join a forward-thinking company transforming the future of film and television production through cutting-edge technology.Two weeks of paid vacation per year.10 paid days for local holidays.Work Schedule: US Pacific Standard Time*Please note our partner is only looking for full-time dedicated team members who are eager to fully integrate within their team.


  • Production Support Engineer

    4 semanas atrás


    Região Geográfica Intermediária de São Paulo, Brasil Tata Consultancy Services Tempo inteiro

    1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Tata Consultancy Services Strategic Hiring Manager | Expertise in Sourcing Top Talent & Streamlining Hiring Processes for Maximum Efficiency Come to one of the biggest IT Services companies in the world!! Here you can...

  • Senior SRE

    3 semanas atrás


    Cachoeiro de Itapemirim, Brasil Remessa Online Tempo inteiro

    Sua carreira com liberdade e propósito Na Remessa Online, não se trata apenas de transferências internacionais, criamos conexões que rompem fronteiras e transformam realidades. Somos movidos pela ousadia, respeito, colaboração, encantamento e responsabilidade. Nosso segredo? Trabalhar juntos com transparência, comprometimento e autonomia, sempre...


  • Juiz de Fora, Brasil Signify Technology Tempo inteiro

    The Company A well-established tech organization building advanced AI products for healthcare and clinical research. The team focuses on secure, reliable platforms that process sensitive medical data and support research and clinical workflows.Role & Responsibilities As aSenior SRE , you will: Design and automate infrastructure(infrastructure-as-code tools)...


  • Cachoeiro de Itapemirim, Brasil The Methodical Group Tempo inteiro

    Job Title: Senior Full Stack Engineer Reports to: Director of EngineeringPosition OverviewThe Senior Full Stack Engineer will play a key role in designing, developing, and maintaining our company's core business applications.We're seeking a true systems analyst who thrives on solving complex business challenges and translating them into robust, scalable, and...

  • Linux System Engineer

    2 semanas atrás


    Cachoeiro de Itapemirim, Brasil InComm Payments Tempo inteiro

    We are seeking a highly skilled and experienced Senior Linux System Engineer to join our InComm Operations team. Ideally, you will have a strong background in Red Hat and Oracle Linux system administration, automation with Ansible, as well as deep expertise in Linux patching, scripting, and GIT version control. 100% Remote + CLT + Benefits (Health Insurance...

  • Senior Software Engineer

    3 semanas atrás


    Cachoeiro de Itapemirim, Brasil Teachable Tempo inteiro

    Teachable is a no-code platform for creators who want to build a more impactful business through courses, coaching, downloadable content, and community. With Teachable, creators can engage their online audiences and get paid—on their own terms. Are you ready to join a dynamic, cross-cultural team at an exciting turning point in our company’s journey? Now...

  • DevOps Engineer

    3 semanas atrás


    Rio de Janeiro, RJ, Brasil Flowmentum, Inc. Tempo inteiro

    DevOps & Platform Engineers We’re hiring DevOps/Platform Engineers with strong SRE skills to work on high-scale SaaS platforms. Our stack is heavy on EKS, MongoDB/Atlas, and you’ll be tackling database contention, scaling challenges, and complex deployments every day. This role is for problem solvers who thrive on multitasking, navigating ambiguity, and...


  • Rio De Janeiro, Brasil ALLSTARSIT Tempo inteiro

    About the Project Our client is a Business Intelligence platform enabling customers to connect multiple data sources (e.G., Excel, SQL) into a unified environment, build interactive dashboards, apply formulas, and customize via plugins, scripts, and embedding. The product supports both SaaS and on-prem deployments. Support is organized into 4 pods:...

  • DevOps Engineer

    3 semanas atrás


    Feira de Santana, Brasil Raidiam Services Limited Tempo inteiro

    About Raidiam Raidiam is the global organisation at the forefront of data sharing technologies that are changing the world. We believe in empowering everyone to share their data safely, securely and simply; in a trusted and consented way; creating the potential to be seamlessly connected to the products and services they need. Since our inception, Raidiam...


  • Rio de Janeiro, Brasil Oracle Tempo inteiro

    EBS AR specialist to support production environment-22000FPX **Applicants are required to read, write, and speak the following languages***: English, Spanish, Portuguese **Preferred Qualifications** Oracle is looking for an Oracle e-business Suite consultant who will be responsible for implementing end-to-end solutions for accounts receivable, receipts...