Sre/production support engineer

Há 2 dias


Brasil Tecla Tempo inteiro

*Native/Bilingual English is required for this role (read/written/spoken) Please upload your CV Resume in English. Monthly salary: $4,000 - $5,500 USD Along with our partner, we are seeking a  Senior SRE/Production Support Engineer to lead the operational reliability, stability, and performance of their production systems. The selected professional will serve as a technical leader for incident response, root cause analysis, and long-term operational improvements. This role requires deep expertise in AWS serverless architectures, Python backends, Postgre SQL, and frontend technologies like React/Amplify. The Senior Production Support Engineer not only resolves incidents but also drives system improvements, mentors junior engineers, and shapes processes for reliability and monitoring. Responsibilities: Lead incident management for production issues across: AWS Lambda-based microservices, Postgre SQL (RDS), and React/Amplify frontend applications Investigate, diagnose, and resolve complex production issues, including performance, data, and configuration problems. Conduct and lead post-incident reviews and root cause analyses (RCA), driving preventive solutions. Mentor and guide junior/mid-level production support engineers in troubleshooting and operational best practices. Maintain and enhance monitoring, alerting, logging, and observability tools (Cloud Watch, X-Ray, Data Dog, etc.). Collaborate with engineering teams to improve system reliability, scalability, and maintainability. Own and improve runbooks, playbooks, and operational documentation. Participate in on-call rotations, providing technical leadership during high-impact incidents. Analyze recurring issues and propose architectural or procedural improvements to prevent recurrence. Support deployment validation, emergency rollbacks, and operational changes. Partner with Dev Ops and Engineering teams to optimize performance, cost, and availability of cloud resources. Required Qualifications: 5+ years of experience in production support, SRE, Dev Ops, or backend engineering roles. Strong expertise with AWS services, particularly Lambda, API Gateway, RDS (Postgre SQL), S3, Cognito, and Cloud Watch. Proficient in Python, with the ability to read, debug, and modify code to resolve issues. Deep understanding of Postgre SQL, including query optimization, data integrity, and troubleshooting. Experience managing and improving observability, monitoring, and alerting in production systems. Proven experience handling high-severity incidents and leading incident response. Strong problem-solving skills and ability to navigate distributed systems. Excellent communication skills for incident reporting, collaboration, and mentoring. Preferred Qualifications: Experience with frontend technologies (React, Amplify) for debugging full-stack issues. Familiarity with serverless architecture best practices and cost/performance optimization. Experience with infrastructure-as-code (Cloud Formation, CDK, Terraform). Knowledge of automation and scripting for operational tasks (Python preferred). Prior experience in defining or improving SLOs, SLAs, and operational KPIs. Familiarity with modern CI/CD pipelines and automated deployment strategies. Hands-on experience with observability and monitoring platforms (Data Dog, New Relic, Sentry). Success Indicators: Production incidents are resolved quickly and effectively, minimizing business impact. Post-incident RCAs lead to measurable improvements in system reliability. Operational playbooks and runbooks are well-maintained and widely used. Junior/mid-level engineers are mentored effectively and develop troubleshooting skills. Systems are proactively monitored, optimized, and improved for stability, scalability, and cost efficiency. Tools You May Use: AWS Services:  Lambda, RDS (Postgre SQL), S3, API Gateway, Cognito, Cloud Watch, X-Ray, SNS/SQS, Event Bridge Languages & Scripting:  Python Monitoring & Observability:  Cloud Watch, Data Dog, Sentry, X-Ray Version Control & CI/CD:  Git Hub/Git Lab, CI/CD pipelines Frontend Collaboration:  React, Amplify Ticketing & Collaboration:  Jira, Confluence AI Prompting: Cursor, Chat GPT Benefits: A fully remote position with a structured schedule that supports work-life balance. The opportunity to join a forward-thinking company transforming the future of film and television production through cutting-edge technology. Two weeks of paid vacation per year. 10 paid days for local holidays. Work Schedule:  US Pacific Standard Time *Please note our partner is only looking for full-time dedicated team members who are eager to fully integrate within their team.



  • Índio do Brasil Tecla Tempo inteiro

    *Native/Bilingual English is required for this role (read/written/spoken)Please upload your CV Resume in English.Monthly salary: $4,000 - $5,500 USDAlong with our partner, we are seeking a Senior SRE/Production Support Engineer to lead the operational reliability, stability, and performance of their production systems. The selected professional will serve as...

  • Senior Platform Engineer

    2 semanas atrás


    Índio do Brasil TurnKey Tech Staffing Tempo inteiro

    About the role Skyflow is a data privacy vault delivered via APIs. It helps companies isolate, protect, govern, and use sensitive data such as personal, health, or payment information securely, without losing usability. It’s built on a zero-trust architecture, meaning no one gets access to data unless explicitly allowed. Skyflow is looking for a Platform...

  • Lead SRE Engineer

    4 semanas atrás


    Brasil Avenue Code Tempo inteiro

    About the Company: Avenue Code is the leading software consultancy focused on delivering end-to-end development solutions for digital transformation across every vertical. We’re privately held, profitable, and have been on a solid growth trajectory since day one. We care deeply about our clients, our partners, and our people. We prefer the word...

  • Lead SRE Engineer

    4 semanas atrás


    Brasil Avenue Code Tempo inteiro

    About the Company: Avenue Code is the leading software consultancy focused on delivering end-to-end development solutions for digital transformation across every vertical. We’re privately held, profitable, and have been on a solid growth trajectory since day one. We care deeply about our clients, our partners, and our people. We prefer the word...

  • Lead SRE Engineer

    3 semanas atrás


    Brasil Avenue Code Tempo inteiro

    About the Company: Avenue Code is the leading software consultancy focused on delivering end-to-end development solutions for digital transformation across every vertical. We’re privately held, profitable, and have been on a solid growth trajectory since day one. We care deeply about our clients, our partners, and our people. We prefer the word...

  • Lead sre engineer

    4 semanas atrás


    Brasil Avenue Code Tempo inteiro

    About the Company: Avenue Code is the leading software consultancy focused on delivering end-to-end development solutions for digital transformation across every vertical. We’re privately held, profitable, and have been on a solid growth trajectory since day one. We care deeply about our clients, our partners, and our people. We prefer the word...

  • Support Engineer

    Há 3 dias


    Brasil PRAGMATIKE Tempo inteiro US$80.000 - US$120.000 por ano

    About the job Support Engineer We're building a new 24/7 support engineering function and need technical experts who can debug complex cloud and Kubernetes issues. This is NOT traditional L1 support - you'll work directly with our compute platform, requiring deep technical skills similar to a senior system admin. Location: RemoteStart date:...

  • Lead SRE Engineer

    2 semanas atrás


    Índio do Brasil Avenue Code Tempo inteiro

    About the Company: Avenue Code is the leading software consultancy focused on delivering end-to-end development solutions for digital transformation across every vertical. We're privately held, profitable, and have been on a solid growth trajectory since day one. We care deeply about our clients, our partners, and our people. We prefer the word 'partner'...


  • Brasil Signify Technology Tempo inteiro

    The Company A well-established tech organization building advanced AI products for healthcare and clinical research. The team focuses on secure, reliable platforms that process sensitive medical data and support research and clinical workflows. Role & Responsibilities As a Senior SRE , you will: - Design and automate infrastructure...


  • Brasil Signify Technology Tempo inteiro

    The Company A well-established tech organization building advanced AI products for healthcare and clinical research. The team focuses on secure, reliable platforms that process sensitive medical data and support research and clinical workflows. Role & Responsibilities As a Senior SRE , you will: Design and automate infrastructure (infrastructure-as-code...