
Site Reliability Engineer
3 semanas atrás
Join to apply for the Site Reliability Engineer role at DEUNA
DEUNA is a rapidly growing startup revolutionizing global commerce with ATHIA, our AI-powered orchestration and payments platform that helps large enterprises boost approval rates, reduce costs, and unlock new revenue. Built by the team behind DEUNA—the fastest-growing Commerce OS in Latin America—ATHIA combines payment intelligence, checkout optimization, and data orchestration in one powerful solution.
With deep integrations across 300+ PSPs and alternative payment methods, and over 20% of Mexico’s digital economy running through our platform, we simplify global payments through a single integration and centralized reconciliation.
We are a rapidly growing startup expanding into the U.S. to meet the urgent needs of large retailers, marketplaces, airlines, and QSRs. Join us to shape the future of payments
Role OverviewAs a Mid SRE at Deuna, you’ll ensure the reliability, scalability, and performance of our AWS-based platform by integrating observability, automation, and SRE best practices across the software lifecycle. You will work closely with development teams to improve uptime, provide observability tooling, and ensure we scale efficiently and securely.
Key Responsibilities- Design, define, and maintain observability and monitoring for our AWS infrastructure
- Define and track SLIs, SLOs, and SLAs for critical systems
- Improve system uptime, latency, and fault tolerance across the platform
- Provide internal libraries and toolsets to developers for diagnostics and debugging
- Manage scaling, performance, and resilience efforts related to system reliability
- Collaborate with technical teams on capacity planning, load testing, and scaling policies
- Improve production operations by defining and evolving deployment strategies and conducting disaster recovery (DR) testing
- Expertise with Prometheus, Grafana, OpenTelemetry, AWS CloudWatch, or other observability tools
- Experience designing dashboards, alerts, and log aggregation pipelines
- Deep understanding of AWS services: ECS, Lambda, RDS, CodePipeline
- Strong proficiency in Go programming language
- Skilled at defining SLIs, SLOs, error budgets, and improving Mean Time to Recovery (MTTR)
- Experience conducting failure drills (e.g., Chaos Monkey, Gremlin) to ensure system resilience
- Excellent communication and collaboration skills
- Adaptability to thrive in dynamic, fast-paced environments
- Strong time management and task prioritization
- Proficiency in English
- A multicultural team distributed throughout LATAM
- Dynamism, agility and constant innovation
- Being part of a high-impact solution for an entire region
- The best tools and technology to operate
- Being part of the startup culture
- We are in full expansion
- Vacations and additional PTO
- Remote work from anywhere
- Economic support for health insurance, internet and cell phone line
- We all own DEUNA, we offer stock options
- Learning and development platform
- Multidisciplinary, diverse and dynamic team
- Growth and career path
- Be part of a dynamic team that's creating the next generation payments platform
- Join us at DEUNA
- Not Applicable
- Full-time
- Engineering and Information Technology
- Software Development
-
Site Reliability Engineer
2 semanas atrás
Curitiba, Brasil Ryz Labs Tempo inteiroJoin to apply for the Site Reliability Engineer role at Ryz Labs1 week ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Ryz LabsGet AI-powered advice on this job and more exclusive features.Remote position within South AmericaRYZ is seeking a Site Reliability Engineer to join one of our clients, who is developing...
-
Site Reliability
3 semanas atrás
Curitiba, Brasil Canonical Tempo inteiroJoin to apply for the Site Reliability / Gitops Engineer role at Canonical 1 day ago Be among the first 25 applicants Join to apply for the Site Reliability / Gitops Engineer role at Canonical Get AI-powered advice on this job and more exclusive features. Canonical is a leading provider of open source software and operating systems to the global...
-
Site reliability engineer sre
1 dia atrás
Curitiba, Brasil Netvagas Tempo inteiroJoin to apply for the Site reliability engineer sre role at Netvagas Sobre a UEX Somos Nerds e Empreendedores! A UEX é um estúdio de tecnologia, que opera no modelo de Startup Studio. Uma empresa de tecnologia, especialista em desenvolvimento, lançamento e operação de produtos e plataformas digitais. COMO VOCÊ VAI CRIAR DRAGÕES? Como Site...
-
Senior Site Reliability
3 semanas atrás
Curitiba, Brasil Canonical Tempo inteiroSenior Site Reliability / Gitops EngineerJoin to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Senior Site Reliability / Gitops Engineer1 day ago Be among the first 25 applicants Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Get AI-powered advice on this job and more exclusive features....
-
Site Reliability Engineer 3
1 semana atrás
Curitiba, Brasil Granicus Tempo inteiroJob Summary:Granicus is seeking an experienced and highly skilled Senior Site Reliability Engineer (SRE) to join our SRE team.As a Senior SRE, you will play a pivotal role in ensuring the reliability, scalability, and performance of our services.You will lead efforts in building and maintaining a robust infrastructure, automating processes, and guiding the...
-
[GenAI Core]
3 semanas atrás
Curitiba, Brasil Stone Tempo inteiro(GenAI Core) - Staff Site Reliability Engineer (GenAI Core) - Staff Site Reliability Engineer AWS Terraform ArgoCD Hashicorp Vault Quem é Stone Tech?A Stone nasceu com o propósito de ser protagonista na transformação da indústria de pagamentos, lutando para oferecer as melhores soluções para quem empreende no Brasil.Pensando nisso, construímos a...
-
Site Reliability Engineer Iii
1 semana atrás
Curitiba, Brasil Guidewire Software Tempo inteiroSummaryAt Guidewire, we deliver the software that Property and Casualty (P&C) insurance companies rely on to protect their customers during crises, natural disasters, accidents, and cyber risks.Our core applications enable insurers to sell and underwrite policies, settle claims, and bill their customers.We also offer a suite of innovative products for data...
-
Site Reliability Engineer Iii
Há 5 dias
Curitiba, Brasil Guidewire Software, Inc. Tempo inteiroBrazil - CuritibaProduct Development and Operations/Full time/HybridAt Guidewire, we deliver the software that Property and Casualty (P&C) insurance companies rely on to protect their customers during crises, natural disasters, accidents, and cyber risks.Our core applications enable insurers to sell and underwrite policies, settle claims, and bill their...
-
Site Reliability Engineer III
2 semanas atrás
Curitiba, Paraná, Brasil Guidewire Software Tempo inteiro R$90.000 - R$120.000 por anoSummaryAt Guidewire, we deliver the software that Property and Casualty (P&C) insurance companies rely on to protect their customers during crises, natural disasters, accidents, and cyber risks. Our core applications enable insurers to sell and underwrite policies, settle claims, and bill their customers. We also offer a suite of innovative products for data...
-
Site Reliability Engineer III
Há 7 dias
Curitiba, Paraná, Brasil Guidewire Software, Inc. Tempo inteiro R$90.000 - R$120.000 por anoBrazil - CuritibaProduct Development and Operations/Full time/HybridAt Guidewire, we deliver the software that Property and Casualty (P&C) insurance companies rely on to protect their customers during crises, natural disasters, accidents, and cyber risks. Our core applications enable insurers to sell and underwrite policies, settle claims, and bill their...