Site Reliability Engineer

3 semanas atrás


São Paulo, Brasil PayRetailers Tempo inteiro

Site Reliability Engineer (focus on Data) Apply for the Site Reliability Engineer (focus on Data) role at PayRetailers. We’re PayRetailers, offering cutting‑edge payment solutions that empower businesses to succeed in Latin America & Africa. Our collaborative and inclusive work environment encourages creativity and growth, where every employee’s contribution is valued. About the Role Site Reliability Engineers are the guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform that consistently meets business and customer expectations for availability and performance, especially focused on data infrastructure. Job Requirements What Is a MUST Proactive attitude, always on the lookout for improvement opportunities. Strong scripting skills (Python, Bash). Experience in Cloud. Knowledge of Grafana, Application Insights, OpenTelemetry, Prometheus. DBA experience in creating and maintaining databases in SQL Server (Mongo or PostgreSQL). Fluent level of English, able to conduct technical meetings in English. What Is Nice To Have Experience with non‑functional and production testing. Analytical mindset, being able to connect the dots and establish cause and effect. Experience with containers and container orchestration platforms (EKS/AKS). Understanding of APIs and asynchronous distributed software architectures. Working knowledge of AI‑enabled tools like VS Code, Claude Code, etc. Demonstrable experience with applying AI to Site Reliability Engineering. Knowledge with process automation tools like N8N. Working experience with chaos engineering. Job Responsibilities Increase automation of operational activities to reduce downtime risk, in collaboration with Platform Engineering and Domain Squads. Drive systemic improvements across engineering teams based on incident RCAs and telemetry insights. Implement non‑functional improvements (resilience, performance, reliability) directly in code, with Domain Squads reviewing and approving changes. Promote adoption of SRE best practices across development teams (integration patterns, monitoring, alerting, real‑time tracing). Provide cross‑platform observability capabilities above and beyond what the Domain Squads provide. Investigate issues and incidents and propose/implement changes as deemed necessary. Continuously review logs, metrics, and alerts to identify and/or implement continuous improvements. Design non‑functional tests and continuously run them to ensure that we build quality up to and including production. Job Benefits Individual development plans


  • Site Reliability Engineer

    3 semanas atrás


    São Paulo, Brasil PayRetailers Tempo inteiro

    Site Reliability Engineer Join PayRetailers in São Paulo. We are expanding across Latin America and Africa, building cutting‑edge payment solutions. We value creativity, growth, and collaboration. About the role Site Reliability Engineers are guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform...


  • São Paulo, Brasil Dev.Pro Tempo inteiro

    Senior Site Reliability Engineer - OP01988 6 days ago Be among the first 25 applicants

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, Brasil Handoff Tempo inteiro

    Why Join Us Handoff is the AI agent that runs a construction company. We help remodelers automate estimating, streamline operations, and win more work - backed by real-time cost data, intuitive design, and workflows that "speak contractor." With over 10,000 monthly active users and $6B in annualized project volume already flowing through our platform,...

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, São Paulo, Brasil Handoff Tempo inteiro

    Why join us? Handoff is the AI agent that runs a construction company. We help remodelers automate estimating, streamline operations, and win more work - backed by real-time cost data, intuitive design, and workflows that "speak contractor." With over 10,000 monthly active users and $6B in annualized project volume already flowing through our platform, we're...


  • São Paulo, Brasil Handoff Tempo inteiro

    Site Reliability Engineer at Handoff We are transforming the construction industry with Handoff, an AI agent that runs a construction company. We help remodelers automate estimating, streamline operations, and win more work—backed by real‑time cost data, intuitive design, and workflows that “speak contractor.” With over 10,000 monthly active users...

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, Brasil Handoff Tempo inteiro

    Why join us? Handoff is the AI agent that runs a construction company. We help remodelers automate estimating, streamline operations, and win more work – backed by real‑time cost data, intuitive design, and workflows that “speak contractor.” With over 10,000 monthly active users and $6B in annualized project volume already flowing through our...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, Brasil Handoff Tempo inteiro

    Why Join Us Handoff is the AI agent that runs a construction company. We help remodelers automate estimating, streamline operations, and win more work - backed by real-time cost data, intuitive design, and workflows that "speak contractor." With over 10,000 monthly active users and $6B in annualized project volume already flowing through our platform,...

  • Site Reliability Engineer

    3 semanas atrás


    São Paulo, Brasil INDI Staffing Services Tempo inteiro

    At INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work.Overview of the role:We are looking for a Site Reliability Engineer to build and maintain highly reliable,...


  • São Paulo, Brasil Indi Staffing Services Tempo inteiro

    Overview We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure OpenShift/Kubernetes clusters. Approach the problem of building and maintaining production systems from a software engineering perspective with a focus on automation and reliability. Responsibilities Build, automate, and maintain...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, Brasil Handoff Tempo inteiro

    Why join us? Handoff is the AI agent that runs a construction company. We help remodelers automate estimating, streamline operations, and win more work – backed by real‑time cost data, intuitive design, and workflows that “speak contractor.” With over 10,000 monthly active users and $6B in annualized project volume already flowing through our...