Site Reliability Engineer

Há 5 dias


São Paulo, Brasil WSO2 Tempo inteiro

About WSO2
Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) products. WSO2's products and platforms—including our next-gen internal developer platform, Choreo—empower organizations to leverage the full potential of APIs for secure delivery of digital services and applications, enabling thousands of enterprises in over 90 countries globally to drive their digital transformation journeys. Our open-source, API-first approach frees developers and architects from vendor lock-in, enabling rapid digital product creation. Recognized as leaders by industry analysts, WSO2 has over 800 employees worldwide with offices in Australia, Brazil, Germany, India, Sri Lanka, the UAE, the UK, and the US, with nearly USD100M in annual recurring revenue. Visit to learn more. Follow WSO2 on LinkedIn and X (formerly Twitter).

About the Role
As a Site Reliability Engineer at WSO2, you'll be instrumental in both supporting our existing customers with their managed or private cloud deployments and initiating new deployments across leading cloud platforms such as Azure, AWS, and GCP. Your mission will include ensuring the seamless operation, scalability, and security of WSO2 cloud services, alongside automating processes to boost both efficiency and reliability.

Your Key Responsibilities
Deployment Setup and Management:
Lead the design and implementation of new cloud deployments, tailoring solutions to meet stakeholder requirements on platforms like Azure, AWS, GCP, and Kubernetes.
Optimize cloud architectures for scalability and cost-effectiveness, adhering to best practices for networking, security, and access controls.
Gain and maintain deep knowledge of cloud infrastructure providers to create robust solutions.
Automation and CI/CD:
Craft and manage automation scripts and infrastructure as code (IaC) with Terraform, Ansible, or CloudFormation.
Deploy CI/CD pipelines to streamline software delivery, testing, and deployment processes, ensuring efficient version control and configuration management.
Managed Cloud Support:
Ensure the availability of the services by configuring system monitors and alerts and attending to critical alerts in a timely manner.
Offer continuous support and maintenance for existing deployments, monitoring system performance and swiftly resolving issues to maintain high availability and reliability.
Implement strategies for performance optimization and failure prevention, conducting thorough root cause analyses to avoid future issues.
Monitoring and Security:
Establish comprehensive monitoring and alerting systems to oversee customer deployments, setting thresholds for incident response.
Conduct regular security assessments and stay abreast of the latest threats and trends to fortify cloud environments against risks.
Collaboration and Knowledge Sharing:
Foster a collaborative environment with product developers, operations, and QA teams to enhance workflows and product quality.
Share knowledge and best practices, contributing to the team’s collective expertise through documentation, training, and mentorship.
Skills and Experience
Fluent in English.
Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
Expertise in cloud platforms such as Azure, AWS and GCP.
Expertise in Linux, virtualization and containerization technologies such as Docker and Kubernetes.
A solid understanding of networking, security principles, and compliance frameworks.
Proficiency in IaC tools (Terraform, CloudFormation), configuration management (Puppet, Chef, Helm), and scripting languages (Python, Bash, PowerShell).
Experience with CI/CD tools (Github Actions, Jenkins) and monitoring/logging tools (Prometheus, ELK stack, Splunk).
Exceptional problem-solving, analytical, and troubleshooting skills, coupled with a proactive, customer-centric mindset.
Strong communication skills and the ability to collaborate effectively in a team environment.
In Addition to a Competitive Compensation Package, WSO2 Offers:
A work culture and environment where we value both hard work AND flexibility.
A flexible vacation/leave plan that fits your needs.
Health, dental, and life insurance for you and your family.
Diversity Drives Innovation:
We've built our business on a commitment to diversity and inclusion. We believe it's important to foster an environment that values and respects each individual's strengths, perspectives, and ideas. Doing so not only drives innovation; it also ensures that we can create superior experiences for our customers, partners, and employees worldwide. We value the diversity of our team regardless of race, ethnicity, religion, gender, age, national origin, disability, sexual orientation, or veteran or marital status, and we do not tolerate any form of discrimination.


  • Site Reliability Engineer

    3 semanas atrás


    São Paulo, Brasil Mouts TI Tempo inteiro

    Na Mouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a) SRE (Site Reliability Engineer) para atuar presencialmente, com foco em infraestrutura, automação e observabilidade em ambientes de missão crítica.Responsabilidades:Implementar e gerenciar soluções de observabilidade

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, Brasil PayRetailers Tempo inteiro

    Site Reliability Engineer Join PayRetailers in São Paulo. We are expanding across Latin America and Africa, building cutting‑edge payment solutions. We value creativity, growth, and collaboration. About the role Site Reliability Engineers are guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform...

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, Brasil PayRetailers Tempo inteiro

    Site Reliability Engineer Join PayRetailers in São Paulo. We are expanding across Latin America and Africa, building cutting‑edge payment solutions. We value creativity, growth, and collaboration. About the role Site Reliability Engineers are guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform...

  • Site Reliability Engineer

    3 semanas atrás


    São Paulo, Brasil PayRetailers Tempo inteiro

    Job Overview We’re PayRetailers, and we offer cutting‑edge payment solutions that empower businesses to succeed in Latin America & Africa. Our collaborative and inclusive work environment encourages creativity and growth, where every employee’s contribution is valued. We’ve got big plans to expand into new markets and make a meaningful impact on the...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, Brasil Review ALL Tempo inteiro

    About the CompanyThis company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide.They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric...


  • São Paulo, Brasil Scubyt Tempo inteiro

    Software Engineer Site Reliability Engineer Location: Brazil REMOTE Duration: Fulltime CLT / REMOTE About the role The Application SRE Team supports several critical components of our foundational technologies for real-time protection, as well as our RBI and SSPM services. We are a team of software engineers focused on improving availability, latency,...


  • São Paulo, Brasil K2 Solutions Tempo inteiro

    Trabalho híbrido na região de Pinheiros/ SP - 3x por semana no escritórioEstamos selecionando um Senior Site Reliability Engineer - SRE para se juntar ao nosso time e desempenhar um papel essencial na manutenção, automação e melhoria da confiabilidade dos sistemas que impulsionam a rede logística da empresa em múltiplas regiões. Essa pessoa...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, Brasil Review ALL Tempo inteiro

    About the Company This company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide. They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric...


  • são paulo, Brasil MetaCTO Tempo inteiro

    About Us At MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services. As a Site Reliability Engineer (SRE) , you will play a critical role in ensuring the reliability, scalability, and security of the backend infrastructure that powers...

  • Site reliability engineer

    2 semanas atrás


    São Paulo, Brasil Mouts TI Tempo inteiro

    NaMouts TI, entregamos soluções que impulsionam a transformação digital de forma ágil, eficiente e descomplicada.Buscamos um(a)SRE (Site Reliability Engineer)para atuarpresencialmente, com foco eminfraestrutura, automação e observabilidadeem ambientes de missão crítica.Responsabilidades:Implementar e gerenciar soluções deobservabilidade(Datadog,...