Senior Site Reliability Engineer

1 dia atrás


Brasil Rocket Tempo inteiro

Job Summary

As a Senior Site Reliability Engineer at Rocket.Chat, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based communications platform. Your expertise in designing, implementing, and maintaining robust infrastructure will be instrumental in delivering exceptional user experiences.

Key Responsibilities

  • Develop and maintain Infrastructure as Code (IaC) using tools like Terraform
  • Automate deployment processes to achieve consistent and repeatable infrastructure provisioning
  • Configure and maintain CI/CD automation pipelines
  • Continuously monitor and plan for capacity increases to accommodate traffic growth and ensure that the infrastructure remains fault-tolerant under varying load conditions
  • Take leadership and accountability in writing blameless post mortems
  • Lead teams in disaster recovery procedures
  • Coordinate the efforts of responding teams efficiently during incidents

Requirements
  • Strong proficiency in Linux/Unix systems administration
  • Proficiency in scripting languages such as Python, Go, or Bash
  • In-depth knowledge of cloud platforms such as AWS, Azure, or GCP
  • Experience with containerization tools such as Docker and container orchestration platforms such as Kubernetes
  • Proficiency in monitoring tools such as Prometheus and Grafana for collecting, analyzing, and visualizing system metrics, logs, and events
  • Experience with CI/CD pipelines and tools such as ArgoCD

About Rocket.Chat
Rocket.Chat is the world's largest open-source communications platform. Built for organizations needing more control over their communications, it enables collaboration between colleagues, partners, customers, communities, and platforms without compromising data ownership, customizations, or integrations. Tens of millions of users in over 150 countries and organizations such as Deutsche Bahn, the U.S. Navy, and Credit Suisse trust Rocket.Chat every day to keep their communications completely private and secure. As Rocket.Chat, we believe in reconnecting the world, one conversation at a time.

  • Brasil Tbwa ChiatDay Inc Tempo inteiro

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing, implementing, and maintaining our cloud-based infrastructure.Key Responsibilities:Design and implement scalable, secure, and highly available...


  • Brasil Swile Tempo inteiro

    About SwileAt Swile, we're dedicated to creating innovative solutions that simplify daily professional life and boost employee satisfaction. Our products cater to various areas, including Fintech, Travel, HR, and Employee Benefits, serving over 5.5 million users across 85,000 companies in France and Brazil.Job DescriptionWe're seeking a Senior Site...


  • Brasil Swile Tempo inteiro

    About SwileSwile is a leading tech company that provides innovative solutions to reduce friction in daily professional life and boost employee satisfaction. With a strong presence in France and Brazil, we serve over 5.5 million users in 85,000 companies.Job DescriptionWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key...


  • Brasil Rocket Tempo inteiro

    Job Title: Senior Site Reliability EngineerAbout the Role:As a Senior Site Reliability Engineer at Rocket.Chat, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud infrastructure. Your expertise in designing, implementing, and maintaining robust infrastructure will be instrumental in delivering exceptional...


  • Brasil Tbwa ChiatDay Inc Tempo inteiro

    Job Title: Senior Site Reliability Engineer - Cloud InfrastructureWe are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure team. As a key member of our team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure to ensure high availability, scalability, and security.Key...


  • Brasil Aptonet Inc Tempo inteiro

    About the RoleWe are seeking a highly skilled Sr. Site Reliability Engineer to join our Backend Engineering Team at Aptonet Inc. As a key member of our team, you will be responsible for ensuring the reliability, security, and performance of our tools and services.Key ResponsibilitiesLifecycle Management & Vulnerability Management: Keep software and services...


  • Brasil Wellhub Inc. Tempo inteiro

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Platform Engineering team in Brazil. As a Senior Site Reliability Engineer, you will be responsible for building and maintaining a global, secure, scalable, and cost-effective cloud platform using Kubernetes in AWS.Key ResponsibilitiesDevelop and evolve Kubernetes...

  • Site Reliability Engineer

    1 semana atrás


    Brasil Guidewire Tempo inteiro

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Guidewire. As a key member of our SRE-Application team, you will be responsible for building and evolving our SRE practice for applications running on our Guidewire Cloud Platform.Key ResponsibilitiesCollaborate with development teams to troubleshoot and solve...


  • brasil Sigma Software Group Tempo inteiro

    We have an excellent opportunity for a bright, smart, and highly motivated Senior/Principal Site Reliability Engineer to join our mature project team. You have a unique chance to become part of our team and work with best practices and methodologies. This role empowers you to take the lead and excel to your fullest potential. CUSTOMER Our customer is...


  • Brasil Sigma Software Group Tempo inteiro

    We have an excellent opportunity for a bright, smart, and highly motivated Senior/Principal Site Reliability Engineer to join our mature project team. You have a unique chance to become part of our team and work with best practices and methodologies. This role empowers you to take the lead and excel to your fullest potential. CUSTOMER Our customer is...

  • Site Reliability Engineer

    2 semanas atrás


    Brasil Gigster Inc. Tempo inteiro

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Gigster Inc. As a Site Reliability Engineer, you will play a critical role in ensuring the smooth operation of our cloud-based services and applications.Key ResponsibilitiesWork collaboratively with customers and partners to resolve complex technical issues and drive...

  • Site Reliability Engineer

    2 semanas atrás


    Brasil Guidewire Software Tempo inteiro

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Guidewire Software. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and scalability of our cloud-based solutions. You will work closely with our development teams to troubleshoot and solve problems, develop automated...


  • brasil Pentasia Tempo inteiro

    My client is looking for a Mid Site Reliability Engineer with strong skills and focus on development and automation to address complex challenges. This is a remote positions in Brazil to work in a payrolls system, and the candidate needs to be at the office 1x a quarter to meet the team in Sao Pablo.🌏 Remote in Brazil💰 Salary: Circa R$14k - R$16k...


  • Brasil Pentasia Tempo inteiro

    My client is looking for a Mid Site Reliability Engineer with strong skills and focus on development and automation to address complex challenges. This is a remote positions in Brazil to work in a payrolls system , and the candidate needs to be at the office 1x a quarter to meet the team in Sao Pablo. 🌏 Remote in Brazil 💰 Salary: Circa R$14k -...


  • Brasil onecontact TECH Tempo inteiro

    Roles and Responsibilities: Architect and manage scalable infrastructure on Microsoft Azure, utilizing services such as Azure Virtual Machines, Azure Kubernetes Service (AKS), Azure Functions, and Logic Apps. Optimize the use of Azure DevOps for continuous integration and continuous deployment (CI/CD) pipelines. Maintain and improve the availability,...


  • brasil onecontact TECH Tempo inteiro

    Roles and Responsibilities:Architect and manage scalable infrastructure on Microsoft Azure, utilizing services such as Azure Virtual Machines, Azure Kubernetes Service (AKS), Azure Functions, and Logic Apps.Optimize the use of Azure DevOps for continuous integration and continuous deployment (CI/CD) pipelines.Maintain and improve the availability,...


  • Brasil, BR onecontact TECH Tempo inteiro

    Roles and Responsibilities:Architect and manage scalable infrastructure on Microsoft Azure, utilizing services such as Azure Virtual Machines, Azure Kubernetes Service (AKS), Azure Functions, and Logic Apps.Optimize the use of Azure DevOps for continuous integration and continuous deployment (CI/CD) pipelines.Maintain and improve the availability,...


  • brasil onecontact TECH Tempo inteiro

    Roles and Responsibilities: Architect and manage scalable infrastructure on Microsoft Azure, utilizing services such as Azure Virtual Machines, Azure Kubernetes Service (AKS), Azure Functions, and Logic Apps. Optimize the use of Azure DevOps for continuous integration and continuous deployment (CI/CD) pipelines. Maintain and improve the availability,...


  • Brasil Tbwa ChiatDay Inc Tempo inteiro

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Data Reliability Engineering squad. As a Site Reliability Engineer, you will be responsible for building, maintaining, and evolving cloud-native and containerized infrastructure. You will work closely with our team to design, implement, and operate scalable and reliable...

  • Site Reliability Engineer

    1 semana atrás


    Brasil Vigil365 Tempo inteiro

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Vigil365 team in Portugal or Brazil. As a key member of our engineering team, you will play a pivotal role in ensuring the stability, resilience, and scale of our services.Key ResponsibilitiesTake ownership of your work, ensuring high-quality outcomes that consider security,...