Senior System Reliability Specialist

Há 3 dias


São Paulo, São Paulo, Brasil beBeeSiteReliabilityEngineer Tempo inteiro R$95.000 - R$122.000
Job Description:

We are seeking a skilled Site Reliability Engineer to join our team. This role involves pushing the limits of technology to create state-of-the-art solutions.

The SRE team faces the challenge of creating scalable solutions for monitoring live trading infrastructures, building command frameworks, and generating actionable alerts for on-call operations members. Additionally, they play a vital role in providing proactive support by responding to alerts, diagnosing issues, and ensuring the continuous availability of their trading platforms.

Responsibilities include:

  • Code, script, and automate using Python and Go Lang
  • Implement new product features, as well as enhance and maintain existing functionality by monitoring solutions and performance characteristics
  • Create/enhance tools to make operational workflows more automated and less error-prone
  • Provide troubleshooting and support for trading system issues across the software, hardware, and network stacks to ensure that services are restored immediately
  • Participate in design discussions, review sessions, and prototyping
  • Ensure the scalability and quality of all code
  • Assist with product documentation, unit testing, monitoring, and ensuring overall product quality
  • Work with application teams to ensure they provide proper monitoring and tools before their application moves into prod environment

Required Skills and Qualifications:

To be successful in this role, you will need:

  • Minimum AWS Certification (Associate Level)
  • Minimum RedHat Certification (RHCSA or higher)
  • Minimum 3 years of experience with Python
  • Familiarity with Terraform
  • Experience with Ruby and Golang a plus
  • Experience with observability and monitoring tools like Grafana or ELK a plus
  • Ability to write Chef Manifests
  • Understanding of network protocols, load balancing, and HA Proxy
  • Solid understanding of functional programming, object-oriented programming, and computer science foundations
  • Good understanding of low-latency backend and server-side components
  • Proven and strong communication skills
  • Proven experience working within Agile/Scrum development methodologies, participating in sprint planning, daily stand-ups, and retrospectives

Benefits:

This role offers a hybrid work arrangement, with two days per week spent in the office. The ideal candidate will thrive in an environment that promotes collaboration and innovation.


About the Role:

This is a challenging and rewarding opportunity for a skilled Site Reliability Engineer. If you are passionate about technology and eager to push the boundaries of what is possible, we encourage you to apply.



  • São Paulo, São Paulo, Brasil beBeeReliability Tempo inteiro US$48.000 - US$72.000

    Reliability ExpertThis role involves ensuring the high availability and performance of our systems, including operating and debugging cloud-native services as well as classic Windows environments.Key Responsibilities:Owning the uptime and performance of core backend infrastructure (Windows + Linux)Maintaining and enhancing observability across systems using...


  • São Paulo, São Paulo, Brasil beBeeRelevance Tempo inteiro

    Job Title:Reliability Management Specialist This is a high-impact regional leadership role that blends technical expertise with people development. The TPM Reliability Manager will lead the deployment of Equipment & Facility Ownership standards and drive maintenance excellence across our LATAM regional operations. Key Responsibilities:Deliver coaching and...


  • São Paulo, São Paulo, Brasil beBeeAutomation Tempo inteiro R$800.000 - R$1.200.000

    Senior Automation SpecialistWe are seeking a skilled professional to join our team as a Senior Automation Specialist. This role involves collaborating with cross-functional teams to design, develop and deploy scalable systems.The ideal candidate will have a strong background in automation, expertise in infrastructure as code (IaC) practices, and experience...


  • São Paulo, São Paulo, Brasil beBeeQuality Tempo inteiro R$70.000 - R$97.000

    Job Title: Senior Quality Assurance SpecialistAre you a meticulous professional with a passion for delivering high-quality software solutions?As a Senior Quality Assurance Specialist, you will play a vital role in ensuring the quality and reliability of our products. Your attention to detail and analytical skills will be essential in identifying and...


  • São Paulo, São Paulo, Brasil beBeeReliability Tempo inteiro

    Job Title:Technical Leadership Role in System Reliability Engineering About the Job:We are seeking an experienced Technical Leader to join our team and lead efforts in system reliability engineering.Key Responsibilities:Lead high-complexity projects as SRE and infrastructure teams.Ensure availability and scalability of critical systems and business...


  • São Paulo, São Paulo, Brasil On Tempo inteiro

    Join to apply for the Specialist - B2B Systems role at On Join to apply for the Specialist - B2B Systems role at On In shortAs a Specialist - B2B Systems, you will work on the ERP system that enables our growth and helps to bridge the gap between our


  • São Paulo, São Paulo, Brasil beBeeData Tempo inteiro

    Senior Data SpecialistWe're seeking a highly skilled Senior Data Specialist to join our team.This individual will be responsible for developing and implementing data solutions that drive business value, leveraging advanced technologies and methodologies.About the Role:Develop and maintain large-scale data systems, ensuring scalability, performance, and...

  • System Support Specialist

    3 semanas atrás


    São Paulo, São Paulo, Brasil Global System™ Tempo inteiro

    Modelo de trabalho: 100% RemotoHorário: Comercial – das 9h às 18hContratação: PJBuscamos um(a) profissional com sólida experiência em monitoramento de ambientes de TI, com foco em Zabbix e Grafana, para atuar na sustentação e evolução de nossas esteiras de monitoramento.Você será responsável por:Manutenção e suporte do ambiente Zabbix...


  • São Paulo, São Paulo, Brasil beBeeReliability Tempo inteiro R$100.000 - R$150.000

    Reliability Expert RoleWe are seeking a highly skilled and experienced Reliability Expert to join our team.Job DescriptionAs a Reliability Engineer at our organization, you will play a pivotal role in ensuring the reliability of our products, projects, and services. You will collaborate closely with cross-functional teams across the business to perform...


  • São Paulo, São Paulo, Brasil beBeeOperational Tempo inteiro US$59.808 - US$76.150

    Job Overview">The System Operations Specialist is a critical role that ensures client satisfaction by providing timely and effective solutions to software, hardware, and network issues.This involves resolving tickets raised by clients, maintaining high-quality service request solutions, performing root cause analysis, and ensuring the acceptance/resolution...