
Senior System Reliability Specialist
Há 3 dias
We are seeking a skilled Site Reliability Engineer to join our team. This role involves pushing the limits of technology to create state-of-the-art solutions.
The SRE team faces the challenge of creating scalable solutions for monitoring live trading infrastructures, building command frameworks, and generating actionable alerts for on-call operations members. Additionally, they play a vital role in providing proactive support by responding to alerts, diagnosing issues, and ensuring the continuous availability of their trading platforms.
Responsibilities include:
- Code, script, and automate using Python and Go Lang
- Implement new product features, as well as enhance and maintain existing functionality by monitoring solutions and performance characteristics
- Create/enhance tools to make operational workflows more automated and less error-prone
- Provide troubleshooting and support for trading system issues across the software, hardware, and network stacks to ensure that services are restored immediately
- Participate in design discussions, review sessions, and prototyping
- Ensure the scalability and quality of all code
- Assist with product documentation, unit testing, monitoring, and ensuring overall product quality
- Work with application teams to ensure they provide proper monitoring and tools before their application moves into prod environment
Required Skills and Qualifications:
To be successful in this role, you will need:
- Minimum AWS Certification (Associate Level)
- Minimum RedHat Certification (RHCSA or higher)
- Minimum 3 years of experience with Python
- Familiarity with Terraform
- Experience with Ruby and Golang a plus
- Experience with observability and monitoring tools like Grafana or ELK a plus
- Ability to write Chef Manifests
- Understanding of network protocols, load balancing, and HA Proxy
- Solid understanding of functional programming, object-oriented programming, and computer science foundations
- Good understanding of low-latency backend and server-side components
- Proven and strong communication skills
- Proven experience working within Agile/Scrum development methodologies, participating in sprint planning, daily stand-ups, and retrospectives
Benefits:
This role offers a hybrid work arrangement, with two days per week spent in the office. The ideal candidate will thrive in an environment that promotes collaboration and innovation.
About the Role:
This is a challenging and rewarding opportunity for a skilled Site Reliability Engineer. If you are passionate about technology and eager to push the boundaries of what is possible, we encourage you to apply.
-
System Reliability Specialist
Há 2 dias
São Paulo, São Paulo, Brasil beBeeReliability Tempo inteiro US$48.000 - US$72.000Reliability ExpertThis role involves ensuring the high availability and performance of our systems, including operating and debugging cloud-native services as well as classic Windows environments.Key Responsibilities:Owning the uptime and performance of core backend infrastructure (Windows + Linux)Maintaining and enhancing observability across systems using...
-
Reliability Management Specialist
Há 2 dias
São Paulo, São Paulo, Brasil beBeeRelevance Tempo inteiroJob Title:Reliability Management Specialist This is a high-impact regional leadership role that blends technical expertise with people development. The TPM Reliability Manager will lead the deployment of Equipment & Facility Ownership standards and drive maintenance excellence across our LATAM regional operations. Key Responsibilities:Deliver coaching and...
-
Senior Automation Specialist
Há 2 dias
São Paulo, São Paulo, Brasil beBeeAutomation Tempo inteiro R$800.000 - R$1.200.000Senior Automation SpecialistWe are seeking a skilled professional to join our team as a Senior Automation Specialist. This role involves collaborating with cross-functional teams to design, develop and deploy scalable systems.The ideal candidate will have a strong background in automation, expertise in infrastructure as code (IaC) practices, and experience...
-
Senior Quality Assurance Specialist
Há 2 dias
São Paulo, São Paulo, Brasil beBeeQuality Tempo inteiro R$70.000 - R$97.000Job Title: Senior Quality Assurance SpecialistAre you a meticulous professional with a passion for delivering high-quality software solutions?As a Senior Quality Assurance Specialist, you will play a vital role in ensuring the quality and reliability of our products. Your attention to detail and analytical skills will be essential in identifying and...
-
São Paulo, São Paulo, Brasil beBeeReliability Tempo inteiroJob Title:Technical Leadership Role in System Reliability Engineering About the Job:We are seeking an experienced Technical Leader to join our team and lead efforts in system reliability engineering.Key Responsibilities:Lead high-complexity projects as SRE and infrastructure teams.Ensure availability and scalability of critical systems and business...
-
Specialist - B2B Systems
Há 2 dias
São Paulo, São Paulo, Brasil On Tempo inteiroJoin to apply for the Specialist - B2B Systems role at On Join to apply for the Specialist - B2B Systems role at On In shortAs a Specialist - B2B Systems, you will work on the ERP system that enables our growth and helps to bridge the gap between our
-
Senior Data Specialist
Há 2 dias
São Paulo, São Paulo, Brasil beBeeData Tempo inteiroSenior Data SpecialistWe're seeking a highly skilled Senior Data Specialist to join our team.This individual will be responsible for developing and implementing data solutions that drive business value, leveraging advanced technologies and methodologies.About the Role:Develop and maintain large-scale data systems, ensuring scalability, performance, and...
-
System Support Specialist
3 semanas atrás
São Paulo, São Paulo, Brasil Global System™ Tempo inteiroModelo de trabalho: 100% RemotoHorário: Comercial – das 9h às 18hContratação: PJBuscamos um(a) profissional com sólida experiência em monitoramento de ambientes de TI, com foco em Zabbix e Grafana, para atuar na sustentação e evolução de nossas esteiras de monitoramento.Você será responsável por:Manutenção e suporte do ambiente Zabbix...
-
Expert Reliability Specialist
Há 2 dias
São Paulo, São Paulo, Brasil beBeeReliability Tempo inteiro R$100.000 - R$150.000Reliability Expert RoleWe are seeking a highly skilled and experienced Reliability Expert to join our team.Job DescriptionAs a Reliability Engineer at our organization, you will play a pivotal role in ensuring the reliability of our products, projects, and services. You will collaborate closely with cross-functional teams across the business to perform...
-
System Operations Specialist
Há 2 dias
São Paulo, São Paulo, Brasil beBeeOperational Tempo inteiro US$59.808 - US$76.150Job Overview">The System Operations Specialist is a critical role that ensures client satisfaction by providing timely and effective solutions to software, hardware, and network issues.This involves resolving tickets raised by clients, maintaining high-quality service request solutions, performing root cause analysis, and ensuring the acceptance/resolution...