
Site Reliability Engineer
1 semana atrás
About the Team/Role
We are seeking a Software Development Engineer Level 3 to join our SRE team dedicated to the Mobility line of business. This role is for a professional with a software development background who will apply SRE principles to ensure the reliability, scalability, and performance of our complex software systems.
The ideal candidate will have related experience and will be a key player in fostering a culture of continuous improvement and collaboration across engineering teams.
SRE is an ongoing journey of continuous improvement, and the core principles apply regardless of the technology's complexity, the customer's needs, or the business context. If you're passionate about building resilient and highly available systems, we encourage you to apply.
How you'll make an impact
As a Site Reliability Engineer, your responsibilities will include:
- Embrace Observability: You'll build and maintain comprehensive monitoring and observability systems by meticulously instrumenting applications, infrastructure, and dependencies. You'll create clear dashboards that provide a direct view of system health, standardizing metrics, logs, and tracing to enable effective correlation and analysis.
- Design for Performance and Resilience: You will design systems with a focus on scalability, redundancy, and fault tolerance. This includes setting clear performance targets (SLIs/SLOs) aligned with business goals and regularly conducting load testing and chaos engineering to find issues proactively.
- Proactive Reliability: You'll help shift our team from a reactive to a proactive mindset by defining explicit Service Level Objectives (SLOs) that reflect user expectations. You'll use error budgets to guide the balance between development and operations, slowing down releases when necessary to maintain reliability.
- Incident Management and Learning: You will treat outages and performance degradations as opportunities to improve resilience. This involves streamlining incident response with clear procedures and conducting blameless postmortems to learn from mistakes.
- Automate Everything (with Caution): You'll automate repetitive and error-prone tasks to minimize toil and free up the team for high-value work. You'll build in robust testing and rollback capabilities into automation pipelines, always maintaining careful oversight and human judgment.
- Impact Engineering and Corporate Culture: You'll collaborate with development and product teams to improve system quality and performance. This includes highlighting impacts on quality, bringing focus to customer journey bottlenecks, and helping to prioritize product stories related to defects.
Experience you'll bring
- Expertise in software design, development, and testing for software enhancements and new products.
- Knowledge of automated testing tools and traditional quality assurance approaches.
- Experience with cloud development, including designing, developing, and maintaining applications on platforms like Amazon Web Services/EC2.
- Understanding of cloud storage services, including EBS, Amazon S3, and EFS.
- Ability to create documentation for future maintenance and issue resolution.
- Experience with APIs, pre-scripting, post-scripting, and integration testing.
-
Site Reliability Engineer
4 semanas atrás
São Paulo, São Paulo, Brasil Coderio Tempo inteiroSite Reliability Engineer (SRE) - Technical Referent Site Reliability Engineer (SRE) - Technical Referent 1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. About UsCoderio designs and delivers scalable digital solutions for global businesses. With a strong technical foundation and a product mindset, our...
-
Site Reliability Engineer
4 semanas atrás
São Paulo, São Paulo, Brasil Grupo Foxbit Tempo inteiroEstamos à procura de um SRE (Site Reliability Engineer) para nos ajudar a garantir a estabilidade, segurança e escalabilidade de uma das maiores exchanges de criptomoedas do BrasilO principal objetivo do time de SRE é, em conjunto com Desenvolvimento e Segurança, garantir a confiabilidade dos sistemas, monitorar, melhorar a performance e automatizar...
-
Site Reliability Engineer
4 semanas atrás
São Paulo, São Paulo, Brasil Coderio Tempo inteiroSite Reliability Engineer (SRE) - Technical ReferentSite Reliability Engineer (SRE) - Technical Referent1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. About Us Coderio designs and delivers scalable digital solutions for global businesses. With a strong technical foundation and a product mindset, our...
-
Site Reliability Engineer
4 semanas atrás
São Paulo, São Paulo, Brasil Trading Technologies Tempo inteiroJoin to apply for the Site Reliability Engineer role at Trading Technologies 3 months ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer role at Trading Technologies We are seeking Site Reliability Engineers for our team who thrive on pushing the limits of technology to produce state of the art solutions. Our SRE team is...
-
Site Reliability Engineer
3 semanas atrás
São Paulo, São Paulo, Brasil Thales Group Tempo inteiroSite Reliability Engineer page is loaded## Site Reliability Engineerremote type: On-Sitelocations: São Paulotime type: Full timeposted on: Posted 2 Days Agojob requisition id: R Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the...
-
Site Reliability Engineer
3 semanas atrás
São Paulo, São Paulo, Brasil buscojobs Brasil Tempo inteiroOverviewAbout the RoleWe are looking for a Senior Site Reliability Engineer (SRE) to join a mission-critical project for one of our U.S.-based clients. This role focuses on maintaining platform reliability and implementing proactive solutions to minimize system downtimes and performance bottlenecks.ResponsibilitiesDesign and maintain scalable,...
-
Site Reliability Engineer
3 semanas atrás
São Paulo, São Paulo, Brasil buscojobs Brasil Tempo inteiroOverview About the Role We are looking for a Senior Site Reliability Engineer (SRE) to join a mission-critical project for one of our U.S.-based clients. This role focuses on maintaining platform reliability and implementing proactive solutions to minimize system downtimes and performance bottlenecks. Responsibilities Design and maintain scalable,...
-
Remote Site Reliability Engineer
3 semanas atrás
São Paulo, São Paulo, Brasil INDI Staffing Services Tempo inteiroAt INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work.Overview of the role:We are looking for a Site Reliability Engineer to build and maintain highly reliable,...
-
Remote Site Reliability Engineer
1 semana atrás
São Paulo, São Paulo, Brasil INDI Staffing Services Tempo inteiroAt INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work. Overview of the role:We are looking for a Site Reliability Engineer to build and maintain highly reliable,...
-
Senior Site Reliability
3 semanas atrás
São Paulo, São Paulo, Brasil Canonical Tempo inteiroSenior Site Reliability / Gitops Engineer Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Senior Site Reliability / Gitops Engineer 1 day ago Be among the first 25 applicants Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Get AI-powered advice on this job and more exclusive features....