
Site Reliability Engineer
1 semana atrás
About the Team/Role
We are seeking a Software Development Engineer Level 3 to join our SRE team dedicated to the Mobility line of business. This role is for a professional with a software development background who will apply SRE principles to ensure the reliability, scalability, and performance of our complex software systems.
The ideal candidate will have related experience and will be a key player in fostering a culture of continuous improvement and collaboration across engineering teams.
SRE is an ongoing journey of continuous improvement, and the core principles apply regardless of the technology's complexity, the customer's needs, or the business context. If you're passionate about building resilient and highly available systems, we encourage you to apply.
How you'll make an impact
As a Site Reliability Engineer, your responsibilities will include:
Embrace Observability: You'll build and maintain comprehensive monitoring and observability systems by meticulously instrumenting applications, infrastructure, and dependencies. You'll create clear dashboards that provide a direct view of system health, standardizing metrics, logs, and tracing to enable effective correlation and analysis.
Design for Performance and Resilience: You will design systems with a focus on scalability, redundancy, and fault tolerance. This includes setting clear performance targets (SLIs/SLOs) aligned with business goals and regularly conducting load testing and chaos engineering to find issues proactively.
Proactive Reliability: You'll help shift our team from a reactive to a proactive mindset by defining explicit Service Level Objectives (SLOs) that reflect user expectations. You'll use error budgets to guide the balance between development and operations, slowing down releases when necessary to maintain reliability.
Incident Management and Learning: You will treat outages and performance degradations as opportunities to improve resilience. This involves streamlining incident response with clear procedures and conducting blameless postmortems to learn from mistakes.
Automate Everything (with Caution): You'll automate repetitive and error-prone tasks to minimize toil and free up the team for high-value work. You'll build in robust testing and rollback capabilities into automation pipelines, always maintaining careful oversight and human judgment.
Impact Engineering and Corporate Culture: You'll collaborate with development and product teams to improve system quality and performance. This includes highlighting impacts on quality, bringing focus to customer journey bottlenecks, and helping to prioritize product stories related to defects.
Experience you'll bring
Expertise in software design, development, and testing for software enhancements and new products.
Knowledge of automated testing tools and traditional quality assurance approaches.
Experience with cloud development, including designing, developing, and maintaining applications on platforms like Amazon Web Services/EC2.
Understanding of cloud storage services, including EBS, Amazon S3, and EFS.
Ability to create documentation for future maintenance and issue resolution.
Experience with APIs, pre-scripting, post-scripting, and integration testing.
-
Senior Site Reliability Engineer
1 semana atrás
Salvador, Bahia, Brasil Marvik Tempo inteiro R$80.000 - R$120.000 por anoWhat's the opportunity?We're looking for a Site Reliability Engineer (SRE) to join our team As an SRE, you're expected to ask key questions like:What data do we need to understand how our systems are performing?How do we collect that data?What patterns are we looking for, and what do they mean?Who needs to be alerted when something isn't working?Are there...
-
Site Reliability Engineer
3 semanas atrás
Salvador, Bahia, Brasil AgileEngine Tempo inteiroOverview Join to apply for the Site Reliability Engineer (Middle) ID38916 role at AgileEngine. AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us...
-
Site Reliability Expert
2 semanas atrás
Salvador, Bahia, Brasil beBeeReliability Tempo inteiro US$150.000 - US$170.000Job OverviewWe are seeking a highly skilled System Reliability Engineer to join our team. As a key technology leader, advisor for our clients, and mentor for other team members, you will be responsible for designing, implementing, and maintaining scalable and reliable infrastructure solutions.Key ResponsibilitiesOperate, maintain, and administer solutions...
-
Linux Site Reliability Consultant
3 semanas atrás
Salvador, Bahia, Brasil Pythian Tempo inteiroOverviewSite Reliability Consultant. Brazil | Remote | Work from Home. One available position for the following time zone: PST.Why PythianAt Pythian, we are experts in strategic database and analytics services, driving digital transformation and operational excellence. Pythian, a multinational company, was founded in 1997 and started by ensuring the...
-
Linux Site Reliability Consultant
3 semanas atrás
Salvador, Bahia, Brasil Pythian Tempo inteiroOverview Site Reliability Consultant. Brazil | Remote | Work from Home. One available position for the following time zone: PST. Why Pythian At Pythian, we are experts in strategic database and analytics services, driving digital transformation and operational excellence. Pythian, a multinational company, was founded in 1997 and started by ensuring the...
-
Senior Data Engineer
1 semana atrás
Salvador, Bahia, Brasil Pride Global Tempo inteiroWe're Hiring: Senior Data Engineer (MLOps) | Remote from Brazil | Fluent English required | USD-Hourly payLocation: Remote – Brazil only Language: Fluent English requiredAre you passionate about building scalable data platforms and cutting-edge MLOps solutions? Do you want to work with a top-tier US company revolutionizing e-commerce and circular...
-
Senior Hardware discipline engineer
3 semanas atrás
Salvador, Bahia, Brasil Dow Tempo inteiroOverview Join to apply for the Senior Hardware discipline engineer role at Dow . Dow (NYSE: DOW) is one of the world's leading materials science companies, serving customers in high-growth markets such as packaging, infrastructure, mobility and consumer applications. Our global breadth, asset integration and scale, focused innovation, leading business...
-
Security Engineer
3 semanas atrás
Salvador, Bahia, Brasil Varsity Tutors, a Nerdy Company Tempo inteiroSecurity Engineer - Detection & Response Join to apply for the Security Engineer - Detection & Response role at Varsity Tutors, a Nerdy Company Security Engineer - Detection & Response 1 day ago Be among the first 25 applicants Join to apply for the Security Engineer - Detection & Response role at Varsity Tutors, a Nerdy Company Overview:You are an...
-
Azure Devops Engineer
3 semanas atrás
Salvador, Bahia, Brasil Decskill Tempo inteiroOverview Join to apply for the Azure Devops Engineer role at Decskill Decskill was founded in 2014 as an IT Consulting Company and their main mission is to delivery value through the knowledge. We enable companies to meet the chalenges of digital world by providing our clients with business models that ensure technological capacity, flexibility and...
-
Cloud Infrastructure Specialist
1 semana atrás
Salvador, Bahia, Brasil beBeeCloudEngineer Tempo inteiroCloud Engineer Opportunity We are seeking a Cloud Engineer to join our team, where you will be working on high-traffic, mission-critical systems that power millions of users across the globe. Our team is passionate about infrastructure done right, and we believe in autonomy, ownership, and solving hard problems — at scale. You will be responsible for...