
Site Reliability Engineer
Há 2 dias
About The Team/Role
We are seeking a Software Development Engineer Level 3 to join our SRE team dedicated to the Mobility line of business. This role is for a professional with a software development background who will apply SRE principles to ensure the reliability, scalability, and performance of our complex software systems.
The ideal candidate will have related experience and will be a key player in fostering a culture of continuous improvement and collaboration across engineering teams.
SRE is an ongoing journey of continuous improvement, and the core principles apply regardless of the technology's complexity, the customer's needs, or the business context. If you're passionate about building resilient and highly available systems, we encourage you to apply.
*How you'll make an impact
As a Site Reliability Engineer, Your Responsibilities Will Include*
- Embrace Observability: You'll build and maintain comprehensive monitoring and observability systems by meticulously instrumenting applications, infrastructure, and dependencies. You'll create clear dashboards that provide a direct view of system health, standardizing metrics, logs, and tracing to enable effective correlation and analysis.
- Design for Performance and Resilience: You will design systems with a focus on scalability, redundancy, and fault tolerance. This includes setting clear performance targets (SLIs/SLOs) aligned with business goals and regularly conducting load testing and chaos engineering to find issues proactively.
- Proactive Reliability: You'll help shift our team from a reactive to a proactive mindset by defining explicit Service Level Objectives (SLOs) that reflect user expectations. You'll use error budgets to guide the balance between development and operations, slowing down releases when necessary to maintain reliability.
- Incident Management and Learning: You will treat outages and performance degradations as opportunities to improve resilience. This involves streamlining incident response with clear procedures and conducting blameless postmortems to learn from mistakes.
- Automate Everything (with Caution): You'll automate repetitive and error-prone tasks to minimize toil and free up the team for high-value work. You'll build in robust testing and rollback capabilities into automation pipelines, always maintaining careful oversight and human judgment.
- Impact Engineering and Corporate Culture: You'll collaborate with development and product teams to improve system quality and performance. This includes highlighting impacts on quality, bringing focus to customer journey bottlenecks, and helping to prioritize product stories related to defects.
*Experience you'll bring*
- Expertise in software design, development, and testing for software enhancements and new products.
- Knowledge of automated testing tools and traditional quality assurance approaches.
- Experience with cloud development, including designing, developing, and maintaining applications on platforms like Amazon Web Services/EC2.
- Understanding of cloud storage services, including EBS, Amazon S3, and EFS.
- Ability to create documentation for future maintenance and issue resolution.
- Experience with APIs, pre-scripting, post-scripting, and integration testing.
-
Site reliability engineer
4 semanas atrás
Porto Alegre, Rio Grande do Sul, Brasil azion Tempo inteiroAbout Azion We are a global leader in the application and security industry. Our platform allows companies to operate with agility, reducing latency and increasing the reliability of their applications. We are focused on simplifying application building and looking for passionate and innovative individuals to join our team At Azion you will have the...
-
Site reliability engineer sre
4 semanas atrás
Porto Alegre, Rio Grande do Sul, Brasil Netvagas Tempo inteiroAbout AzionWe are a global leader in the application and security industry. Our platform allows companies to operate with agility, reducing latency and increasing the reliability of their applications. We are focused on simplifying application building and looking for passionate and innovative individuals to join our teamAt Azion you will have the...
-
Site reliability engineer sre
4 semanas atrás
Porto Alegre, Rio Grande do Sul, Brasil Netvagas Tempo inteiroAbout Azion We are a global leader in the application and security industry. Our platform allows companies to operate with agility, reducing latency and increasing the reliability of their applications. We are focused on simplifying application building and looking for passionate and innovative individuals to join our team At Azion you will have the...
-
Mid level Site Reliability Engineer
1 semana atrás
Porto Alegre, Rio Grande do Sul, Brasil WEX Tempo inteiro R$90.000 - R$120.000 por anoAbout The Team/RoleThe WEX Site Reliability Engineering (SRE) team seeks individuals passionate about developing software and solutions for observability, incident response, reliability, performance, operational excellence, and compliance. As part of the Site Reliability Engineering organization, you will support internal stakeholders and Payment Platform...
-
Senior Site Reliability Engineer
1 semana atrás
Porto Alegre, Rio Grande do Sul, Brasil Azion Technologies Tempo inteiro R$90.000 - R$120.000 por anoSobre a AzionSomos uma empresa global de tecnologia especializada em aplicações e segurança digital. Nossa plataforma ajuda empresas a operar com mais agilidade, reduzindo o tempo de resposta e aumentando a confiabilidade de seus sistemas.Na Azion, nosso propósito é simplificar a construção de aplicações e transformar o futuro com tecnologia de...
-
Senior Site Reliability Engineer
4 semanas atrás
Porto Alegre, Rio Grande do Sul, Brasil Canonical Tempo inteiroOverview Join to apply for the Senior Site Reliability Engineer role at Canonical . Location: Globally remote role. Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI,...
-
Data Engineer
2 semanas atrás
Porto Alegre, Rio Grande do Sul, Brasil Ambush Tempo inteiro R$90.000 - R$120.000 por anoAmbush is a People Company. But what does that mean exactly? It means we care about our people as much as we care about building great products. We take a human-centered approach to identifying, retaining and integrating highly-talented, long-term remote people into America's best product and development team.We began our consulting journey in 2015 and have...
-
Lead ML Engineer
4 semanas atrás
Porto Alegre, Rio Grande do Sul, Brasil Launch Potato Tempo inteiroJoin to apply for the Lead ML Engineer role at Launch Potato 5 days ago Be among the first 25 applicants Join to apply for the Lead ML Engineer role at Launch Potato Overview WHO ARE WE? Launch Potato is a profitable digital media company that reaches over 30M+ monthly visitors through brands such as FinanceBuzz, All About Cookies, and...
-
Specialist Software Engineer
2 semanas atrás
Porto Alegre, Rio Grande do Sul, Brasil WEX Tempo inteiro R$90.000 - R$120.000 por anoDesign, develop, and maintain high-performance, scalable, and secure software applications using Java and related frameworks.Collaborate with product owners, designers, and other engineers to understand requirements and translate them into technical specifications.Implement new features and enhance existing functionalities, ensuring adherence to coding...
-
Senior SRE/DevOps Engineer
2 semanas atrás
Porto Alegre, Rio Grande do Sul, Brasil ADP Tempo inteiro R$90.000 - R$120.000 por anoADP is hiring a Senior SRE/DevOps EngineerDevOps Engineer/Site reliability engineer is responsible for the infrastructure, configuration and pipeline automation which enables the efficient delivery and reliable operations of ADP Multi-National Country (MNC) products in all environments from development to production and DR. In addition, they are responsible...