
Senior Site Reliability Engineer
Há 2 dias
As a Senior Site Reliability Engineer at WSO2, you'll be instrumental in both supporting our existing customers with their managed or private cloud deployments and initiating new deployments across leading cloud platforms such as Azure, AWS, and GCP. Your mission will include ensuring the seamless operation, scalability, and security of WSO2 cloud services, alongside automating processes to boost both efficiency and reliability.
Your Key Responsibilities Deployment Setup and Management:- Lead the design and implementation of new cloud deployments, tailoring solutions to meet stakeholder requirements on platforms like Azure, AWS, GCP, and Kubernetes.
- Optimize cloud architectures for scalability and cost-effectiveness, adhering to best practices for networking, security, and access controls.
- Gain and maintain deep knowledge of cloud infrastructure providers to create robust solutions.
- Proactively introduce continuous improvements and cost-optimized solutions to enhance infrastructure adaptability and streamline deployment processes.
- Craft and manage automation scripts and infrastructure as code (IaC) using tools such as Terraform, Ansible, or CloudFormation.
- Deploy CI/CD pipelines to streamline software delivery, testing, and deployment processes, ensuring efficient version control and configuration management.
- Ensure the availability of services by configuring system monitors and alerts and attending to critical alerts in a timely manner.
- Offer continuous support and maintenance for existing deployments, monitoring system performance and swiftly resolving issues to maintain high availability and reliability.
- Implement strategies for performance optimization and failure prevention, conducting thorough root cause analyses to avoid future issues.
- Demonstrate strong ownership during critical incident scenarios, ensuring smooth operations under pressure by delivering timely resolutions. Implement effective workarounds and conduct thorough root cause analysis (RCA)
- Establish comprehensive monitoring and alerting systems to oversee customer deployments, setting thresholds for incident response.
- Conduct regular security assessments and stay abreast of the latest threats and trends to fortify cloud environments against risks.
- Foster a collaborative environment with product developers, operations, and QA teams to enhance workflows and product quality.
- Share knowledge and best practices, contributing to the team’s collective expertise through documentation, training, and mentorship.
- Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
- 2+ years of hands-on experience as a Site Reliability Engineer, managing and improving production systems at scale.
- Strong collaboration and leadership skills, with a proven ability to drive cross-functional initiatives, deliver results, and align efforts toward organizational goals.
- Expertise in cloud platforms such as Azure, AWS and GCP.
- Expertise in Linux and virtualization and containerization technologies such as Docker and Kubernetes.
- A solid understanding of networking, security principles, and compliance frameworks.
- Proficiency in IaC tools (Terraform, CloudFormation), configuration management (Puppet, Chef, Helm), and scripting languages (Python, Bash, PowerShell).
- Experience with CI/CD tools (Github Actions, Jenkins) and monitoring/logging tools (Prometheus, ELK stack, Splunk).
- Exceptional problem-solving, analytical, and troubleshooting skills, coupled with a proactive, customer-centric mindset.
- Strong communication skills and the ability to collaborate effectively in a team environment.
- A work culture and environment where we value both hard work AND flexibility.
- A flexible vacation/leave plan that fits your needs.
- Health, dental, and life insurance for you and your family.
We've built our business on a commitment to diversity and inclusion.We believe it's important to foster an environment that values and respects each individual's strengths, perspectives, and ideas.Doing so not only drives innovation; it also ensures that we can create superior experiences for our customers, partners, and employees worldwide.We value the diversity of our team regardless of race, ethnicity, religion, gender, age, national origin, disability, sexual orientation,or veteran or marital status, and we do not tolerate any form of discrimination.
Apply NowFirst Name *
Last Name *
Email *
Country Code *
Phone *
Address *
Country *
Upload CV (PDF only / 5MB) *
Do you have authorization to work in selected job location ? * Yes No
Yes, I give WSO2 permission to use my personal data for recruitment purposes only.
I would like to receive emails from WSO2 to learn about new releases, security announcements, and other updates.
#J-18808-Ljbffr-
Site Reliability Engineer
2 semanas atrás
Canoas, Brasil Gauge Tempo inteiroSomos uma empresa do Grupo Stefanini.Especializados em marketing digital, utilizamos uma abordagem integrada que combina tecnologia, inteligência de dados, design e profundo conhecimento do comportamento do consumidor.Nosso foco está em potencializar os resultados de nossos parceiros, oferecendo soluções que vão desde consultoria estratégica até a...
-
Site Reliability Engineer
Há 4 dias
Canoas, Brasil Canonical Tempo inteiroJoin to apply for the Site Reliability Engineer role at CanonicalCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets.Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT.Our customers include...
-
Linux Site Reliability Consultant
3 semanas atrás
Canoas, Brasil Pythian Tempo inteiroOverview Join to apply for the Linux Site Reliability Consultant role at Pythian . 2 weeks ago Be among the first 25 applicants. Site Reliability Consultant Brazil | Remote | Work from Home. One available position for the following time zone: PST . Why Pythian At Pythian, we are experts in strategic database and analytics services, driving digital...
-
Site Reliability Engineer
3 semanas atrás
Canoas, Rio Grande do Sul, Brasil buscojobs Brasil Tempo inteiroOverview Sobre a oportunidade: Junte-se à nossa equipe de SRE e seja fundamental para garantir a confiabilidade, escalabilidade e segurança dos ambientes em nuvem dos nossos clientes. Você será responsável por configurar e gerenciar a esteira CI / CD, o ambiente on-cloud e os componentes arquiteturais, garantindo a entrega de soluções robustas e de...
-
Senior Advisor, Machine Learning Engineer
4 semanas atrás
Canoas, Rio Grande do Sul, Brasil Dell Tempo inteiroSenior MLOps Engineer The AI-Centric Engineering unit is a fast paced and exhilarating part of the Dell Technologies Office of the CTO. We drive research and engineering for the future of Dell Technologies in a high-visibility and high-collaboration environment. Our team is specialized in developing advanced generative AI solutions that empower businesses...
-
Software Engineer
1 semana atrás
Canoas, Brasil The Flex Tempo inteiroOverviewJoin to apply for the Software Engineer - Remote role at The Flex.The Flex is on a mission to transform the rental sector globally.We believe renting a home should be as seamless as buying an item online.Our vision is to give tenants the freedom to rent anywhere in the world while enabling landlords to manage their properties with ease, without high...
-
Senior Software Engineer, SaaS
3 semanas atrás
Canoas, Rio Grande do Sul, Brasil Savant Labs Tempo inteiroOverview Join to apply for the Senior Software Engineer, SaaS (Remote Brazil) role at Savant Labs . Get AI-powered advice on this job and more exclusive features. About Savant Labs Savant is a rapidly growing (Series A: $18M) SaaS company focused on building an all-in-one platform for analytics automation. We aim to transform the way business analysts...
-
Senior Bootloader
3 semanas atrás
Canoas, Brasil Canonical Tempo inteiro2 weeks ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. We are seeking an experienced software engineer passionate about Linux systems, hardware architectures, Ubuntu, and the open source community, to join the Ubuntu Foundations Engineering team to maintain and enhance Ubuntu bootloader stack to provide...
-
.NET Engineer
Há 2 dias
Canoas, Brasil AgileEngine Tempo inteiroOverview AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. As a Senior/Lead .NET Backend Engineer at...
-
Information Technology Support Engineer
2 semanas atrás
Canoas, Brasil TECEZE Tempo inteiroOverview We are looking for a dedicated and proactive On-Site IT Support Engineer to provide hands-on support for our local infrastructure, users, and critical systems. This role ensures smooth IT operations, continuity of services, and timely resolution of incidents during the designated support period. The engineer will serve as the primary point of...