Senior Site Reliability
4 semanas atrás
Senior Site Reliability / Gitops Engineer Canonical is a leading provider of open‑source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. We are hiring a Senior Site Reliability / Gitops Engineer to our Information Systems (IS) team. This role is an opportunity for an “automation‑first” senior technologist with a passion for Linux to build a career with Canonical and drive success for users of Ubuntu and our open‑source products. Job Summary The IS team supports and maintains all of Canonical’s IT production services, running services used by over 60 million Ubuntu users. As a Senior SRE & Gitops engineer you will drive operations automation—using IaC, CI/CD pipelines, and Canonical’s leading products—to both private and public clouds. You will also improve Canonical products and open‑source technologies by providing feedback, writing bugs, or contributing pull requests, and collaborate on design and implementation with other teams. As a Senior Site Reliability / Gitops Engineer you will Drive the development of automation and Gitops in your team as an embedded tech lead Collaborate closely with the IS architect to align solutions with the IS architecture vision Design and architect services that IS can offer to the organization as products Apply IaC experience to develop infrastructure as code practice within IS, constantly increasing automation and improving IaC processes Automate software operations for re‑usability and consistency across private and public clouds, considering distributed systems complexities Maintain operational responsibility for all of Canonical's core services, networks, and infrastructure Develop skills in troubleshooting, capacity planning, and performance investigation; set up and use observability tools such as Prometheus, Grafana, and Elasticsearch; design and maintain monitoring and alerting Assist and work with globally distributed engineering, operations, and support peers Receive uninterrupted development time to focus on larger projects and automation of manual tasks Share experience, know‑how, and best practices with team members in design sessions, mentorship and collaborative work Take final responsibility for time‑critical escalations What we are looking for in you A modern view on hosting architecture, driven by IaC across private and public clouds A product mindset, thriving to develop products rather than solutions Python software development experience with large projects Experience with Kubernetes or other container orchestration systems Proven exposure to manage and deploy cloud infrastructure with code Practical knowledge of Linux networking, routing, and firewalls Affinity with various forms of Linux storage, from Ceph to databases Hands‑on experience administering enterprise Linux servers Extensive knowledge of cloud computing concepts and technologies Bachelor's degree or greater, preferably in computer science or related engineering field Excellent communication in English across email, chat, video or voice calls and in person Motivated to troubleshoot from kernel to web and willing to ask others when appropriate Willingness to be flexible and learn new ideas quickly Passion for fast‑changing environments Comfortable working within distributed teams Passionate about open‑source, especially Ubuntu or Debian What we offer Distributed work environment with twice‑yearly team sprints in person Personal learning and development budget of USD 2 000 per year Annual compensation review Recognition rewards Annual holiday leave Maternity and paternity leave Team member assistance program & wellness platform Opportunity to travel to new locations to meet colleagues Priority Pass and travel upgrades for long‑haul company events About Canonical Canonical is a pioneering tech firm at the forefront of the global move to open source. As the publisher of Ubuntu, one of the most important open‑source projects and the platform for AI, IoT, and the cloud, we are changing the world of software. We recruit on a global basis and set a high standard for people joining the company. We expect excellence; to succeed, we must be the best. Most colleagues at Canonical have worked from home since our inception in 2004. Working here is a step into the future and will challenge you to think differently, work smarter, learn new skills, and raise your game. We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background creates a better work environment and better products. Whatever your identity, we will give your application fair consideration. #J-18808-Ljbffr
-
Site Reliability Engineer
4 semanas atrás
Rio de Janeiro, Brasil BairesDev Tempo inteiroOverview Site Reliability Engineer at BairesDev. We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure OpenShift/Kubernetes clusters. We will need you to approach the problem of building and maintaining production systems from a software engineering perspective with a focus on automation, and reliability....
-
Site Reliability Engineer
3 semanas atrás
Rio de Janeiro, Brasil BairesDev Tempo inteiroJoin to apply for the Site Reliability Engineer - Remote Work role at BairesDev At BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely...
-
Site Reliability Engineer
3 semanas atrás
Rio de Janeiro, Brasil BairesDev Tempo inteiroJoin or sign in to find your next job Join to apply for the Site Reliability Engineer - Remote Work role at BairesDev . 3 days ago Be among the first 25 applicants. At BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting‑edge solutions to giants like Google and the most innovative startups in Silicon Valley....
-
Site Reliability Engineer
Há 2 dias
Rio de Janeiro, Rio de Janeiro, Brasil BairesDev Tempo inteiroAt BairesDev, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley.Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely on roles that drive significant impact worldwide.When you apply for this position,...
-
Site Reliability Engineer
4 semanas atrás
Rio de Janeiro, Brasil BairesDev Tempo inteiroOverview Site Reliability Engineer at BairesDev. We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure OpenShift/Kubernetes clusters. You will approach the problem of building and maintaining production systems from a software engineering perspective with a focus on automation and reliability. What You...
-
Site Reliability Engineer
Há 5 dias
Rio de Janeiro, Brasil BairesDev Tempo inteiroOverview At BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely on roles that drive significant impact worldwide. When you apply for...
-
Site Reliability Engineer
3 semanas atrás
Rio de Janeiro, Brasil Canonical Tempo inteiroOverview Site Reliability Engineer role at Canonical (globally remote). Canonical provides open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, supports enterprise initiatives in cloud, data science, AI, engineering innovation, and IoT. We are hiring a Site Reliability Engineer to help perfect...
-
Staff Site Reliability Engineer
3 semanas atrás
Rio de Janeiro, Brasil Nearsure Tempo inteiroStaff Site Reliability Engineer - Work from home Staff Site Reliability Engineer - Work from home 1 week ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management. Say...
-
Site Reliability Engineer
1 dia atrás
Cabo de Santo Agostinho, Brasil Psm Company Tempo inteiroSobre a vagaA PSM Company é especializada na identificação de Talentos para as áreas de TI / Telecom como também para as áreas operacionais e administrativas.Nossa história de sucesso está baseada em nosso modelo de negócios que proporcionam assertividade e qualidade no processo seletivo, baixo Turn Over e isenção de riscos e passivos...
-
Site Reliability Engineer
2 semanas atrás
Cabo de Santo Agostinho, Brasil Psm Company Tempo inteiroSobre a vaga A PSM Company é especializada na identificação de Talentos para as áreas de TI / Telecom como também para as áreas operacionais e administrativas. Nossa história de sucesso está baseada em nosso modelo de negócios que proporcionam assertividade e qualidade no processo seletivo, baixo Turn Over e isenção de riscos e passivos...