Senior Software Engineer, Reliability Engineering
Há 6 horas
Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible for guests to connect with communities in a more authentic way.
The Community You Will Join:
We are looking for a Senior Software Engineer to join our Site Reliability Engineering team. As a Senior Software Engineer in Production SRE, you will be responsible for developing and maintaining the tools and systems that enable our engineering teams to operate our services reliably and at scale. You will work closely with our SREs and other engineering teams to ensure our services are properly instrumented and able to scale with our growing business.
The Difference You Will Make:
In this role, your expertise in developing and maintaining tools and systems will be instrumental in bolstering our services' reliability and improving how the company manages incidents broadly. By collaborating closely with other engineering teams you will help establish a culture of reliability throughout the organization by providing a comprehensive incident management platform that is being used for instrumentation, operability, and around incidents. Your ability to identify opportunities for improvement and drive their implementation will contribute significantly to our overall operational efficiency and growth, ensuring that our services remain resilient as our business continues to expand.
Additionally, as an essential part of this role, you will serve as an active member of the Production SRE team, responding to and managing high severity incidents. Your vast technical experience and leadership skills will be invaluable as you step into the role of Incident Commander during these critical events. You will guide cross-functional teams during crisis situations and ensure timely resolution, minimizing the impact on our customers and business. This aspect of your work will require not just strong technical acumen, but also excellent communication and coordination skills, resilience under pressure, and a firm commitment to our culture of blamelessness and continuous learning.
A Typical Day:
- Design, implement and maintain the tools and systems that support service reliability, monitoring, and alerting.
- Collaborate with other engineering teams to ensure services are designed with reliability in mind, and provide guidance on the appropriate use of tooling and automation.
- Identify opportunities to improve the reliability, scalability, and efficiency of our services and drive their implementation.
- Work with infrastructure engineers to understand the challenges they face in operating our services and develop tools and systems to help them manage these challenges.
- Participate in incident response and post-mortems to identify and address systemic issues.
- Continuously evaluate new technologies and industry best practices to improve our SRE tooling and incident response procedures.
- Gain and maintain an intimate understanding of how the critical parts of the site work (services, infrastructure, product, tools, and processes)
- Lead high-urgency incidents and mentor less-experienced engineers in effectively handling incidents.
Your Expertise:
- Bachelor's degree in Computer Science or related field.
- 5+ years of experience in software engineering or SRE roles, with a focus on large scale distributed systems.
- Strong coding skills in at least one programming language, such as Java, Python, or Go.
- Experience with distributed systems and service-oriented architectures.
- Experience with cloud computing platforms such as AWS or Google Cloud Platform.
- Strong conviction in software development best practices, including version control, automated testing, and continuous integration and delivery.
- Experience with containerization technologies such as Docker and Kubernetes.
- Excellent problem-solving and analytical skills, with a strong attention to detail.
- Ability to work effectively in a fast-paced and dynamic environment.
- Strong communication and interpersonal skills.
- Fluent in English (Professional Level)
Our Commitment To Inclusion & Belonging:
Airbnb is committed to working with the broadest talent pool possible. We believe diverse ideas foster innovation and engagement, and allow us to attract creatively-led people, and to develop the best products, services and solutions. All qualified individuals are encouraged to apply.
We strive to also provide a disability inclusive application and interview process. If you are a candidate with a disability and require reasonable accommodation in order to submit an application, please contact us at: Please include your full name, the role you're applying for and the accommodation necessary to assist you with the recruiting process.
We ask that you only reach out to us if you are a candidate whose disability prevents you from being able to complete our online application.
-
Software Engineering Manager
Há 6 horas
São Paulo, São Paulo, Brasil TRACTIAN 𝗕𝗥 Tempo inteiroEngineering at TRACTIANThe Engineering team at TRACTIAN is responsible for building and scaling the infrastructure that powers our products: from advanced industrial monitoring systems to our CMMS (Computerized Maintenance Management System) and complex integrations with enterprise systems. We work with massive volumes of IoT data, real-time streams, and...
-
Site Reliability Engineer
Há 6 horas
São Paulo, São Paulo, Brasil PayRetailers Tempo inteiroJob DescriptionWe're PayRetailers, and we offer cutting-edge payment solutions that empower businesses to succeed in Latin America & Africa. Our collaborative and inclusive work environment encourages creativity and growth, where every employee's contribution is valued.We've got big plans to expand into new markets and make a meaningful impact on the world...
-
Site Reliability Engineer
1 semana atrás
São Paulo, São Paulo, Brasil INDI Staffing Services Tempo inteiroAt INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work.Overview of the role:We are looking for a Site Reliability Engineer to build and maintain highly reliable,...
-
Engineering Manager
Há 6 horas
São Paulo, São Paulo, Brasil Handoff Tempo inteiroWhy join us?Handoff is the AI agent that runs a construction company.We help remodelers automate estimating, streamline operations, and win more work - backed by real-time cost data, intuitive design, and workflows that "speak contractor." With over 10,000 monthly active users and $6B in annualized project volume already flowing through our platform, we're...
-
Software Engineer
3 semanas atrás
São Paulo, Estado de São Paulo, Brasil GraceMark Solutions Tempo inteiroSoftware Engineer (Fintech Engineering) Location: São Paulo, Brazil (Onsite Tue–Thu)⏱️ Duration: 6 months (contract with extension potential) Compensation: R$25,000 per monthWhat You’ll DoDesign, develop, test, and support scalable applications serving Finance and Tax teams globally.Partner with internal stakeholders to gather business requirements...
-
Site Reliability Engineer
Há 2 horas
São Paulo, Estado de São Paulo, Brasil INDI Staffing Services Tempo inteiroAt INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work.Overview of the role:We are looking for a Site Reliability Engineer to build and maintain highly reliable,...
-
Senior Azure Engineer – New Development
Há 6 horas
São Paulo, São Paulo, Brasil Engineering Search Firm Inc. Tempo inteiroRemote, HybridWe are seeking a highly experiencedSenior Azure Engineerto lead and executenew application development and cloud operationson Microsoft Azure. This role requires deep hands-on expertise in Azure architecture, DevOps practices, and modern application development, with a strong focus on building scalable, secure, and reliable cloud-native...
-
Senior Software Engineer
Há 6 horas
São Paulo, São Paulo, Brasil SumUp Tempo inteiroBecoming part of the Brazil Engineering team (BR Market) means helping solve our small merchant's problems on your daily routine and helping shape the future of our products in the Brazilian market.As a Senior Software Engineer, you will be responsible for improving the functionality and user experience of SumUp's Receivables platform by building scalable,...
-
Site Reliability Engineer
2 semanas atrás
São Paulo, Estado de São Paulo, Brasil Conquest One Tempo inteiroVaga: SRE Sênior️ Inglês para conversação é imprescindívelHíbrido – presencial 2x na semana no Jardim Paulista (Av. Nove de Julho – São Paulo/SP) + 3x na semana de home office Contratação: CLT Horário de trabalho: 09:00 às 18:00Estamos em busca de um(a) Site Reliability Engineer Sênior para atuar de forma estratégica na transformação e...
-
Software Engineer, Credit Limit Engineering
Há 6 horas
São Paulo, São Paulo, Brasil Brex Tempo inteiroWhy join usBrex is the AI-powered spend platform. We help companies spend with confidence with integrated corporate cards, banking, and global payments, plus intuitive software for travel and expenses. Tens of thousands of companies from startups to enterprises — including DoorDash, Flexport, and Compass — use Brex to proactively control spend, reduce...