
Staff Site Reliability Engineer
2 semanas atrás
Staff Site Reliability Engineer - Work from homeStaff Site Reliability Engineer - Work from home1 week ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management. Say goodbye to micromanagement We champion autonomy, open communication, and respect for diversity as our core values. ️Your well-being matters: Our People Care team is here from day one to support you with everything from time-off requests to wellness check-ins. Plus, our Accounts Management team ensures smooth, effective client relationships, so you can focus on what you do best. Ready to grow with us? Here’s what we offer you by joining us Competitive USD salary – We value your skills and contributions 100% remote work – While you can work from anywhere, you’re always welcome to connect with teammates and grow your network at our coworking spaces across LATAM Paid time off – Take the time you need according to your country’s regulations, all while receiving your full salary. Rest, recharge, and come back stronger National Holidays celebrated – Take time off to celebrate important events and traditions with loved ones, fully embracing your culture. Sick leave – Focus on your health without the stress. Take the necessary time to recover and feel better. Refundable Annual Credit – Spend it on the perks you love to enhance your work-life balance Team-building activities – Join us for coffee breaks, tech talks, and after-work gatherings to bond with your Nearsure family and feel part of our vibrant community. Birthday day off – Enjoy an extra day off during your birthday week to celebrate in style with friends and family About the project: As a Staff Site Reliability Engineer , you will own and optimize OpenTelemetry pipelines, enabling scalable and efficient observability. You’ll build tools that empower teams, support incident response, and drive best practices. Your work ensures a reliable, secure infrastructure and actionable alerting across the organization. How your day-to-day work will look like Design, implement, and maintain observability pipelines across the three main signals—logs, metrics, and traces—ensuring standardized, scalable, and efficient data ingestion. Optimize ingestion strategies to balance cost, performance, and usability. Build self-service automation and tooling that enables development teams to instrument and leverage observability without requiring manual intervention from the SRE team. Drive adoption of best practices while ensuring teams own their telemetry. Design the processes, playbooks, checklists, and automations for them and other engineers to follow during an incident. Interact with members from almost all teams across the business to understand their monitoring, alerting, and SLO / SLA requirements and design systems and processes that ensure we meet or exceed these requirements. Influence architectural decisions during initial design stages to ensure resiliency and scale at the outset of software development. Design the processes, playbooks, checklists, and automations for them and other engineers to follow during an incident. Leverage Infrastructure-as-Code (IaC) to provision and manage monitoring tools, alerting rules, and our observability configurations across OTEL Pipelines. Design base-level requirements for new and existing services to ensure that all client infrastructure and code are monitored consistently and accurately at a basic level. Take full ownership of client infrastructure reliability, ensuring adherence to key availability and security KPIs. This would make you the ideal candidate Bachelor's Degree in Computer Science, Engineering, or a related field. 8+ Years of experience working as an SRE Engineer or in a very similar role, more focused on observability. 5+ Years of experience working with cloud (AWS). 5+ Years of experience working with IaC tools (Terraform) and GitOps CI/CD solutions (ArgoCD, GitHub Actions, or similar). 4+ Years of experience working with monitoring and logging tools such as Grafana, Prometheus, Loki, New Relic, or Datadog (experience managing observability pipelines at scale in high-throughput environments). 4+ Years of experience working in Kubernetes, including its core components, deployment methodologies, and monitoring best practices. Strong communication skills with team members and stakeholders (technical and nontechnical communication). Strong scripting abilities (Python, Go, or similar) for automating observability tasks. Experience integrating incident management platforms (PagerDuty, Jira) with automated alerting workflows. Advanced English Level is required for this role as you will work with US clients. Effective communication in English is essential to deliver the best solutions to our clients and expand your horizons. What to expect from our hiring process 1. Let’s chat about your experience 2. Impress our recruiters, and you’ll move on to a technical interview with our top developers. 3. Nail that, and you’ll meet our client - your final step to joining our amazing team At Nearsure, we’re dedicated to solving complex business challenges through cutting-edge technology and we believe in the power of tailored solutions. Whether you are passionate about transforming businesses with Generative AI, building innovative software products, or implementing comprehensive enterprise platform solutions, we invite you to be part of our dynamic team We would love to hear from you if you are eager to make an impact and join a collaborative team that values creativity and expertise. Let’s work together to shape the future of technology By applying to this position, you authorize Nearsure to collect, store, transfer, and process your personal data in accordance with our Privacy Policy. For more information, please review our Privacy Policy. ( Seniority levelSeniority levelMid-Senior level Employment typeEmployment typeFull-time Job functionJob functionInformation Technology IndustriesSoftware Development Referrals increase your chances of interviewing at Nearsure by 2x Sign in to set job alerts for “Site Reliability Engineer” roles.Staff Site Reliability Engineer - Work from homeSenior Site Reliability / Gitops EngineerSoftware Engineer (Python/Linux/Packaging)Python and Kubernetes Software Engineer - Data, AI/ML & AnalyticsPython and Kubernetes Software Engineer - Data, Workflows, AI/ML & AnalyticsJunior Software Engineer - Cross-platform C++ - MultipassSoftware Engineer - Solutions EngineeringPython Software Engineer - Ubuntu Hardware Certification TeamWe’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr
-
Senior Site Reliability
1 semana atrás
Rio de Janeiro, Brasil Canonical Tempo inteiroSenior Site Reliability / Gitops EngineerJoin or sign in to find your next job Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Senior Site Reliability / Gitops Engineer1 day ago Be among the first 25 applicants Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Canonical is a leading provider...
-
Site Reliability
4 semanas atrás
Região Geográfica Imediata de Criciúma, Brasil Canonical Tempo inteiroJoin to apply for the Site Reliability / Gitops Engineer role at Canonical 1 day ago Be among the first 25 applicants Join to apply for the Site Reliability / Gitops Engineer role at Canonical Get AI-powered advice on this job and more exclusive features. Canonical is a leading provider of open source software and operating systems to the global...
-
Site Reliability Engineer
2 semanas atrás
Região Geográfica Intermediária de Juiz de Fora, Brasil Pacifica Continental Tempo inteiroOur engineering team has built the largest private Medicare marketplace in the country. We passionately focus on the continuous improvement of the systems we build. We have spent many years growing and fostering a DevOps culture by bridging the divide between our Software and Infrastructure Engineering departments. We want the cross-functional teams that we...
-
Senior Site Reliability Engineer
Há 7 dias
Rio de Janeiro, Brasil Canonical Tempo inteiroOverview Join to apply for the Senior Site Reliability Engineer role at Canonical . Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT....
-
Senior Site Reliability Engineer
Há 5 dias
Rio de Janeiro, Brasil Mercado Eletrônico Tempo inteiroO Mercado Eletrônico é líder na América Latina em soluções de gestão de compras B2B.Suas tecnologias e serviços para as áreas de compras ajudam empresas a conquistarem mais economia, agilidade, governança e colaboração.Com escritórios no Brasil, Estados Unidos, México e Portugal, contabiliza mais de 1 milhão de fornecedores, 10 mil compradores...
-
Site Reliability Engineer
1 semana atrás
Greater Rio de Janeiro, Brasil Personetics Tempo inteiro R$90.000 - R$120.000 por anoDescriptionPersonetics is shaping the Cognitive Banking era, harnessing AI to help banks anticipate customer needs, provide actionable insights, and deliver intelligent financial guidance. Our platform continuously analyzes and leverages real-time transactional data, enabling banks to proactively support customers in managing their finances and reaching...
-
Site Reliability Engineer
2 semanas atrás
Região Geográfica Intermediária de Sorocaba, Brasil BairesDev Tempo inteiroSite Reliability Engineer - Remote Work: At BairesDev, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely on roles that drive significant impact...
-
Site Reliability
1 semana atrás
Rio de Janeiro, Brasil Canonical Tempo inteiroJob Summary Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. We are hiring a Site Reliability / Gitops Engineer to our...
-
Staff Frontend Engineer
2 semanas atrás
Rio de Janeiro, Brasil Cint Tempo inteiroJob DescriptionThe OpportunityCint is looking to raise the bar on engineering standards as well as user experiences.We offer a smaller company atmosphere, where engineers can shape technical strategy, lead cross-team initiatives, and set the direction for how we build and deliver products at scale.Our products are redefining experiences within the...
-
Staff Frontend Engineer
2 semanas atrás
Rio de Janeiro, Brasil Cint Tempo inteiroWho We Are Cint is a pioneer in research technology (ResTech).Our customers use the Cint platform to post questions and get answers from real people to build business strategies, confidently publish research, accurately measure the impact of digital advertising, and more.The Cint platform is built on a programmatic marketplace, which is the world's largest,...