Senior Site Reliability Engineer

1 semana atrás


BrazilRemote, Brasil Articul8 Tempo inteiro US$90.000 - US$120.000 por ano
About Us

Articul8 AI is at the forefront of Generative AI innovation, delivering cutting-edge SaaS products that transform how businesses operate. Our platform empowers organizations to leverage the power of artificial intelligence in a reliable, scalable, and secure environment.

Position Overview

We are seeking an experienced Site Reliability Engineer (SRE) to join our team and help ensure the reliability, performance, and scalability of our GenAI SaaS platform. As an SRE, you will bridge the gap between development and operations, implementing automation and best practices to maintain our service reliability objectives while supporting rapid innovation.

Key Responsibilities
  • Architect and maintain scalable, highly available infrastructure for our GenAI platform.

  • Design and implement robust monitoring, alerting, and observability solutions to proactively ensure system health and performance.

  • Automate deployment, scaling, and management of our cloud-native infrastructure, reducing toil and improving efficiency.

  • Define, measure, and improve Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to deliver outstanding service quality.

  • Participate in on-call rotations and provide rapid response to production incidents, minimizing downtime and user impact.

  • Collaborate closely with development teams to build reliable, scalable, and efficient systems for complex AI workloads.

  • Lead incident response efforts, conduct thorough post-mortems, and champion continuous improvement initiatives.

  • Optimize infrastructure for performance, scalability, and cost-effectiveness—especially for high-demand AI workloads.

  • Implement and enforce security best practices across all systems and environments.

  • Create and maintain comprehensive documentation, including runbooks and knowledge base articles, to foster a culture of shared knowledge.

Qualifications Required
  • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience

  • 5+ years of experience in DevOps, SRE, or similar roles

  • Strong experience with cloud platforms (AWS, GCP, or Azure)

  • Proficiency in at least one programming/scripting language (Python, Go, Bash, etc.)

  • Hands-on experience with infrastructure as code tools (Terraform, CloudFormation, etc.)

  • Solid background in containerization technologies (Docker, Kubernetes)

  • Proven experience with monitoring and observability tools (Prometheus, Grafana, ELK stack, etc.)

  • Strong understanding of CI/CD pipelines and automation

  • Exceptional troubleshooting and problem-solving skills and ability to troubleshoot complex systems

Preferred
  • Experience supporting AI/ML systems in production

  • Knowledge of GPU infrastructure management and optimization

  • Familiarity with distributed systems and high-performance computing

  • Experience with database systems (SQL and NoSQL)

  • Certifications in cloud platforms (AWS, GCP, Azure)

  • Experience with chaos engineering and resilience testing

  • Knowledge of security best practices and compliance requirements

Ready to shape the future of resilient software systems? Apply now and help drive the reliability of tomorrow's AI at Articul8 AI


  • Site Reliability Engineer

    1 semana atrás


    Remote, Brasil Seedify Tempo inteiro R$90.000 - R$120.000 por ano

    Seedify is a leading cryptocurrency launchpad platform dedicated to fostering innovation and success in the Web3 space. Our mission is to identify and assist promising teams and projects and offer outstanding returns to our investor base Job Description We are seeking a highly skilled Site Reliability Engineer with extensive experience in DevOps,...


  • Remote, São Paulo, Brazil Wizdaa Tempo inteiro US$90.000 - US$120.000 por ano

    Level: Senior (5+ years) | Department: Foundation/Platform Engineering Role Overview Lead development of internal Kubernetes platform enabling scalable application deployment through GitOps. Engineer solutions for deployment complexity, database migrations, multi-environment management, and developer productivity. Drive DevOps practices including CI/CD...

  • Senior Solutions Engineer

    1 semana atrás


    Remote - Brazil Twilio Tempo inteiro US$90.000 - US$120.000 por ano

    Who we are At Twilio, we're shaping the future of communications, all from the comfort of our homes. We deliver innovative solutions to hundreds of thousands of businesses and empower millions of developers worldwide to craft personalized customer experiences. Our dedication to remote-first work, and strong culture of connection and global inclusion...

  • Senior Data Engineer

    Há 15 horas


    Brazil, BR Pride Global Tempo inteiro

    We're Hiring: Senior Data Engineer (MLOps) | Remote from Brazil | Fluent English required | USD-Hourly payLocation: Remote – Brazil only ️Language: Fluent English requiredAre you passionate about building scalable data platforms and cutting-edge MLOps solutions? Do you want to work with a top-tier US company revolutionizing e-commerce and circular...

  • Senior Data Engineer

    Há 2 horas


    Brazil Pride Global Tempo inteiro

    We're Hiring: Senior Data Engineer (MLOps) | Remote from Brazil | Fluent English required | USD-Hourly pay Location : Remote – Brazil only ️Language : Fluent English required Are you passionate about building scalable data platforms and cutting-edge MLOps solutions? Do you want to work with a top-tier US company revolutionizing e-commerce and...

  • BSAtech | Recife

    1 semana atrás


    Manaus, Pernambuco, Brazil BSATech Tempo inteiro

    A BSAtech é uma empresa especializada no desenvolvimento de jogos de entretenimento com alcance global. Nosso compromisso é entregar experiências digitais de alta qualidade, combinando inovação, criatividade e tecnologia.Estamos em um momento de expansão e buscamos profissionais excepcionais para nos ajudar a ampliar nossas áreas de negócio e...


  • Brazil, BR HCLTech Tempo inteiro

    Your role and responsabilities:Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution.Performing deep-dive application troubleshooting and identifying preventive actions.Managing CIRS-related requests including deployments, feature toggles, and data fixes.Following up on major production incidents...

  • Senior Data Engineer

    1 semana atrás


    Buenos Aires, Espírito Santo, Brazil beBeeDataEngineer Tempo inteiro US$105.000 - US$140.000

    Job TitleWe are seeking a Senior Data Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and implementing scalable data pipelines, building and maintaining large-scale data systems, and collaborating with cross-functional teams to deliver high-quality software solutions.Key ResponsibilitiesDesign...

  • Senior Backend Engineer

    1 hora atrás


    Federative Republic Of Brazil Sphise Tempo inteiro

    Senior Backend Engineer (PHP/Laravel) Location:Brazil (Remote) Our trusted high-growth healthcare technology partner is seeking a talented Senior Backend Engineer (PHP/Laravel) to join their dynamic team. This innovative company is dedicated to revolutionizing the healthcare industry through cutting-edge technology solutions. Position Overview:As a Senior

  • Senior Web Engineer

    1 hora atrás


    Federative Republic Of Brazil beBeefrontend Tempo inteiro

    Job Title: Senior Web Engineer We are seeking a highly skilled and experienced Senior Web Engineer to join our team. As a key member of the development team, you will be responsible for designing, developing, and testing complex web applications using cutting-edge technologies.