
Senior Site Reliability Engineer
Há 8 horas
Senior Site Reliability Engineer (SRE) - (Brazil)Join to apply for the Senior Site Reliability Engineer (SRE) - (Brazil) role at Articul8 AI.Position OverviewWe are seeking an experienced Site Reliability Engineer (SRE) to join our team and help ensure the reliability, performance, and scalability of our GenAI SaaS platform.As an SRE, you will bridge the gap between development and operations, implementing automation and best practices to maintain our service reliability objectives while supporting rapid innovation.Key ResponsibilitiesArchitect and maintain scalable, highly available infrastructure for our GenAI platform.Design and implement robust monitoring, alerting, and observability solutions to proactively ensure system health and performance.Automate deployment, scaling, and management of our cloud-native infrastructure, reducing toil and improving efficiency.Define, measure, and improve Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to deliver outstanding service quality.Participate in on-call rotations and provide rapid response to production incidents, minimizing downtime and user impact.Collaborate closely with development teams to build reliable, scalable, and efficient systems for complex AI workloads.Lead incident response efforts, conduct thorough post-mortems, and champion continuous improvement initiatives.Optimize infrastructure for performance, scalability, and cost-effectiveness—especially for high-demand AI workloads.Implement and enforce security best practices across all systems and environments.Create and maintain comprehensive documentation, including runbooks and knowledge base articles, to foster a culture of shared knowledge.Required QualificationsBachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience.5+ years of experience in DevOps, SRE, or similar roles.Strong experience with cloud platforms (AWS, GCP, or Azure).Proficiency in at least one programming/scripting language (Python, Go, Bash, etc.).Hands-on experience with infrastructure as code tools (Terraform, CloudFormation, etc.).Solid background in containerization technologies (Docker, Kubernetes).Proven experience with monitoring and observability tools (Prometheus, Grafana, ELK stack, etc.).Strong understanding of CI/CD pipelines and automation.Exceptional troubleshooting and problem-solving skills and ability to troubleshoot complex systems.Preferred QualificationsExperience supporting AI/ML systems in production.Knowledge of GPU infrastructure management and optimization.Familiarity with distributed systems and high-performance computing.Experience with database systems (SQL and NoSQL).Certifications in cloud platforms (AWS, GCP, Azure).Experience with chaos engineering and resilience testing.Knowledge of security best practices and compliance requirements.#J-*****-Ljbffr
-
Site Reliability
Há 6 horas
Fortaleza, Brasil Canonical Tempo inteiroJoin to apply for the Site Reliability / Gitops Engineer role at Canonical1 day ago Be among the first 25 applicantsJoin to apply for the Site Reliability / Gitops Engineer role at CanonicalGet AI-powered advice on this job and more exclusive features.Canonical is a leading provider of open source software and operating systems to the global enterprise and...
-
Site Reliability Engineer
Há 2 dias
Fortaleza, Brasil INDI Staffing Services Tempo inteiro3 days ago Be among the first 25 applicants Direct message the job poster from INDI Staffing Services At INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work. Overview...
-
Senior Site Reliability Engineer
Há 6 dias
Fortaleza, Brasil Mercado Eletrônico Tempo inteiroO Mercado Eletrônico é líder na América Latina em soluções de gestão de compras B2B. Suas tecnologias e serviços para as áreas de compras ajudam empresas a conquistarem mais economia, agilidade, governança e colaboração. Com escritórios no Brasil, Estados Unidos, México e Portugal, contabiliza mais de 1 milhão de fornecedores, 10 mil...
-
[Genai Core]
2 semanas atrás
Fortaleza, Brasil Stone Tempo inteiro(GenAI Core) - Staff Site Reliability Engineer(GenAI Core) - Staff Site Reliability EngineerAWSTerraformArgoCDHashicorp VaultQuem é Stone Tech?A Stone nasceu com o propósito de ser protagonista na transformação da indústria de pagamentos, lutando para oferecer as melhores soluções para quem empreende no Brasil.Pensando nisso, construímos a Stone...
-
Site Reliability Engineer
2 semanas atrás
Fortaleza, Brasil Premiersoft Tempo inteiroOverviewNa Premiersoft, transformamos desafios em soluções.Com mais de uma década de pioneirismo em desenvolvimento mobile, somos movidos por um propósito claro: criar experiências tecnológicas que impulsionam o crescimento e a transformação dos nossos clientes.Nosso time, formado por mais de 200 #Heroes, combina expertise técnica com o nosso DNA:...
-
Site Reliability Engineer
2 semanas atrás
Fortaleza, Brasil AgileEngine Tempo inteiroSite Reliability Engineer (Middle) ID38916Join to apply for the Site Reliability Engineer (Middle) ID38916 role at AgileEngine . OverviewAgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and...
-
Site Reliability Engineer
Há 6 dias
Fortaleza, Brasil AgileEngine Tempo inteiroSite Reliability Engineer (Middle) ID38916 Join to apply for the Site Reliability Engineer (Middle) ID38916 role at AgileEngine . Overview AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML,...
-
Senior Data Engineer
Há 7 dias
Fortaleza, Brasil Microtalent is becoming INSPYR Global Solutions Tempo inteiroWE ARE HIRING DATA ENGINEER°Offer 100% remotly ONLY BrazilDirect contract with clientThe Senior Cloud Data Engineer leads the design, architecture, and implementation of secure, scalable data solutions on AWS, utilizing Snowflake, dbt, and modern automation tools. This role drives best practices for data quality, validation, and governance, while optimizing...
-
Aws/Kubernetes Devops Engineer
Há 9 horas
Fortaleza, Brasil Fullstack Labs Tempo inteiroJoin to apply for the AWS/Kubernetes DevOps Engineer - Remote - Latin America role at FullStack Labs2 days ago Be among the first 25 applicantsJoin to apply for the AWS/Kubernetes DevOps Engineer - Remote - Latin America role at FullStack LabsAbout FullStackFullStack is the most transparent IT talent network, connecting highly skilled individuals with top...
-
Site reliability engineer sênior
2 semanas atrás
Fortaleza, Brasil Stone Tempo inteiroQuem é Stone Tech?A Stone nasceu com o propósito de ser protagonista na transformação da indústria de pagamentos, lutando para oferecer as melhores soluções para quem empreende no Brasil.Pensando nisso, construímos a Stone Tech! A junção dos times de tecnologia Stone Co. e as empresas financeiras do grupo que reconhecem o potencial empreendedor de...