Senior Site Reliability Engineer
3 semanas atrás
Senior Site Reliability Engineer (SRE) - (Brazil) About Us Articul8 AI is at the forefront of Generative AI innovation, delivering cutting‑edge SaaS products that transform how businesses operate. Our platform empowers organizations to leverage the power of artificial intelligence in a reliable, scalable, and secure environment. Position Overview We are seeking an experienced Site Reliability Engineer (SRE) to join our team and help ensure the reliability, performance, and scalability of our GenAI SaaS platform. As an SRE, you will bridge the gap between development and operations, implementing automation and best practices to maintain our service reliability objectives while supporting rapid innovation. Key Responsibilities Architect and maintain scalable, highly available infrastructure for our GenAI platform. Design and implement robust monitoring, alerting, and observability solutions to proactively ensure system health and performance. Automate deployment, scaling, and management of our cloud‑native infrastructure, reducing toil and improving efficiency. Define, measure, and improve Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to deliver outstanding service quality. Participate in on‑call rotations and provide rapid response to production incidents, minimizing downtime and user impact. Collaborate closely with development teams to build reliable, scalable, and efficient systems for complex AI workloads. Lead incident response efforts, conduct thorough post‑mortems, and champion continuous improvement initiatives. Optimize infrastructure for performance, scalability, and cost‑effectiveness—especially for high‑demand AI workloads. Implement and enforce security best practices across all systems and environments. Create and maintain comprehensive documentation, including runbooks and knowledge base articles, to foster a culture of shared knowledge. Qualifications Required Bachelor’s degree in Computer Science, Engineering, or related field, or equivalent practical experience. 5+ years of experience in DevOps, SRE, or similar roles. Strong experience with cloud platforms (AWS, GCP, or Azure). Proficiency in at least one programming/scripting language (Python, Go, Bash, etc.). Hands‑on experience with infrastructure as code tools (Terraform, CloudFormation, etc.). Solid background in containerization technologies (Docker, Kubernetes). Proven experience with monitoring and observability tools (Prometheus, Grafana, ELK stack, etc.). Strong understanding of CI/CD pipelines and automation. Exceptional troubleshooting and problem‑solving skills and ability to troubleshoot complex systems. Preferred Experience supporting AI/ML systems in production. Knowledge of GPU infrastructure management and optimization. Familiarity with distributed systems and high‑performance computing. Experience with database systems (SQL and NoSQL). Certifications in cloud platforms (AWS, GCP, Azure). Experience with chaos engineering and resilience testing. Knowledge of security best practices and compliance requirements. Ready to shape the future of resilient software systems? Apply now and help drive the reliability of tomorrow’s AI at Articul8 AI #J-18808-Ljbffr
-
Site Reliability Engineer
4 semanas atrás
Santo André, Brasil BairesDev Tempo inteiroSite Reliability Engineer - Remote Work | REF# Get AI-powered advice on this job and more exclusive features. At BairesDev® we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of...
-
Software Engineer II
4 semanas atrás
Espírito Santo, Brasil Microsoft Tempo inteiroSoftware Engineer II / Senior Software Engineer Join to apply for the Software Engineer II / Senior Software Engineer role at Microsoft. We are hiring multiple Software Engineers II and Senior Software Engineers to join the Microsoft 365 team. These are remote positions, allowing you to work from the comfort of your home. The Microsoft 365 team is looking...
-
Senior Backend Engineer
4 semanas atrás
Espírito Santo, Brasil Sphise Tempo inteiroSenior Backend Engineer (PHP / Laravel) Location : Brazil (Remote) Our trusted high-growth healthcare technology partner is seeking a talented Senior Backend Engineer (PHP / Laravel) to join their dynamic team. This innovative company is dedicated to revolutionizing the healthcare industry through cutting-edge technology solutions. Position Overview As a...
-
Linux Site Reliability Consultant
3 semanas atrás
Espírito Santo, Brasil Pythian Tempo inteiroOverview Linux Site Reliability Consultant — Brazil | Remote | Work from Home. One available position for the following time zone: PST . Why Pythian At Pythian, we are experts in strategic database and analytics services, driving digital transformation and operational excellence. Pythian, a multinational company, was founded in 1997 and started by ensuring...
-
Software Engineer II
4 semanas atrás
Espírito Santo, Brasil Microsoft Tempo inteiroSoftware Engineer II / Senior Software Engineer Join to apply for the Software Engineer II / Senior Software Engineer role at Microsoft We are hiring multiple Software Engineers II and Senior Software Engineers to join the Microsoft 365 team. These are remote positions, allowing you to work from the comfort of your home! The Microsoft 365 team is looking for...
-
Senior Software Engineer
1 semana atrás
Espírito Santo, Brasil GeorgiaTEK Systems Inc. Tempo inteiroSenior Software Engineer (Golang) Location: Remote (Brazil) Experience: 8+ years overall Engagement: Contractor About the Role We are seeking an experienced Senior Software Engineer with strong expertise in Golang , Node.js , and React to develop and enhance high-performance, scalable backend services. The role emphasizes Go, gRPC, Kafka, Kubernetes, and...
-
Senior Backend
4 semanas atrás
Cabo De Santo Agostinho, Brasil Kake Tempo inteiroSenior Backend (Node+Python) EngineerSummaryWe’re looking for a Senior Backend Engineer to join one of our key partners in building the next generation of digital commerce infrastructure. This role is focused on creating high-performance, scalable backend services that power mission-critical features for large-scale retailers and digital platforms. You...
-
Senior Data Engineer
4 semanas atrás
Espírito Santo, Brasil Workana Tempo inteiroWorkana is the largest remote work platform for talents in Latin America. Our new segment, Workana Premium, focuses on matching the most exceptional professionals with leading and innovative companies around the globe. Enjoy competitive compensation, dedicated support, and the flexibility of remote work within a dynamic environment that fosters collaboration...
-
DevOps Engineer
3 semanas atrás
Santo André, Brasil Nearsure Tempo inteiroJoin to apply for the DevOps Engineer - Work from home role at Nearsure 1 day ago Be among the first 25 applicants Join to apply for the DevOps Engineer - Work from home role at Nearsure Get AI-powered advice on this job and more exclusive features. Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and...
-
Senior Mobile Flutter Engineer
1 semana atrás
Cabo de Santo Agostinho, Brasil Kake Tempo inteiroSenior Mobile Flutter Engineer Summary We're looking for a Senior Mobile Engineer to join one of our partners in building innovative and high-performing mobile applications.You'll play a key role in designing and developing Flutter-based apps that deliver seamless user experiences, ensuring performance, scalability, and clean architecture across multiple...