Site Reliability Engineer
Há 3 dias
Job Description Join our Data & AI Platform team as a Site Reliability Engineer (SRE) – Platform Operation. You will support and maintain scalable, resilient, and efficient infrastructure for our Data & AI Platform, ensuring reliable infrastructure availability and enhancing business as usual. You will collaborate closely with Platform Engineers, Architects, Data Engineers, DevOps, and Security teams to maintain and optimize our platforms. Your Missions Support, manage, and maintain Azure resources: Azure SQL, Synapse, Data Factory, Databricks, Unity Catalog Monitor Azure workloads, troubleshoot incidents, alerts, and performance bottlenecks Implement and manage RBAC, identity & access policies, and compliance controls Optimize Azure cost and performance using Azure Monitor, DataDog, and Cost Management tools Automate tasks using PowerShell, Azure CLI, Terraform, and Python Utilize Git, GitHub Actions, and Airflow for workflow automation Provide L2/L3 support for data pipelines, reporting, and cloud services Conduct incident response, root cause analysis (RCA), and proactive issue resolution Collaborate with Cloud Engineering, Data Engineers, BI Developers, and Cloud Architects Follow ITSM processes: Incident, Change, and Problem Management Ensure platform security and compliance with frameworks like MICS Your Profile Academic background: Bachelor’s or Master’s degree in Computer Science, Information Technology, or related field (minimum 3 years of experience) Experience: 5+ years hands‑on with cloud platforms (Azure, AWS, GCP), programming (Bash, PowerShell, Terraform, Python, Java), and Infrastructure as Code (IaC) English language: Professional working proficiency in English and the local language Tools / software: Deep expertise in Azure, Databricks, Unity Catalog, Kubernetes, Helm, Docker, Power BI, Datadog, Grafana, GitHub, Azure DevOps, ArgoCD, Airflow, SSIS, Power Query, and relational/NoSQL databases AI experience: Experience supporting enterprise Data & AI platforms Soft skills: Analytical problem‑solving Effective communication and active listening Team player with respect for others Strong troubleshooting and platform monitoring skills Automation (Python, PowerShell, CLI, KQL, Terraform) ITIL-based workflow experience What we offer An international community bringing together 110+ different nationalities An environment where trust has a central place: 70% of our key leaders started their careers at the first level of responsibility A robust training system with our internal Academy and 250+ available modules A vibrant workplace that frequently gathers for internal events (afterworks, team buildings, etc.) Strong commitments to CSR, notably through participation in our WeCare Together program Diversity and Inclusion Mantu is proud to be an equal opportunity workplace. We are committed to promoting diversity within the workforce and creating an inclusive working environment. For this purpose, we welcome applications from all qualified candidates regardless of gender, sexual orientation, race, ethnicity, beliefs, age, marital status, disability, or other characteristics. #J-18808-Ljbffr
-
Site Reliability Engineer
Há 2 horas
São Paulo, Brasil Conquest One Tempo inteiro🎯 Vaga: SRE Sênior🗣️ Inglês para conversação é imprescindível📍Híbrido – presencial 2x na semana no Jardim Paulista (Av. Nove de Julho – São Paulo/SP) + 3x na semana de home office📑 Contratação: CLT🕓 Horário de trabalho: 09:00 às 18:00Estamos em busca de um(a) Site Reliability Engineer Sênior para atuar
-
Site Reliability Engineer
4 semanas atrás
São Paulo, Brasil PayRetailers Tempo inteiroSite Reliability Engineer Join PayRetailers in São Paulo. We are expanding across Latin America and Africa, building cutting‑edge payment solutions. We value creativity, growth, and collaboration. About the role Site Reliability Engineers are guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform...
-
Senior Site Reliability Engineer
3 semanas atrás
São Paulo, Brasil Dev.Pro Tempo inteiroSenior Site Reliability Engineer - OP01988 6 days ago Be among the first 25 applicants
-
Site Reliability Engineer
1 semana atrás
São Paulo, Brasil Handoff Tempo inteiroWhy Join Us Handoff is the AI agent that runs a construction company. We help remodelers automate estimating, streamline operations, and win more work - backed by real-time cost data, intuitive design, and workflows that "speak contractor." With over 10,000 monthly active users and $6B in annualized project volume already flowing through our platform,...
-
Site Reliability Engineer
2 semanas atrás
São Paulo, São Paulo, Brasil Handoff Tempo inteiroWhy join us? Handoff is the AI agent that runs a construction company. We help remodelers automate estimating, streamline operations, and win more work - backed by real-time cost data, intuitive design, and workflows that "speak contractor." With over 10,000 monthly active users and $6B in annualized project volume already flowing through our platform, we're...
-
Site Reliability Engineer
Há 3 dias
São Paulo, Brasil Handoff Tempo inteiroSite Reliability Engineer at Handoff We are transforming the construction industry with Handoff, an AI agent that runs a construction company. We help remodelers automate estimating, streamline operations, and win more work—backed by real‑time cost data, intuitive design, and workflows that “speak contractor.” With over 10,000 monthly active users...
-
Site Reliability Engineer
1 semana atrás
São Paulo, Brasil Handoff Tempo inteiroWhy Join Us Handoff is the AI agent that runs a construction company. We help remodelers automate estimating, streamline operations, and win more work - backed by real-time cost data, intuitive design, and workflows that "speak contractor." With over 10,000 monthly active users and $6B in annualized project volume already flowing through our platform,...
-
Site Reliability Engineer
2 semanas atrás
São Paulo, Brasil Handoff Tempo inteiroWhy join us? Handoff is the AI agent that runs a construction company. We help remodelers automate estimating, streamline operations, and win more work – backed by real‑time cost data, intuitive design, and workflows that “speak contractor.” With over 10,000 monthly active users and $6B in annualized project volume already flowing through our...
-
Site Reliability Engineer
3 semanas atrás
São Paulo, Brasil INDI Staffing Services Tempo inteiroAt INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work.Overview of the role:We are looking for a Site Reliability Engineer to build and maintain highly reliable,...
-
Remote Site Reliability Engineer
Há 7 dias
São Paulo, Brasil Indi Staffing Services Tempo inteiroOverview We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure OpenShift/Kubernetes clusters. Approach the problem of building and maintaining production systems from a software engineering perspective with a focus on automation and reliability. Responsibilities Build, automate, and maintain...