Platform Operations Team Lead

1 semana atrás


Lagoa Santa, Brasil Mindhive Global Tempo inteiro

About the RoleMindhive builds AI-powered vision systems that transform industrial production. As we scale globally, reliability, observability, and rapid issue response are critical. The Platform Operations Team Lead (Brazil) plays a central role in ensuring our systems remain healthy across LATAM and European time zones.You will lead the Platform Operations function — the customer-adjacent, reliability-focused counterpart to our Platform Engineering team in New Zealand. Your team will ensure our deployed systems are monitored, stable, recoverable, and well-understood by the rest of the business.This role focuses on:monitoring, alert quality, and fast incident responsesupporting on-premise and edge deploymentsimproving operational processes and toolingfollow-the-sun coverage across Brazil and Portugalensuring operational readiness for engineering-led upgradesYou will collaborate closely with the NZ-based Platform Engineering team, who drive deep engineering projects (Puppet 8 rollout, Python 3.13 upgrade, CDK migration, CI/CD consistency, platform hardening, test-rig reliability). Together, you will form Mindhive's global reliability backbone.This is a hands-on leadership role with high impact on customer experience, system uptime, and our ability to scale installations worldwide.About YouYou are a strong operational leader with deep hands-on technical skills. You thrive in live production environments, enjoy solving real-world system issues, and understand how to build reliable systems across time zones. You excel in:structured incident responseobservability and monitoringimproving operational processesleading teams in distributed, multicultural environmentsworking close to customers to ensure uptime and stabilityYou care about technical quality, clarity, and people — and you bring a mindset focused on resilience, collaboration, and steady improvement.Key CompetenciesOperational Excellence - builds systems, processes, and behaviours that improve stability and reliability.Leadership & Mentorship - develops engineers and coordinates distributed teams.Systems Thinking - sees the interplay between cloud, edge hardware, software, people, and processes.Collaboration - works closely and constructively with NZ Platform Engineering and cross-functional teams.Calm Under Pressure - handles incidents and live issues with clarity and good judgement.Continuous Improvement - always looking for ways to automate, simplify, and strengthen operations.Key ResponsibilitiesPlatform Operations LeadershipLead and grow the Platform Operations team across Brazil and Portugal.Build a high-performing follow-the-sun operational capability that supports both internal teams and customers.Establish clear daily operational rhythms, including alert review, ticket management, and incident response.Team Leadership & CultureMentor engineers and technicians across Brazil and Portugal.Create a culture of ownership and continuous improvement.Ensure communication is clear, predictable, and aligned with our values.Build a team that is highly accountable, collaborative, and customer-focused.Observability & MonitoringOwn the quality and accuracy of Datadog dashboards, alerts, service catalog, resource catalog, and operational visibility.Reduce alert noise, improve signal quality, and ensure teams receive actionable information.Develop and maintain runbooks, playbooks, and operational documentation.Incident Response & ReliabilityOversee first-line and second-line incident response during LATAM and EU hours.Ensure fast, structured triage for issues across cloud, on-premise, and edge deployments.Maintain clear escalation paths and strong communication practices during incidents.Partner with Implementation and Customer Success teams to resolve client-facing issues.Collaboration with Platform Engineering (NZ)Act as the operational counterpart to NZ Platform Engineering.Ensure operational readiness for major engineering initiatives, such as:-Puppet 8 migration-Python 3.13 upgrade-CDK migration-CI/CD unification-Platform hardening-Test rig and E2E reliability improvementsProvide field feedback, operational insights, and rollout support for these improvements.System Health & Operational ExcellenceMonitor the health of live systems across sites and proactively identify stability risks.Help drive improvements in:-edge hardware reliability-network stability-server provisioning consistency-observability for both cloud and on-prem componentsWork with teams to reduce operational toil and automate repetitive tasks.Required Skills & ExperienceLeadership & CommunicationExperience leading distributed teams across multiple time zones.Excellent communication in English and Portuguese.Ability to collaborate effectively with engineering, implementation, and customer-facing teams.Strong organisational skills with ability to manage competing priorities.TechnicalStrong background in DevOps, SRE, or Production Engineering environments.Hands-on experience operating hybrid cloud + on-premise / edge systems.Proficiency with:-Datadog (or similar observability platforms)-AWS (IAM, networking, security, monitoring)-Containerization (Docker)-Kubernetes / K3S-IaC tools (AWS CDK ideal)Solid programming skills in Python (TypeScript/JavaScript is a plus).Understanding of security best practices (identity, access, endpoint, and network security).OperationalExperience running incident response, on-call processes, or follow-the-sun operations.Proven ability to write and maintain runbooks, playbooks, and operational documentation.Experience supporting industrial, IoT, or hardware-integrated systems (ideal).About UsMindhive Ltd is a fast-moving AI company using machine learning and computer vision to reimagine industrial systems. Our products run across cloud, on-premise, and edge deployments, bringing AI performance and reliability directly to the factory floor.We care deeply about people, quality, and impact. We work collaboratively, iterate quickly, and tackle meaningful, complex problems.Mindhive is a New Zealand Hi-Tech Awards winner, recognised for innovation and impact in software, AI, and advanced manufacturing.Work Environment & FlexibilityWe support hybrid and remote work, with our people distributed across Brazil, Portugal, Italy, Japan and New Zealand. We trust each other to deliver results in ways that suit our lives while maximising our collective impact. We move quickly, adapt fast, and support each other through the ups and downs that come with building something new and meaningful.Our valuesRelentless Curiosity - we explore deeply, question assumptions, and seek better ways.Authentic Humanity - we support and care for people first.Inclusive Connection - we collaborate openly and build strong relationships with customers and colleagues.Determination to Deliver - we strive to do the right thing, consistently and with purpose.



  • Lagoa Santa, Brasil Mindhive Global Tempo inteiro

    About the RoleMindhive builds AI-powered vision systems that transform industrial production. As we scale globally, reliability, observability, and rapid issue response are critical. The Platform Operations Team Lead (Brazil) plays a central role in ensuring our systems remain healthy across LATAM and European time zones.You will lead the Platform Operations...

  • UI/Visual Lead

    2 semanas atrás


    Lagoa Santa, MG, Brasil Uitify Tempo inteiro

    English fluency is a must for this role Who We Are Hey there! We're Uitify, a creative design studio delivering premium UI/UX and branding for early-stage SaaS startups in the US. We help founders turn vision into beautifully functional design. If you're all about detail, innovation, and creating modern digital experiences, you'll feel right at home here....

  • Operations Manager

    2 semanas atrás


    Nova Santa Rita, Brasil Amazon Tempo inteiro

    This job is with Amazon, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly. DESCRIPTION: This position will be based in Porto Alegre region. Amazon is looking for an Operations Manager to be based in our new Fulfillment Center in Nova Santa Rita -...

  • Senior It Support Engineer

    2 semanas atrás


    Lagoa Santa, Brasil Rain Tempo inteiro

    Job DescriptionRain isthe fastest-growing earned wage access (EWA) fintech in the U.S. , serving3.5 million employeesand backed bytop investors like QED and Prosus .We've raisednearly $400Min funding—includingthe largest Series A in fintech history —andjust closed our Series B to fuel our next stage of hypergrowth.We're seeking an experienced Senior IT...

  • Platform Engineer

    2 semanas atrás


    Santa Cruz do Sul, Brasil Flowmentum, Inc. Tempo inteiro

    Senior DevOps & Platform Engineer (Azure Networking | .NET 4.6 | Terraform | PowerShell | Azure DevOps)Remote |Global Team | Flexible Hours We're hiring a Senior DevOps & Platform Engineer to join our remote-first, results-driven engineering team. If you're an expert in Azure networking and have deep experience with .NET Framework 4.6 , this is your...

  • Site Operations Specialist

    2 semanas atrás


    Santa Rosa, Brasil beBeeReliability Tempo inteiro

    Job Title: Site Operations Specialist We are seeking a skilled Site Operations Specialist to join our team. The successful candidate will be responsible for providing onsite technical support, maintaining communication systems, and ensuring the continuity of business processes. The ideal candidate will have excellent problem-solving skills, with the ability...

  • Site Operations Specialist

    2 semanas atrás


    Santa Rosa, Brasil beBeeReliability Tempo inteiro

    Job Title: Site Operations Specialist We are seeking a skilled Site Operations Specialist to join our team. The successful candidate will be responsible for providing onsite technical support, maintaining communication systems, and ensuring the continuity of business processes. The ideal candidate will have excellent problem-solving skills, with the ability...

  • AEP Platform Engineer

    2 semanas atrás


    Santa Catarina, Brasil Nearsure Tempo inteiro

    Explore the Nearsure experience Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management. Say goodbye to micromanagement We champion autonomy, open communication, and respect for diversity as our core values. Your well-being matters: Our People Care team is here from...

  • Lead AI Engineer

    1 semana atrás


    Santa Luzia, Brasil GeorgiaTEK Systems Inc. Tempo inteiro

    Lead AI Engineer (3 Positions)Location: Brazil (Remote / Hybrid based on project needs)Role Overview We are seeking highly skilled Lead AI Engineers based in Brazil to design, develop, and deploy scalable AI and machine learning solutions across enterprise systems. The ideal candidates will have strong expertise in Generative AI , RAG architectures , LLMs ,...


  • Lagoa Santa, Brasil HCLTech Tempo inteiro

    We are HCLTech, one of the world’s largest and fastest growing technology and DSA companies with over 227,000 professionals across 60 countries, driving progress through industry-leading capabilities focused on Digital, Engineering and Cloud. The driving force behind this work, our people, is a diverse, creative and passionate audience that enables us to...