Platform Operations Team Lead
Há 5 horas
About the Role
Mindhive builds AI-powered vision systems that transform industrial production. As we scale globally, reliability, observability, and rapid issue response are critical. The Platform Operations Team Lead (Brazil) plays a central role in ensuring our systems remain healthy across LATAM and European time zones.
You will lead the Platform Operations function — the customer-adjacent, reliability-focused counterpart to our Platform Engineering team in New Zealand. Your team will ensure our deployed systems are monitored, stable, recoverable, and well-understood by the rest of the business.
This role focuses on:
- monitoring, alert quality, and fast incident response
- supporting on-premise and edge deployments
- improving operational processes and tooling
- follow-the-sun coverage across Brazil and Portugal
- ensuring operational readiness for engineering-led upgrades
You will collaborate closely with the NZ-based Platform Engineering team, who drive deep engineering projects (Puppet 8 rollout, Python 3.13 upgrade, CDK migration, CI/CD consistency, platform hardening, test-rig reliability). Together, you will form Mindhive’s global reliability backbone.
This is a hands-on leadership role with high impact on customer experience, system uptime, and our ability to scale installations worldwide.
About You
You are a strong operational leader with deep hands-on technical skills. You thrive in live production environments, enjoy solving real-world system issues, and understand how to build reliable systems across time zones. You excel in:
- structured incident response
- observability and monitoring
- improving operational processes
- leading teams in distributed, multicultural environments
- working close to customers to ensure uptime and stability
You care about technical quality, clarity, and people — and you bring a mindset focused on resilience, collaboration, and steady improvement.
Key Competencies
- Operational Excellence - builds systems, processes, and behaviours that improve stability and reliability.
- Leadership & Mentorship - develops engineers and coordinates distributed teams.
- Systems Thinking - sees the interplay between cloud, edge hardware, software, people, and processes.
- Collaboration - works closely and constructively with NZ Platform Engineering and cross-functional teams.
- Calm Under Pressure - handles incidents and live issues with clarity and good judgement.
- Continuous Improvement - always looking for ways to automate, simplify, and strengthen operations.
Key Responsibilities
Platform Operations Leadership
- Lead and grow the Platform Operations team across Brazil and Portugal.
- Build a high-performing follow-the-sun operational capability that supports both internal teams and customers.
- Establish clear daily operational rhythms, including alert review, ticket management, and incident response.
Team Leadership & Culture
- Mentor engineers and technicians across Brazil and Portugal.
- Create a culture of ownership and continuous improvement.
- Ensure communication is clear, predictable, and aligned with our values.
- Build a team that is highly accountable, collaborative, and customer-focused.
Observability & Monitoring
- Own the quality and accuracy of Datadog dashboards, alerts, service catalog, resource catalog, and operational visibility.
- Reduce alert noise, improve signal quality, and ensure teams receive actionable information.
- Develop and maintain runbooks, playbooks, and operational documentation.
Incident Response & Reliability
- Oversee first-line and second-line incident response during LATAM and EU hours.
- Ensure fast, structured triage for issues across cloud, on-premise, and edge deployments.
- Maintain clear escalation paths and strong communication practices during incidents.
- Partner with Implementation and Customer Success teams to resolve client-facing issues.
Collaboration with Platform Engineering (NZ)
- Act as the operational counterpart to NZ Platform Engineering.
- Ensure operational readiness for major engineering initiatives, such as:
-Puppet 8 migration
-Python 3.13 upgrade
-CDK migration
-CI/CD unification
-Platform hardening
-Test rig and E2E reliability improvements
- Provide field feedback, operational insights, and rollout support for these improvements.
System Health & Operational Excellence
- Monitor the health of live systems across sites and proactively identify stability risks.
- Help drive improvements in:
-edge hardware reliability
-network stability
-server provisioning consistency
-observability for both cloud and on-prem components
- Work with teams to reduce operational toil and automate repetitive tasks.
Required Skills & Experience
Leadership & Communication
- Experience leading distributed teams across multiple time zones.
- Excellent communication in English and Portuguese.
- Ability to collaborate effectively with engineering, implementation, and customer-facing teams.
- Strong organisational skills with ability to manage competing priorities.
Technical
- Strong background in DevOps, SRE, or Production Engineering environments.
- Hands-on experience operating hybrid cloud + on-premise / edge systems.
- Proficiency with:
-Datadog (or similar observability platforms)
-AWS (IAM, networking, security, monitoring)
-Containerization (Docker)
-Kubernetes / K3S
-IaC tools (AWS CDK ideal)
- Solid programming skills in Python (TypeScript/JavaScript is a plus).
- Understanding of security best practices (identity, access, endpoint, and network security).
Operational
- Experience running incident response, on-call processes, or follow-the-sun operations.
- Proven ability to write and maintain runbooks, playbooks, and operational documentation.
- Experience supporting industrial, IoT, or hardware-integrated systems (ideal).
About Us
Mindhive Ltd is a fast-moving AI company using machine learning and computer vision to reimagine industrial systems. Our products run across cloud, on-premise, and edge deployments, bringing AI performance and reliability directly to the factory floor.
We care deeply about people, quality, and impact. We work collaboratively, iterate quickly, and tackle meaningful, complex problems.
Mindhive is a New Zealand Hi-Tech Awards winner, recognised for innovation and impact in software, AI, and advanced manufacturing.
Work Environment & Flexibility
We support hybrid and remote work, with our people distributed across Brazil, Portugal, Italy, Japan and New Zealand. We trust each other to deliver results in ways that suit our lives while maximising our collective impact. We move quickly, adapt fast, and support each other through the ups and downs that come with building something new and meaningful.
Our values
- Relentless Curiosity - we explore deeply, question assumptions, and seek better ways.
- Authentic Humanity - we support and care for people first.
- Inclusive Connection - we collaborate openly and build strong relationships with customers and colleagues.
- Determination to Deliver - we strive to do the right thing, consistently and with purpose.
-
Global CRM Sustain
Há 2 dias
Brazil, BR Otis Elevator Co. Tempo inteiroGlobal CRM Sustain & Operations Lead As a Sustain & Operations Lead, you will be responsible for the management, quality, analysis, and lead of Global CRM system under your responsibility following the appropriate methodologies. The functions are: Provide Recommendations, Action Plans, and Status Updates to Leadership and Key Stakeholders Manage Key...
-
Security Operations Center Analyst
Há 2 dias
Brazil, BR UST España & Latam Tempo inteiroWe are still looking for talent… and we would love for you to join our team!For over 25 years, UST has worked alongside the world’s best companies to make a real impact through business transformation. Driven by technology, inspired by people, and guided by our purpose, UST supports clients from design to implementation. Together, with more than 30,000...
-
Lead Engineer – Agentic AI
Há 2 dias
Brazil, BR Tata Consultancy Services Tempo inteiroJoin one of the biggest IT Services companies in the world! Here you can transform your career!Why join TCS? Here at TCS we believe that people make the difference, that's why we live a culture of unlimited learning full of opportunities for improvement and mutual development. The ideal scenario to expand ideas through the right tools, contributing to our...
-
Platform Engineer
Há 2 dias
Brazil, BR Flowmentum, Inc. Tempo inteiroSenior DevOps & Platform Engineer(Azure Networking | .NET 4.6 | Terraform | PowerShell | Azure DevOps) Remote | Global Team | ⏰ Flexible HoursWe're hiring a Senior DevOps & Platform Engineer to join our remote-first, results-driven engineering team. If you're an expert in Azure networking and have deep experience with .NET Framework 4.6, this is your...
-
Sharepoint Team Lead
Há 2 dias
Brazil, BR HCLTech Tempo inteiroSharepoint Team Lead**Add resumes in English**- ResponsibilitiesSharePoint Development & CustomizationDevelop and customize SharePoint solutions using SPFx for modern web parts and extensions.Build low-code applications and automate workflows using Power Apps and Power Automate.Create custom solutions using REST API & CSOM.Implement and maintain front-end...
-
Operations Manager
Há 5 horas
Brazil, BR Virtual Work World Tempo inteiroRole: Operations Manager (Salon, Spa, and Aesthetic industry - MUST HAVE EXPERIENCE)Role Specifics: We are only considering candidates with experience in the Salon, Spa, and Aesthetic industry. Please apply only if you have worked in this field.Contract: Full-time Remote Contractor (40 hours weekly / 160 hours monthly)Business Hours: Monday to Friday, 8 AM...
-
Staff Platform Engineer
Há 2 dias
Brazil, BR CAI Software, LLC Tempo inteiroTitle: Staff Platform EngineerBusiness Unit: Process ManufacturingLocation: Brazil, remote About CAICAI is a leading provider of digital work execution platforms designed to enhance operational efficiency and drive productivity in industrial environments. CAI’s platform leverages advanced technology to convert complex, paper-based procedures into...
-
Revenue Operations Specialist
Há 2 dias
Brazil, BR Velozient Tempo inteiroWe are looking for a remote, full-time Revenue Operations Specialist with 3 to 6 years of experience in revenue and sales operations to support our U.S. client's growing go-to-market organization. You will be responsible for managing and improving processes that drive revenue visibility and efficiency, including Salesforce administration, pipeline analytics,...
-
Marketing Operations Analyst
Há 2 dias
Brazil, BR Bybit Tempo inteiroAbout the RoleWe’re looking for a highly organized, data-driven, and creative Marketing Operations Analyst to own the planning, execution, and optimization of multi-channel marketing initiatives of Bybit in Brazil. You will play a key role in scaling marketing operations, improving efficiency, and delivering measurable results across campaigns and...
-
Marketing Operations Specialist
Há 2 dias
Brazil, BR Bybit Tempo inteiroAbout the RoleWe’re looking for a highly organized, data-driven, and creative Marketing Operations Analyst to own the planning, execution, and optimization of multi-channel marketing initiatives of Bybit in Brazil. You will play a key role in scaling marketing operations, being responsible for growing, activating and retaining the Brazilian userbase...