ML Data Pipeline Engineer
1 semana atrás
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.What You’ll Do Pipeline Operations & Improvement Maintain and enhance our multi-source data collection system: IMU sensors (via mobile app) and synchronized video streams from gym-based cameras. Improve video capture software robustness, particularly handling network interruptions and operational monitoring. Deploy and monitor services in remote Linux environments with appropriate DevOps practices. Data Quality & Validation Evolve our Python-based QC engine that validates data pre- and post-annotation Implement checks for IMU-video time synchronization, sensor health, and measurement consistency Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities. Develop validation logic comparing annotations against sensor data to ensure temporal alignment. Analysis & Troubleshooting Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes Tooling and Visualization Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders Create visualizations (Chart.js) for QC metrics and signal analysis Integrate with LabelStudio annotation interface What You Bring Required Strong Python programming skills, particularly for data processing pipelines Experience with time-series data and digital signal processing Comfortable working in Linux environments and deploying/monitoring remote services Ability to debug complex multi-component systems (sensors, video, networks, sync) Data quality mindset: designing validation rules, tracking metrics, investigating anomalies SQL/database experience for managing pipeline metadata Highly Valued Video processing experience (RTSP streams, encoding, OCR) Working with sensor/IoT data and handling connectivity challenges NextJS or modern web frameworks for data tooling DevOps practices: containerization, monitoring, logging, alerting Experience with annotation pipelines and ML training data workflows Background in biomechanics, sports science, or wearable sensors Tech Stack Languages: Python (primary), JavaScript/TypeScript (NextJS UI) Data: IMU sensor streams, video (RTSP), time-series analysis, DSP Tools: LabelStudio, Chart.js, Linux/bash, OCR libraries Infrastructure: Remote deployment, monitoring systems You'll Thrive Here If You Enjoy detective work: diagnosing why data doesn't match expectations Balance pragmatism with quality: shipping improvements while maintaining reliability Communicate well across technical and non-technical stakeholders Can work autonomously in a small, mission-driven team
-
Machine Learning
Há 5 dias
Campinas, Brasil Bizmoni - The Next Gen AI Super APP Tempo inteiroOverviewJoin to apply for the Machine Learning (ML) Ops Engineer role at Bizmoni - The Next Gen AI Super APP . Bizmoni is the worlds first AI Super App designed to help anyone earn, learn, and grow in the AI era. We are building a global, fully remote team to shape the future of financial technology. Our mission is to make powerful AI-driven financial and...
-
Machine Learning
1 semana atrás
Campinas, Brasil Bizmoni Corp. Tempo inteiroLocation: Fully remoteEmployment type: Part-time About Bizmoni Corp.Bizmoni is the worlds first AI Super App designed to help anyone earn, learn, and grow in the AI era. We are building a global, fully remote team to shape the future of financial technology. Our mission is to make powerful AI-driven financial and business tools accessible to everyone from...
-
Machine Learning
Há 4 dias
Campinas, Brasil Bizmoni Tempo inteiroLocation: Fully remoteEmployment type: Part-timeAbout Bizmoni Corp.Bizmoni is the worlds first AI Super App designed to help anyone earn, learn, and grow in the AI era.We are building a global, fully remote team to shape the future of financial technology.Our mission is to make powerful AI-driven financial and business tools accessible to everyone from...
-
Machine Learning
Há 3 dias
Campinas, Brasil Bizmoni Tempo inteiroLocation: Fully remote Employment type: Part-time About Bizmoni Corp. Bizmoni is the worlds first AI Super App designed to help anyone earn, learn, and grow in the AI era. We are building a global, fully remote team to shape the future of financial technology. Our mission is to make powerful AI-driven financial and business tools accessible to everyone from...
-
Machine Learning
Há 6 dias
Campinas, Brasil Bizmoni - The Next Gen AI Super APP Tempo inteiroOverview Join to apply for the Machine Learning (ML) Ops Engineer role at Bizmoni - The Next Gen AI Super APP . Bizmoni is the worlds first AI Super App designed to help anyone earn, learn, and grow in the AI era. We are building a global, fully remote team to shape the future of financial technology. Our mission is to make powerful AI-driven financial and...
-
Data Engineer ID43407
3 semanas atrás
Campinas, Brasil AgileEngine Tempo inteiroOverview AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. About the role As a Senior Data Engineer , you’ll...
-
Principal Machine Learning Engineer
1 semana atrás
Campinas, Brasil Bairesdev Tempo inteiroJoin to apply for the Principal Machine Learning Engineer - Remote Work role at BairesDev.At BairesDev®, we've been leading the way in technology projects for over 15 years.We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley.Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works...
-
Principal Data Engineer
Há 5 dias
Campinas, Brasil AB InBev Tempo inteiroJoin to apply for the Principal Data Engineer role at AB InBev 3 days ago Be among the first 25 applicants Join to apply for the Principal Data Engineer role at AB InBev AB InBev is the leading global brewer and one of the world’s top 5 consumer product companies. With over 500 beer brands, we’re number one or two in many of the world’s top beer...
-
Senior Data Engineer
1 semana atrás
Greater Campinas, Brasil Vertico Tempo inteiro R$90.000 - R$120.000 por anoVaga: Senior Data Engineer (Founding role)Estamos em busca de um(a) Founding Sr. Data Engineer/ Data Lead para uma empresa daindústria pet.O desafio é liderar aconstrução da área de dados do zeroe transformar informações em vantagem competitiva para o negócio. Principais responsabilidadesCriar a fundação de dados: data lake, pipelines, governança...
-
Principal Data Engineer
2 semanas atrás
Campinas, Brasil AB InBev Tempo inteiroJoin to apply for the Principal Data Engineer role at AB InBev 3 days ago Be among the first 25 applicants Join to apply for the Principal Data Engineer role at AB InBev AB InBev is the leading global brewer and one of the world’s top 5 consumer product companies. With over 500 beer brands, we’re number one or two in many of the world’s top beer...