ML Data Pipeline Engineer
Há 6 dias
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment. What You’ll Do Pipeline Operations & Improvement Maintain and enhance our multi-source data collection system: IMU sensors (via mobile app) and synchronized video streams from gym-based cameras. Improve video capture software robustness, particularly handling network interruptions and operational monitoring. Deploy and monitor services in remote Linux environments with appropriate DevOps practices. Data Quality & Validation Evolve our Python-based QC engine that validates data pre- and post-annotation Implement checks for IMU-video time synchronization, sensor health, and measurement consistency Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities. Develop validation logic comparing annotations against sensor data to ensure temporal alignment. Analysis & Troubleshooting Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes Tooling and Visualization Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders Create visualizations (Chart.js) for QC metrics and signal analysis Integrate with LabelStudio annotation interface What You Bring Required Strong Python programming skills, particularly for data processing pipelines Experience with time-series data and digital signal processing Comfortable working in Linux environments and deploying/monitoring remote services Ability to debug complex multi-component systems (sensors, video, networks, sync) Data quality mindset: designing validation rules, tracking metrics, investigating anomalies SQL/database experience for managing pipeline metadata Highly Valued Video processing experience (RTSP streams, encoding, OCR) Working with sensor/IoT data and handling connectivity challenges NextJS or modern web frameworks for data tooling DevOps practices: containerization, monitoring, logging, alerting Experience with annotation pipelines and ML training data workflows Background in biomechanics, sports science, or wearable sensors Tech Stack Languages: Python (primary), JavaScript/TypeScript (NextJS UI) Data: IMU sensor streams, video (RTSP), time-series analysis, DSP Tools: LabelStudio, Chart.js, Linux/bash, OCR libraries Infrastructure: Remote deployment, monitoring systems You'll Thrive Here If You Enjoy detective work: diagnosing why data doesn't match expectations Balance pragmatism with quality: shipping improvements while maintaining reliability Communicate well across technical and non-technical stakeholders Can work autonomously in a small, mission-driven team
-
ML Data Pipeline Engineer
Há 5 dias
Brazil Prosigliere Tempo inteiroWe're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a...
-
Senior Data Engineer
1 semana atrás
Brazil Pride Global Tempo inteiroWe're Hiring: Senior Data Engineer | Remote from Brazil | Fluent English required | Location : Remote – Brazil only Contact: Temporary Are you passionate about building scalable data platforms and cutting-edge MLOps solutions? Do you want to work with a top-tier US company revolutionizing e-commerce and circular fashion? We're looking for a Senior Data...
-
Senior Data Engineer
Há 7 dias
Brazil Pride Global Tempo inteiroWe're Hiring: Senior Data Engineer | Remote from Brazil | Fluent English required | Location: Remote – Brazil onlyContact: TemporaryAre you passionate about building scalable data platforms and cutting-edge MLOps solutions? Do you want to work with a top-tier US company revolutionizing e-commerce and circular fashion?We're looking for a Senior Data...
-
Data Engineer
Há 6 dias
Brazil Insight Global Tempo inteiroInsight Global is seeking a Data Engineer to join a Workforce Productivity and Data Engineering team and lead initiatives across Microsoft Azure, Fabric, and Databricks platforms onsite in Costa Rica. You will be responsible for designing, building, and maintaining scalable data pipelines using Azure Data Factory, Azure Synapse Analytics, and Databricks. You...
-
Senior Data Engineer
Há 7 dias
São Paulo, State of São Paulo, Brazil Grupo Data Tempo inteiro R$80.000 - R$120.000 por anoHi Weare DATA Group and we are searching for the best talent Our goal is tosimplify our clients' lives with innovative IT solutions. We operate at globalscale and we are expanding to PortugalIf youare passionate and have the desire to make the difference, we want to get toknow you Join us to be part of this incredible adventureWhoare we looking for?...
-
Software Engineer, Data
Há 5 dias
Brazil Talent Systems, LLC Tempo inteiroTitle: Software Engineer, Data EngLevel: L2 or L3Location: São Paulo, BrazilWork setup: Initially remote until we open the office and then Hybrid (2-3 days/week in the office)Comp: $3800-$4200 USD per monthCompanyTalent Systems, LLC is the leading technology solution provider for casting and auditioning tothe entertainment industry. Casting directors and...
-
Machine Learning Engineer
4 semanas atrás
Brazil Flatiron Software Tempo inteiroAboutFlatiron is a global remote software development company with engineers located around the world. We unite experts from diverse backgrounds and experiences in a collaborative culture to deliver exceptional products and services for our clients. As a forward-thinking software engineering company, we provide industry-leading solutions to complex problems...
-
Machine Learning Engineer
4 semanas atrás
Brazil Flatiron Software Tempo inteiroAbout Flatiron is a global remote software development company with engineers located around the world. We unite experts from diverse backgrounds and experiences in a collaborative culture to deliver exceptional products and services for our clients. As a forward-thinking software engineering company, we provide industry-leading solutions to complex problems...
-
Machine Learning Engineer
4 semanas atrás
Brazil Flatiron Software Tempo inteiroAboutFlatiron is a global remote software development company with engineers located around the world. We unite experts from diverse backgrounds and experiences in a collaborative culture to deliver exceptional products and services for our clients. As a forward-thinking software engineering company, we provide industry-leading solutions to complex problems...
-
Lead Data Engineer
4 semanas atrás
Brazil Acendeo Tempo inteiroLead Data Engineer Description We are looking for a highly experienced Lead Data Engineer to lead and guide the technical evolution of our enterprise data lake. This role will combine hands-on development, tactical guidance, team leadership, and partner engagement across our technical team. English fluency is a MUST for this role! Only candidates with level...