ML Data Pipeline Engineer
Há 4 dias
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment. What You'll Do Pipeline Operations & Improvement Maintain and enhance our multi-source data collection system: IMU sensors (via mobile app) and synchronized video streams from gym-based cameras. Improve video capture software robustness, particularly handling network interruptions and operational monitoring. Deploy and monitor services in remote Linux environments with appropriate DevOps practices. Data Quality & Validation Evolve our Python-based QC engine that validates data pre- and post-annotation Implement checks for IMU-video time synchronization, sensor health, and measurement consistency Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities. Develop validation logic comparing annotations against sensor data to ensure temporal alignment. Analysis & Troubleshooting Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes Tooling and Visualization Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders Create visualizations (Chart.js) for QC metrics and signal analysis Integrate with LabelStudio annotation interface What You Bring Required Strong Python programming skills, particularly for data processing pipelines Experience with time-series data and digital signal processing Comfortable working in Linux environments and deploying/monitoring remote services Ability to debug complex multi-component systems (sensors, video, networks, sync) Data quality mindset: designing validation rules, tracking metrics, investigating anomalies SQL/database experience for managing pipeline metadata Highly Valued Video processing experience (RTSP streams, encoding, OCR) Working with sensor/IoT data and handling connectivity challenges NextJS or modern web frameworks for data tooling DevOps practices: containerization, monitoring, logging, alerting Experience with annotation pipelines and ML training data workflows Background in biomechanics, sports science, or wearable sensors Tech Stack Languages: Python (primary), JavaScript/TypeScript (NextJS UI) Data: IMU sensor streams, video (RTSP), time-series analysis, DSP Tools: LabelStudio, Chart.js, Linux/bash, OCR libraries Infrastructure: Remote deployment, monitoring systems You'll Thrive Here If You Enjoy detective work: diagnosing why data doesn't match expectations Balance pragmatism with quality: shipping improvements while maintaining reliability Communicate well across technical and non-technical stakeholders Can work autonomously in a small, mission-driven team
-
Data Engineer
3 semanas atrás
São Paulo, SP, Brasil able.digital Tempo inteiroAbout the Role We are seeking an Intermediate Data Engineer to support our data infrastructure initiatives by connecting analytics systems, managing data pipelines, and enabling our teams with clean, accessible data. This role will focus on integrating key data sources, developing efficient pipelines, and ensuring seamless data flow across platforms like...
-
Software Engineer, Data
Há 4 dias
Jaguariúna, SP, Brasil Talent Systems, LLC Tempo inteiroTitle: Software Engineer, Data Eng Level: L2 or L3 Location: São Paulo, Brazil Work setup: Initially remote until we open the office and then Hybrid (2-3 days/week in the office) Comp: $3800-$4200 USD per month Company Talent Systems, LLC is the leading technology solution provider for casting and auditioning to the entertainment industry. Casting directors...
-
Data Engineer
2 semanas atrás
São Paulo, SP, Brasil Elios Talent Tempo inteiroData Engineer Highlights Build and maintain pipelines that power data-driven insights Hybrid or remote flexibility High-growth opportunities in cloud and big data technologies Role Summary We are seeking a Data Engineer to architect and maintain data infrastructure that supports analytics and machine learning. This role ensures that high-quality data is...
-
Data Engineer
2 semanas atrás
São Paulo, SP, Brasil Elios Talent Tempo inteiroData Engineer Highlights Build and maintain pipelines that power data-driven insights Hybrid or remote flexibility High-growth opportunities in cloud and big data technologies Role Summary We are seeking a Data Engineer to architect and maintain data infrastructure that supports analytics and machine learning. This role ensures that high-quality data is...
-
Software Engineer, Data
Há 3 dias
São Paulo, SP, Brasil Talent Systems, LLC Tempo inteiroTitle: Software Engineer, Data Eng Level: L2 or L3 Location: São Paulo, Brazil Work setup: Initially remote until we open the office and then Hybrid (2-3 days/week in the office) Comp: $3800-$4200 USD per month Company Talent Systems, LLC is the leading technology solution provider for casting and auditioning to the entertainment industry. Casting directors...
-
Lead Data Engineer
4 semanas atrás
São Paulo, SP, Brasil Acendeo Tempo inteiroLead Data Engineer Description We are looking for a highly experienced Lead Data Engineer to lead and guide the technical evolution of our enterprise data lake. This role will combine hands-on development, tactical guidance, team leadership, and partner engagement across our technical team. English fluency is a MUST for this role! Only candidates with level...
-
Senior Data Engineer
3 semanas atrás
Guarulhos, SP, Brasil Sky Systems, Inc. (SkySys) Tempo inteiroAbout the Role We’re partnering with a leading global commerce technology organization to expand their Data Engineering & Analytics team. This is an exciting opportunity for senior-level Data Engineers who are passionate about building modern data solutions, optimizing data pipelines, and driving impactful data-driven decisions within a global enterprise...
-
Senior Data Engineer
3 semanas atrás
São Paulo, SP, Brasil Sinch Tempo inteiroSinch is looking for a talented and experienced Senior Data Engineer to join our Data Engineering team. In this crucial role, you will be responsible for building, maintaining, and supporting data pipelines that connect our various products globally. You will use innovative approaches and technologies, designing data architectures that empower the data...
-
Senior Data Engineer
3 semanas atrás
São Paulo, SP, Brasil Sinch Tempo inteiroSinch is looking for a talented and experienced Senior Data Engineer to join our Data Engineering team. In this crucial role, you will be responsible for building, maintaining, and supporting data pipelines that connect our various products globally. You will use innovative approaches and technologies, designing data architectures that empower the data...
-
Data Engineer
3 semanas atrás
Botucatu, SP, Brasil Tata Consultancy Services Tempo inteiroCome to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a culture of unlimited learning full of opportunities for improvement and mutual development. The ideal scenario to expand ideas through the right tools, contributing...