Ml Data Pipeline Engineer
3 semanas atrás
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment. What You'll Do Pipeline Operations & Improvement Maintain and enhance our multi-source data collection system: IMU sensors (via mobile app) and synchronized video streams from gym-based cameras. Improve video capture software robustness, particularly handling network interruptions and operational monitoring. Deploy and monitor services in remote Linux environments with appropriate DevOps practices. Data Quality & Validation Evolve our Python-based QC engine that validates data pre- and post-annotation Implement checks for IMU-video time synchronization, sensor health, and measurement consistency Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities. Develop validation logic comparing annotations against sensor data to ensure temporal alignment. Analysis & Troubleshooting Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes Tooling and Visualization Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders Create visualizations (Chart.Js) for QC metrics and signal analysis Integrate with LabelStudio annotation interface What You Bring Required Strong Python programming skills, particularly for data processing pipelines Experience with time-series data and digital signal processing Comfortable working in Linux environments and deploying/monitoring remote services Ability to debug complex multi-component systems (sensors, video, networks, sync) Data quality mindset: designing validation rules, tracking metrics, investigating anomalies SQL/database experience for managing pipeline metadata Highly Valued Video processing experience (RTSP streams, encoding, OCR) Working with sensor/IoT data and handling connectivity challenges NextJS or modern web frameworks for data tooling DevOps practices: containerization, monitoring, logging, alerting Experience with annotation pipelines and ML training data workflows Background in biomechanics, sports science, or wearable sensors Tech Stack Languages: Python (primary), JavaScript/TypeScript (NextJS UI) Data: IMU sensor streams, video (RTSP), time-series analysis, DSP Tools: LabelStudio, Chart.Js, Linux/bash, OCR libraries Infrastructure: Remote deployment, monitoring systems You'll Thrive Here If You Enjoy detective work: diagnosing why data doesn't match expectations Balance pragmatism with quality: shipping improvements while maintaining reliability Communicate well across technical and non-technical stakeholders Can work autonomously in a small, mission-driven team
-
Senior Data Engineer
Há 7 dias
Mato Grosso, Brasil Eightpoint Tempo inteiroAbout Eightpoint Eightpoint is an internet technology company specializing in the agile development of products and content that address real-world interests, captivating users and driving significant growth for partners. With offices in the United States and Cayman Islands, Eightpoint collaborates with partners globally on the next generation of...
-
Software Engineer, Data Integrations
2 semanas atrás
Mato Grosso, Brasil BlawkAI Tempo inteiroBlawk is building next-generation infrastructure to automate and analyze financial data at scale across the entertainment and creator economy. We're looking for a Software Engineer (Data Integrations) to help design and develop reliable automation pipelines for ingesting, mapping, and enriching large datasets from multiple external platforms. You'll play a...
-
Data Engineer
Há 7 dias
Mato Grosso, Brasil Tata Consultancy Services Tempo inteiroCome to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a culture of unlimited learning full of opportunities for improvement and mutual development. The ideal scenario to expand ideas through the right tools, contributing...
-
Sr Python Data Engineer
Há 7 dias
Mato Grosso, Brasil Softensity Inc Tempo inteiroSenior Python Data Engineer About the Project Responsibilities Design, build, and maintain high-performance data processing pipelines using Python libraries (Pandas, Polars). Develop and expose RESTful APIs using FastAPI or similar frameworks. Consume and process normalized Parquet files from multiple upstream sources to generate dynamic Excel reports....
-
Gcp Data Engineer
Há 7 dias
Mato Grosso, Brasil Tata Consultancy Services Tempo inteiroCome to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a culture of unlimited learning full of opportunities for improvement and mutual development. The ideal scenario to expand ideas through the right tools, contributing...
-
Ai Software Engineer
3 semanas atrás
Mato Grosso, Brasil Velozient Tempo inteiroWe are looking for a remote, full-time AI Software Engineer to join our US client's team. You should have a minimum of 3 to 5+ years of experience developing and delivering commercial software, with a solid background in AI/ML, Python, TypeScript/JavaScript, and C#/ .NET. In this role, you will leverage deep expertise in NLP and ML to help build scalable,...
-
Data Lead Engineer
Há 4 dias
Mato Grosso, Brasil Ampstek Tempo inteiroTitle: Data Lead Engineer – Snowflake Location: Remote Mexico/Brazi Job Type: Contract Responsibilities - Technical Leadership: Provide technical direction and mentorship to a team of data engineers, ensuring best practices in coding, architecture, and data operations. - End-to-End Ownership: Architect, implement, and optimize end-to-end data pipelines...
-
Data Lead Engineer
Há 6 dias
Mato Grosso, Brasil Ampstek Tempo inteiroData Lead Engineer – Snowflake Remote Contract Brazil/ Mexico Responsibilities - Technical Leadership: Provide technical direction and mentorship to a team of data engineers, ensuring best practices in coding, architecture, and data operations. - End-to-End Ownership: Architect, implement, and optimize end-to-end data pipelines that process and transform...
-
Azure Data Engineer
Há 2 dias
Mato Grosso, Brasil Tata Consultancy Services Tempo inteiroCome to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a culture of unlimited learning full of opportunities for improvement and mutual development. The ideal scenario to expand ideas through the right tools, contributing...
-
Data Scientist
3 semanas atrás
Mato Grosso, Brasil International Digital Partners Tempo inteiroData Scientist (Remote) Our client is looking for a Data Scientist to support the development and implementation of advanced data and AI strategies. The ideal professional is passionate about data-driven innovation and eager to work with cutting-edge technologies in Machine Learning and Generative AI. Key Responsibilities Develop and implement data...