Ml Data Pipeline Engineer

Há 3 horas


Cuiabá, Brasil Prosigliere Tempo inteiro

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.
What You'll Do
Pipeline Operations & Improvement
Maintain and enhance our multi-source data collection system: IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.
Improve video capture software robustness, particularly handling network interruptions and operational monitoring.
Deploy and monitor services in remote Linux environments with appropriate DevOps practices.
Data Quality & Validation
Evolve our Python-based QC engine that validates data pre- and post-annotation
Implement checks for IMU-video time synchronization, sensor health, and measurement consistency
Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.
Develop validation logic comparing annotations against sensor data to ensure temporal alignment.
Analysis & Troubleshooting
Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes
Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors
Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes
Tooling and Visualization
Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders
Create visualizations (Chart.Js) for QC metrics and signal analysis
Integrate with LabelStudio annotation interface
What You Bring
Required
Strong Python programming skills, particularly for data processing pipelines
Experience with time-series data and digital signal processing
Comfortable working in Linux environments and deploying/monitoring remote services
Ability to debug complex multi-component systems (sensors, video, networks, sync)
Data quality mindset: designing validation rules, tracking metrics, investigating anomalies
SQL/database experience for managing pipeline metadata
Highly Valued
Video processing experience (RTSP streams, encoding, OCR)
Working with sensor/IoT data and handling connectivity challenges
NextJS or modern web frameworks for data tooling
DevOps practices: containerization, monitoring, logging, alerting
Experience with annotation pipelines and ML training data workflows
Background in biomechanics, sports science, or wearable sensors
Tech Stack
Languages: Python (primary), JavaScript/TypeScript (NextJS UI)
Data: IMU sensor streams, video (RTSP), time-series analysis, DSP
Tools: LabelStudio, Chart.Js, Linux/bash, OCR libraries
Infrastructure: Remote deployment, monitoring systems
You'll Thrive Here If You
Enjoy detective work: diagnosing why data doesn't match expectations
Balance pragmatism with quality: shipping improvements while maintaining reliability
Communicate well across technical and non-technical stakeholders
Can work autonomously in a small, mission-driven team


  • Ml Data Pipeline Engineer

    3 semanas atrás


    Cuiabá, Brasil Prosigliere Tempo inteiro

    We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a...

  • Data Engineer

    3 semanas atrás


    Cuiabá, Brasil Insight Global Tempo inteiro

    Insight Global is seeking a Data Engineer to join a Workforce Productivity and Data Engineering team and lead initiatives across Microsoft Azure, Fabric, and Databricks platforms onsite in Costa Rica. You will be responsible for designing, building, and maintaining scalable data pipelines using Azure Data Factory, Azure Synapse Analytics, and Databricks. You...

  • Machine Learning Engineer

    2 semanas atrás


    Cuiabá, Brasil UST España & Latam Tempo inteiro

    We are still looking for talent… and we would love for you to join our team! For over 25 years, UST has worked alongside the world’s best companies to make a real impact through business transformation. Driven by technology, inspired by people, and guided by our purpose, UST supports clients from design to implementation. Together, with more than 30,000...

  • Lead Ai Engineer

    1 semana atrás


    Cuiabá, Brasil GeorgiaTEK Systems Inc. Tempo inteiro

    Lead AI Engineer (3 Positions) Location: Brazil (Remote / Hybrid based on project needs) Role Overview We are seeking highly skilled Lead AI Engineers based in Brazil to design, develop, and deploy scalable AI and machine learning solutions across enterprise systems. The ideal candidates will have strong expertise in Generative AI, RAG architectures,...

  • Lead AI Engineer

    Há 5 dias


    Cuiabá, Brasil GeorgiaTEK Systems Inc. Tempo inteiro

    Lead AI Engineer (3 Positions) Location : Brazil (Remote / Hybrid based on project needs) Role Overview We are seeking highly skilled Lead AI Engineers based in Brazil to design, develop, and deploy scalable AI and machine learning solutions across enterprise systems. The ideal candidates will have strong expertise in Generative AI , RAG architectures , LLMs...

  • Senior Data Engineer

    3 semanas atrás


    Cuiabá, Brasil Eightpoint Tempo inteiro

    About Eightpoint Eightpoint is an internet technology company specializing in the agile development of products and content that address real-world interests, captivating users and driving significant growth for partners. With offices in the United States and Cayman Islands, Eightpoint collaborates with partners globally on the next generation of...

  • Senior Data Engineer

    Há 5 horas


    Cuiabá, Brasil Eightpoint Tempo inteiro

    About Eightpoint Eightpoint is an internet technology company specializing in the agile development of products and content that address real-world interests, captivating users and driving significant growth for partners. With offices in the United States and Cayman Islands, Eightpoint collaborates with partners globally on the next generation of...

  • Data Engineer

    2 semanas atrás


    Cuiabá, Brasil Tata Consultancy Services Tempo inteiro

    Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a culture of unlimited learning full of opportunities for improvement and mutual development. The ideal scenario to expand ideas through the right tools, contributing...

  • Data Engineer

    Há 4 horas


    Cuiabá, Brasil Tata Consultancy Services Tempo inteiro

    Come to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a culture of unlimited learning full of opportunities for improvement and mutual development. The ideal scenario to expand ideas through the right tools, contributing...

  • Sr Python Data Engineer

    1 semana atrás


    Cuiabá, Brasil Softensity Inc Tempo inteiro

    Senior Python Data Engineer About the Project Responsibilities - Design, build, and maintain high-performance data processing pipelines using Python libraries (Pandas, Polars). - Develop and expose RESTful APIs using FastAPI or similar frameworks. - Consume and process normalized Parquet files from multiple upstream sources to generate dynamic Excel...