Ml Data Pipeline Engineer

3 semanas atrás


Cuiabá, Brasil Prosigliere Tempo inteiro

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.
What You'll Do
Pipeline Operations & Improvement
Maintain and enhance our multi-source data collection system: IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.
Improve video capture software robustness, particularly handling network interruptions and operational monitoring.
Deploy and monitor services in remote Linux environments with appropriate DevOps practices.
Data Quality & Validation
Evolve our Python-based QC engine that validates data pre- and post-annotation
Implement checks for IMU-video time synchronization, sensor health, and measurement consistency
Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.
Develop validation logic comparing annotations against sensor data to ensure temporal alignment.
Analysis & Troubleshooting
Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes
Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors
Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes
Tooling and Visualization
Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders
Create visualizations (Chart.Js) for QC metrics and signal analysis
Integrate with LabelStudio annotation interface
What You Bring
Required
Strong Python programming skills, particularly for data processing pipelines
Experience with time-series data and digital signal processing
Comfortable working in Linux environments and deploying/monitoring remote services
Ability to debug complex multi-component systems (sensors, video, networks, sync)
Data quality mindset: designing validation rules, tracking metrics, investigating anomalies
SQL/database experience for managing pipeline metadata
Highly Valued
Video processing experience (RTSP streams, encoding, OCR)
Working with sensor/IoT data and handling connectivity challenges
NextJS or modern web frameworks for data tooling
DevOps practices: containerization, monitoring, logging, alerting
Experience with annotation pipelines and ML training data workflows
Background in biomechanics, sports science, or wearable sensors
Tech Stack
Languages: Python (primary), JavaScript/TypeScript (NextJS UI)
Data: IMU sensor streams, video (RTSP), time-series analysis, DSP
Tools: LabelStudio, Chart.Js, Linux/bash, OCR libraries
Infrastructure: Remote deployment, monitoring systems
You'll Thrive Here If You
Enjoy detective work: diagnosing why data doesn't match expectations
Balance pragmatism with quality: shipping improvements while maintaining reliability
Communicate well across technical and non-technical stakeholders
Can work autonomously in a small, mission-driven team


  • Ml Data Pipeline Engineer

    2 semanas atrás


    Cuiabá, Brasil Prosigliere Tempo inteiro

    We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a...

  • Ml Data Pipeline Engineer

    2 semanas atrás


    Cuiabá, Brasil Prosigliere Tempo inteiro

    We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure.You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.This role combines systems engineering, data quality automation, and hands-on problem-solving in a production...

  • Data Engineer

    3 semanas atrás


    Cuiabá, Brasil HeartCentrix Solutions Tempo inteiro

    We are seeking a highly skilled Python Data Engineer with an AI / ML focus to join our client’s growing data & analytics team in Brazil. This role is ideal for someone who loves building scalable data pipelines, operationalizing machine learning workflows, and partnering closely with data scientists to bring models into production. You will design,...

  • Data Engineer

    Há 2 dias


    Cuiabá, Brasil Zunzun Solutions Tempo inteiro

    Summary: We are seeking a highly skilled Data Engineer (Azure Databricks) to design, implement, and optimize enterprise-grade data pipelines. In this role, you will leverage Azure Databricks, Azure Data Factory, SQL Server, and Python to enable scalable, governed, and performant data solutions. You will play a key role in modernizing our data platform on...

  • Data Engineer

    Há 7 horas


    Cuiabá, Brasil Zunzun Solutions Tempo inteiro

    Summary: We are seeking a highly skilled Data Engineer (Azure Databricks) to design, implement, and optimize enterprise-grade data pipelines. In this role, you will leverage Azure Databricks, Azure Data Factory, SQL Server, and Python to enable scalable, governed, and performant data solutions. You will play a key role in modernizing our data platform on...

  • Lead AI Engineer

    4 semanas atrás


    Cuiabá, Brasil GeorgiaTEK Systems Inc. Tempo inteiro

    Lead AI Engineer (3 Positions) Location : Brazil (Remote / Hybrid based on project needs) Role Overview We are seeking highly skilled Lead AI Engineers based in Brazil to design, develop, and deploy scalable AI and machine learning solutions across enterprise systems. The ideal candidates will have strong expertise in Generative AI , RAG architectures , LLMs...


  • Cuiabá, Brasil GlobalSource IT Tempo inteiro

    Databricks Data Engineer Fully Remote Contract We're looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta Lake. This role focuses on ingesting data from multiple sources, transforming it for analytics, and publishing high-quality datasets and...

  • Data Engineer

    Há 5 dias


    Cuiabá, Brasil Ascendion Tempo inteiro

    Overview: We are seeking a highly skilled Data Engineer to support the development of personalized search capabilities and data assimilation initiatives within the organization.This is a remote role aligned to the EST time zone.The ideal candidate will have strong experience working with Python, databricks, and Kafka, and will contribute directly to ongoing...

  • Data Engineer

    Há 6 dias


    Cuiabá, Brasil Ascendion Tempo inteiro

    Overview: We are seeking a highly skilled Data Engineer to support the development of personalized search capabilities and data assimilation initiatives within the organization. This is a remote role aligned to the EST time zone. The ideal candidate will have strong experience working with Python, databricks, and Kafka, and will contribute directly to...

  • Data Engineer

    2 semanas atrás


    Cuiabá, Brasil Design Manager Tempo inteiro

    Job Title: Data Engineer Location: Remote (Brazil) Employment Type: Full TimeCompensation: Competitive hourly rate, commensurate with experienceAbout Design ManagerDesign Manager (+DesignSpec) is a leading provider of project management and accounting software tailored specifically for interior design firms.For over 30 years, we've helped thousands of...