ML Data Pipeline Engineer

1 semana atrás


Brasília, Brasil Prosigliere Tempo inteiro

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.What You’ll DoPipeline Operations & ImprovementMaintain and enhance our multi-source data collection system: IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.Improve video capture software robustness, particularly handling network interruptions and operational monitoring.Deploy and monitor services in remote Linux environments with appropriate DevOps practices.Data Quality & ValidationEvolve our Python-based QC engine that validates data pre- and post-annotationImplement checks for IMU-video time synchronization, sensor health, and measurement consistencyApply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.Develop validation logic comparing annotations against sensor data to ensure temporal alignment.Analysis & TroubleshootingPerform ad-hoc analysis on ~1,200+ workout tasks to classify failure modesIdentify whether issues stem from pipeline bugs, sensor problems, or annotation errorsPrioritize engineering work based on data quality impact and coordinate with annotation team on fixesTooling and VisualizationMaintain and extend our NextJS UI serving annotators, data scientists, and stakeholdersCreate visualizations (Chart.js) for QC metrics and signal analysisIntegrate with LabelStudio annotation interfaceWhat You Bring RequiredStrong Python programming skills, particularly for data processing pipelinesExperience with time-series data and digital signal processingComfortable working in Linux environments and deploying/monitoring remote servicesAbility to debug complex multi-component systems (sensors, video, networks, sync)Data quality mindset: designing validation rules, tracking metrics, investigating anomaliesSQL/database experience for managing pipeline metadataHighly ValuedVideo processing experience (RTSP streams, encoding, OCR)Working with sensor/IoT data and handling connectivity challengesNextJS or modern web frameworks for data toolingDevOps practices: containerization, monitoring, logging, alertingExperience with annotation pipelines and ML training data workflowsBackground in biomechanics, sports science, or wearable sensorsTech StackLanguages: Python (primary), JavaScript/TypeScript (NextJS UI)Data: IMU sensor streams, video (RTSP), time-series analysis, DSPTools: LabelStudio, Chart.js, Linux/bash, OCR librariesInfrastructure: Remote deployment, monitoring systemsYou'll Thrive Here If YouEnjoy detective work: diagnosing why data doesn't match expectationsBalance pragmatism with quality: shipping improvements while maintaining reliabilityCommunicate well across technical and non-technical stakeholdersCan work autonomously in a small, mission-driven team


  • Ml data pipeline engineer

    1 semana atrás


    Brasília, Brasil Prosigliere Tempo inteiro

    We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.This role combines systems engineering, data quality automation, and hands-on problem-solving in a production...

  • Senior Data Engineer

    2 semanas atrás


    Brasília, Brasil Pride Global Tempo inteiro

    We're Hiring: Senior Data Engineer | Remote from Brazil | Fluent English required | Location: Remote – Brazil onlyContact: TemporaryAre you passionate about building scalable data platforms and cutting-edge MLOps solutions? Do you want to work with a top-tier US company revolutionizing e-commerce and circular fashion?We're looking for a Senior Data...

  • Senior data engineer

    2 semanas atrás


    Brasília, Brasil Pride Global Tempo inteiro

    We're Hiring: Senior Data Engineer | Remote from Brazil | Fluent English required|Location: Remote – Brazil onlyContact:TemporaryAre you passionate about building scalable data platforms and cutting-edge MLOps solutions? Do you want to work with a top-tier US company revolutionizing e-commerce and circular fashion?We're looking for aSenior Data...


  • Brasília, Brasil Launch Potato Tempo inteiro

    As The Discovery and Conversion Company, our mission is to connect consumers with the world's leading brands through data-driven content and technology.Headquartered in South Florida with a remote-first team spanning over 15 countries, we've built a high-growth, high-performance culture where speed, ownership, and measurable impact drive success.WHY JOIN...


  • Brasília, Brasil Launch Potato Tempo inteiro

    As The Discovery and Conversion Company, our mission is to connect consumers with the world’s leading brands through data-driven content and technology. Headquartered in South Florida with a remote‑first team spanning over 15 countries, we’ve built a high‑growth, high‑performance culture where speed, ownership, and measurable impact drive success....


  • Brasília, Brasil UST España & Latam Tempo inteiro

    We are still looking for talent… and we would love for you to join our team!For over 25 years, UST has worked alongside the world’s best companies to make a real impact through business transformation. Driven by technology, inspired by people, and guided by our purpose, UST supports clients from design to implementation. Together, with more than 30,000...

  • Machine Learning Engineer

    4 semanas atrás


    Brasília, Brasil Flatiron Software Tempo inteiro

    About Flatiron is a global remote software development company with engineers located around the world. We unite experts from diverse backgrounds and experiences in a collaborative culture to deliver exceptional products and services for our clients. As a forward-thinking software engineering company, we provide industry-leading solutions to complex problems...

  • Machine Learning Engineer

    2 semanas atrás


    Brasília, Brasil Flatiron Software Tempo inteiro

    About Flatiron is a global remote software development company with engineers located around the world. We unite experts from diverse backgrounds and experiences in a collaborative culture to deliver exceptional products and services for our clients. As a forward-thinking software engineering company, we provide industry-leading solutions to complex problems...

  • Machine Learning Engineer

    4 semanas atrás


    Brasília, Brasil Flatiron Software Tempo inteiro

    About Flatiron is a global remote software development company with engineers located around the world. We unite experts from diverse backgrounds and experiences in a collaborative culture to deliver exceptional products and services for our clients. As a forward-thinking software engineering company, we provide industry-leading solutions to complex problems...

  • Data Engineer

    Há 2 dias


    Brasília, Brasil Artefact Tempo inteiro

    The current vacancy is for the Brazilian office and we work in a Free Office model. Who we are At Artefact LatAm, we believe in and live a culture based on empathy A healthy work environment is a place where all voices are heard, respected, and valued. Our commitment is to build a more diverse and inclusive environment, because empathy is for everyone,...