ML Data Pipeline Engineer

2 semanas atrás


Manaus, Brasil Prosigliere Tempo inteiro

We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training. This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment. What You'll Do Pipeline Operations & Improvement Maintain and enhance our multi-source data collection system: IMU sensors (via mobile app) and synchronized video streams from gym-based cameras. Improve video capture software robustness, particularly handling network interruptions and operational monitoring. Deploy and monitor services in remote Linux environments with appropriate DevOps practices. Data Quality & Validation Evolve our Python-based QC engine that validates data pre- and post-annotation. Implement checks for IMU-video time synchronization, sensor health, and measurement consistency. Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities. Develop validation logic comparing annotations against sensor data to ensure temporal alignment. Analysis & Troubleshooting Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes. Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors. Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes. Tooling and Visualization Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders. Create visualizations (Chart.js) for QC metrics and signal analysis. Integrate with LabelStudio annotation interface. What You Bring Required Strong Python programming skills, particularly for data processing pipelines. Experience with time-series data and digital signal processing. Comfortable working in Linux environments and deploying / monitoring remote services. Ability to debug complex multi-component systems (sensors, video, networks, sync). Data quality mindset: designing validation rules, tracking metrics, investigating anomalies. SQL / database experience for managing pipeline metadata. Highly Valued Video processing experience (RTSP streams, encoding, OCR). Working with sensor / IoT data and handling connectivity challenges. NextJS or modern web frameworks for data tooling. DevOps practices: containerization, monitoring, logging, alerting. Experience with annotation pipelines and ML training data workflows. Background in biomechanics, sports science, or wearable sensors. Tech Stack Languages: Python (primary), JavaScript / TypeScript (NextJS UI) Data: IMU sensor streams, video (RTSP), time-series analysis, DSP Tools: LabelStudio, Chart.js, Linux / bash, OCR libraries Infrastructure: Remote deployment, monitoring systems You'll Thrive Here If You Enjoy detective work : diagnosing why data doesn't match expectations. Balance pragmatism with quality : shipping improvements while maintaining reliability. Communicate well across technical and non-technical stakeholders. Can work autonomously in a small, mission-driven team. #J-18808-Ljbffr


  • Backend Data Engineer

    2 semanas atrás


    Manaus, Brasil Bebeedataengineer Tempo inteiro

    Backend Data Engineer RoleWe are seeking an experienced Backend Data Engineer to lead the development of our data pipelines and services.This role emphasizes Python-based architecture, AWS data workflows, and integration with AI/ML frameworks to enable advanced media processing capabilities.Key Responsibilities:Design and develop scalable backend services...

  • Backend Data Engineer

    2 semanas atrás


    Manaus, Brasil beBeeDataEngineer Tempo inteiro

    Backend Data Engineer Role We are seeking an experienced Backend Data Engineer to lead the development of our data pipelines and services. This role emphasizes Python-based architecture, AWS data workflows, and integration with AI/ML frameworks to enable advanced media processing capabilities. Key Responsibilities: Design and develop scalable backend...

  • Backend Data Engineer

    1 semana atrás


    Manaus, AM, Brasil beBeeDataEngineer Tempo inteiro

    Backend Data Engineer Role We are seeking an experienced Backend Data Engineer to lead the development of our data pipelines and services. This role emphasizes Python-based architecture, AWS data workflows, and integration with AI/ML frameworks to enable advanced media processing capabilities. Key Responsibilities: Design and develop scalable backend...


  • Manaus, Brasil beBeeData Tempo inteiro

    Data Engineer Position Data engineers play a pivotal role in shaping the modern data-driven enterprise. The primary focus is on creating and maintaining scalable data pipelines that ensure seamless data flow, from ingestion to visualization.

  • Data Engineering Expert

    1 semana atrás


    Manaus, Brasil beBeeDataEngineer Tempo inteiro

    Data Engineer We are seeking a highly skilled Data Engineer with an AI/ML focus to join our data & analytics team. As a key player, you'll build scalable data pipelines, operationalize machine learning workflows, and partner closely with data scientists to bring models into production. This fully remote position involves designing, developing, and...


  • Manaus, AM, Brasil beBeeDataEngineer Tempo inteiro

    Backend Data Engineer We are seeking a seasoned Backend Data Engineer to design, develop and maintain scalable data pipelines using Python. This role will focus on building high-quality software solutions that integrate AI/LLM frameworks into production systems. Key Responsibilities: Design and implement backend data pipelines using Python. Develop AWS...


  • Manaus, AM, Brasil beBeeData Tempo inteiro

    Data Engineer Position Data engineers play a pivotal role in shaping the modern data-driven enterprise. The primary focus is on creating and maintaining scalable data pipelines that ensure seamless data flow, from ingestion to visualization.

  • Principal ML Engineer

    2 semanas atrás


    Manaus, Brasil Launch Potato Tempo inteiro

    Overview As The Discovery and Conversion Company, our mission is to connect consumers with the world’s leading brands through data-driven content and technology. Headquartered in South Florida with a remote-first team spanning over 15 countries, we’ve built a high-growth, high-performance culture where speed, ownership, and measurable impact drive...

  • Principal Ml Engineer

    2 semanas atrás


    Manaus, Brasil Launch Potato Tempo inteiro

    OverviewAs The Discovery and Conversion Company, our mission is to connect consumers with the world's leading brands through data-driven content and technology.Headquartered in South Florida with a remote-first team spanning over 15 countries, we've built a high-growth, high-performance culture where speed, ownership, and measurable impact drive success.WHY...

  • Data Driven Decision Maker

    1 semana atrás


    Manaus, Brasil Bebeemachinelearning Tempo inteiro

    AI Model DeveloperWe are seeking a highly skilled Ai Model Developer to join our AI team.This role involves designing, training and deploying ML models that power autonomous exception resolution, anomaly detection and explainable insights.Design, train and deploy ML models for production.Build pipelines using vector databases and frameworks such as...