ML Data Pipeline Engineer
1 dia atrás
We're seeking a Data Pipeline Engineer to own and evolve our exercise recognition training data infrastructure. You'll manage the end-to-end pipeline that collects, synchronizes, validates, and prepares IMU sensor and video data for ML model training.
This role combines systems engineering, data quality automation, and hands-on problem-solving in a production environment.
What You’ll Do
Pipeline Operations & Improvement
- Maintain and enhance our multi-source data collection system: IMU sensors (via mobile app) and synchronized video streams from gym-based cameras.
- Improve video capture software robustness, particularly handling network interruptions and operational monitoring.
- Deploy and monitor services in remote Linux environments with appropriate DevOps practices.
Data Quality & Validation
- Evolve our Python-based QC engine that validates data pre- and post-annotation
- Implement checks for IMU-video time synchronization, sensor health, and measurement consistency
- Apply digital signal processing techniques to identify sensor failures, connectivity issues, and measurement irregularities.
- Develop validation logic comparing annotations against sensor data to ensure temporal alignment.
Analysis & Troubleshooting
- Perform ad-hoc analysis on ~1,200+ workout tasks to classify failure modes
- Identify whether issues stem from pipeline bugs, sensor problems, or annotation errors
- Prioritize engineering work based on data quality impact and coordinate with annotation team on fixes
Tooling and Visualization
- Maintain and extend our NextJS UI serving annotators, data scientists, and stakeholders
- Create visualizations (Chart.js) for QC metrics and signal analysis
- Integrate with LabelStudio annotation interface
What You Bring
Required
- Strong Python programming skills, particularly for data processing pipelines
- Experience with time-series data and digital signal processing
- Comfortable working in Linux environments and deploying/monitoring remote services
- Ability to debug complex multi-component systems (sensors, video, networks, sync)
- Data quality mindset: designing validation rules, tracking metrics, investigating anomalies
- SQL/database experience for managing pipeline metadata
Highly Valued
- Video processing experience (RTSP streams, encoding, OCR)
- Working with sensor/IoT data and handling connectivity challenges
- NextJS or modern web frameworks for data tooling
- DevOps practices: containerization, monitoring, logging, alerting
- Experience with annotation pipelines and ML training data workflows
- Background in biomechanics, sports science, or wearable sensors
Tech Stack
- Languages: Python (primary), JavaScript/TypeScript (NextJS UI)
- Data: IMU sensor streams, video (RTSP), time-series analysis, DSP
- Tools: LabelStudio, Chart.js, Linux/bash, OCR libraries
- Infrastructure: Remote deployment, monitoring systems
You'll Thrive Here If You
- Enjoy detective work: diagnosing why data doesn't match expectations
- Balance pragmatism with quality: shipping improvements while maintaining reliability
- Communicate well across technical and non-technical stakeholders
- Can work autonomously in a small, mission-driven team
-
Senior Data Engineer
4 semanas atrás
Recife, Brasil Microtalent is becoming INSPYR Global Solutions Tempo inteiroWE ARE HIRING DATA ENGINEER° Offer 100% remotly ONLY Brazil Direct contract with client The Senior Cloud Data Engineer leads the design, architecture, and implementation of secure, scalable data solutions on AWS, utilizing Snowflake, dbt, and modern automation tools. This role drives best practices for data quality, validation, and governance, while...
-
Senior Data Engineer
Há 3 dias
Recife, Brasil GeorgiaTEK Systems Inc. Tempo inteiro❄️ We’re Hiring – Snowflake Lead Developer (Azure & ADF) 📍 Location: Remote in Brazil 💼 Employment Type: Full-time 🕐 Experience: 7–10 years total experience | 2–3 years hands-on Snowflake About the Role We are seeking a highly skilled Snowflake Lead Developer with strong expertise in Azure Data Factory (ADF) and data engineering...
-
Data Engineer
2 semanas atrás
Recife, Brasil Tata Consultancy Services Tempo inteiroCome to one of the biggest IT Services companies in the world!! Here you can transform your career! Why to join TCS? Here at TCS we believe that people make the difference, that's why we live a culture of unlimited learning full of opportunities for improvement and mutual development. The ideal scenario to expand ideas through the right tools, contributing...
-
Data Engineer
Há 6 horas
Recife, Brasil Tata Consultancy Services Tempo inteiroCome to one of the biggest IT Services companies in the world Here you can transform your career Why to join TCS?Here at TCS we believe that people make the difference, that's why we live a culture of unlimited learning full of opportunities for improvement and mutual development.The ideal scenario to expand ideas through the right tools, contributing to our...
-
Lead Data Engineer
4 semanas atrás
Recife, Brasil Meta Tempo inteiroWhat are we looking for?About the RoleWe are seeking an experienced Lead Data Engineer to design, develop, and lead the delivery of enterprise-grade data integration and analytics solutions. The ideal candidate will have deep expertise in Oracle performance tuning and Oracle Data Integrator (ODI), combined with strong leadership and communication skills. In...
-
Java Engineer ID43924
Há 6 dias
Recife, Brasil AgileEngine Tempo inteiroAgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. WHY JOIN US If you're looking for a place to grow, make an...
-
Recife, Brasil Bebeesoftwareengineer Tempo inteiroJob TitleWe are seeking a skilled software engineer with strong mobile app development experience.Required Skills:Mobile app development expertise (iOS preferred, Swift skills essential).Familiarity with CI/CD pipelines and Git workflows.Proficiency in at least two of the following programming languages:PythonTypeScriptSwiftExperience integrating REST or...
-
Devops Engineer
2 semanas atrás
Recife, Brasil Flowmentum, Inc. Tempo inteiroWe're Flowmentum and our clients are fast-moving teams buildingreliable, scalable, and secure infrastructurefor companies shaping the future of AI, fintech, cloud services, and beyond.Our engineers work onhigh-traffic, mission-critical systemsthat power millions of users across the globe.We believe in autonomy, ownership, and solving hard problems — at...
-
Senior Software Engineer
2 semanas atrás
Recife, Pernambuco, Brasil LIFERAY, INC. Tempo inteiro R$60.000 - R$180.000 por anoAbout LiferayLiferay is a uniquely profitable B2B enterprise software company with 1,000+ fiery-eyed employees all across Europe, the Americas, the Middle East, Asia, and Africa. As a renowned provider of enterprise open source technologies, we have been recognized by Gartner for empowering businesses around the world to solve complex digital challenges....
-
Sr. Full-Stack Software Engineer
2 semanas atrás
Recife, Brasil Tecla Tempo inteiro*Native/Bilingual English is required for this role (read/written/spoken) Please upload your CV Resume in English. Monthly salary: $5,000 - $6,000 USD Along with our partner, we are seeking a highly capable and experienced Full-Stack Software Engineer to join their team. This role is ideal for a senior-level engineer who thrives on owning features...