Senior Data Engineer
Há 3 dias
Staff/Senior Data Engineer: AI Training Data (2-4 Months Contract) Location: Remote Role Type: Contract (2-4 Months) Time Commitment: 40 hrs/week (Full-time availability required) Compensation: Hyper-competitive hourly rate (matching Tier-1 Staff engineering bands) Experience: 6-12+ years About BespokeLabs BespokeLabs is a premier, VC-backed AI Research lab with an exceptionally talent-dense team of IIT and Ivy League alumni. We don’t just build tooling around AI—we build the massive-scale data systems and reasoning architectures that directly power next-generation models. Our research shapes the frontier of AI: we’ve published breakthroughs like GEPA, driven foundational datasets like OpenThoughts, and shipped state-of-the-art models including Bespoke-MiniCheck and Bespoke-MiniChart. More on our website :) Role Overview We are looking for a top-tier Senior/Staff Data Engineer for a high-impact, 2-month sprint.
You will leverage your deep expertise in enterprise-grade data platforms to architect and build the complex curation systems required for advanced AI model training. This is not a traditional ETL pipeline role. We need a heavy-hitter who has already operated production data platforms at scale inside large, complex organizations (FAANG, Fortune 100). You will use the mental models, architectural intuition, and coding skills you've developed over your career to generate, transform, and evaluate the data that trains the next generation of AI.
What You Will
Do (The Contract) Architect AI-Scale Systems: Design the overarching data architecture and processing topology needed to programmatically curate and shape datasets at TB/PB scale, ensuring low latency and high consistency. Hands-On Development: Write production-grade code (Python/Scala, Spark) to build custom ingestion logic, highly efficient transformation scripts, and performant data validation checks. Complex Data Logic: Implement advanced filtering, deduplication, and quality-scoring algorithms at scale, ensuring the resulting data objects are optimized for LLM/ML consumption. Quality & Performance Tuning: Rigorously test, benchmark, and optimize processing workloads (CPU/memory tuning, partitioning strategies in Spark/Iceberg) to meet aggressive throughput targets.
Domain Subject Matter Expert: Act as the ultimate technical authority on distributed systems, data processing, and cloud structures to ensure the training data factory meets enterprise-grade accuracy. What You Bring to the Table (Your Past Experience) To be successful in this contract, you must have a track record of: End-to-End Ownership: Designing and owning enterprise data platforms (batch + streaming). High-Throughput Processing: Building and operating Kafka-first streaming pipelines. Lakehouse Architecture: Utilizing Apache Iceberg, Delta Lake, or Hudi for analytics and ML at scale.
Reliability Engineering: Ensuring data reliability through SLAs, monitoring, backfills, and recovery. Scale: Processing billions of events and managing TB–PB scale data systems. Required Qualifications (Non-Negotiable) Experience: 6+ years of Data Engineering experience. Seniority: Demonstrated Senior/Staff-level ownership of production data platforms.
Pedigree: Background at Tier-1 enterprises (FAANG, large SaaS, Fortune 100). Technical Stack: Deep fluency in Python/Scala, Spark, Kafka, Airflow, and Major Cloud Warehouses (Snowflake, BigQuery, Redshift).
-
Senior Data Engineer
Há 2 dias
São Paulo, Brasil Grupo Data Tempo inteiroOverview Hi! Weare DATA Group and we are searching for the best talent! Our goal is tosimplify our clients' lives with innovative IT solutions. We operate at globalscale and we are expanding to Portugal! If youare passionate and have the desire to make the difference, we want to get toknow you! Join us to be part of this incredible adventure! Whoare we...
-
Senior Data Engineer
Há 2 dias
São Paulo, São Paulo, Brasil Grupo Data Tempo inteiroHi Weare DATA Group and we are searching for the best talent Our goal is tosimplify our clients' lives with innovative IT solutions. We operate at globalscale and we are expanding to PortugalIf youare passionate and have the desire to make the difference, we want to get toknow you Join us to be part of this incredible adventureWhoare we looking for?...
-
Senior Data Engineer
Há 2 dias
São Paulo, Brasil TRACTIAN ?? Tempo inteiroSenior Data Engineer TRACTIAN
-
Senior Data Engineer
1 dia atrás
são paulo, Brasil Pride Global Tempo inteiroWe're Hiring: Senior Data Engineer | Remote from Brazil | Fluent English required | Location : Remote – Brazil only Contact: PJ - Long Term Are you passionate about building scalable data platforms and cutting-edge MLOps solutions? Do you want to work with a top-tier US company revolutionizing e-commerce and circular fashion? We're looking for a Senior...
-
Senior Data Engineer
Há 7 dias
São Paulo, Brasil Veraxion Tempo inteiroWe’re looking for a Senior Data Engineer to design, build, and scale modern data platforms on AWS. You’ll work with Python, Spark, DBT, and AWS-native services in an Agile environment to deliver scalable, secure, and high-performance data solutions.What you’ll doDevelop and optimize ETL/ELT pipelines with Python, DBT, and AWS services (Data Ops...
-
Senior Analytics Engineer
Há 2 dias
São Paulo, São Paulo, Brasil vaga para Senior Analytics Engineer na Nubank Tempo inteiroAbout UsNu is one of the largest digital financial platforms in the world, with more than 127 million customers across Brazil, Mexico, and Colombia. Guided by our mission to fight complexity and empower people, we are redefining financial services in Latin America and this is still just the beginning of the purple future we're building. Listed on the New...
-
Senior Data Engineer
Há 2 dias
São Paulo, Brasil Wildlife Studios Tempo inteiroWe're looking for a talented and passionate Senior Data Engineer to join Wildlife's Marketing Tech team in São Paulo, Brazil. Role Wildlife is one of the largest gaming companies in the world. Besides building amazing games, part of the reason it has been super successful has been its ability to acquire new users more efficiently than other companies. As a...
-
Senior Data Engineer
Há 2 dias
São Paulo, Brasil Wildlife Studios Tempo inteiroWe're looking for a talented and passionate Senior Data Engineer to join Wildlife's Marketing Tech team in São Paulo, Brazil. Wildlife is one of the largest gaming companies in the world. Besides building amazing games, part of the reason it has been super successful has been its ability to acquire new users more efficiently than other companies. As a...
-
Senior Data Engineer
Há 2 dias
São Paulo, São Paulo, Brasil Wildlife Studios Tempo inteiroWe're looking for a talented and passionate Senior Data Engineer to join Wildlife's Marketing Tech team in São Paulo, Brazil.RoleWildlife is one of the largest gaming companies in the world. Besides building amazing games, part of the reason it has been super successful has been its ability to acquire new users more efficiently than other companies. As a...
-
Senior Data Engineer
Há 2 dias
São Paulo, Brasil Bespoke Labs Tempo inteiroStaff/Senior Data Engineer: AI Training Data (2-4 Months Contract)Location: RemoteRole Type: Contract (2-4 Months)Time Commitment: 40 hrs/week (Full-time availability required)Compensation: Hyper-competitive hourly rate (matching Tier-1 Staff engineering bands) Experience: 6-12+ yearsAbout BespokeLabsBespokeLabs is a premier, VC-backed AI Research lab with...