Senior Data Engineer

Há 2 dias


Brazil DeepRec.ai Tempo inteiro

Senior Data Engineer – ML Services and Data Pipelines

Location: Remote

Employment Type: Contract

Experience: 5+ years

3 month contract + multiple extensions possible


Our client is based in the USA but employs a remote workforce. You should be available for team meetings at 8am PT and have some cross over for team collaboration in PST/ET timezone. Happy to hire remote worldwide.


We are seeking a skilled and experienced Data Engineer to design and implement scalable and extensible Machine Learning services and data pipelines within our AWS/Kubernetes environment. In this role, you will be responsible for setting up infrastructure to support the ingestion, processing, and indexing of text-based data. You will collaborate closely with our ML and software engineering teams to build robust pipelines and enable efficient data processing and document indexing.


Key Responsibilities


  • ML Infrastructure Setup: Design, implement, and maintain scalable ML infrastructure on AWS and Kubernetes to support current and future project needs. Tools like Amazon SageMaker for model development, deployment, and monitoring, and AWS Lambda for serverless computing will be key.
  • Data Pipeline Development: Develop, deploy, and manage data pipelines that process large volumes of data for machine learning use cases, with a focus on efficient text data processing. Use AWS Glue for ETL jobs, Amazon Kinesis for real-time data streaming, and AWS Step Functions to coordinate workflows.
  • Vector Database Integration: Set up and maintain vector databases to support natural language processing (NLP) models, ensuring efficient and accurate text-based data retrieval and analysis. Amazon OpenSearch Service and Amazon DynamoDB can be leveraged for indexing and storing large volumes of vectorized data.
  • Document Indexing System: Design and implement a system for document ingestion and indexing, providing seamless access to data for downstream ML and analytics processes. Tools like Amazon S3 for scalable storage and AWS Lambda for automation and processing will play a critical role.
  • Development Pipeline Setup: Establish a development pipeline for document ingestion, collaborating with DevOps and data science teams to ensure continuous integration and deployment practices using AWS CodePipeline and AWS CodeBuild.


Required Skills & Qualifications



  • Experience: 5+ years of experience in data engineering with a focus on ML infrastructure and data pipelines.
  • Technical Expertise:
  • Strong background in AWS and Kubernetes for deploying and managing scalable ML and data solutions.
  • Proficiency in data pipeline tools and frameworks such as AWS Glue, Amazon Kinesis, AWS Step Functions, or similar.
  • Experience with text-based data processing, NLP techniques, and vector databases (e.g., Amazon OpenSearch Service, Amazon DynamoDB, or third-party vector databases like Pinecone, Weaviate, or FAISS).
  • Programming Skills: Advanced skills in Python or similar languages for data engineering and ML pipeline development.
  • Data Handling & Storage: Proficiency in data storage solutions, including Amazon S3, AWS Redshift, Amazon RDS, and AWS Data Lake.
  • Problem-Solving Abilities: Ability to troubleshoot and resolve issues within ML pipelines and data processing environments.


Preferred Qualifications


  • Hands-on experience with ML Ops frameworks and practices.
  • Familiarity with continuous integration/continuous deployment (CI/CD) for data engineering workflows using AWS CodePipeline, AWS CodeBuild, and AWS CloudFormation.



  • Brazil, BR DeepRec.ai Tempo inteiro

    Senior Data Engineer – ML Services and Data PipelinesLocation: RemoteEmployment Type: ContractExperience: 5+ years3 month contract + multiple extensions possibleOur client is based in the USA but employs a remote workforce. You should be available for team meetings at 8am PT and have some cross over for team collaboration in PST/ET timezone. Happy to hire...

  • Data Engineer

    4 semanas atrás


    Brazil Premier Group Recruitment Tempo inteiro

    Data EngineerContract$32 - $36 per hour. Premier Group has engaged exclusively with a growing marketing agency who are seeking a Data Engineer to join them on a 3-month contract, with a huge possibility of extension. The company is an English speaking business, so you must have strong English, both written and verbal, as well as your own Sociedade...

  • Data Engineer

    4 semanas atrás


    Brazil Premier Group Recruitment Tempo inteiro

    Data Engineer Contract $32 - $36 per hour. Premier Group has engaged exclusively with a growing marketing agency who are seeking a Data Engineer to join them on a 3-month contract, with a huge possibility of extension. The company is an English speaking business, so you must have strong English, both written and verbal, as well as your own Sociedade...

  • Data Engineer

    4 semanas atrás


    Brazil, BR Premier Group Recruitment Tempo inteiro

    Data EngineerContract$32 - $36 per hour. Premier Group has engaged exclusively with a growing marketing agency who are seeking a Data Engineer to join them on a 3-month contract, with a huge possibility of extension. The company is an English speaking business, so you must have strong English, both written and verbal, as well as your own Sociedade...

  • Senior Data Engineer

    2 meses atrás


    Brazil Luxoft Tempo inteiro

    ResponsibilitiesDesign, build, and maintain scalable data pipelines on Azure Databricks and Google Cloud Platform (GCP) using PySpark and Python.Work with Azure and GCP platforms, specifically managing services like Azure Databricks, Azure SQL Server, BigQuery, DataProc, and Azure Machine Learning.Develop and maintain CI/CD pipelines using GitHub Actions to...

  • Senior Data Engineer

    2 meses atrás


    Brazil, BR Luxoft Tempo inteiro

    ResponsibilitiesDesign, build, and maintain scalable data pipelines on Azure Databricks and Google Cloud Platform (GCP) using PySpark and Python.Work with Azure and GCP platforms, specifically managing services like Azure Databricks, Azure SQL Server, BigQuery, DataProc, and Azure Machine Learning.Develop and maintain CI/CD pipelines using GitHub Actions to...

  • Data Engineer

    2 meses atrás


    Brazil, BR Blueclip Tempo inteiro

    As Blueclip, we started our journey to bring the most creative minds and the know-how together to deliver great works.About Our SearchWe are looking for a multi-talented Big Data Engineer to facilitate the operations of the Data scientists and Engineering team of our partner which is a cryptocurrency investment research platform that’s driven by machine...

  • Data Engineer

    2 meses atrás


    Brazil Blueclip Tempo inteiro

    As Blueclip, we started our journey to bring the most creative minds and the know-how together to deliver great works.About Our SearchWe are looking for a multi-talented Big Data Engineer to facilitate the operations of the Data scientists and Engineering team of our partner which is a cryptocurrency investment research platform that’s driven by machine...

  • Data Engineer

    2 meses atrás


    Brazil Blueclip Tempo inteiro

    As Blueclip, we started our journey to bring the most creative minds and the know-how together to deliver great works. About Our Search We are looking for a multi-talented Big Data Engineer to facilitate the operations of the Data scientists and Engineering team of our partner which is a cryptocurrency investment research platform that’s driven by...

  • Data Engineer

    Há 4 dias


    Brazil, BR Pivotal Solutions Tempo inteiro

    This is a contract position that will start immediately through the end of the year, with possible extension through next year depending on performance.Responsibilities:Collaborate with data analysts, engineers, business stakeholders, data scientists, and other team members to understand detailed data requirementsWrite code in Python and Airflow to build...

  • Data Engineer

    Há 4 dias


    Brazil Pivotal Solutions Tempo inteiro

    This is a contract position that will start immediately through the end of the year, with possible extension through next year depending on performance.Responsibilities:Collaborate with data analysts, engineers, business stakeholders, data scientists, and other team members to understand detailed data requirementsWrite code in Python and Airflow to build...

  • Senior Data Engineer

    3 semanas atrás


    Brazil HCLTech Tempo inteiro

    www.hcltech.comWe are HCLTech, one of the world’s largest and fastest growing technology and DSA companies with over 227,000 professionals across 60 countries, driving progress through industry-leading capabilities focused on Digital, Engineering and Cloud.The driving force behind this work, our people, is a diverse, creative and passionate audience that...

  • Data Engineer

    2 meses atrás


    Brazil Remobi Tempo inteiro

    Data Engineer - (REMOTE) Brazil About Remobi: We are building the world's greatest community of remote technologists! Today, organizations that understand the value of remote working will reap the rewards. It doesn’t just provide team members with a healthier work-life balance, it gives you the opportunity to access the brightest minds in the...

  • Data Engineer

    2 meses atrás


    Brazil Remobi Tempo inteiro

    Data Engineer - (REMOTE) BrazilAbout Remobi:We are building the world's greatest community of remote technologists!Today, organizations that understand the value of remote working will reap the rewards. It doesn’t just provide team members with a healthier work-life balance, it gives you the opportunity to access the brightest minds in the world.Our...

  • Data Engineer

    2 meses atrás


    Brazil, BR Remobi Tempo inteiro

    Data Engineer - (REMOTE) BrazilAbout Remobi:We are building the world's greatest community of remote technologists!Today, organizations that understand the value of remote working will reap the rewards. It doesn’t just provide team members with a healthier work-life balance, it gives you the opportunity to access the brightest minds in the world.Our...

  • Senior Data Engineer

    2 meses atrás


    Brazil, BR Sigma Software Group Tempo inteiro

    Our growing data teams are seeking a savvy Senior Data Engineer to join them and help build and evolve the next generation of Audience & Identity data platforms that handle data at scale and use state-of-the-art technologies to unlock the data’s business value.If you’re excited about working with cutting-edge technology in a fast-paced environment,...

  • Senior Data Engineer

    2 meses atrás


    Brazil Sigma Software Group Tempo inteiro

    Our growing data teams are seeking a savvy Senior Data Engineer to join them and help build and evolve the next generation of Audience & Identity data platforms that handle data at scale and use state-of-the-art technologies to unlock the data’s business value.If you’re excited about working with cutting-edge technology in a fast-paced environment,...

  • Senior Data Engineer

    2 meses atrás


    Brazil Sigma Software Group Tempo inteiro

    Our growing data teams are seeking a savvy Senior Data Engineer to join them and help build and evolve the next generation of Audience & Identity data platforms that handle data at scale and use state-of-the-art technologies to unlock the data’s business value. If you’re excited about working with cutting-edge technology in a fast-paced environment,...

  • Senior Python Engineer

    3 semanas atrás


    Brazil YASH Technologies Tempo inteiro

    Hello We are actively looking for a Senior Python Engineer. If you or your consultant are actively looking for a new job please share your profile. Role: Senior Python Engineer Duration: 2+ Months Location: Brazil ( REMOTE ) Mandatory Skills : Experience with Airflow on Kubernetes or KEDA We are seeking a highly skilled Senior Python Engineer to...

  • Senior Python Engineer

    3 semanas atrás


    Brazil, BR YASH Technologies Tempo inteiro

    Hello We are actively looking for a Senior Python Engineer. If you or your consultant are actively looking for a new job please share your profile.Role: Senior Python EngineerDuration: 2+ Months Location: Brazil ( REMOTE ) Mandatory Skills : Experience with Airflow on Kubernetes or KEDAWe are seeking a highly skilled Senior Python Engineer to join our...