Data Engineer

Há 5 dias


João Pessoa, Brasil Jobgether Tempo inteiro

Get AI-powered advice on this job and more exclusive features. This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Data Engineer (DBT + Spark + Argo) in Latin America . We are seeking a highly skilled Data Engineer to join a remote-first, collaborative team driving the modernization of large-scale data platforms in the healthcare sector. In this role, you will work on transforming legacy SQL pipelines into modular, scalable, and testable DBT architectures, leveraging Spark for high-performance processing and Argo for workflow orchestration. You will implement modern lakehouse solutions, optimize storage and querying strategies, and enable real-time analytics with ElasticSearch. This position offers the chance to contribute to a cutting-edge, cloud-native data environment, working closely with cross-functional teams to deliver reliable, impactful data solutions. Accountabilities Translate legacy T-SQL logic into modular, scalable DBT models powered by Spark SQL Build reusable, high-performance data transformation pipelines Develop testing frameworks to ensure data accuracy and integrity within DBT workflows Design and orchestrate automated workflows using Argo Workflows and CI/CD pipelines with Argo CD Manage reference datasets and mock data (e.g., ICD-10, CPT), maintaining version control and governance Implement efficient storage and query strategies using Apache Hudi, Parquet, and Iceberg Integrate ElasticSearch for analytics through APIs and pipelines supporting indexing and querying Collaborate with DevOps teams to optimize cloud storage, enforce security, and ensure compliance Participate in Agile squads, contributing to planning, estimation, and sprint reviews Requirements Strong experience with DBT for data modeling, testing, and deployment Hands-on proficiency in Spark SQL, including performance tuning Solid programming skills in Python for automation and data manipulation Familiarity with Jinja templating to build reusable DBT components Practical experience with data lake formats: Apache Hudi, Parquet, Iceberg Expertise in Argo Workflows and CI/CD integration with Argo CD Deep understanding of AWS S3 storage, performance tuning, and cost optimization Experience with ElasticSearch for indexing and querying structured/unstructured data Knowledge of healthcare data standards (e.g., ICD-10, CPT) Ability to work cross-functionally in Agile environments Nice to have: Experience with Docker, Kubernetes, cloud-native data tools (AWS Glue, Databricks, EMR), CI/CD automation, data compliance standards (HIPAA, SOC2), or contributions to open-source DBT/Spark projects Benefits Contractor agreement with payment in USD 100% remote work within LATAM Observance of local public holidays Access to English classes and professional learning platforms Referral program and other growth opportunities Exposure to cutting-edge data engineering projects in a cloud-native environment Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching. When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly. Our AI thoroughly analyzes your CV and LinkedIn profile, evaluating your skills, experience, and achievements It compares your profile against the job's core requirements and past success factors to calculate a match score The top 3 candidates with the highest match are automatically shortlisted


  • Lead Data Engineer

    Há 7 dias


    João Pessoa, Brasil Fusemachines Tempo inteiro

    3 weeks ago Be among the first 25 applicants About FusemachinesFusemachines is a leading AI strategy, talent, and education services provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican...


  • João Pessoa, Brasil Epam Systems Tempo inteiro

    We are seeking a Senior Data DevOps Engineer to join our remote team, working on a cutting-edge project that involves developing and maintaining large-scale big data infrastructure.In this role, you will play a crucial role in ensuring the reliability, scalability, and performance of our big data infrastructure.You will work closely with cross-functional...


  • João Pessoa, Brasil EPAM Systems Tempo inteiro

    We are seeking a Senior Data DevOps Engineer to join our remote team, working on a cutting-edge project that involves developing and maintaining large-scale big data infrastructure. In this role, you will play a crucial role in ensuring the reliability, scalability, and performance of our big data infrastructure. You will work closely with cross-functional...

  • Etl + Data Engineer

    1 semana atrás


    João Pessoa, Brasil Bairesdev Tempo inteiro

    Join to apply for the ETL + Data Engineer - REMOTE WORK | REF#**** role at BairesDevContinue with Google Continue with Google5 months ago Be among the first 25 applicantsJoin to apply for the ETL + Data Engineer - REMOTE WORK | REF#**** role at BairesDevAt BairesDev, we've been leading the way in technology projects for over 15 years.We deliver cutting-edge...

  • Data Platform Engineer

    2 semanas atrás


    João Pessoa, Brasil BairesDev Tempo inteiro

    Overview Data Platform Engineer - Remote Work | REF# at BairesDev. We are looking for an outstanding Data Platform Engineer to join BairesDev’s Research & Development Team (R&D). This professional will be responsible for building the data platform to simplify and accelerate tasks related to ingestion, storage, processing, cataloging, quality, and...


  • João Pessoa, Brasil Speechify Tempo inteiro

    Software Engineer, Data Infrastructure & AcquisitionRemoteThe mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify's text-to-speech products to turn whatever they're reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember...

  • Staff Data Engineer

    1 semana atrás


    João Pessoa, Brasil Loggi Tempo inteiro

    Somos uma empresa de tecnologia que está reinventando a logística e a forma de fazer entregas no Brasil.Queremos empoderar pessoas e transformar negócios com entregas de excelência. Movimentamos mais de 400 mil pacotes por dia com uma tecnologia de ponta a ponta, que aproxima todos os cantos do país.Temos o propósito de que as entregas sejam...


  • João Pessoa, Brasil Canonical Tempo inteiro

    Python and Kubernetes Software Engineer - Data, AI/ML & Analytics Join to apply for the Python and Kubernetes Software Engineer - Data, AI/ML & Analytics role at Canonical Python and Kubernetes Software Engineer - Data, AI/ML & Analytics 4 months ago Be among the first 25 applicants Join to apply for the Python and Kubernetes Software Engineer - Data, AI/ML...

  • Data Processing Specialist

    2 semanas atrás


    João Pessoa, Brasil Bebeebackend Tempo inteiro

    Senior Backend Engineer RoleWe are seeking a highly skilled and experienced Senior Backend Engineer to join our team.This is a 12-month contract opportunity that will challenge you to create innovative solutions.Develop high-quality data processing pipelines using Typescript and Go on AWS.Create and optimize processes for efficiency, scalability, and...


  • João Pessoa, Brasil LAB Tech & Analytics Office Tempo inteiro

    Company Description LAB Tech Analytics is a dynamic and forward-thinking small company dedicated to revolutionizing data-driven solutions. We empower businesses to unlock the true potential of their data assets with our comprehensive suite of services. From our top-notch data recruitment and training hub to our cutting-edge data products and services, we...