Lead Data Engineer

1 semana atrás

Goiânia, Brasil Fusemachines Tempo inteiro

About Fusemachines Fusemachines is a leading AI strategy, talent, and education services provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 450 employees). Fusemachines seeks to bring its global expertise in AI to transform companies around the world. Location Location: Remote (Full-time) About The Role This is a remote full-time position, responsible for designing, building, testing, optimizing and maintaining the infrastructure and code required for data integration, storage, processing, pipelines and analytics (BI, visualization and Advanced Analytics) from ingestion to consumption, implementing data flow controls, and ensuring high data quality and accessibility for analytics and business intelligence purposes. This role requires a strong foundation in programming, and a keen understanding of how to integrate and manage data effectively across various storage systems and technologies. We\'re looking for someone who can quickly ramp up, contribute right away and lead the work in Data & Analytics, helping from backlog definition, to architecture decisions, and lead technical the rest of the team with minimal oversight. We are looking for a skilled Sr. Data Engineer/Technical Lead with a strong background in Python, SQL, Pyspark, Redshift and AWS cloud-based large scale data solutions with a passion for data quality, performance and cost optimization. The ideal candidate will develop in an Agile environment, and would have GCP experience too, to contribute to the migration from AWS to GCP. This role is perfect for an individual passionate about leading, leveraging data to drive insights, improve decision-making, and support the strategic goals of the organization through innovative data engineering solutions. Qualifications / Skill Set Must have a full-time Bachelor\'s degree in Computer Science Information Systems, Engineering, or a related field 5+ years of real-world data engineering development experience in AWS and GCP (certifications preferred). Strong expertise in Python, SQL, PySpark and AWS in an Agile environment, with a proven track record of building and optimizing data pipelines, architectures, and datasets, and proven experience in data storage, modeling, management, lake, warehousing, processing/transformation, integration, cleansing, validation and analytics Senior person who can understand requirements and design end to end solutions with minimal oversight Strong programming Skills in one or more languages such as Python, Scala, and proficient in writing efficient and optimized code for data integration, storage, processing and manipulation Strong knowledge SDLC tools and technologies, including project management software (Jira or similar), source code management (GitHub or similar), CI/CD system (GitHub actions, AWS CodeBuild or similar) and binary repository manager (AWS CodeArtifact or similar) Good understanding of Data Modeling and Database Design Principles. Being able to design and implement efficient database schemas that meet the requirements of the data architecture to support data solutions Strong SQL skills and experience working with complex data sets, Enterprise Data Warehouse and writing advanced SQL queries. Proficient with Relational Databases (RDS, MySQL, Postgres, or similar) and NonSQL Databases (Cassandra, MongoDB, Neo4j, etc.) Skilled in Data Integration from different sources such as APIs, databases, flat files, event streaming. Strong experience in implementing data pipelines and efficient ELT/ETL processes, batch and real-time, in AWS and using open source solutions, being able to develop custom integration solutions as needed, including Data Integration from different sources such as APIs (PoS integrations is a plus), ERP (Oracle and Allegra are a plus), databases, flat files, Apache Parquet, event streaming, including cleansing, transformation and validation of the data Strong experience with scalable and distributed Data Technologies such as Spark/PySpark, DBT and Kafka, to be able to handle large volumes of data Experience with stream-processing systems: Storm, Spark-Streaming, etc. is a plus Strong experience in designing and implementing Data Warehousing solutions in AWS with Redshift. Demonstrated experience in designing and implementing efficient ELT/ETL processes that extract data from source systems, transform it (DBT), and load it into the data warehouse Strong experience in Orchestration using Apache Airflow Expert in Cloud Computing in AWS, including deep knowledge of a variety of AWS services like Lambda, Kinesis, S3, Lake Formation, EC2, EMR, ECS/ECR, IAM, CloudWatch, etc Good understanding of Data Quality and Governance, including implementation of data quality checks and monitoring processes to ensure that data is accurate, complete, and consistent Good understanding of BI solutions including Looker and LookML (Looker Modeling Language) Strong knowledge and hands-on experience of DevOps principles, tools and technologies (GitHub and AWS DevOps) including continuous integration, continuous delivery (CI/CD), infrastructure as code (IaC – Terraform), configuration management, automated testing, performance tuning and cost management and optimization Good Problem-Solving skills: being able to troubleshoot data processing pipelines and identify performance bottlenecks and other issues Possesses strong leadership skills with a willingness to lead, create Ideas, and be assertive Strong project management and organizational skills Excellent communication skills to collaborate with cross-functional teams, including business users, data architects, DevOps/DataOps/MLOps engineers, data analyst, data scientists, developers, and operations teams. Essential to convey complex technical concepts and insights to non-technical stakeholders effectively Ability to document processes, procedures, and deployment configurations Responsibilities Design, implement, deploy, test and maintain highly scalable and efficient data architectures, defining and maintaining standards and best practices for data management independently with minimal guidance Ensuring the scalability, reliability, quality and performance of data systems Mentoring and guiding junior/mid-level data engineers Collaborating with Product, Engineering, Data Scientists and Analysts to understand data requirements and develop data solutions, including reusable components Evaluating and implementing new technologies and tools to improve data integration, data processing and analysis Design architecture, observability and testing strategies, and building reliable infrastructure and data pipelines Takes ownership of storage layer, data management tasks, including schema design, indexing, and performance tuning Swiftly address and resolve complex data engineering issues, incidents and resolve bottlenecks in SQL queries and database operations Conduct Discovery on existing Data Infrastructure and Proposed Architecture Evaluate and implement cutting-edge technologies and methodologies and continue learning and expanding skills in data engineering and cloud platforms, to improve and modernize existing data systems Evaluate, design, and implement data governance solutions: cataloging, lineage, quality and data governance frameworks that are suitable for a modern analytics solution, considering industry-standard best practices and patterns. Define and document data engineering architectures, processes and data flows Assess best practices and design schemas that match business needs for delivering a modern analytics solution (descriptive, diagnostic, predictive, prescriptive) Be an active member of our Agile team, participating in all ceremonies and continuous improvement activities Equal Opportunity Employer Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status. #J-18808-Ljbffr

Data Architect

4 semanas atrás

Goiânia, Brasil buscojobs Brasil Tempo inteiro

Overview At Aravita, we're not just building a product; we're building the future of AI-enabled fresh food operations. We are a fast-paced, innovative startup on a mission to solve complex problems with data. Our culture is built on collaboration, curiosity, and a passion for technology. We're looking for a brilliant Data Engineer / Cloud Architect to join...
Senior Data Engineer Brazil

1 semana atrás

Goiânia, Brasil Artefact Tempo inteiro

Join to apply for the Senior Data Engineer Brazil role at Artefact Who We Are At Artefact LatAm, we believe in and live a culture based on empathy! A healthy work environment is a place where all voices are heard, respected, and valued. Our commitment is to build a more diverse and inclusive environment, because empathy is for everyone, regardless of...
Azure Data Engineer

1 semana atrás

Goiânia, Brasil Aubay Portugal Tempo inteiro

Your connection with Aubay starts in the following lines: Aubay Portugal is a multinational French company, in Portugal since 2007. We have offices in Lisbon and Oporto and we are a specialized consultant in Management, Implementation, Development and Maintenance of Information Systems. We have more than 150 active partners and we operate in sectors such as...
Coordenador de Pré-Vendas

3 semanas atrás

Goiânia, Brasil Open Data Center Tempo inteiro

Overview Você é apaixonado por vendas, processos e liderança ? Gosta de transformar times em alta performance e tem energia para escalar resultados ? Então essa vaga é para você! Modelo de contratação CLT ou PJ Responsabilidades Liderar e desenvolver o time de SDRs, garantindo geração constante de oportunidades qualificadas. Criar e...
Líder Técnico de Desenvolvimento

2 semanas atrás

Goiânia, Brasil Open Data Center Tempo inteiro

Líder Técnico de Desenvolvimento (Tech Lead) Liderar e orientar equipe de desenvolvedores, fornecendo suporte, feedback e direcionamento técnico Tomar decisões arquiteturais, garantir qualidade do código e definir melhores práticas Desenvolver e manter aplicações web fullstack modernas (frontend e backend) Ser referência técnica, acompanhando o...
Security Engineer

Há 6 dias

Goiânia, Brasil LEDN Tempo inteiro

Overview Join to apply for the Security Engineer role at LEDN . Ledn is a global financial services company built for digital assets, helping to improve the everyday lives of Bitcoin holders while building generational wealth for the future. We offer a suite of egalitarian lending, savings and trading products to digital asset holders in over 150 countries...
Senior Machine Learning Engineer

Há 6 dias

Goiânia, Goiás, Brasil CVS Health Tempo inteiro US$83.430 - US$222.480

At CVS Health, we're building a world of health around every consumer and surrounding ourselves with dedicated colleagues who are passionate about transforming health care.As the nation's leading health solutions company, we reach millions of Americans through our local presence, digital channels and more than 300,000 purpose-driven colleagues – caring for...
Senior Vue.js Frontend Engineer

4 semanas atrás

Goiânia, Brasil Lean Tech Tempo inteiro

Company Overview Lean Tech is a rapidly expanding organization situated in Medellín, Colombia. We pride ourselves on possessing one of the most influential networks within software development and IT services for the entertainment, financial, and logistics sectors. Our corporate projections offer a multitude of opportunities for professionals to elevate...
Senior Python Engineer

2 semanas atrás

Goiânia, Brasil FullStack Labs Tempo inteiro

Senior Python Engineer - Remote - Latin AmericaJoin to apply for the Senior Python Engineer - Remote - Latin America role at FullStack Labs About FullStack FullStack is the most transparent IT talent network, connecting highly skilled individuals with top global companies and Silicon Valley startups for remote, on-demand projects. We focus on building a...
iOS Engineer

2 semanas atrás

Goiânia, Brasil AgileEngine Tempo inteiro

Overview Join to apply for the iOS Engineer (Senior) ID41871 role at AgileEngine . ABOUT THE ROLE We are seeking a Senior iOS Engineer to lead a critical modernization of our app’s networking stack. Our current implementation relies on AFNetworking, which was officially deprecated in January 2023, creating long-term risks to stability, security, and...

Américas

Europa

Ásia / Oceania

África

Lead Data Engineer