
Lead Data Engineer
2 semanas atrás
3 weeks ago Be among the first 25 applicantsAbout FusemachinesFusemachines is a leading AI strategy, talent, and education services provider.
Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI.
With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 450 employees).
Fusemachines seeks to bring its global expertise in AI to transform companies around the world.About FusemachinesFusemachines is a leading AI strategy, talent, and education services provider.
Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI.
With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 450 employees).
Fusemachines seeks to bring its global expertise in AI to transform companies around the world.Location: Remote (Full-time)About The RoleThis is a remote full-time position, responsible for designing, building, testing, optimizing and maintaining the infrastructure and code required for data integration, storage, processing, pipelines and analytics (BI, visualization and Advanced Analytics) from ingestion to consumption, implementing data flow controls, and ensuring high data quality and accessibility for analytics and business intelligence purposes.
This role requires a strong foundation in programming, and a keen understanding of how to integrate and manage data effectively across various storage systems and technologies.We're looking for someone who can quickly ramp up, contribute right away and lead the work in Data & Analytics, helping from backlog definition, to architecture decisions, and lead technical the rest of the team with minimal oversight.We are looking for a skilled Sr.
Data Engineer/Technical Lead with a strong background in Python, SQL, Pyspark, Redshift and AWS cloud-based large scale data solutions with a passion for data quality, performance and cost optimization.
The ideal candidate will develop in an Agile environment, and would have GCP experience too, to contribute to the migration from AWS to GCP.This role is perfect for an individual passionate about leading, leveraging data to drive insights, improve decision-making, and support the strategic goals of the organization through innovative data engineering solutions.Qualification / Skill Set Requirement:Must have a full-time Bachelor's degree in Computer Science Information Systems, Engineering, or a related field5+ years of real-world data engineering development experience in AWS and GCP (certifications preferred).
Strong expertise in Python, SQL, PySpark and AWS in an Agile environment, with a proven track record of building and optimizing data pipelines, architectures, and datasets, and proven experience in data storage, modeling, management, lake, warehousing, processing/transformation, integration, cleansing, validation and analyticsSenior person who can understand requirements and design end to end solutions with minimal oversightStrong programming Skills in one or more languages such as Python, Scala, and proficient in writing efficient and optimized code for data integration, storage, processing and manipulationStrong knowledge SDLC tools and technologies, including project management software (Jira or similar), source code management (GitHub or similar), CI/CD system (GitHub actions, AWS CodeBuild or similar) and binary repository manager (AWS CodeArtifact or similar)Good understanding of Data Modeling and Database Design Principles.
Being able to design and implement efficient database schemas that meet the requirements of the data architecture to support data solutionsStrong SQL skills and experience working with complex data sets, Enterprise Data Warehouse and writing advanced SQL queries.
Proficient with Relational Databases (RDS, MySQL, Postgres, or similar) and NonSQL Databases (Cassandra, MongoDB, Neo4j, etc.)Skilled in Data Integration from different sources such as APIs, databases, flat files, event streaming.Strong experience in implementing data pipelines and efficient ELT/ETL processes, batch and real-time, in AWS and using open source solutions, being able to develop custom integration solutions as needed, including Data Integration from different sources such as APIs (PoS integrations is a plus), ERP (Oracle and Allegra are a plus), databases, flat files, Apache Parquet, event streaming, including cleansing, transformation and validation of the dataStrong experience with scalable and distributed Data Technologies such as Spark/PySpark, DBT and Kafka, to be able to handle large volumes of dataExperience with stream-processing systems: Storm, Spark-Streaming, etc. is a plusStrong experience in designing and implementing Data Warehousing solutions in AWS with Redshift.
Demonstrated experience in designing and implementing efficient ELT/ETL processes that extract data from source systems, transform it (DBT), and load it into the data warehouseStrong experience in Orchestration using Apache AirflowExpert in Cloud Computing in AWS, including deep knowledge of a variety of AWS services like Lambda, Kinesis, S3, Lake Formation, EC2, EMR, ECS/ECR, IAM, CloudWatch, etcGood understanding of Data Quality and Governance, including implementation of data quality checks and monitoring processes to ensure that data is accurate, complete, and consistentGood understanding of BI solutions including Looker and LookML (Looker Modeling Language)Strong knowledge and hands-on experience of DevOps principles, tools and technologies (GitHub and AWS DevOps) including continuous integration, continuous delivery (CI/CD), infrastructure as code (IaC – Terraform), configuration management, automated testing, performance tuning and cost management and optimizationGood Problem-Solving skills: being able to troubleshoot data processing pipelines and identify performance bottlenecks and other issuesPossesses strong leadership skills with a willingness to lead, create Ideas, and be assertiveStrong project management and organizational skillsExcellent communication skills to collaborate with cross-functional teams, including business users, data architects, DevOps/DataOps/MLOps engineers, data analyst, data scientists, developers, and operations teams.
Essential to convey complex technical concepts and insights to non-technical stakeholders effectivelyAbility to document processes, procedures, and deployment configurationsResponsibilities:Design, implement, deploy, test and maintain highly scalable and efficient data architectures, defining and maintaining standards and best practices for data management independently with minimal guidanceEnsuring the scalability, reliability, quality and performance of data systemsMentoring and guiding junior/mid-level data engineersCollaborating with Product, Engineering, Data Scientists and Analysts to understand data requirements and develop data solutions, including reusable componentsEvaluating and implementing new technologies and tools to improve data integration, data processing and analysisDesign architecture, observability and testing strategies, and building reliable infrastructure and data pipelinesTakes ownership of storage layer, data management tasks, including schema design, indexing, and performance tuningSwiftly address and resolve complex data engineering issues, incidents and resolve bottlenecks in SQL queries and database operationsConduct Discovery on existing Data Infrastructure and Proposed ArchitectureEvaluate and implement cutting-edge technologies and methodologies and continue learning and expanding skills in data engineering and cloud platforms, to improve and modernize existing data systemsEvaluate, design, and implement data governance solutions: cataloging, lineage, quality and data governance frameworks that are suitable for a modern analytics solution, considering industry-standard best practices and patterns.Define and document data engineering architectures, processes and data flowsAssess best practices and design schemas that match business needs for delivering a modern analytics solution (descriptive, diagnostic, predictive, prescriptive)Be an active member of our Agile team, participating in all ceremonies and continuous improvement activitiesEqual Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.Powered by JazzHRZrFIVaBonRSeniority levelSeniority levelMid-Senior levelEmployment typeEmployment typeContractJob functionJob functionInformation TechnologyIndustriesInternet PublishingReferrals increase your chances of interviewing at Fusemachines by 2xJunior Data Analytics / R+D - Remote Work | REF#281450Junior Software Development Engineer in Test / R+D - Remote Work | REF#271070Software Engineer PHP - Remote Work | REF#5716Golang Software Engineer (Senior/Lead) ID37218We're unlocking community knowledge in a new way.
Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
-
Data Engineer
Há 2 dias
Joinville, Brasil AgileEngine Tempo inteiroAgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. WHY JOIN US If you're looking for a place to grow, make an...
-
Data Engineer
3 semanas atrás
Joinville, Santa Catarina, Brasil Jeeves Trabalho remoto Freelance Tempo inteiroJeeves is a groundbreaking financial operating system built for global businesses that provides corporate cards, cross-border payments, and spend management software within one unified platform. The company operates across 20+ countries including Brazil, Canada, Colombia, Mexico, the United Kingdom, across Europe, and the United States, and serves over 5,000...
-
Junior Data Engineer
2 semanas atrás
Joinville, Brasil Dev.Pro Tempo inteiroOverview Are you in Brazil, Argentina or Colombia? Join us as we actively recruit in these locations, offering a comfortable remote environment. Submit your CV in English, and we'll get back to you! We invite a Junior Data Engineer to join our dynamic team supporting a major enterprise client in modernizing their data platform. In this role, you’ll assist...
-
Lead Linux Kernel Engineer
3 semanas atrás
Joinville, Brasil Canonical Tempo inteiroJoin to apply for the Lead Linux Kernel Engineer - Ubuntu role at Canonical 3 days ago Be among the first 25 applicants Join to apply for the Lead Linux Kernel Engineer - Ubuntu role at Canonical Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very...
-
Data Engineer | Bees Personalization
2 semanas atrás
Joinville, Brasil Ab Inbev Growth Group Tempo inteiroOverviewAB InBev is the leading global brewer and one of the top 5 consumer goods companies in the world.With over 500 beer brands, we are the number one or two in many of the main beer markets worldwide, including North America, Latin America, Europe, Asia, and Africa.About ABI Growth GroupCreated in 2022, the Growth Group unifies our business-to-business...
-
Golang Software Engineer
3 semanas atrás
Joinville, Brasil AgileEngine Tempo inteiroGolang Software Engineer (Senior/Lead) ID37218Golang Software Engineer (Senior/Lead) ID37218 role at AgileEngine. AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and startups across 17+ industries. We rank among the leaders in application development and AI/ML, and our people-first culture has earned us Best...
-
Golang Software Engineer
3 semanas atrás
Joinville, Santa Catarina, Brasil AgileEngine Tempo inteiroGolang Software Engineer (Senior/Lead) ID37218Golang Software Engineer (Senior/Lead) ID37218 role at AgileEngine. AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and startups across 17+ industries. We rank among the leaders in application development and AI/ML, and our people-first culture has earned us Best...
-
Lead Software Engineer
3 semanas atrás
Joinville, Santa Catarina, Brasil EPAM Systems Tempo inteiroOverview We are looking for an innovative and driven Lead Software Engineer - Backend to join Feedonomics' fully remote Engineering team. You will work closely with skilled engineers to design, implement, and optimize mission-critical infrastructure and services for our data-intensive SaaS platform. Responsibilities Build scalable and highly reliable...
-
Big Data Engineer
3 semanas atrás
Joinville, Brasil BairesDev Tempo inteiroJoin to apply for the Big Data Engineer - Remote Work | REF# role at BairesDev 6 months ago Be among the first 25 applicants Join to apply for the Big Data Engineer - Remote Work | REF# role at BairesDev At BairesDev, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the...
-
Sr. Data Engineer
3 semanas atrás
Joinville, Brasil RecargaPay Tempo inteiroCome Make an Impact on Millions of Brazilians! At RecargaPay, we’re on a mission to deliver the best payment experience for Brazilian consumers and small businesses — by building a powerful digital ecosystem where the banked and unbanked connect, and where consumers and merchants have a one-stop shop for all their financial needs. We serve over 10...