AI/ML Engineer

3 semanas atrás


Região Geográfica Intermediária de Sorocaba, Brasil Zyte Tempo inteiro

AI/ML Engineer - Web Data Quality - Remote 3 weeks ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. About Zyte At Zyte, we eat data for breakfast and you can eat your breakfast anywhere and work for Zyte. Founded in 2010, we are a globally distributed team of over 250 Zytans working from over 28 countries who are on a mission to enable our customers to extract the data they need to continue to innovate and grow their businesses. We believe that all businesses deserve a smooth pathway to data. For more than a decade, Zyte has led the way in building powerful, easy-to-use tools to collect, format, and deliver web data, quickly, dependably, and at scale. Today, the data we extract helps thousands of organizations make smarter business decisions, secure competitive advantage, and drive sustainable growth. Today, over 3,000 companies and 1 million developers rely on our tools and services to get the data they need from the web. Data QA is an important function within Zyte. The Data QA team works to ensure that the quality and usability of the data scraped by our web scrapers meets and exceeds the expectations of our enterprise clients. Are you passionate about data and data quality and integrity? Do you enjoy using Python and AI to analyze and manipulate data, detect data quality issues, and visualize your findings? Are you highly customer-focused with excellent attention to detail? Owing to growing business and the need for ever more sophisticated Data QA, we are looking for a talented Data Scientist to join our team. As a Zyte Engineer, you work on AI-based data wrangling, data manipulation, and data visualisation techniques and apply them in the verification and validation of data quality as it pertains to data extracted from the web. Requirements Design and implement AI-driven quality checks: build models to detect anomalies, identify schema drift, and classify data errors in real time Automate and scale QA: replace manual and rule-based validation with ML-powered solutions that continuously improve Leverage GenAI for validation: use embedding models, LLMs, and prompt-driven pipelines to perform semantic checks on scraped data Develop monitoring & alerting pipelines: quantify data quality via KPIs, dashboards, and automated reports for stakeholders Experiment & innovate: research and prototype new AI techniques for QA, e.g. using embeddings, synthetic data, and reinforcement learning to stress-test scrapers Collaborate cross-functionally: work with developers, product managers, and account teams to integrate AI-based QA into production workflows Communicate insights: present findings with clear visualizations, metrics, and evidence-based recommendations to technical and non-technical audiences Requirements Proficiency in Python & PyData stack (NumPy, pandas, scikit-learn, PyTorch/TensorFlow preferred) 3+ years in a data science, applied ML, or data engineering role (ideally with exposure to QA or data validation at scale) Hands-on experience with GenAI tools: LLM APIs (OpenAI, Anthropic, Google), prompt engineering, cost/token optimization Strong ML fundamentals: anomaly detection, classification, clustering, embeddings, evaluation metrics Experience with big data frameworks (Spark, BigQuery, or similar) Ability to work with very large datasets (millions+ of records) Version control skills (GitHub/Bitbucket) Excellent communication in English, both technical and non-technical Desired Skills Prior experience in data quality automation or web data QA Familiarity with LangChain, MCP, Marvin, or similar orchestration frameworks Experience building QA dashboards or visualization layers Background in statistics or applied mathematics Previous remote/distributed work experience Benefits As a new Zytan, you will become part of a self-motivated, progressive, multi-cultural team. Have the freedom and flexibility to work from where you do your best work. Attend conferences and meet with team members from across the globe. Work with cutting-edge open source technologies and tools. Seniority level Mid-Senior level Employment type Full-time Job function Other Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Zyte by 2x Sign in to set job alerts for “Machine Learning Engineer” roles. Other related job postings: Python and Kubernetes Software Engineer - Data, AI/ML & Analytics Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics Senior Machine Learning Engineer, Ad Performance Software Engineer Iii, Fullstack, Quickpack (Remote) Lead Machine Learning Engineer, Ad Performance Principal Machine Learning Engineer, Ad Performance Software Engineer - Solutions Engineering Software Engineer II / Senior Software Engineer We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr



  • Região Geográfica Intermediária de Sorocaba, Brasil Launch Potato Tempo inteiro

    Senior ML Engineer, Recommendation Systems Join to apply for the Senior ML Engineer, Recommendation Systems role at Launch Potato. WHO ARE WE? Launch Potato is a profitable digital media company that reaches over 30M+ monthly visitors through brands such as FinanceBuzz, All About Cookies, and OnlyInYourState. As The Discovery and Conversion Company, our...

  • AI Red Team Engineer

    4 semanas atrás


    Região Geográfica Intermediária de Sorocaba, Brasil LILT AI Tempo inteiro

    AI Red Team Engineer - Brazilian Portuguese 1 day ago Be among the first 25 applicants About LILT AI is changing how the world communicates — and LILT is leading that transformation. We’re on a mission to make the world’s information accessible to everyone, regardless of the language they speak. We use cutting‑edge AI, machine translation, and...

  • AI Engineer

    3 semanas atrás


    Região Geográfica Intermediária de Sorocaba, Brasil Nearsure Tempo inteiro

    1 day ago Be among the first 25 applicants Join our close-knit LATAM remote team : Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management. Say goodbye to micromanagement! We champion autonomy, open communication, and respect for diversity as our core values. ️Your well-being matters : Our People Care...

  • AI Driven Engineer Ruby

    3 semanas atrás


    Região Geográfica Intermediária de Sorocaba, Brasil Promote Project Tempo inteiro

    We are not just another fintech unicorn. We are a pack of dreamers, makers, and tech enthusiasts building the future of payments. With millions of happy customers and a hunger for innovation, we're now expanding our neural network - literally and metaphorically. Now, we are looking for engineers eager to shape the future, not just through code but with...

  • AI Engineer

    4 semanas atrás


    Região Geográfica Intermediária de São Paulo, Brasil BairesDev Tempo inteiro

    2 days ago Be among the first 25 applicants At BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting‑edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely on roles that drive significant...

  • Principal Ai Engineer

    4 semanas atrás


    Região Geográfica Intermediária de São Paulo, Brasil Stealth Healthcare Ai Startup Tempo inteiro

    WHO WE ARE We are a stealth‑stage healthcare AI startup that is the missing link in healthcare revenue management, bridging the gap between clinical care and financial operations. We're transforming how primary care clinics capture revenue through innovative AI technology that operates at the intersection of medical documentation, billing, and...

  • Senior Machine Learning

    3 semanas atrás


    Região Geográfica Intermediária de Sorocaba, Brasil BairesDev Tempo inteiro

    Join to apply for the Senior Machine Learning & LLM Engineer - Remote Work | REF# role at BairesDev 4 months ago Be among the first 25 applicants Join to apply for the Senior Machine Learning & LLM Engineer - Remote Work | REF# role at BairesDev Get AI-powered advice on this job and more exclusive features. At BairesDev, we've been leading the way in...

  • Data/Ai Consultant

    4 semanas atrás


    Região Geográfica Intermediária de Sorocaba, Brasil Able.Digital Tempo inteiro

    We’re looking for a fractional Data / AI Specialist to partner with our sales team on an hourly / contract basis . In this role, you’ll act as the technical expert during sales cycles , helping prospects understand the value of our data-driven solutions and guiding them toward the right implementation approach. This position is primarily remote, but will...

  • AI Engineer

    4 semanas atrás


    Região Geográfica Imediata de Criciúma, Brasil Lumenalta Tempo inteiro

    AI Engineer (FinTech Workflow Automation) At Lumenalta, we partner with forward-thinking organizations to build technology solutions that scale, delight users, and accelerate business growth. Our global teams bring curiosity, commitment, and technical excellence to every project. We value transparency, autonomy, and impact—empowering every team member to...

  • Data Engineer

    3 semanas atrás


    Região Geográfica Intermediária de São Paulo, Brasil Applaudo Tempo inteiro

    1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. You are someone who wants to influence your own development. You’re looking for a company where you have the opportunity to pursue your interests and be able to grow professionally. You bring to Applaudo the following competencies: Bachelor’s degree...