Multimodal Ai Evaluation Specialist

Há 7 dias


Belo Horizonte, Brasil Bebeeevaluation Tempo inteiro

Evaluate and analyze multimodal data to improve the accuracy and reliability of large language models (LLMs), vision models (LVMs), and multimodal AI systems.We are seeking detail-oriented and analytically minded individuals to perform highly nuanced evaluations of AI system outputs across different modalities.Analysts will assess the quality, clarity, and cultural alignment of model outputs against complex guidelines, ensuring that results align with project standards and real-world use cases.Responsibilities:Evaluate outputs generated by LLMs and LVMs across multiple modalities.Assess quality against project-specific criteria such as correctness, coherence, completeness, style, cultural appropriateness, and safety.Identify subtle errors, hallucinations, or biases in AI responses.Apply domain expertise and logical reasoning to resolve ambiguous or unclear outputs.Provide detailed written feedback, tagging, and scoring of outputs to ensure consistency across the evaluation team.Collaborate with project managers and quality leads to meet accuracy, reliability, and turnaround benchmarks.Skills & Competencies:Strong critical reading, observational, and evaluative skills across different modalities.Ability to articulate nuanced judgments with precision and clarity.Familiarity with LLMs, generative AI, and multimodal systems.Strong attention to detail and ability to apply guidelines consistently.Awareness of cultural and linguistic nuances, including potential bias and harm in AI outputs.Requirements:Bachelor's degree or equivalent educational qualification.1+ years of experience in data annotation, LLM evaluation, content moderation, or related AI/ML domains.Demonstrated experience working with data annotation tools and software platforms.Ability to adapt quickly to changing project directions and fast-paced work environments.What We Offer:Opportunities to shape the evaluation standards for next-generation multimodal AI systems.Innovative and supportive global working environment.Competitive compensation and flexible remote working arrangements.This is a unique opportunity to contribute to the development and improvement of advanced AI systems.



  • Belo Horizonte, Brasil Imerit Technology Tempo inteiro

    Multimodal GenAI Evaluation AnalystiMerit seeks detail-oriented and analytically minded Multimodal GenAI Evaluation Analysts to perform highly nuanced evaluations of AI system outputs across different modalities: text, image, video, and multimodal interactions.Analysts will assess the accuracy, appropriateness, quality, clarity, and cultural alignment of...


  • Belo Horizonte, Brasil iMerit Technology Tempo inteiro

    Multimodal GenAI Evaluation Analyst iMerit seeks detail‑oriented and analytically minded Multimodal GenAI Evaluation Analysts to perform highly nuanced evaluations of AI system outputs across different modalities: text, image, video, and multimodal interactions. Analysts will assess the accuracy, appropriateness, quality, clarity, and cultural alignment of...


  • Belo Horizonte, Brasil Invisible Expert Marketplace Tempo inteiro

    Join to apply for the Korean–Japanese Bilingual Specialist – AI Trainer role at Invisible Expert MarketplaceWe're looking for a Korean-Japanese Bilingual Specialist who can bring linguistic precision, cultural knowledge, and critical thinking to training data.You'll work with cutting-edge AI tools, evaluate translations and conversations between Korean...


  • Belo Horizonte, Brasil Invisible Expert Marketplace Tempo inteiro

    Maltese Language Specialist - AI Trainer Join to apply for the Maltese Language Specialist - AI Trainer role at Invisible Expert Marketplace . As an experienced Maltese language professional, you will shape the future of AI by providing linguistic depth, cultural context, and precision to training data. You will work with cutting‑edge AI tools, evaluate...


  • Belo Horizonte, Brasil Invisible Expert Marketplace Tempo inteiro

    Join to apply for the Malay Trilingual Language Specialist – AI Trainer role at Invisible Expert Marketplace Are you fluent in Malay, English, and at least one additional language, and eager to apply your linguistic expertise to the future of AI? Large-scale language models are transforming how people communicate, learn, and access information across...

  • Ai Agent Evaluation Analyst

    2 semanas atrás


    Belo Horizonte, Brasil Mindrift Tempo inteiro

    At Mindrift, innovation meets opportunity.We believe in using the power of collective human intelligence to ethically shape the future of AI.The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients.Our mission is to unlock the potential of GenAI by tapping into real-world...

  • AI Agent Evaluation Analyst

    2 semanas atrás


    Belo Horizonte, Minas Gerais, Brasil Mindrift Tempo inteiro R$13.200 - R$78.000 por ano

    This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of...


  • Belo Horizonte, Brasil Mindrift Tempo inteiro

    OverviewAt Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into...


  • Belo Horizonte, Brasil Mindrift Tempo inteiro

    OverviewAt Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into...

  • Language Content Specialist

    1 semana atrás


    Belo Horizonte, Brasil Bebeelinguistics Tempo inteiro

    Job DescriptionWe are seeking highly skilled professionals to contribute to the training and enhancement of Large Language Models (LLMs).In this role, you will leverage your linguistic expertise to evaluate prompts, rate AI-generated outputs, and create high-quality content that helps refine AI performance.The role involves evaluating prompts and responses...