Multimodal Ai Evaluation Specialist
Há 7 dias
Evaluate and analyze multimodal data to improve the accuracy and reliability of large language models (LLMs), vision models (LVMs), and multimodal AI systems.We are seeking detail-oriented and analytically minded individuals to perform highly nuanced evaluations of AI system outputs across different modalities.Analysts will assess the quality, clarity, and cultural alignment of model outputs against complex guidelines, ensuring that results align with project standards and real-world use cases.Responsibilities:Evaluate outputs generated by LLMs and LVMs across multiple modalities.Assess quality against project-specific criteria such as correctness, coherence, completeness, style, cultural appropriateness, and safety.Identify subtle errors, hallucinations, or biases in AI responses.Apply domain expertise and logical reasoning to resolve ambiguous or unclear outputs.Provide detailed written feedback, tagging, and scoring of outputs to ensure consistency across the evaluation team.Collaborate with project managers and quality leads to meet accuracy, reliability, and turnaround benchmarks.Skills & Competencies:Strong critical reading, observational, and evaluative skills across different modalities.Ability to articulate nuanced judgments with precision and clarity.Familiarity with LLMs, generative AI, and multimodal systems.Strong attention to detail and ability to apply guidelines consistently.Awareness of cultural and linguistic nuances, including potential bias and harm in AI outputs.Requirements:Bachelor's degree or equivalent educational qualification.1+ years of experience in data annotation, LLM evaluation, content moderation, or related AI/ML domains.Demonstrated experience working with data annotation tools and software platforms.Ability to adapt quickly to changing project directions and fast-paced work environments.What We Offer:Opportunities to shape the evaluation standards for next-generation multimodal AI systems.Innovative and supportive global working environment.Competitive compensation and flexible remote working arrangements.This is a unique opportunity to contribute to the development and improvement of advanced AI systems.
-
English Language Specialist
Há 5 dias
Belo Horizonte, Brasil Imerit Technology Tempo inteiroMultimodal GenAI Evaluation AnalystiMerit seeks detail-oriented and analytically minded Multimodal GenAI Evaluation Analysts to perform highly nuanced evaluations of AI system outputs across different modalities: text, image, video, and multimodal interactions.Analysts will assess the accuracy, appropriateness, quality, clarity, and cultural alignment of...
-
English Language Specialist
Há 7 dias
Belo Horizonte, Brasil iMerit Technology Tempo inteiroMultimodal GenAI Evaluation Analyst iMerit seeks detail‑oriented and analytically minded Multimodal GenAI Evaluation Analysts to perform highly nuanced evaluations of AI system outputs across different modalities: text, image, video, and multimodal interactions. Analysts will assess the accuracy, appropriateness, quality, clarity, and cultural alignment of...
-
Belo Horizonte, Brasil Invisible Expert Marketplace Tempo inteiroJoin to apply for the Korean–Japanese Bilingual Specialist – AI Trainer role at Invisible Expert MarketplaceWe're looking for a Korean-Japanese Bilingual Specialist who can bring linguistic precision, cultural knowledge, and critical thinking to training data.You'll work with cutting-edge AI tools, evaluate translations and conversations between Korean...
-
Maltese Language Specialist
Há 7 dias
Belo Horizonte, Brasil Invisible Expert Marketplace Tempo inteiroMaltese Language Specialist - AI Trainer Join to apply for the Maltese Language Specialist - AI Trainer role at Invisible Expert Marketplace . As an experienced Maltese language professional, you will shape the future of AI by providing linguistic depth, cultural context, and precision to training data. You will work with cutting‑edge AI tools, evaluate...
-
Belo Horizonte, Brasil Invisible Expert Marketplace Tempo inteiroJoin to apply for the Malay Trilingual Language Specialist – AI Trainer role at Invisible Expert Marketplace Are you fluent in Malay, English, and at least one additional language, and eager to apply your linguistic expertise to the future of AI? Large-scale language models are transforming how people communicate, learn, and access information across...
-
Ai Agent Evaluation Analyst
2 semanas atrás
Belo Horizonte, Brasil Mindrift Tempo inteiroAt Mindrift, innovation meets opportunity.We believe in using the power of collective human intelligence to ethically shape the future of AI.The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients.Our mission is to unlock the potential of GenAI by tapping into real-world...
-
AI Agent Evaluation Analyst
2 semanas atrás
Belo Horizonte, Minas Gerais, Brasil Mindrift Tempo inteiro R$13.200 - R$78.000 por anoThis opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of...
-
Quality Assurance for AI Agent Evaluation Analyst
3 semanas atrás
Belo Horizonte, Brasil Mindrift Tempo inteiroOverviewAt Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into...
-
Quality Assurance for AI Agent Evaluation Analyst
3 semanas atrás
Belo Horizonte, Brasil Mindrift Tempo inteiroOverviewAt Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into...
-
Language Content Specialist
1 semana atrás
Belo Horizonte, Brasil Bebeelinguistics Tempo inteiroJob DescriptionWe are seeking highly skilled professionals to contribute to the training and enhancement of Large Language Models (LLMs).In this role, you will leverage your linguistic expertise to evaluate prompts, rate AI-generated outputs, and create high-quality content that helps refine AI performance.The role involves evaluating prompts and responses...