Ai Evaluation Specialist

Há 2 dias


Marília, Brasil Bebeeevaluation Tempo inteiro

Job OverviewiMerit seeks detail-oriented and analytically skilled professionals to evaluate AI system outputs across text, image, video, and multimodal interactions.Evaluations will inform the development of large language models, vision models, and multimodal AI systems.Key ResponsibilitiesEvaluate AI-generated outputs for accuracy, quality, clarity, and cultural alignment against complex guidelines.Assess subtle errors, hallucinations, or biases in AI responses.Apply domain expertise and logical reasoning to resolve ambiguous outputs.Provide detailed feedback and tagging to ensure consistency across the evaluation team.Escalate unclear cases and contribute to refining evaluation guidelines.Requirements and QualificationsBachelor's degree/diploma or equivalent educational qualification.1+ years of experience in data annotation, LLM evaluation, content moderation, or related AI/ML domains.Experience working with data annotation tools and software platforms.Expected OutcomesImproved AI model performance through accurate and consistent evaluations.Enhanced ability to identify and mitigate potential biases in AI outputs.