Multimodal Genai Evaluation Specialist

Há 5 dias


Viana, Brasil Bebeeevaluation Tempo inteiro

Job Description:We are seeking a detail-oriented and analytically minded Multimodal GenAI Evaluation Specialist to perform highly nuanced evaluations of AI system outputs across different modalities: text, image, video, and multimodal interactions.The ideal candidate will assess the accuracy, appropriateness, quality, clarity, and cultural alignment of model outputs against complex guidelines ensuring that results align with project standards and real-world use cases.Role Responsibilities:Evaluate outputs generated by LLMs across multiple modalities (text, image captions, video descriptions, and multimodal prompts).Assess quality against project-specific criteria such as correctness, coherence, completeness, style, cultural appropriateness, and safety.Identify subtle errors hallucinations or biases in AI responses.Apply domain expertise and logical reasoning to resolve ambiguous or unclear outputs.Provide detailed written feedback tagging and scoring of outputs to ensure consistency across the evaluation team.Escalate unclear cases and contribute to refining evaluation guidelines.Collaborate with Project Managers and Quality Leads to meet accuracy reliability and turnaround benchmarks.Required Skills and Qualifications:Bachelor's degree/diploma or equivalent educational qualification.1+ years of experience in data annotation LLM evaluation content moderation or related AI/ML domains.Demonstrated experience working with data annotation tools and software platforms.Strong understanding of language and multimodal communication.Ability to adapt quickly to changing project directions and fast-paced work environments.What We Offer:A competitive compensation package, opportunities for professional growth, and a dynamic work environment that values innovation and collaboration.How to Apply:If you are a motivated and detail-oriented individual looking for a challenging role in AI evaluation, please submit your application for consideration.



  • Viana, Brasil Bebeespecialist Tempo inteiro

    About the Opportunity:We are seeking a Language Specialist to contribute to the training and enhancement of Large Language Models (LLMs).Key Responsibilities:Evaluate prompts and responses generated by AI models.Provide structured feedback on AI outputs.Develop, edit, and curate high-quality content for training datasets.Collaborate with cross-functional...


  • Viana, Brasil Bebeeapplication Tempo inteiro

    AI Application DeveloperWe are seeking a talented AI Application Developer to join our team.The ideal candidate will have experience in designing, developing, and maintaining scalable and secure AI applications leveraging existing Large Language Model (LLM) APIs.The successful candidate will be responsible for building and maintaining AI applications on top...