Ai Evaluation Specialist

Há 3 dias


Jandira, Brasil Bebeeevaluation Tempo inteiro

Job Description:We are seeking detail-oriented and analytically minded professionals to perform highly nuanced evaluations of AI system outputs across different modalities.Analysts will assess the accuracy, appropriateness, quality, clarity, and cultural alignment of model outputs against complex guidelines, ensuring that results align with project standards and real-world use cases.Key Responsibilities:Evaluate outputs generated by LLMs across multiple modalities (text, image captions, video descriptions, and multimodal prompts).Assess quality against project-specific criteria such as correctness, coherence, completeness, style, cultural appropriateness, and safety.Identify subtle errors, hallucinations, or biases in AI responses.Apply domain expertise and logical reasoning to resolve ambiguous or unclear outputs.Provide detailed written feedback, tagging, and scoring of outputs to ensure consistency across the evaluation team.Escalate unclear cases and contribute to refining evaluation guidelines.Collaborate with Project Managers and Quality Leads to meet accuracy, reliability, and turnaround benchmarks.Skills & Competencies:Strong critical reading, observational, and evaluative skills across different modalities.Ability to articulate nuanced judgments with precision and clarity.Excellent English comprehension; additional languages a plus.Familiarity with LLMs, generative AI, and multimodal systems.Strong attention to detail and ability to apply guidelines consistently.Awareness of cultural and linguistic nuances, including potential bias and harm in AI outputs.Comfort with evolving workflows, rapid feedback cycles, and complex quality frameworks.Requirements:Bachelor's degree/diploma or equivalent educational qualification.1+ years of experience in data annotation, LLM evaluation, content moderation, or related AI/ML domains.Demonstrated experience working with data annotation tools and software platforms.Strong understanding of language and multimodal communication (instruction following in image generation, fact-checking, narrative coherence in video, etc.).Ability to adapt quickly to changing project directions and fast-paced work environments.Previous experience creating or annotating complex data specifically for Large Language Model (LLM) training.Prior exposure to generative AI, prompt engineering, or LLM fine-tuning workflows is a plus.What We Offer:Opportunities to shape the evaluation standards for next-generation multimodal AI systems.Innovative and supportive global working environment.Competitive compensation and flexible remote working arrangements.Continuous learning and growth in applied AI evaluation.iMerit offers a dynamic and inclusive work environment that fosters innovation, collaboration, and professional growth.Our team members enjoy competitive compensation, flexible remote working arrangements, and opportunities for continuous learning and growth in applied AI evaluation.



  • Jandira, Brasil Bebeelinguistic Tempo inteiro

    Job Title: Natural Language Enhancement Specialist We're seeking highly skilled professionals to contribute to the development and refinement of Large Language Models.As a Natural Language Enhancement Specialist, you'll play a vital role in evaluating prompts, rating AI-generated outputs, and creating high-quality content that enhances AI performance.Key...


  • Jandira, Brasil Bebeeartificialintelligence Tempo inteiro

    Job Title: AI Software EngineerWe are seeking a talented AI software engineer to join our team.The ideal candidate will have a strong background in artificial intelligence and software engineering, with experience in building and maintaining large-scale AI applications.Key Responsibilities:Design, develop, and maintain scalable and secure AI applications...


  • Jandira, Brasil Bebeeeducation Tempo inteiro

    Job OverviewWe are seeking a highly qualified Subject Matter Expert to design, evaluate, and review educational materials that help train and refine advanced AI language models.This is an opportunity to leverage your expertise in STEM, Social Sciences, or Humanities to make a significant impact on the development of AI education.Key Responsibilities:Create...

  • Chemistry Specialist

    3 semanas atrás


    Jandira, Brasil Innodata Inc. Tempo inteiro

    Overview Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider of choice for 4 out of 5 of the world's biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine....


  • Jandira, Brasil AMX Healthcare Tempo inteiro

    AI Engineer – Data & Automation (Healthcare SaaS)About UsAMX Healthcare is building a new SaaS platform to transform how healthcare staffing is managed—connecting clients, contractors, and job opportunities with speed and compliance. We’re investing in AI to streamline recruiting, automate data flows, and surface insights from our knowledge base...

  • Golang Engineer

    Há 7 dias


    Jandira, Brasil Nearsure Tempo inteiro

    Explore the Nearsure experience!Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management.Say goodbye to micromanagement!We champion autonomy, open communication, and respect for diversity as our core values.??Your well-being matters: Our People Care team is here from...