Freelance Agent Evaluation Analyst

2 semanas atrás


Buenos Aires, Brasil Mindrift Tempo inteiro

Job Description 1 week ago – Be among the first 25 applicants. Get AI‑powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. Overview At Mindrift, innovation meets opportunity. Our platform connects domain experts with cutting‑edge AI projects. We aim to unlock the potential of generative AI by tapping into real‑world expertise from across the globe. About the Project We are looking for quality assurance specialists (QAs) for autonomous AI agents to validate and improve complex task structures, policy logic, and agent evaluation frameworks. The role requires balancing quality assurance, research, and logical problem solving. No coding background is required, but curiosity and rigorous evaluation skills are essential. What You’ll Be Doing Review evaluation tasks and scenarios for logic, completeness, and realism Identify inconsistencies, missing assumptions, or unclear decision points Help define clear expected behaviors (gold standards) for AI agents Annotate cause‑effect relationships, reasoning paths, and plausible alternatives Think through complex systems and policies as a human would to ensure agents are tested properly Work closely with QA, writers, or developers to suggest refinements or edge‑case coverage How to Get Started Apply to this post, qualify, and get the chance to contribute on your own schedule. Shape the future of AI while building tools that benefit everyone. Requirements Excellent analytical thinking: Reason about complex systems, scenarios, and logical implications Strong attention to detail: Spot contradictions, ambiguities, and vague requirements Familiarity with structured data formats: Read, not necessarily write JSON/YAML Ability to assess scenarios holistically: Identify missing or unrealistic elements Good communication and clear writing (in English) to document findings We also value applicants who have: Experience with policy evaluation, logic puzzles, case studies, or structured scenario design Background in consulting, academia, olympiads, or research Exposure to LLMs, prompt engineering, or AI‑generated content Familiarity with QA or test‑case thinking (edge cases, failure modes, “what could go wrong”) Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.) Benefits Get paid for your expertise, with rates up to $17/hour depending on skills, experience, and project needs Flexible, remote, freelance project that fits around primary professional or academic commitments Advanced AI project with valuable experience to enhance your portfolio Influence how future AI models understand and communicate in your field of expertise Referrals increase your chances of interviewing at Mindrift by 2x. #J-18808-Ljbffr


  • AI Agent Evaluation Analyst

    2 semanas atrás


    Buenos Aires, Brasil Mindrift Tempo inteiro

    6 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets...

  • Product Analyst

    Há 6 dias


    Buenos Aires, Brasil Agent Careers Tempo inteiro

    Overview Job Title: Product Analyst (Academic Background) Compensation: $3,500- 4,500/month, (salary depending on experience) Location: Remote, LATAM Working hours: 9-5pm PST About The Company Our client is a neuroscience-based technology company dedicated to revolutionizing health behavior change through a positive, iterative approach to habit formation....

  • Freelance Product Manager

    2 semanas atrás


    Buenos Aires, Brasil Hogarth Tempo inteiro

    Join to apply for the Freelance Product Manager role at Hogarth . Hogarth is the Global Content Production Company, part of WPP, supporting brands such as Coca‑Cola, Ford, Rolex, Nestlé and Dyson. Our global team of 7,500+ craft and technology experts delivers engaging content across all channels and media, with a focus on speed, mass personalization and...

  • Offensive Security Analyst

    2 semanas atrás


    Buenos Aires, Brasil Alignerr Tempo inteiro

    Overview At Alignerr, we partner with the world’s leading AI research teams and labs to build and train cutting-edge AI models. This role focuses on structured adversarial reasoning rather than exploit development, modeling how threats move through systems, where defenses fail, and how risk propagates across modern environments. Organization Organization :...

  • Logistics Analyst

    1 semana atrás


    Buenos Aires, Brasil Pfizer Tempo inteiro

    Role Summary The Logistics Analyst, as a core member of the Market’s Logistics Organization and reporting to the Logistics Coordinator, is responsible for the coordination/execution of Logistics/Distribution operations in market, whether in‑house or LSP managed. She/he is responsible for end‑to‑end processes – Inbound, Storage, and Outbound...

  • Freelance Programme Manager

    1 semana atrás


    Buenos Aires, Brasil Hogarth Tempo inteiro

    Hogarth is the Global Content Production Company. Part of WPP, Hogarth partners with one in every two of the world’s top 100 brands including Coca-Cola, Ford, Rolex, Nestlé, Mondelez and Dyson. With a breadth of experience across an extensive range of sectors, Hogarth offers the unrivaled ability to deliver relevant, engaging, and measurable content...


  • Buenos Aires, Brasil Lendbuzz Tempo inteiro

    Overview This role is at Lendbuzz. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range: ARS60,906,721.50/yr - ARS67,672,970.00/yr. At Lendbuzz, we believe financial opportunity should be more personalized and fair. We develop innovative technologies that provide underserved and overlooked...


  • Buenos Aires, Brasil Canonical Tempo inteiro

    Join to apply for the Security Risk Management Specialist role at Canonical Join to apply for the Security Risk Management Specialist role at Canonical In security risk management we're looking to harness the power of industry best practice combined with driving new innovation on how we do security risk assessments and modelling. Our security risk management...

  • Desarrollador Comercial

    4 semanas atrás


    Buenos Aires, Brasil Monster Energy Tempo inteiro

    Position Summary/Resumen De PosiciónThe vision is to achieve the long-term strategic and tactical sales and distribution plan for the Monster brand portfolio in support of the Company’s business objectives. In the position you will be achieve long-term strategic and tactical sales and distribution plan for the Monster brand portfolio in support of the...


  • Buenos Aires / Bahia Blanca (ARG), Brasil Jobs at Vestas Tempo inteiro

    The Global Service LEAN team prioritizes enhancing a culture of continuous improvement and execution excellence within our Service Business. We coordinate Strategy Deployment, establish global concepts, and implement breakthrough improvements using methodologies such as Lean and Six Sigma.What You'll Do:Engage with our Service team across the country and...