AI Research Engineer

Há 6 dias


Brasil Tether Tempo inteiro

AI Research Engineer (Model Evaluation) at Tether.io

About the job: As a member of our AI model team, you will drive innovation across the entire AI lifecycle by developing and implementing rigorous evaluation frameworks and benchmark methodologies for pre-training, post-training, and inference. Your work will focus on designing metrics and assessment strategies that ensure our models are highly responsive, efficient, and reliable across real-world applications. You will work on a wide spectrum of systems, from resource-efficient models designed for limited hardware environments to complex, multi-modal architectures that integrate text, images, and audio.

You will:

  • Develop, test, and deploy integrated frameworks that rigorously assess models during pre-training, post-training, and inference. Define and track key performance indicators such as accuracy, loss metrics, latency, throughput, and memory footprint across diverse deployment scenarios.
  • Curate high-quality evaluation datasets and design standardized benchmarks to reliably measure model quality and robustness. Ensure benchmarks reflect improvements from both pre-training and post-training processes, driving consistency in evaluation practices.
  • Engage with product management, engineering, data science, and operations to align evaluation metrics with business objectives. Present evaluation findings and recommendations through dashboards and reports supporting decision-making.
  • Analyze evaluation data to identify bottlenecks across the model lifecycle. Propose and implement optimizations that enhance performance, scalability, and resource utilization on resource-constrained platforms.
  • Conduct iterative experiments and empirical research to refine evaluation methodologies, staying abreast of emerging techniques and trends to improve benchmarking practices and model reliability.
  • Collaborate with cross-functional teams to share evaluation findings and integrate stakeholder feedback. Build robust evaluation pipelines and performance dashboards that drive continuous improvement in model deployment strategies.
Qualifications
  • A degree in Computer Science or related field; ideally PhD in NLP, Machine Learning, or related field, with a solid track record in AI R&D (publications in A* conferences).
  • Experience designing and evaluating AI models across pre-training, post-training, and inference stages. Proficiency in developing evaluation frameworks that assess accuracy, convergence, loss improvements, and robustness.
  • Strong programming skills with hands-on expertise in evaluation benchmarks and frameworks. Experience building, automating, and scaling complex evaluation and benchmarking pipelines. Familiarity with metrics such as latency, throughput, and memory footprint.
  • Proven ability to conduct iterative experiments and empirical research to refine evaluation methodologies and stay updated with emerging trends.
  • Experience collaborating with product, engineering, and operations teams to align evaluation strategies with organizational goals and translate technical findings into actionable insights.

Employment type: Full-time

Job function: Information Technology

Industries: Technology, Information and Internet

#J-18808-Ljbffr
  • AI Research Manager

    2 semanas atrás


    Brasil Articul8 AI Tempo inteiro US$125.000 - US$175.000 por ano

    About us:At Articul8 AI, we relentlessly pursue excellence and create exceptional AI products that exceed customer expectations. We are a team of dedicated individuals who take pride in our work and strive for greatness in every aspect of our business. We believe in using our advantages to make a positive impact on the world and inspiring others to do the...


  • Brasil AnswerThis Tempo inteiro

    Location: Remote (Applications open worldwide) Compensation: $20,000 – 40,000 / year (based on experience and scope of ownership) Skills: Semantic Search, Vector Databases, Prompt Engineering, Gen AI Frameworks, React Agents, Graph Agents, Document Parsing, Python, Scalable APIs About Answer This Answer This is an AI-powered research platform built to...

  • AI Systems Designer

    Há 7 dias


    Brasil beBeeArtificial Tempo inteiro

    Full-Stack AI Engineer Position Pioneering Enterprise AI for Research and Workflows is a challenging opportunity that we are seeking to fulfill with the help of an experienced Full-Stack AI Agent Engineer. Key Responsibilities: Designing, building, and deploying large and complex maintainable AI apps with consistent production-quality features in days is...

  • AI Systems Designer

    2 semanas atrás


    Brasil beBeeArtificial Tempo inteiro R$150.000 - R$250.000

    Full-Stack AI Engineer PositionPioneering Enterprise AI for Research and Workflows is a challenging opportunity that we are seeking to fulfill with the help of an experienced Full-Stack AI Agent Engineer.Key Responsibilities:Designing, building, and deploying large and complex maintainable AI apps with consistent production-quality features in days is...


  • Brasil Komo Tempo inteiro

    Early-stage startup (ex-Google, Stanford) seeks extremely talented full-time contractor for full-stack AI Agent engineer to help pioneer Enterprise AI for Research and Workflows. You must have 2+ years experience designing and building large and complex (yet maintainable) AI apps, and you consistently ship production-quality features in days, not weeks. You...

  • Full-Stack Ai Agent Engineer

    2 semanas atrás


    Brasil Komo Tempo inteiro

    Early-stage startup (ex-Google, Stanford) seeks extremely talented full-time contractor for full-stack AI Agent engineer to help pioneer Enterprise AI for Research and Workflows.You must have 2+ years experience designing and building large and complex (yet maintainable) AI apps, and you consistently ship production-quality features in days, not weeks.You...

  • Ai Systems Designer

    Há 6 dias


    Brasil Bebeeartificial Tempo inteiro

    Full-Stack AI Engineer PositionPioneering Enterprise AI for Research and Workflows is a challenging opportunity that we are seeking to fulfill with the help of an experienced Full-Stack AI Agent Engineer.Key Responsibilities:Designing, building, and deploying large and complex maintainable AI apps with consistent production-quality features in days is...


  • Brasil AnswerThis Tempo inteiro

    Location: Remote (Applications open worldwide) Compensation: $20,000 – 40,000 / year (based on experience and scope of ownership) Skills: Semantic Search, Vector Databases, Prompt Engineering, GenAI Frameworks, React Agents, Graph Agents, Document Parsing, Python, Scalable APIs About AnswerThis AnswerThis is an AI-powered research platform built to...

  • AI Solutions Engineer

    2 semanas atrás


    Brasil HireHawk Tempo inteiro

    Overview AI Solutions Engineer: Location: Remote (US Time Zones Preferred- PST). Type: Full‑Time Contractor (40hours/week). Start Date: ASAP. Reports to: Head of Growth. About HireHawk HireHawk is on a mission to disrupt the global outsourcing and recruiting space through AI‑powered automation and intelligent systems. Built by a leadership team that...


  • Brasil Menlo Ventures Tempo inteiro

    About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role As a...