AI Engineer

4 semanas atrás


Guarulhos, São Paulo, Brasil Totalperform Tempo inteiro

About the Role

We're hiring an AI Engineer to design and build a production-grade RAG platform that powers our test autoscripting agent. This platform ingests our QA codebase and documentation, transforms them into embeddings, and serves relevant context (page objects, fixtures, helpers, examples) via a retrieval API—enabling high-quality LLM-generated tests. You'll own everything from ingestion to evaluation, including keeping the index fresh via Jenkins and optimizing for token cost and latency.

This role is ideal for someone who thrives in the intersection of LLM tooling, backend engineering, and developer productivity.

What You'll Do

Build and maintain a local RAG platform , including:

  • Loaders for Git, Confluence, Drive.
  • Code-aware chunking (AST/semantic) and embedding pipelines.
  • Vector indexing in ChromaDB with metadata and reranking.
  • FastAPI (or similar) retrieval service for the autoscripting agent.
    • Implement metadata filters (e.g., layer=page-object|fixture|helper|test, Git SHA, feature tags) and import-based neighbor expansion to optimize context.
    • Optimize for cost and performance : tune k values, context lengths, reranker thresholds, and cache frequent retrievals.
    • Build retrieval evaluation and telemetry : track recall, faithfulness, token usage, compile success of generated code, and wire alerts into Jenkins CI.
    • Manage access to Claude 4 Sonnet and other model APIs; help deploy self-hosted endpoints if needed (keys, quotas, audit logs).
    • Write runbooks and train the SDET team on how to use and troubleshoot the RAG system.

Tech Stack (Initial Plan)

LLM orchestration: Claude 4 Sonnet

Embeddings: mxbai-embed-large-v1 (text), bge-code-base (code)

Reranker: mxbai-rerank-base-v2

Vector store: ChromaDB (local)

Pipeline orchestration: LangChain (router by MIME/type)

Scheduling: Jenkins (daily delta ingestion)

Retrieval API: FastAPI

Evaluation: Telemetry + basic metrics (compile/run, cost, retrieval quality)

What You Bring

  • 4+ years in ML/AI or platform-oriented backend engineering , including 2+ years building LLM or RAG applications .
  • Strong experience with LangChain , vector DBs (ChromaDB, Qdrant, pgvector), and code-aware embeddings (BGE-code or similar).
  • Solid Python skills (FastAPI or Flask) and comfort reading Java to inform chunking and context design.
  • Experience with Jenkins , secrets management, and basic observability tooling (Grafana, Prometheus, LangSmith, or RAGAS).
  • Comfortable working with OpenAI/Anthropic APIs or deploying self-hosted endpoints, including handling keys, rate limits, and safety controls.

It is an asset if you have:

  • Experience with Claude-specific practices , structured prompting, and cost control techniques.
  • Familiarity with retrieval evaluation tools like RAGAS or LangChain Evaluators, plus A/B testing for prompt or routing strategies.
  • Understanding of security and compliance for developer-facing AI tools (PII handling, audit logging).

Collaboration & Role Scope

  • The SDET team focuses on test quality and final review of autoscripted code.
  • The Automation Agent Engineer tunes prompts and retrieval logic.
  • You own the RAG platform : indexing, retrieval quality, LLM orchestration, and CI integration.
#J-18808-Ljbffr
  • AI Engineer

    4 semanas atrás


    Guarulhos, São Paulo, Brasil Totalperform Tempo inteiro

    About the RoleWe're hiring an AI Engineer to design and build a production-grade RAG platform that powers our test autoscripting agent. This platform ingests our QA codebase and documentation, transforms them into embeddings, and serves relevant context (page objects, fixtures, helpers, examples) via a retrieval API—enabling high-quality LLM-generated...


  • Guarulhos, São Paulo, Brasil BairesDev Tempo inteiro

    Join to apply for the Senior Machine Learning + LLM Engineer - Remote Work role at BairesDevContinue with Google Continue with Google19 hours ago Be among the first 25 applicantsJoin to apply for the Senior Machine Learning + LLM Engineer - Remote Work role at BairesDevAt BairesDev, we've been leading the way in technology projects for over 15 years. We...


  • Guarulhos, São Paulo, Brasil beBeeSoftware Tempo inteiro R$90.000 - R$102.000

    Job Title: Technical Software EngineerJob SummaryWe are seeking a skilled Technical Software Engineer to join our team. The ideal candidate will have experience in developing and maintaining backend services using .NET Core (C#) and designing secure REST APIs.Key ResponsibilitiesDevelop and maintain cloud infrastructure on AWS, ensuring HIPAA compliance and...

  • Senior Automation Engineer

    4 semanas atrás


    Guarulhos, São Paulo, Brasil Totalperform Tempo inteiro

    About the RoleWe're looking for a hands-on Automation Engineer to take full ownership of our existing QA automation framework and elevate it to the next level. You'll focus primarily on API automation using RestAssured, with supporting UI tests in Selenium—both built in Java using Cucumber and the Screenplay pattern.This role sits at the intersection of...

  • Senior Automation Engineer

    4 semanas atrás


    Guarulhos, São Paulo, Brasil Totalperform Tempo inteiro

    About the Role We're looking for a hands-on Automation Engineer to take full ownership of our existing QA automation framework and elevate it to the next level. You'll focus primarily on API automation using RestAssured, with supporting UI tests in Selenium—both built in Java using Cucumber and the Screenplay pattern. This role sits at the intersection of...


  • Guarulhos, São Paulo, Brasil beBeeCloudEngineer Tempo inteiro R$60 - R$80

    Job Title: Advanced Cloud Solutions EngineerWe are seeking a highly skilled and experienced engineer to modernize our applications and transition from legacy systems. This is an exceptional opportunity for someone with expertise in Python development, cloud-based solutions on AWS and Azure, and migration of legacy .NET systems.The ideal candidate will have...


  • Guarulhos, São Paulo, Brasil beBeeLeadership Tempo inteiro

    We are seeking a senior AI leader to drive the design and implementation of an end-to-end Retrieval-Augmented Generation (RAG) architecture. This role will oversee the ingestion of diverse data sources, including GitHub repositories, Confluence pages, and Qtest artifacts, to power autoscripting, onboarding search, and long-term knowledge reuse.Key...


  • Guarulhos, São Paulo, Brasil Totalperform Tempo inteiro

    What We're Looking ForWe value engineers who take initiative, communicate clearly, and are comfortable working in a fast-moving environment where responsibilities can shift.You'll be expected to balance technical execution with attention to detail, collaborate across functions, and approach challenges with a sense of ownership.What You'll DoDevelop and...

  • Developer/Demo Engineer

    4 semanas atrás


    Guarulhos, São Paulo, Brasil Salesforce Tempo inteiro

    To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job CategoryUser ExperienceJob DetailsAbout Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn't a...

  • Developer/Demo Engineer

    4 semanas atrás


    Guarulhos, São Paulo, Brasil Salesforce Tempo inteiro

    To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job CategoryUser ExperienceJob DetailsAbout SalesforceSalesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn't a...