RAG Architect

3 semanas atrás


Buenos Aires, Brasil Perform Tempo inteiro

Join to apply for the RAG Architect role at Perform

We are seeking a Senior AI Engineer to lead the design and implementation of an end-to-end Retrieval-Augmented Generation (RAG) architecture. This role will drive the ingestion of GitHub repositories, Confluence pages, Qtest artifacts, PRDs, and script libraries to power autoscripting, onboarding search, and long-term knowledge reuse. As a technical leader, you will set the strategic direction, select cutting-edge models, and mentor AI and Automation Agent Engineers to deliver a scalable, secure, and innovative platform.

Key Responsibilities
  • Architect ingestion and retrieval layers, selecting loaders, chunking strategies (AST-aware for Java), embeddings (e.g., BGE-Code, mxbai), vector stores (e.g., Chroma), cross-encoder rerankers, and LangChain router chains.
  • Design CI orchestrations, including daily Jenkins jobs for delta detection, image captioning (e.g., Qwen2-VL, LLaVA), cost/latency guardrails, and rollback strategies.
  • Establish model and prompt governance, including prompt templates, few-shot libraries, safety filters, and evaluation rubrics (faithfulness, coverage, compile success).
  • Lead architecture for a UI onboarding tool, deciding on hosting (FastAPI + React or Streamlit MVP), SSO/auth flows, token streaming, and feedback mechanisms for continuous learning.
  • Oversee data security and compliance, embedding privacy policies, source citations, audit logs, and ensuring Confluence/Qtest credentials are managed in Secrets Manager.
  • Provide technical leadership by reviewing PRs, setting code quality standards, and conducting architecture workshops for AI and Automation Agent Engineers.
Must-Have Qualifications
  • 6–8 years of experience building data or ML platforms, with at least 2 years deploying LLM/RAG systems in production.
  • Deep expertise in LangChain, ChromaDB, Qdrant, or pgvector, and cross-encoder rerankers.
  • Strong proficiency in Python (FastAPI or Flask) and ability to analyze Java codebases for chunking boundaries.
  • Proven experience designing CI/CD pipelines (Jenkins, GitHub Actions) with delta builds and artifact promotion.
  • Hands-on experience managing OpenAI/Anthropic API keys or self-hosting large models.
  • Demonstrated expertise in security and compliance, including PII protection, role-based access, and secret rotation.

We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr
  • RAG Architect

    1 semana atrás


    Buenos Aires, Brasil Perform Tempo inteiro

    Join to apply for the RAG Architect role at Perform We are seeking a Senior AI Engineer to lead the design and implementation of an end-to-end Retrieval-Augmented Generation (RAG) architecture. This role will drive the ingestion of GitHub repositories, Confluence pages, Qtest artifacts, PRDs, and script libraries to power autoscripting, onboarding search,...

  • RAG Architect

    3 semanas atrás


    Buenos Aires, Espírito Santo, Brazil Perform Tempo inteiro

    Join to apply for the RAG Architect role at PerformWe are seeking a Senior AI Engineer to lead the design and implementation of an end-to-end Retrieval-Augmented Generation (RAG) architecture. This role will drive the ingestion of GitHub repositories, Confluence pages, Qtest artifacts, PRDs, and script libraries to power autoscripting, onboarding search, and...

  • Chief Data Architect

    2 semanas atrás


    Buenos Aires, Brasil beBeeData Tempo inteiro US$120.000 - US$150.000

    About Our TeamWe are seeking a seasoned software engineer to collaborate with our AI product development team in creating cutting-edge, user-centric products. The primary goal is to integrate large language models (LLMs), retrieval-augmented generation (RAG), and cloud-native infrastructure to deliver intelligent business insights.


  • Buenos Aires, Brasil beBeeEngineer Tempo inteiro R$100.000 - R$125.000

    Senior AI Engineer LeadWe're building a cutting-edge Retrieval-Augmented Generation (RAG) architecture to unlock community knowledge in a new way. As a Senior AI Engineer Lead, you will lead the design and implementation of this innovative solution.Key ResponsibilitiesArchitect ingestion and retrieval layers, selecting loaders, chunking strategies (AST-aware...


  • Buenos Aires, Brasil Google Tempo inteiro

    Cloud AI Consultant, Professional Services, Google Cloud (English) Join to apply for the Cloud AI Consultant, Professional Services, Google Cloud (English) role at Google Cloud AI Consultant, Professional Services, Google Cloud (English) 6 days ago Be among the first 25 applicants Join to apply for the Cloud AI Consultant, Professional Services, Google...


  • Buenos Aires, Brasil Kyndryl Tempo inteiro

    Join to apply for the Backend Developer – GenAI Projects role at Kyndryl Join to apply for the Backend Developer – GenAI Projects role at Kyndryl Get AI-powered advice on this job and more exclusive features. Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So...