
RAG Architect
Há 21 horas
Join to apply for the RAG Architect role at Perform
We are seeking a Senior AI Engineer to lead the design and implementation of an end-to-end Retrieval-Augmented Generation (RAG) architecture. This role will drive the ingestion of GitHub repositories, Confluence pages, Qtest artifacts, PRDs, and script libraries to power autoscripting, onboarding search, and long-term knowledge reuse. As a technical leader, you will set the strategic direction, select cutting-edge models, and mentor AI and Automation Agent Engineers to deliver a scalable, secure, and innovative platform.
Key Responsibilities- Architect ingestion and retrieval layers, selecting loaders, chunking strategies (AST-aware for Java), embeddings (e.g., BGE-Code, mxbai), vector stores (e.g., Chroma), cross-encoder rerankers, and LangChain router chains.
- Design CI orchestrations, including daily Jenkins jobs for delta detection, image captioning (e.g., Qwen2-VL, LLaVA), cost/latency guardrails, and rollback strategies.
- Establish model and prompt governance, including prompt templates, few-shot libraries, safety filters, and evaluation rubrics (faithfulness, coverage, compile success).
- Lead architecture for a UI onboarding tool, deciding on hosting (FastAPI + React or Streamlit MVP), SSO/auth flows, token streaming, and feedback mechanisms for continuous learning.
- Oversee data security and compliance, embedding privacy policies, source citations, audit logs, and ensuring Confluence/Qtest credentials are managed in Secrets Manager.
- Provide technical leadership by reviewing PRs, setting code quality standards, and conducting architecture workshops for AI and Automation Agent Engineers.
- 6–8 years of experience building data or ML platforms, with at least 2 years deploying LLM/RAG systems in production.
- Deep expertise in LangChain, ChromaDB, Qdrant, or pgvector, and cross-encoder rerankers.
- Strong proficiency in Python (FastAPI or Flask) and ability to analyze Java codebases for chunking boundaries.
- Proven experience designing CI/CD pipelines (Jenkins, GitHub Actions) with delta builds and artifact promotion.
- Hands-on experience managing OpenAI/Anthropic API keys or self-hosting large models.
- Demonstrated expertise in security and compliance, including PII protection, role-based access, and secret rotation.
We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr-
RAG Architect
Há 3 dias
Buenos Aires, Espírito Santo, Brazil Perform Tempo inteiroJoin to apply for the RAG Architect role at PerformWe are seeking a Senior AI Engineer to lead the design and implementation of an end-to-end Retrieval-Augmented Generation (RAG) architecture. This role will drive the ingestion of GitHub repositories, Confluence pages, Qtest artifacts, PRDs, and script libraries to power autoscripting, onboarding search, and...
-
Chief RAG Innovation Officer
Há 2 dias
Buenos Aires, Espírito Santo, Brazil beBeeRetrieval Tempo inteiro R$90.000 - R$120.000RAG Architect Opportunity">We're seeking a senior AI engineer to lead the design and implementation of an end-to-end Retrieval-Augmented Generation (RAG) architecture. This role will drive the ingestion of GitHub repositories, Confluence pages, Qtest artifacts, PRDs, and script libraries to power autoscripting, onboarding search, and long-term knowledge...
-
Enterprise AI Solution Developer
Há 5 dias
Buenos Aires, Brasil beBeeSoftware Tempo inteiro US$120.000 - US$180.000GenAI Solution ArchitectWe are seeking an experienced architect to design and develop robust GenAI solutions using prompt engineering, RAG, and custom pipelines. This is a high-impact role supporting an AI Center of Excellence (CoE).Key Responsibilities:Design and develop GenAI solutions using prompt engineering, RAG, and custom pipelines.Develop...
-
Transformative AI Leader
Há 13 horas
Buenos Aires, Brasil beBeeArchitect Tempo inteiro R$100.000 - R$150.000Senior AI Engineer - RAG ArchitectWe are seeking a highly skilled Senior AI Engineer to lead the design and implementation of an end-to-end Retrieval-Augmented Generation (RAG) architecture. This role will drive the ingestion of various data sources, including GitHub repositories, Confluence pages, Qtest artifacts, PRDs, and script libraries, to power...
-
Advanced AI Solution Architect
Há 24 horas
Buenos Aires, Brasil beBeegenerative Tempo inteiro R$90.000 - R$120.000We are seeking an experienced Generative AI Engineer to support enterprise-wide AI initiatives. This role involves building robust GenAI and agentic AI workflows, automating processes, and working across platforms like AWS, Microsoft 365, MS Copilot, and other GenAI platforms and libraries.Key Responsibilities:Design and develop GenAI solutions using prompt...
-
Chief Artificial Intelligence Architect
Há 5 dias
Buenos Aires, Brasil beBeeMachineLearning Tempo inteiro US$120.000 - US$160.000Job SummaryWe are seeking a highly skilled Senior Machine Learning Engineer to lead our Gen-AI project team.The successful candidate will be responsible for effective communication with stakeholders, active participation in pre-sales and discovery phases of projects, and the development of large language models (LLMs) using LangChain, LlamaIndex, Chain of...
-
Technical Expert and Innovator
Há 5 dias
Buenos Aires, Brasil beBeeSoftware Tempo inteiro US$100.000 - US$150.000Job TitleSoftware Architect and EngineerJob Description:As a seasoned technology professional, you will play a key role in designing and delivering market-leading technology products. Your expertise will be utilized to create innovative solutions that meet the needs of our customers.Responsibilities:Design and develop software solutions using innovative...
-
Buenos Aires, Brasil Google Tempo inteiroCloud AI Consultant, Professional Services, Google Cloud (English) Join to apply for the Cloud AI Consultant, Professional Services, Google Cloud (English) role at Google Cloud AI Consultant, Professional Services, Google Cloud (English) 6 days ago Be among the first 25 applicants Join to apply for the Cloud AI Consultant, Professional Services, Google...
-
Backend Developer – GenAI Projects
2 semanas atrás
Buenos Aires, Brasil Kyndryl Tempo inteiroJoin to apply for the Backend Developer – GenAI Projects role at Kyndryl Join to apply for the Backend Developer – GenAI Projects role at Kyndryl Get AI-powered advice on this job and more exclusive features. Who We AreAt Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So...