Programming on Retrieval-Augmented Generation (RAG) for Private Domain Knowledge Base Queries

Course Code: ragbspk

Duration: 21 hours

Prerequisites:

Audience

Overview:

By the end of this training, participants will be able to:

grasp programming skills of implementing Retrieval-Augmented Generation (RAG) workflow for private knowledge base queries.

Course Outline:

A. Background of Knowledge Base Queries

Overview of textual data search for private knowledge base queries, e.g. RAG architecture, vector database and Large Language Model (LLM)
Key components, their relationships and process

B. Setup and Configuration of LLM

Overview of different On-Prem LLM models
Installation of Python, Hugging Face, LlamaIndex, Mistral Large 2 or similar that works with LlamaIndex, and essential libraries on local machine

C. Setup of Knowledge Base

Introduction of different types of document loaders and different embedding models
Programming on loading documents
Programming on chunking documents
Programming on transforming into embeddings, using BGE-EN-ICL or similar that works with LlamaIndex

D. Setup and Configuration of Vector Database (VDB)

Overview of different VDB brands for Microsoft Server environment
Installation of Postgre (pgvector) or similar that can be run on MS OS environment
Programming on loading vector embeddings to VDB
Programming on populating and updating VDB for new documents

E. RAG Workflow

F. Testing and Optimising

G. Hardware Requirements Covering Systems Tracks

Models

(a) LLM: Mistral Large 2 or similar that works with LlamaIndex

(b) VDB: Postgre (pgvector) or similar that can be installed under MS OS environment

(d) LlamaIndex