Gen AI

The document discusses Retrieval Augmented Generation (RAG) as a method to enhance the performance of large language models (LLMs) by integrating a dynamic knowledge base. RAG involves a retrieval step that extracts relevant information based on user prompts before passing it to the LLM, thereby addressing limitations in the model's static knowledge. The process utilizes text embeddings to rank the relevance of items in the knowledge base to improve the response quality of LLMs.

Uploaded by

Satyam Sangwan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views9 pages

Gen AI

Uploaded by

Satyam Sangwan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

LLM Fine-Tuning

IMPROVE LLM PERFORMANCE USING

RETRIEVAL AUGMENTED GENERATION

Satyam Sangwan

July 10th, 2024

What is Retrieval Augmented Generation
An LLM’s knowledge is static

LLMs may have an insufficient “understanding” of niche and specialised

information that was not prominent in their training data.

One way we can dec. these limitations is to augment a model via a specialised
and mutable knowledge base

RAG does not fundamentally change how we use an LLM; it's still prompt-in and
response-out.
What is RAG?
RAG works by adding a step to this basic process of
Prompt and Response.

A retrieval step is performed where, based on the user’s

prompt, the relevant information is extracted from an
external knowledge base and injected into the prompt
before being passed to the LLM.

RAG a flexible and (relatively)

straightforward way to improve LLM-
based systems.
How it works
RAG system: a retriever and a knowledge base.

A retriever takes a user prompt and returns relevant items

from a knowledge base. This typically works using so-
called text embeddings and numerical text
representations in concept space.

Text embeddings can compute a similarity score between

the user’s query and each item in the knowledge base.

The result of this process is a ranking of each item’s

relevance to the input query.
What are Text Embeddings
What are Text Embeddings
How it works
Knowledge base

This houses all the information you want to make available to the LLM.

The process can be broken down into four key steps:

References
https://www.shawhintalebi.com/
Thank you

WWW Databricks Com Glossary Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Databricks Com Glossary Retrieval-Augmented-Generation-Rag
12 pages
Retrieval Augmented Generation (RAG) For Everyone
No ratings yet
Retrieval Augmented Generation (RAG) For Everyone
57 pages
RAG 570 Hasnad Ahmed2
No ratings yet
RAG 570 Hasnad Ahmed2
9 pages
RAG - A Simple Introduction
100% (6)
RAG - A Simple Introduction
75 pages
Retrieval Augmented Generation - A Simple Introduction
No ratings yet
Retrieval Augmented Generation - A Simple Introduction
82 pages
Key Excerpts - A Simple Guide To Rag
No ratings yet
Key Excerpts - A Simple Guide To Rag
3 pages
RAG Architecture
100% (11)
RAG Architecture
52 pages
Evolution of Retrieval Augmented Generation
100% (2)
Evolution of Retrieval Augmented Generation
8 pages
Retrieval Augmented Generation (Rag) For Precision Language Models
No ratings yet
Retrieval Augmented Generation (Rag) For Precision Language Models
10 pages
RAG and Vector Database Guide
No ratings yet
RAG and Vector Database Guide
18 pages
Rag System Notes
No ratings yet
Rag System Notes
26 pages
RAG - Genai
No ratings yet
RAG - Genai
11 pages
Rag in 80 Questions Rag Basics 1732967574
No ratings yet
Rag in 80 Questions Rag Basics 1732967574
28 pages
Building LLM Applications
No ratings yet
Building LLM Applications
14 pages
Master RAG Course
No ratings yet
Master RAG Course
50 pages
A Simple Guide To RAG 1722465469
No ratings yet
A Simple Guide To RAG 1722465469
8 pages
Guide to Retrieval Augmented Generation
No ratings yet
Guide to Retrieval Augmented Generation
9 pages
Challenges of LLMs in Business Content
No ratings yet
Challenges of LLMs in Business Content
10 pages
Learning: Gen Ai
No ratings yet
Learning: Gen Ai
6 pages
2.5 Retrieval Augmented Generation RAG
No ratings yet
2.5 Retrieval Augmented Generation RAG
2 pages
Advance RAG Technique
No ratings yet
Advance RAG Technique
23 pages
Challenge
No ratings yet
Challenge
8 pages
A Taxonomy of Retrieval Augmented Generation
100% (5)
A Taxonomy of Retrieval Augmented Generation
56 pages
Advanced RAG Techniques for LLMs
No ratings yet
Advanced RAG Techniques for LLMs
12 pages
Tyjt
No ratings yet
Tyjt
2 pages
A Powerful Technique For Improved Text Generation and Efficiency
No ratings yet
A Powerful Technique For Improved Text Generation and Efficiency
14 pages
RAG: Enhancing LLMs for AI Applications
No ratings yet
RAG: Enhancing LLMs for AI Applications
7 pages
A Survey On Retrieval-Augmented Text Generation For Large Language Models
No ratings yet
A Survey On Retrieval-Augmented Text Generation For Large Language Models
18 pages
RAG Deep-Dive Research Report
No ratings yet
RAG Deep-Dive Research Report
46 pages
RAG From Scratch - Overview
No ratings yet
RAG From Scratch - Overview
1 page
RAG Interview QA
No ratings yet
RAG Interview QA
2 pages
SSRN 5267341
No ratings yet
SSRN 5267341
16 pages
RAG and Its Variants - Graph RAG Light RAG and Agentic RAG
No ratings yet
RAG and Its Variants - Graph RAG Light RAG and Agentic RAG
16 pages
Session 7 LLMs Fine Tuning and RAG
No ratings yet
Session 7 LLMs Fine Tuning and RAG
21 pages
2024-05-EB-A Compact GuideTo RAG
No ratings yet
2024-05-EB-A Compact GuideTo RAG
38 pages
A Deep Dive Into Retrieval Augmented Generation: Team Members
No ratings yet
A Deep Dive Into Retrieval Augmented Generation: Team Members
14 pages
Retrieval-Augmented Generation for LLMs
No ratings yet
Retrieval-Augmented Generation for LLMs
17 pages
Améliorer Les Outputs Des LLM
No ratings yet
Améliorer Les Outputs Des LLM
14 pages
RAG Revolutionizing Modern AI
No ratings yet
RAG Revolutionizing Modern AI
3 pages
Llmrag
No ratings yet
Llmrag
6 pages
RAG and LLM Agents Explained
No ratings yet
RAG and LLM Agents Explained
1 page
RAG Enhancements for Financial QA
No ratings yet
RAG Enhancements for Financial QA
7 pages
Understanding RAG for LLM Applications
No ratings yet
Understanding RAG for LLM Applications
4 pages
How Build A RAG Agent With LlamaIndex
No ratings yet
How Build A RAG Agent With LlamaIndex
4 pages
RAG for LLMs: A Comprehensive Survey
No ratings yet
RAG for LLMs: A Comprehensive Survey
26 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
Corrective Retrieval Augmented Generation
No ratings yet
Corrective Retrieval Augmented Generation
13 pages
Retrieval Augmented Generation
No ratings yet
Retrieval Augmented Generation
18 pages
Medical Rag Report
No ratings yet
Medical Rag Report
6 pages
Enhancing Education with RAG and LLMs
No ratings yet
Enhancing Education with RAG and LLMs
18 pages
Weaviate Advanced RAG Techniques Ebook
100% (1)
Weaviate Advanced RAG Techniques Ebook
13 pages
Rag PDF
No ratings yet
Rag PDF
10 pages
Mastering RAG 1750736731-12-38
No ratings yet
Mastering RAG 1750736731-12-38
27 pages
Retrieval-Augmented Generation (RAG) From Basics To Advanced - by Tejpal Kumawat - Medium
No ratings yet
Retrieval-Augmented Generation (RAG) From Basics To Advanced - by Tejpal Kumawat - Medium
38 pages
Advanced RAG Techniques Guide
No ratings yet
Advanced RAG Techniques Guide
16 pages
Generative AI
No ratings yet
Generative AI
25 pages
A Survey On Rag Meeting LLM
No ratings yet
A Survey On Rag Meeting LLM
18 pages
Corrective Retrieval Augmented Generation: Zhang Et Al. 2023b Muhlgay Et Al. 2023
No ratings yet
Corrective Retrieval Augmented Generation: Zhang Et Al. 2023b Muhlgay Et Al. 2023
14 pages
RAG Cheat Sheet-2
No ratings yet
RAG Cheat Sheet-2
29 pages

Gen AI

Uploaded by

Gen AI

Uploaded by

LLM Fine-Tuning

IMPROVE LLM PERFORMANCE USING

July 10th, 2024

LLMs may have an insufficient “understanding” of niche and specialised

A retrieval step is performed where, based on the user’s

RAG a flexible and (relatively)

A retriever takes a user prompt and returns relevant items

Text embeddings can compute a similarity score between

The result of this process is a ranking of each item’s

The process can be broken down into four key steps:

You might also like