100% found this document useful (1 vote)

456 views5 pages

Rag 1708257109

The document discusses the stages and key concepts in RAG including loading, indexing, storing, querying, and evaluation data. It also covers application types like query engines, chat engines, and agents. Techniques and tools for data ingestion, chunk size optimization, metadata filtering, and fine-tuning embeddings are presented. Challenges around missing data, ranking, consolidation, and formatting are addressed.

Uploaded by

Rakesh Shindhe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

456 views5 pages

Rag 1708257109

Uploaded by

Rakesh Shindhe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Stages in RAG: Key Concepts:

1. Loading: 1. Nodes and Documents:

Import your data (text files, PDFs, databases, APIs) using LlamaHub's extensive range of Fundamental units in LlamaIndex, where Documents encapsulate data sources and Nodes represent
connectors. data "chunks" with associated metadata.
2. Indexing: 1. Connectors:
Create searchable data structures, primarily through vector embeddings and metadata strategies, Bridge various data sources into the RAG framework, transforming them into Nodes and Documents.
enabling efficient context retrieval. 1. Indexes:
The backbone of RAG, enabling the storage of vector embeddings in a vector store along with crucial
3. Storing:
metadata.
Securely store your indexed data and metadata for quick access without the need to re-index.
1. Embeddings:
4. Querying:
Numerical representations of data, facilitating the relevance filtering process.
Utilize LLMs and LlamaIndex data structures for diverse querying techniques, including sub-
1. Retrievers:
queries and hybrid strategies.
Define efficient retrieval strategies, ensuring the relevancy and efficiency of data retrieval.
5. Evaluation:
1. Routers:
Continuously assess the effectiveness of your pipeline to ensure accuracy, faithfulness, and
Manage the selection of appropriate retrievers based on query specifics and metadata.
response speed.
1. Node Postprocessors:
Apply transformations or re-ranking logic to refine the set of retrieved nodes.
1. Response Synthesizers:
Application Types: Craft responses from the LLM, utilizing user queries and retrieved text chunks for enriched answers.

1. Query Engines:
For direct question-answering over your data.
2. Chat Engines:
Enables conversations with your data for an interactive experience.
3. Agents:
Automated decision-makers that interact with external tools, adaptable for complex tasks.

Diagram credit Langchain

Steve Nouri
Indexing:

Steve Nouri
https://github.com/langchain-ai/rag-from-scratch/blob/main/rag_from_scratch_1_to_4.ipynb
Generation:

https://github.com/langchain-ai/rag-from-scratch/blob/main/rag_from_scratch_1_to_4.ipynb
Multi Query:

https://python.langchain.com/docs/modules/data_connection/retrievers/MultiQueryRetriever

RAG-Fusion:

Steve Nouri
https://github.com/langchain-ai/langchain/blob/master/cookbook/rag_fusion.ipynb

Decomposition:

https://arxiv.org/pdf/2205.10625.pdf
Step Back:

https://arxiv.org/pdf/2310.06117.pdf

Steve Nouri
HyDE:

https://arxiv.org/abs/2212.10496
Techniques and Tools: Challenges and Solutions:
1. Data Ingestion and Querying: Missing Data:
Using tools like LlamaIndex for processing and querying data from various sources into the Addressed by expanding the document corpus or integrating external knowledge bases.
model's prompt. The issue with Ranking:
2. Chunk Size Optimization: Overcome by using advanced retrieval techniques like rerankers.
Adjusting the size of data chunks for efficient processing and retrieval, improving response Consolidation Issues:
quality. Solved by employing strategies that ensure relevant documents are included in the final context.

3. Metadata Filtering: Formatting Issues:

Addressed by ensuring the system correctly interprets and responds to format-specific queries.
Enhancing retrieval by adding structured context to data, utilizing vector database capabilities for
more relevant results. Incorrect Specifics and Incomplete Answers:
Mitigated by adjusting the detail level of responses to match user queries.
4. Fine-Tuning Embeddings:
Extraction Challenges:
Customizing embedding models to better match query context with relevant data, improving
Overcome by refining the system's ability to accurately extract information from the selected
precision and recall.
context.
5. Advanced Retrieval Algorithms:
Implementing sophisticated retrieval methods like recursive retrieval and parent-child chunk
retrieval to enhance context understanding and response accuracy.

Self-RAG
Self-reflection can enhance RAG, enabling correction of poor quality retrieval or generations.

https://arxiv.org/abs/2310.11511

Corrective RAG
Corrective-RAG (CRAG) is a recent paper that introduces an interesting approach for self-reflective RAG.

https://arxiv.org/pdf/2401.15884.pdf

RAG Slide ENG
No ratings yet
RAG Slide ENG
41 pages
Ue21cs421ac1 20240924233834
No ratings yet
Ue21cs421ac1 20240924233834
54 pages
RAG Technics
100% (1)
RAG Technics
8 pages
Building RAG-based LLM Applications For Production (Part 1) : Blog Detail
100% (1)
Building RAG-based LLM Applications For Production (Part 1) : Blog Detail
39 pages
LLM Questions
100% (1)
LLM Questions
51 pages
300 LangChain Projects
100% (1)
300 LangChain Projects
17 pages
Agentic AI Projects
33% (3)
Agentic AI Projects
9 pages
Hands-On Guide To Agentic Corrective RAG-1
No ratings yet
Hands-On Guide To Agentic Corrective RAG-1
5 pages
7 Agentic RAG System Architectures To Build AI Agents
100% (1)
7 Agentic RAG System Architectures To Build AI Agents
12 pages
LLM Application Through Production
100% (11)
LLM Application Through Production
254 pages
PythonAI LLMs ForSharing
No ratings yet
PythonAI LLMs ForSharing
47 pages
Types of RAG: @bhavishya Pandit
No ratings yet
Types of RAG: @bhavishya Pandit
15 pages
RAG Architecture
100% (8)
RAG Architecture
52 pages
Building A Streamlit Chatbot With LangChain and Llama 3.1 - Exploring LLMs - 3 - by Abou Zuhayr - Sep, 2024 - GoPenAI
No ratings yet
Building A Streamlit Chatbot With LangChain and Llama 3.1 - Exploring LLMs - 3 - by Abou Zuhayr - Sep, 2024 - GoPenAI
15 pages
Building Machine Learning Systems With A Feature Store - Early Release
100% (2)
Building Machine Learning Systems With A Feature Store - Early Release
48 pages
Databricks Big Book of GenAI FINAL
100% (7)
Databricks Big Book of GenAI FINAL
118 pages
Retrieval-Augmented Generation For Large Language Models A Survey
No ratings yet
Retrieval-Augmented Generation For Large Language Models A Survey
26 pages
RAG and AI Agents Simplified
No ratings yet
RAG and AI Agents Simplified
14 pages
A Taxonomy of Retrieval Augmented Generation
100% (2)
A Taxonomy of Retrieval Augmented Generation
56 pages
Generative Ai Terminology
67% (3)
Generative Ai Terminology
26 pages
26 RAG Concepts in Alphabetical Order
No ratings yet
26 RAG Concepts in Alphabetical Order
15 pages
GenAI Interview Questions-Draft
No ratings yet
GenAI Interview Questions-Draft
27 pages
Vector Database Essentials
No ratings yet
Vector Database Essentials
26 pages
LLM Mesh: A Practical Guide To Using Generative AI in The Enterprise
100% (1)
LLM Mesh: A Practical Guide To Using Generative AI in The Enterprise
27 pages
Multi-Document Agentic RAG Using Llama-Index and Mistral - by Plaban Nayak - The AI Forum - May, 2024 - Medium
100% (1)
Multi-Document Agentic RAG Using Llama-Index and Mistral - by Plaban Nayak - The AI Forum - May, 2024 - Medium
24 pages
Agentic Systems - A Guide To Transforming Industries With Vertical AI Agents
No ratings yet
Agentic Systems - A Guide To Transforming Industries With Vertical AI Agents
31 pages
Vector Databases - A Technical Primer
100% (1)
Vector Databases - A Technical Primer
68 pages
Generative AI With Large Language Models
100% (3)
Generative AI With Large Language Models
31 pages
Building Generative AI Agents With Vertex AI Agent Builder
No ratings yet
Building Generative AI Agents With Vertex AI Agent Builder
13 pages
Building LLM Applications For Production
100% (3)
Building LLM Applications For Production
28 pages
Scalable Deployment of AI Agents in The Enterprise 1736999533
100% (1)
Scalable Deployment of AI Agents in The Enterprise 1736999533
48 pages
GenAI POC - Training
100% (1)
GenAI POC - Training
43 pages
Stock Maintenance System
No ratings yet
Stock Maintenance System
13 pages
LangGraph: Multi-Agent Systems
No ratings yet
LangGraph: Multi-Agent Systems
9 pages
Introduction To Generative AI LLM
100% (1)
Introduction To Generative AI LLM
9 pages
How To Perform DBHealth Checkwith SQLScripts
No ratings yet
How To Perform DBHealth Checkwith SQLScripts
37 pages
Langchain Retrieval Augmented Generation White Paper
100% (1)
Langchain Retrieval Augmented Generation White Paper
23 pages
Software Architecture in An AI World
100% (1)
Software Architecture in An AI World
25 pages
GraphRAG + GPT-4o-Mini Is The RAG Heaven - by Vatsal Saglani - Jul, 2024 - Towards AI
No ratings yet
GraphRAG + GPT-4o-Mini Is The RAG Heaven - by Vatsal Saglani - Jul, 2024 - Towards AI
34 pages
Knowledge Graphs V Vector Databases and When Not To Use Them!
No ratings yet
Knowledge Graphs V Vector Databases and When Not To Use Them!
3 pages
KAG Graph + Multimodal RAG + LLM Agents = Powerful AI Reasoning - by Gao Dalie (高達烈) - in Towards AI - Freedium
No ratings yet
KAG Graph + Multimodal RAG + LLM Agents = Powerful AI Reasoning - by Gao Dalie (高達烈) - in Towards AI - Freedium
13 pages
Vector Databases
No ratings yet
Vector Databases
35 pages
Generative AI - 48 Hours TOC
100% (1)
Generative AI - 48 Hours TOC
4 pages
What Are Vector Databases
No ratings yet
What Are Vector Databases
5 pages
RAG - A Simple Introduction
100% (5)
RAG - A Simple Introduction
75 pages
Explaining Vector Databases in 3 Levels of Difficulty - by Leonie Monigatti - Jul, 2023 - Towards Data Science
No ratings yet
Explaining Vector Databases in 3 Levels of Difficulty - by Leonie Monigatti - Jul, 2023 - Towards Data Science
12 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
Building LLM Powered Applications With Langchain
100% (1)
Building LLM Powered Applications With Langchain
11 pages
RAG Notes
No ratings yet
RAG Notes
19 pages
Marketplace: System Requirements Specification (SRS)
100% (2)
Marketplace: System Requirements Specification (SRS)
60 pages
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
100% (1)
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
21 pages
LLM Applications
100% (1)
LLM Applications
1 page
LangChain Cheat Sheet KDnuggets
No ratings yet
LangChain Cheat Sheet KDnuggets
1 page
Current Best Practices For Training LLMs From Scratch - Final
No ratings yet
Current Best Practices For Training LLMs From Scratch - Final
23 pages
GraphRAG + GPT-4o Mini - Building An AI Knowledge Graph at Low Cost - by Shuyi Wang - Jul, 2024 - Cubed
No ratings yet
GraphRAG + GPT-4o Mini - Building An AI Knowledge Graph at Low Cost - by Shuyi Wang - Jul, 2024 - Cubed
31 pages
Mastering Chunking in RAG - Techniques and Strategies
No ratings yet
Mastering Chunking in RAG - Techniques and Strategies
12 pages
Evolving LLOMPS For RAG
No ratings yet
Evolving LLOMPS For RAG
6 pages
Data Mining and Predictive Analytics - Andres Fortino
No ratings yet
Data Mining and Predictive Analytics - Andres Fortino
390 pages
LLMs in Production-MLC - GRC
No ratings yet
LLMs in Production-MLC - GRC
39 pages
Ccs341 - Data Warehousing
100% (1)
Ccs341 - Data Warehousing
2 pages
Retrieval Augmented Generation - Streamlining The Creation of Intelligent Natural Language Processing Models
No ratings yet
Retrieval Augmented Generation - Streamlining The Creation of Intelligent Natural Language Processing Models
8 pages
Enhancing AI Systems With Agentic Workflows Patterns in Large Language Model
No ratings yet
Enhancing AI Systems With Agentic Workflows Patterns in Large Language Model
6 pages
List of Open Sourced Fine-Tuned Large Language Models (LLM) - by Sung Kim - Geek Culture - Mar, 2023 - Medium
No ratings yet
List of Open Sourced Fine-Tuned Large Language Models (LLM) - by Sung Kim - Geek Culture - Mar, 2023 - Medium
18 pages
Veeam Definitive Guide 2023
No ratings yet
Veeam Definitive Guide 2023
36 pages
Post Matric Scholarship Application Form Jains 2013 14
No ratings yet
Post Matric Scholarship Application Form Jains 2013 14
5 pages
Langchain PDF Reader
100% (1)
Langchain PDF Reader
15 pages
Philosophy of Homeopathy
No ratings yet
Philosophy of Homeopathy
3 pages
Distributed Database Architecture
No ratings yet
Distributed Database Architecture
14 pages
Informix Resource: Cisco Resources
0% (1)
Informix Resource: Cisco Resources
9 pages
Chapter 3
No ratings yet
Chapter 3
30 pages
Solution Brief - OT Extended ECM For Investigative Case Management
No ratings yet
Solution Brief - OT Extended ECM For Investigative Case Management
3 pages
Woodman File
No ratings yet
Woodman File
139 pages
Cbir-Using CNN
100% (1)
Cbir-Using CNN
59 pages
DP Note Sss2 Second Term 2024-25
No ratings yet
DP Note Sss2 Second Term 2024-25
19 pages
Lecture 012
No ratings yet
Lecture 012
33 pages
Data Mining Jurnal
No ratings yet
Data Mining Jurnal
20 pages
? Excel Questions AML KYC
No ratings yet
? Excel Questions AML KYC
7 pages
HCI Lesson Plan
No ratings yet
HCI Lesson Plan
1 page
Folio Help PDF
No ratings yet
Folio Help PDF
10 pages
BDA Mini Project Sem-7
No ratings yet
BDA Mini Project Sem-7
11 pages
Your Shopping Cart - OZiva
No ratings yet
Your Shopping Cart - OZiva
1 page
Antennas and Propagation: at Least TWO Questions From Each Part
No ratings yet
Antennas and Propagation: at Least TWO Questions From Each Part
1 page
Aritra Das Critical Analysis Report For CIA 3
No ratings yet
Aritra Das Critical Analysis Report For CIA 3
5 pages
Current Log
No ratings yet
Current Log
34 pages
Business User Focused Vocabularies For IBM Industry Models
No ratings yet
Business User Focused Vocabularies For IBM Industry Models
34 pages
SCM 5003 2021 Access Question
No ratings yet
SCM 5003 2021 Access Question
2 pages
Smartling Sample RFP PDF
No ratings yet
Smartling Sample RFP PDF
17 pages
E-Marketing Lecture Notes Knowledge Management
No ratings yet
E-Marketing Lecture Notes Knowledge Management
15 pages
Types of Primary Memory in Computers
No ratings yet
Types of Primary Memory in Computers
6 pages
Mandeep Singh
No ratings yet
Mandeep Singh
1 page
Current Challenges For Studying Search As Learning Processes
No ratings yet
Current Challenges For Studying Search As Learning Processes
4 pages
St2 For The Sixth Week
No ratings yet
St2 For The Sixth Week
1 page
Case Study Guidelines: Property of STI
No ratings yet
Case Study Guidelines: Property of STI
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Rag 1708257109

Uploaded by

Rag 1708257109

Uploaded by

Stages in RAG: Key Concepts:

1. Loading: 1. Nodes and Documents:

Diagram credit Langchain

3. Metadata Filtering: Formatting Issues:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.