0% found this document useful (0 votes)

23 views39 pages

LangChain From 0 To 1 Public 1 PpuSgEN

Uploaded by

raghunadha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views39 pages

LangChain From 0 To 1 Public 1 PpuSgEN

Uploaded by

raghunadha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

LangChain From 0 To 1

Unveiling the Power of LLM Programming

GitHub

https://github.com/Stell0/fosdem2024

- Presentation
- Code
- Useful links
Our Journey

1. Introduction to LangChain
2. Document loaders
3. Text Splitters
4. Embeddings
5. Vectorstores
6. Retrievers
7. Prompts and Templates
8. Large Language Models
9. Chains
10. RAG - Retrieval Augmented Generation
11. Demo
Retrieval Augmented Generation (RAG) 🔥🔥🔥

Augment LLM knowledge using additional data

● Combines retrieval + generation

● Data not in training dataset
○ Private data
○ Data after cutoff date, even real time
● Improves accuracy and relevancy
● Supports evidence-Based Responses, can
reference source
Example of RAG use case: QA over unstructured data

data

Answer
~~~
~~~~~~
~~~~~~ LLM
~~~

Question
Example of RAG use case: QA over unstructured data

~~
~~~
~~~
~~~~~~
~~~
~~~
~~~
[0.2, 0.3, 2.1, 0.2, …]
~~~~ ~~~~~~
YouTube ~~~~~~
transcript ~~~
video
~~ ~~~
~~~
~~~
~~~
[1.2, 4.7, 0.1, 0.1, …]
~ ~~~ ~~~

Prompt Answer
[0.9, 1.2, 2.1, 1.1, …]
~~~
~~~~~~ Template LLM
~~~~~~
~~~
Instructions
+
Question {Context}
+
{Question}
LangChain

- Python (also JS/TS) framework

- Building blocks
- Swappable components
- Examples
- From PoC to Production
- Speed of improvement
LangChain
Preparing and storing data
~~
~~~
~~~
~~~~~~
~~~
~~~
~~~
[0.2, 0.3, 2.1, 0.2, …]
~~~~ ~~~~~~
YouTube ~~~~~~
transcript ~~~
video
~~ ~~~
~~~
~~~
~~~
[1.2, 4.7, 0.1, 0.1, …]
~ ~~~ ~~~
Document loader Text Splitter Embedding Function Vectorstore

~~~
~~~
~~~ [0.2, 0.3, 2.1, 0.2, …] [0.2, 0.3, 2.1, 0.2, …]
~~~
HTML ~~~
PDF ~~~ [1.2, 4.7, 0.1, 0.1, …]
[0.9, 1.2, 2.1, 1.1, …]
JSON ~~~
~~~ [1.2, 4.7, 0.1, 0.1, …]
TXT
~~~
[0.4, 0.4, 1.5, 0.6, …]
…
~~~
~~~
~~~
~~~
~~~
~~~ ~~~
~~~ [0.9, 1.2, 2.1, 1.1, …]
~~~

~~~ ~~~
~~~ ~~~
~~~ ~~~
~~~
~~~
~~~
~~~
~~~
~~~
~~~ [0.4, 0.4, 1.5, 0.6, …]
~~~
~~~
Document Loaders

Microsoft Word
MongoDB
Open Document Format (ODT)
Arxiv Pandas DataFrame
CSV PubMed
Discord ReadTheDocs Documentation
HTML
PDF Email Reddit
EPub RSS Feeds
JSON EverNote Slack
TXT Facebook Chat Snowﬂake
… Figma Telegram
Git X
GitHub URL
HTML WhatsApp Chat
JSON Wikipedia
~~~
~~~ Markdown XML
~~~ Mastodon YouTube audio
MediaWiki Dump YouTube transcripts
Document Loaders

Loading a YouTube video transcript

- YoutubeLoader from LangChain Community

- loaders return a list of Documents
Document class

page_content: Document text metadata: dictionary { “source”:”https://…”}

Text Splitters

Break text into smaller chunks

~~~
~~~
~~~

~~~
~~~
~~~
~~~
~~~
~~~

~~~ ~~~
~~~ ~~~
~~~ ~~~
~~~
~~~
~~~

https://chunkviz.up.railway.app
Characters / Tokens

Text Splitters: 5 levels Recursive Character

of text splitting
Document structure

Semantic Chunker

Agent-like Splitting
Text Splitters

RecursiveCharacterTextSplitter
Embeddings

- Numerical representation
- Vectors in High-dimensional space
- Each dimension reﬂects an aspect
- Similarity = Proximity in embedding space ~~~
~~~ [0.2, 0.3, 2.1, 0.2, …]
~~~

~~~
~~~ [1.2, 4.7, 0.1, 0.1, …]
~~~

~~~
~~~ [0.9, 1.2, 2.1, 1.1, …]
~~~

~~~
~~~ [0.4, 0.4, 1.5, 0.6, …]
~~~
Embeddings

- Complexity is hidden
- We rely on an external provider
- note: data is sent to the external provider
Vectorstore

Storing embeddings

- Stores
- Search [0.2, 0.3, 2.1, 0.2, …]
- Retrieve
[1.2, 4.7, 0.1, 0.1, …]
[0.9, 1.2, 2.1, 1.1, …]
[0.4, 0.4, 1.5, 0.6, …]
Vectorstore

- ChromaDB initialized from our documents

- OpenAI embedding function
- Optional: persist directory
Most Used Vectorstores

https://blog.langchain.dev/langchain-state-of-ai-2023/
Using data

Prompt Answer
[0.9, 1.2, 2.1, 1.1, …]
~~~
~~~~~~ Template LLM
~~~~~~
~~~
Instructions
+
Question {Context}
+
{Question}
Retriever Prompt/Template LLM Chain

{
T }
~~~ ~~~
~~~ ~~~
~~~ ~~~
~~~
~~~
~~~
~~~~
~
Retriever

Question ➞ Embedding ➞ distance

Relevant Documents

~~~ ~~~
~~~ ~~~
~~~ ~~~
~~~
~~~
~~~
Retriever
Another Retriever

Multi Query Retriever

- use LLM to generate multiple variations of our questions

- increase chances of ﬁnding Documents near to the questions
Prompt/Template

- Guide LLM output

{
Question
T }
+

Documents

⬇
~~~~
~
context
Prompt
Prompt from Hub
LLM

https://python.langchain.com/docs/integrations/llms/
LLM
“Nobody Gets Fired For Buying
IBM OpenAI”
Most Used LLM Providers

https://blog.langchain.dev/langchain-state-of-ai-2023/
Most Used OSS Model Providers

https://blog.langchain.dev/langchain-state-of-ai-2023/
Put everything together
Chains

Sequence of calls

- Advantages:
- Simple
- Modular
- Efﬁcient
- compose your own
- Off-the-shelf
- Legacy Class
- LCEL
- Streaming
- Async (and sync) support
- Optimized parallel execution
- integrated with LangSmith and LangServe
- …
Put everything together using LCEL
Other use cases

- QA over structured data

- Question ➞ SQL Query ➞ Query Results ➞ Additional Context ➞Answer
- Extraction
- Unstructured Text + JSON Schema ➞ Compiled JSON
- Summarization
- MOAR text ➞ LESS text
- Synthetic data generation
- JSON Schema ➞ [Unstructured Text, Unstructured Text, Unstructured Text, Unstructured Text …]
- Agents
- let LLM takes actions
The End
https://github.com/Stell0
https://x.com/Stll00
https://t.me/Stll0
https://www.linkedin.com/in/stefano-fancello

Practical RAG
No ratings yet
Practical RAG
127 pages
Tutorial Membuat RAG AI ChatBot API Dengan Python FastAPI Dan Open Source LLMs
No ratings yet
Tutorial Membuat RAG AI ChatBot API Dengan Python FastAPI Dan Open Source LLMs
41 pages
Generative AI Apps With Langchain and Python - Rabi Jay
100% (1)
Generative AI Apps With Langchain and Python - Rabi Jay
387 pages
Vectorstores
No ratings yet
Vectorstores
11 pages
The Rise of Vector Databases in The Age of LLMs
No ratings yet
The Rise of Vector Databases in The Age of LLMs
26 pages
Tactiq Free Transcript AC3h KzLARo
No ratings yet
Tactiq Free Transcript AC3h KzLARo
33 pages
(English) Python RAG Tutorial (With Local LLMS) - AI For Your PDFs (DownSub - Com)
No ratings yet
(English) Python RAG Tutorial (With Local LLMS) - AI For Your PDFs (DownSub - Com)
15 pages
Building RAG-based LLM Applications For Production: Blog Detail
No ratings yet
Building RAG-based LLM Applications For Production: Blog Detail
78 pages
10+ Ways To Run Open-Source Models With LlamaIndex - by Wenqi Glantz
100% (1)
10+ Ways To Run Open-Source Models With LlamaIndex - by Wenqi Glantz
34 pages
Building RAG-based LLM Applications For Production (Part 1) : Blog Detail
100% (1)
Building RAG-based LLM Applications For Production (Part 1) : Blog Detail
39 pages
2025 04 22 Intro To LLMsv1
No ratings yet
2025 04 22 Intro To LLMsv1
41 pages
Retrieval Augmented Language Model (Ralm) : Module #3 - Langchain
No ratings yet
Retrieval Augmented Language Model (Ralm) : Module #3 - Langchain
54 pages
Flowise AI Tutorial #3 File Loaders, Text Splitters, Embeddings & Vector Stores
No ratings yet
Flowise AI Tutorial #3 File Loaders, Text Splitters, Embeddings & Vector Stores
3 pages
LangChain Custom Project - Student Implementation Guide
No ratings yet
LangChain Custom Project - Student Implementation Guide
9 pages
Complete Generative AI Curriculum
No ratings yet
Complete Generative AI Curriculum
6 pages
BG Embeddings (BGE), Llama v2, LangChain, and Chroma For Retrieval QA - by Datadrifters - Aug, 2023 - GoPenAI
No ratings yet
BG Embeddings (BGE), Llama v2, LangChain, and Chroma For Retrieval QA - by Datadrifters - Aug, 2023 - GoPenAI
18 pages
CrateDB and LangChain
No ratings yet
CrateDB and LangChain
14 pages
LLMOps Toolkit - Prashant Sahu
No ratings yet
LLMOps Toolkit - Prashant Sahu
12 pages
Agentic RAG - Removed
No ratings yet
Agentic RAG - Removed
9 pages
Documentacao Langchain
No ratings yet
Documentacao Langchain
53 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
43 pages
Enhanced Stock Prediction Pipeline With RAG and Fine-Tuned LLM
No ratings yet
Enhanced Stock Prediction Pipeline With RAG and Fine-Tuned LLM
10 pages
LangChain Talk (Aug-Sep'23)
No ratings yet
LangChain Talk (Aug-Sep'23)
47 pages
02 Data Connections
No ratings yet
02 Data Connections
32 pages
Week 5 Large Language Models
No ratings yet
Week 5 Large Language Models
5 pages
LangChain & RAG
No ratings yet
LangChain & RAG
62 pages
Lecture 36 Introduction To Langchain
No ratings yet
Lecture 36 Introduction To Langchain
31 pages
Little Guide To Building Large Language Models in 2024
100% (1)
Little Guide To Building Large Language Models in 2024
65 pages
Exploring HuggingFace
No ratings yet
Exploring HuggingFace
16 pages
Langchain N VDB
No ratings yet
Langchain N VDB
6 pages
4-HC24.PrimisAI - Hans Bouwmeester.v4
No ratings yet
4-HC24.PrimisAI - Hans Bouwmeester.v4
29 pages
LangChain Talk
No ratings yet
LangChain Talk
35 pages
Large Language Model (LLM) Interview Question and Answer Course
No ratings yet
Large Language Model (LLM) Interview Question and Answer Course
10 pages
Langchain App Design
No ratings yet
Langchain App Design
7 pages
LangChain - Chat With Your Data
No ratings yet
LangChain - Chat With Your Data
32 pages
LangChain Talk
No ratings yet
LangChain Talk
35 pages
Tabular Lecture Outline
No ratings yet
Tabular Lecture Outline
1 page
GenAI Curriculum (DataSpoof)
No ratings yet
GenAI Curriculum (DataSpoof)
4 pages
Little Guide To Building Large Language Models in 2024
No ratings yet
Little Guide To Building Large Language Models in 2024
65 pages
Large Language Models and Where To Use Them - Part 2
No ratings yet
Large Language Models and Where To Use Them - Part 2
12 pages
GenAI PDF
No ratings yet
GenAI PDF
34 pages
Introducing Transformers Agents 20
No ratings yet
Introducing Transformers Agents 20
8 pages
LLM Frameworks
No ratings yet
LLM Frameworks
8 pages
Formatted Lecture Outline
No ratings yet
Formatted Lecture Outline
1 page
Semantic Search and Beyond handout-Tim-Clarke
No ratings yet
Semantic Search and Beyond handout-Tim-Clarke
16 pages
Chick Literature
No ratings yet
Chick Literature
9 pages
Emerging Architectures For LLM Applications - Andreessen Horowitz
No ratings yet
Emerging Architectures For LLM Applications - Andreessen Horowitz
15 pages
NANDARAJJAAT Thefinaldraft
No ratings yet
NANDARAJJAAT Thefinaldraft
165 pages
Datastax Langchain Architecture Design Guide
No ratings yet
Datastax Langchain Architecture Design Guide
16 pages
Self RAG
No ratings yet
Self RAG
12 pages
Building RAG Apps
No ratings yet
Building RAG Apps
32 pages
Datafy Generative-Ai Learning Path
No ratings yet
Datafy Generative-Ai Learning Path
7 pages
(Ebook - Commodore Computers) Impossible Routines For The c64 PDF
No ratings yet
(Ebook - Commodore Computers) Impossible Routines For The c64 PDF
211 pages
One Stop Framework Building Applications With Llms
No ratings yet
One Stop Framework Building Applications With Llms
8 pages
An Effective Query System Using Llms and Langchain IJERTV12IS060161
No ratings yet
An Effective Query System Using Llms and Langchain IJERTV12IS060161
4 pages
LangChain Chat Bot March 15
No ratings yet
LangChain Chat Bot March 15
9 pages
14 Key Skills To Master Large Language Models 1729745509
No ratings yet
14 Key Skills To Master Large Language Models 1729745509
17 pages
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
100% (2)
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
48 pages
Generative AI Curriculum
No ratings yet
Generative AI Curriculum
2 pages
LLMs in Production-MLC - GRC
No ratings yet
LLMs in Production-MLC - GRC
39 pages
LLM Project Guide
No ratings yet
LLM Project Guide
4 pages
Community Session IndexingChaining
No ratings yet
Community Session IndexingChaining
19 pages
SailPoint Active Directory Connector Guide
No ratings yet
SailPoint Active Directory Connector Guide
114 pages
Keith Kelly - Ingredients For Successful CLIL - TeachingEnglish - British Council - BBC
No ratings yet
Keith Kelly - Ingredients For Successful CLIL - TeachingEnglish - British Council - BBC
10 pages
Vibgyor International School Pilkhuwa: Syllabus
No ratings yet
Vibgyor International School Pilkhuwa: Syllabus
2 pages
Select Statements in Sap Abap
No ratings yet
Select Statements in Sap Abap
7 pages
Resource Notebook For Families of Children Who Are Deaf or Hard of Hearing
No ratings yet
Resource Notebook For Families of Children Who Are Deaf or Hard of Hearing
118 pages
The Unconcept PDF
No ratings yet
The Unconcept PDF
16 pages
Advanced Windows Power Shell Scripting
No ratings yet
Advanced Windows Power Shell Scripting
28 pages
GOT Barcode Reader Function
No ratings yet
GOT Barcode Reader Function
8 pages
SAP HANA Cloud - Foundation - Unit 3
No ratings yet
SAP HANA Cloud - Foundation - Unit 3
20 pages
7 Cs of Communication
No ratings yet
7 Cs of Communication
2 pages
Mac Network Commands Cheat Sheet
No ratings yet
Mac Network Commands Cheat Sheet
1 page
Wake Model
No ratings yet
Wake Model
48 pages
Technology Lesson Plan Part 2
No ratings yet
Technology Lesson Plan Part 2
6 pages
Concerning Divine Wisdom in The Creation of Man 1st Edition Abu Hamid Al-Ghazali PDF Download
No ratings yet
Concerning Divine Wisdom in The Creation of Man 1st Edition Abu Hamid Al-Ghazali PDF Download
42 pages
Tips For Freshers /: Some of The Personality Traits The GD Is Trying To Gauge May Include
No ratings yet
Tips For Freshers /: Some of The Personality Traits The GD Is Trying To Gauge May Include
5 pages
Speaking in Subtitles Revaluing Screen Translation 1st Edition Tessa Dwyer 2024 Scribd Download
100% (1)
Speaking in Subtitles Revaluing Screen Translation 1st Edition Tessa Dwyer 2024 Scribd Download
72 pages
Cap540 Download Problems
No ratings yet
Cap540 Download Problems
6 pages
A1 Unit7 Notes
No ratings yet
A1 Unit7 Notes
10 pages
Travelling: Types of Transport
No ratings yet
Travelling: Types of Transport
2 pages
Eng Lang cl6th WK 3
No ratings yet
Eng Lang cl6th WK 3
3 pages
6to Año TALKING ABOUT PLANS
No ratings yet
6to Año TALKING ABOUT PLANS
3 pages
Order of The Blessing of Animal
No ratings yet
Order of The Blessing of Animal
8 pages
Lecture4 AccessControl
No ratings yet
Lecture4 AccessControl
13 pages
Paper DCRE
No ratings yet
Paper DCRE
13 pages
A Assignment Guidelines & Submission Instructions 2017 - UNDERGRADUATES 17a
No ratings yet
A Assignment Guidelines & Submission Instructions 2017 - UNDERGRADUATES 17a
7 pages
Individual Assignment II
No ratings yet
Individual Assignment II
2 pages
Essay
No ratings yet
Essay
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

LangChain From 0 To 1 Public 1 PpuSgEN

Uploaded by

LangChain From 0 To 1 Public 1 PpuSgEN

Uploaded by

LangChain From 0 To 1

Unveiling the Power of LLM Programming

Augment LLM knowledge using additional data

● Combines retrieval + generation

- Python (also JS/TS) framework

Loading a YouTube video transcript

- YoutubeLoader from LangChain Community

page_content: Document text metadata: dictionary { “source”:”https://…”}

Break text into smaller chunks

Text Splitters: 5 levels Recursive Character

- ChromaDB initialized from our documents

Question ➞ Embedding ➞ distance

Multi Query Retriever

- use LLM to generate multiple variations of our questions

- Guide LLM output

- QA over structured data

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.