0% found this document useful (0 votes)

150 views199 pages

Intro To AI Models and Rag v.0.1

Intro-to-AI-models

Uploaded by

Mina Saneed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

150 views199 pages

Intro To AI Models and Rag v.0.1

Intro-to-AI-models

Uploaded by

Mina Saneed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 199

LLM Models and RAG

Hands-on guide

Mohamed El-Zahaby
V 0.1
April 2024
This guide is primarily for technical teams engaged in developing a basic
conversational AI with RAG solutions. It offers a basic introduction to the
technical aspects.

This guide helps anyone with basic technical background to get involved in the
AI domain.

This guide combines between the theoretical, basic knowledge and code
implementation.

It's important to note that most of the content is compiled from various
online resources, reflecting the extensive effort in
curating and organizing this information from numerous sources..
Contents
INTRODUCTION ........................................................................................................................................8
What is Conversational AI? ..........................................................................................................................9
The Technology Behind Conversational AI .................................................................................................9
1. Speech-to-text: ..........................................................................................................................................9
2. Language processing: ...............................................................................................................................9
3. Text-To-Speech (TTS): ..........................................................................................................................10
4.Context and Multi-turn conversations: ....................................................................................................10
5. Dialogue policy: .....................................................................................................................................10
LLM Basics.................................................................................................................................................12
What is a large language model (LLM)? ....................................................................................................13
How do LLMs work? .................................................................................................................................13
Machine learning and deep learning ...........................................................................................................13
Neural networks..........................................................................................................................................14
Transformer models....................................................................................................................................14
What are the Relations and Differences between LLMs and Transformers? .............................................15
Transformers...............................................................................................................................................15
LLM (Large Language Model)...................................................................................................................15
Relation and Differences between LLMs and Transformers ......................................................................16
What are Pipelines in Transformers?..........................................................................................................17
What are Hugging Face Transformers? ......................................................................................................18
Hugging Face provides: ..............................................................................................................................18
Chains .........................................................................................................................................................19
What are chains?.........................................................................................................................................20
Some reasons you may want to use chains: ................................................................................................20
Foundational chain types in LangChain .....................................................................................................20
LLMChain ..................................................................................................................................................21
Creating an LLMChain...............................................................................................................................22
Sequential Chains .......................................................................................................................................26
SimpleSequentialChain ..............................................................................................................................26
SequentialChain ..........................................................................................................................................28
Transformation ...........................................................................................................................................30
Prompt Engineering.....................................................................................................................................36
What is Prompt Engineering? .....................................................................................................................37
Prompt ........................................................................................................................................................37
Types of Prompts ........................................................................................................................................38
Instruction Prompting .................................................................................................................................38
Role Prompting ...........................................................................................................................................39
“Standard” Prompting.................................................................................................................................41
Chain of Thought (CoT) Prompting ...........................................................................................................41
Recommendations and Tips for Prompt Engineering with OpenAI API ...................................................43
Embeddings.................................................................................................................................................48
A problem with semantic search.................................................................................................................49
What are embeddings?................................................................................................................................50
What is a vector in machine learning?........................................................................................................51
How do embeddings work? ........................................................................................................................53
How are embeddings used in large language models (LLMs)?..................................................................54
Vector Stores ...............................................................................................................................................55
What Are Vector Databases? ......................................................................................................................56
The Benefits of Using Open Source Vector Databases ..............................................................................56
Open Source Vector Databases Comparison: Chroma Vs. Milvus Vs. Weaviate ......................................57
1. Chroma ...................................................................................................................................................57
2. Milvus .....................................................................................................................................................58
3. Weaviate .................................................................................................................................................58
4.Faiss: ........................................................................................................................................................59
Chunking.....................................................................................................................................................63
Document Splitting .....................................................................................................................................64
Chunking Methods .....................................................................................................................................65
Character Splitting ......................................................................................................................................66
Recursive Character Text Splitting.............................................................................................................71
Split by Tokens ...........................................................................................................................................75
Tiktoken Tokenizer.....................................................................................................................................75
Hugging Face Tokenizer ............................................................................................................................75
Other Tokenizer ..........................................................................................................................................76
Things to Keep in Mind ..............................................................................................................................76
Quantization ................................................................................................................................................77
What is Quantization? ................................................................................................................................78
How does quantization work? ....................................................................................................................78
Hugging Face and Bitsandbytes Uses.........................................................................................................78
Loading a Model in 4-bit Quantization ......................................................................................................79
Loading a Model in 8-bit Quantization ......................................................................................................80
Changing the Compute Data Type .............................................................................................................80
Using NF4 Data Type .................................................................................................................................81
Nested Quantization for Memory Efficiency .............................................................................................81
Loading a Quantized Model from the Hub .................................................................................................81
Exploring Advanced techniques and configuration ....................................................................................82
Fine-Tuning a Model Loaded in 8-bit ........................................................................................................82
Temperature ................................................................................................................................................83
Top P and Temperature ..............................................................................................................................84
Temperature ................................................................................................................................................84
Top p ...........................................................................................................................................................84
Token length ...............................................................................................................................................85
Max tokens .................................................................................................................................................85
Stop tokens .................................................................................................................................................86
Langchain Memory .....................................................................................................................................87
What is Conversational memory?...............................................................................................................88
ConversationChain .....................................................................................................................................89
Forms of Conversational Memory ..............................................................................................................91
ConversationBufferMemory .......................................................................................................................91
ConversationSummaryMemory..................................................................................................................96
ConversationBufferWindowMemory .......................................................................................................103
ConversationSummaryBufferMemory .....................................................................................................108
Other Memory Types................................................................................................................................110
Agents & Tools .........................................................................................................................................111
Tools .........................................................................................................................................................112
Agents .......................................................................................................................................................112
Chains .......................................................................................................................................................113
Memory ....................................................................................................................................................114
Callback Handlers.....................................................................................................................................116
Walkthrough — Project Utilizing Langchain ...........................................................................................116
RAG ..........................................................................................................................................................122
The Curse Of The LLMs ..........................................................................................................................123
The Challenge ...........................................................................................................................................123
What is RAG?...........................................................................................................................................123
How does RAG help? ...............................................................................................................................125
𝗡𝗡𝗡𝗡𝗡𝗡 𝗥𝗥𝗥𝗥𝗥𝗥 𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁𝘁 :- ........................................................................................................................126
groq ...........................................................................................................................................................128
What is groq? ............................................................................................................................................129
What is LPU?............................................................................................................................................129
How Groq's LPU Works ...........................................................................................................................129
How LPU is different from GPU ..............................................................................................................131
Groq Tools ................................................................................................................................................133
Groq and RAG Architecture Example ......................................................................................................137
What is LlamaParse ? ...............................................................................................................................139
Use Case – 1..............................................................................................................................................157
Conversational AI chatbot ........................................................................................................................158
implementation-1-A4000..........................................................................................................................158
implementation-2-A100............................................................................................................................175
implementation-3-groq .............................................................................................................................175
implementation-4-llama3-A4000 .............................................................................................................189
Use Case – 2..............................................................................................................................................192
Action integration with chatbot (google calendar booking) .....................................................................193
Source Code ..............................................................................................................................................198
INTRODUCTION
What is Conversational AI?
Conversational AI means using technology with artificial intelligence to make machines
talk to people. Basically, it figures out what someone says or writes and responds
naturally to keep the conversation going. Thanks to recent improvements, machines can
now have smart and natural conversations with humans.

The Technology Behind Conversational AI

Conversational AI relies on various components to function, spanning from speech
recognition to intent detection and concluding with a spoken or written response. The
following components constitute the core of the conversational AI technology stack:

1. Speech-to-text:
- This technology converts spoken words into text transcriptions.

2. Language processing:
2.1 Natural Language Understanding (NLU):

- NLU is the process by which technology comprehends natural human language.

- Especially crucial in voice interactions where speakers may not use specific keywords or
share longer stories.

2.2. Intent:

- Intents within conversational AI determine the actions triggered based on conversational

inputs.

2.3. Intent detection:

- This process involves the bot correctly identifying the intent behind an utterance.

- More challenging in voice compared to text due to the tendency for longer stories in
speech.
2.4. Value extraction:

- AI agents extract relevant information from customer queries and store them against
corresponding 'slots.'

- Vital for handling multiple values in a single speech, ensuring natural conversations.

3. Text-To-Speech (TTS):
- This technology converts written text into spoken utterances.

- Off-the-shelf solutions may sound robotic, but voice actors can be used for natural
responses.

4.Context and Multi-turn conversations:

- Conversational bots need to maintain context across multiple turns for natural-feeling
conversations.

- Particularly important in voice interactions where chat history isn't displayed to the
customer.

- Each back-and-forth interaction in a conversation is a 'turn.'

- Multi-turn conversations involve more than one interaction, contributing to a

comprehensive dialogue.

5. Dialogue policy:
- Dialogue policy guides the flow of a conversation, allowing the bot to intelligently
navigate a transaction.

- A robust dialogue policy accommodates interruptions, such as clarifying questions and

enhancing the user experience.

ref:

https://spotintelligence.com/2024/01/30/conversational-ai-explained-top-9-tools-how-to-
guide-including-gpt/

https://i0.wp.com/spotintelligence.com/wp-content/uploads/2024/01/key-components-of-
conversational-ai-1024x576.webp?resize=1024,576&ssl=1

https://spotintelligence.com/2024/01/30/conversational-ai-explained-top-9-tools-how-to-
guide-including-gpt/
LLM Basics
What is a large language model (LLM)?
A large language model (LLM) is a type of [artificial intelligence
(AI)](https://www.cloudflare.com/learning/ai/what-is-artificial-intelligence/ ) program
that can recognize and generate text, among other tasks. LLMs are trained on [huge sets
of data](https://www.cloudflare.com/learning/ai/big-data/ ) — hence the name "large."
LLMs are built on [machine learning](https://www.cloudflare.com/learning/ai/what-is-
machine-learning/ ): specifically, a type of [neural
network](https://www.cloudflare.com/learning/ai/what-is-neural-network/ ) called a
transformer model.

In simpler terms, an LLM is a computer program that has been fed enough examples to be
able to recognize and interpret human language or other types of complex data. Many
LLMs are trained on data that has been gathered from the Internet — thousands or
millions of gigabytes' worth of text. But the quality of the samples impacts how well
LLMs will learn natural language, so an LLM's programmers may use a more curated data
set.

LLMs use a type of machine learning called [deep

learning](https://www.cloudflare.com/learning/ai/what-is-deep-learning/ ) in order to
understand how characters, words, and sentences function together. Deep learning
involves the probabilistic analysis of unstructured data, which eventually enables the deep
learning model to recognize distinctions between pieces of content without human
intervention.

LLMs are then further trained via tuning: they are fine-tuned or prompt-tuned to the
particular task that the programmer wants them to do, such as interpreting questions and
generating responses, or translating text from one language to another.

How do LLMs work?

Machine learning and deep learning

At a basic level, LLMs are built on machine learning. Machine learning is a subset of AI,
and it refers to the practice of feeding a program large amounts of data in order to train the
program how to identify features of that data without human intervention.

LLMs use a type of machine learning called deep learning. Deep learning models can
essentially train themselves to recognize distinctions without human intervention,
although some human fine-tuning is typically necessary.
Deep learning uses probability in order to "learn." For instance, in the sentence "The quick
brown fox jumped over the lazy dog," the letters "e" and "o" are the most common,
appearing four times each. From this, a deep learning model could conclude (correctly)
that these characters are among the most likely to appear in English-language text.

Realistically, a deep learning model cannot actually conclude anything from a single
sentence. But after analyzing trillions of sentences, it could learn enough to predict how to
logically finish an incomplete sentence, or even generate its own sentences.

Neural networks
In order to enable this type of deep learning, LLMs are built on neural networks. Just as
the human brain is constructed of neurons that connect and send signals to each other, an
artificial neural network (typically shortened to "neural network") is constructed of
network nodes that connect with each other. They are composed of several "layers”: an
input layer, an output layer, and one or more layers in between. The layers only pass
information to each other if their own outputs cross a certain threshold.

Transformer models
The specific kind of neural networks used for LLMs are called transformer models.
Transformer models are able to learn context — especially important for human language,
which is highly context-dependent. Transformer models use a mathematical technique
called self-attention to detect subtle ways that elements in a sequence relate to each other.
This makes them better at understanding context than other types of machine learning. It
enables them to understand, for instance, how the end of a sentence connects to the
beginning, and how the sentences in a paragraph relate to each other.

This enables LLMs to interpret human language, even when that language is vague or
poorly defined, arranged in combinations they have not encountered before, or
contextualized in new ways. On some level they "understand" semantics in that they can
associate words and concepts by their meaning, having seen them grouped together in that
way millions or billions of times.

ref: https://www.cloudflare.com/learning/ai/what-is-large-language-model/
What are the Relations and Differences
between LLMs and Transformers?

Transformers
Has gained a lot of popularity in the field of natural language processing (NLP). These are
good at understanding the relationships between words in a sentence or sequence of text.
Unlike traditional models like RNNs, Transformers don't rely on sequential processing,
allowing them to do computation in parallel and process sentences more efficiently.
Overall, these are powerful models good at understanding relationships between words
and have modernized NLP area.

Imagine a sentence: "The cat sat on the mat." A transformer breaks down this sentence
into smaller units called "tokens" (e.g., "The," "cat," "sat," "on," "the," "mat," and
punctuation marks). Each token is represented as a vector, capturing its meaning and
context. The transformer then learns to analyse the relationships between these tokens to
understand the sentence's overall meaning.

Example models,

- BERT (Bidirectional Encoder Representations from Transformers)

- GPT (Generative Pre-trained Transformer)

- T5 (Text-to-Text Transfer Transformer)

- DialoGPT

LLM (Large Language Model)

Is a specific type of transformer that has been trained on vast amounts of text data. It has
learned to predict the next word in a sentence given the context of the previous words.
This ability allows LLMs to generate contextually correct text.

For instance, if you provide the prompt "Once upon a time in a land far" an LLM can
generate the next words as "away." The LLM bases its predictions on the patterns and
context it has learned during training on massive amounts of text. This makes LLMs
useful for various applications, such as auto-completion, translation, summarization, and
even creative writing.

- GPT3.5 Turbo & GPT-4 by OpenAI

- BLOOM by BigScience

- LaMDA by Google

- MT-NLG by Nvidia/Microsoft2

- LLaMA by Meta AI2

Relation and Differences between LLMs and

Transformers
Transformers and LLMs (Large Language Models) are related concepts, as LLMs are a
specific type of model that is built upon the transformer architecture. While transformers,
in general, can be used for various tasks beyond language modeling, LLMs are
specifically trained in generating text and understanding natural language(There can be
exceptions as this field is quickly evolving, and pace of research and funding is
unprecedented).

The main differences between transformers and LLMs lie in their specific purposes and
training objectives. Transformers are a broader class of models that can be applied to
various tasks, including language translation, speech recognition, and image captioning,
while LLMs are focused on language modeling and text generation(there are some
exceptions). Transformers serve as the underlying architecture that enables LLMs to
understand and generate text by capturing contextual relationships and long-range
dependencies. Transformers are more general-purpose models, whereas LLMs are
specifically trained and optimized for language modeling and generation tasks.

Transformer models can also be divided into three categories: encoders, decoders, and
encoder-decoder architectures. This categorization is based on the different roles these
components play in the model's overall function. Encoders aim to understand the input
sequence. They focus on processing the input and capturing its meaning and
context. Decoders, on the other hand, generate output based on the information learned
by the encoder. They take the encoded representations and produce the desired output
sequence. Encoder-decoder models combine both encoder and decoder components.
They are used in tasks where the input and output sequences have different lengths or
meanings. The encoder understands the input sequence, and the decoder generates the
corresponding output sequence.
ref: https://www.linkedin.com/pulse/transformers-llms-next-frontier-ai-vijay-chaudhary/

What are Pipelines in Transformers?

- They provide an easy-to-use API through pipeline() method for performing inference
over a variety of tasks.

- They are used to encapsulate the overall process of every Natural Language Processing
task, such as text cleaning, tokenization, embedding, etc.

The pipeline() method has the following structure:

from transformers import pipeline

# To use a default model & tokenizer for a given task(e.g. question-answering)
pipeline("task-name")
# To use an existing model
pipeline("task-name", model="model_name")
# To use a custom model/tokenizer
pipeline('task-name', model='model name',tokenizer='tokenizer_name')
)

>This code snippet is using the transformers library to create a pipeline for natural
language processing tasks such as question-answering.

- The first line imports the pipeline function from the transformers library.

- The next three lines show how to use the pipeline function for different scenarios.

- The first scenario uses a default model and tokenizer for a given task, which is specified
in the placeholder "task-name".

- The second scenario uses an existing model, which is specified in the placeholder
"model_name", for the same task as in the first scenario.

- The third scenario uses a custom model and tokenizer, which are specified in the
placeholders "model name" and "tokenizer_name", respectively, for the same task as in
the first two scenarios.

- Overall, the pipeline function allows for easy implementation of natural language
processing tasks with various models and tokenizers.

ref: https://www.datacamp.com/tutorial/an-introduction-to-using-transformers-and-
hugging-face

What are Hugging Face Transformers?

[Hugging Face Transformers](https://huggingface.co/docs/transformers/index ) is an
open-source framework for deep learning created by Hugging Face. It provides APIs and
tools to download state-of-the-art pre-trained models and further tune them to maximize
performance. These models support common tasks in different modalities, such as natural
language processing, computer vision, audio, and multi-modal applications.

For many applications, such as sentiment analysis and text summarization, pre-trained
models work well without any additional model training.

Hugging Face Transformers pipelines encode best practices and have default models
selected for different tasks, making it easy to get started. Pipelines make it easy to use
GPUs when available and allow batching of items sent to the GPU for better throughput
performance.

Hugging Face provides:

- A [model hub](https://huggingface.co/models ) containing many pre-trained models.

- The [� � Transformers library](https://huggingface.co/docs/transformers/index

) that supports the download and use of these models for NLP applications and fine-
tuning. It is common to need both a tokenizer and a model for natural language processing
tasks.

� Transformers
- [�
pipelines](https://huggingface.co/docs/transformers/v4.26.1/en/pipeline_tutorial ) that
have a simple interface for most natural language processing tasks.

ref: https://docs.databricks.com/en/machine-learning/train-model/huggingface/index.html
Chains
What are chains?
A chain is an end-to-end wrapper around multiple individual components executed in a
defined order.

Chains are one of the core concepts of LangChain. Chains allow you to go beyond just a
single API call to a language model and instead chain together multiple calls in a logical
sequence.

They allow you to combine multiple components to create a coherent application.

Some reasons you may want to use chains:

- To break down a complex task into smaller steps that can be handled sequentially by
different models or utilities. This allows you to leverage the different strengths of different
systems.

- To add state and memory between calls. The output of one call can be fed as input to
the next call to provide context and state.

- To add additional processing, filtering or validation logic between calls.

- For easier debugging and instrumentation of a sequence of calls.

Foundational chain types in LangChain

The `LLMChain`, `RouterChain`, `SimpleSequentialChain`, and `TransformChain` are
considered the core foundational building blocks that many other more complex chains
build on top of. They provide basic patterns like chaining LLMs, conditional logic,
sequential workflows, and data transformations.

• `LLMChain`: Chains together multiple calls to language models. Useful for breaking
down complex prompts.

• `RouterChain`: Allows conditionally routing between different chains based on logic.

Enables branching logic.

• `SimpleSequentialChain`: Chains together multiple chains in sequence. Useful for linear

workflows.
• `TransformChain`: Applies a data transformation between chains. Helpful for data
munging and preprocessing.

Other key chain types like `Agents` and `RetrievalChain` build on top of these
foundations to enable more advanced use cases like goal-oriented conversations and
knowledge-grounded generation.

However the foundational four provide the basic patterns for chain construction in
LangChain.

LLMChain

The most commonly used type of chain is an LLMChain.

The LLMChain consists of a PromptTemplate, a language model, and an optional output

parser. For example, you can create a chain that takes user input, formats it with a
PromptTemplate, and then passes the formatted response to an LLM. You can build more
complex chains by combining multiple chains, or by combining chains with other
components.

The main differences between using an LLMChain versus directly passing a prompt to an
LLM are:

- LLMChain allows chaining multiple prompts together, while directly passing a prompt
only allows one. With LLMChain, you can break down a complex prompt into multiple
more straightforward prompts and chain them together.

- LLMChain maintains state and memory between prompts. The output of one prompt
can be fed as input to the following prompt to provide context. Directly passing prompts
lack this memory.

- LLMChain makes adding preprocessing logic, validation, and instrumentation between

prompts easier. This helps with debugging and quality control.

- LLMChain provides some convenience methods like `apply` and `generate` that make
it easy to run the chain over multiple inputs.
Creating an LLMChain
To create an LLMChain, you need to specify:

- The language model to use

- The prompt template

Code Example:

from langchain import PromptTemplate, OpenAI, LLMChain

# the language model
llm = OpenAI(temperature=0)
# the prompt template
prompt_template = "Act like a comedian and write a super funny two-sentence short
story about {thing}?"
llm_chain = LLMChain(
llm=llm,
prompt=PromptTemplate.from_template(prompt_template)
)
llm_chain("A toddler hiding his dad's laptop")

{'thing': "A toddler hiding his dad's laptop",

'text': '\n\nThe toddler thought he was being sneaky, but little did he know his
dad was watching the whole time from the other room, laughing.'}

Use `apply` when you have a list of inputs and want to get the LLM to generate text for
each one, it will run the LLMChain for every input dictionary in the list and return a list of
outputs.

input_list = [
{"thing": "a Punjabi rapper who eats too many samosas"},
{"thing": "a blind eye doctor"},
{"thing": "a data scientist who can't do math"}
]
llm_chain.apply(input_list)

[{'text': "\n\nThe Punjabi rapper was so famous that he was known as the 'Samosa
King', but his fame was short-lived when he ate so many samosas that he had to be
hospitalized for a stomachache!"},
{'text': "\n\nA blind eye doctor was so successful that he was able to cure his own
vision - but he still couldn't find his glasses."},
{'text': '\n\nA data scientist was so bad at math that he had to hire a calculator
to do his calculations for him. Unfortunately, the calculator was even worse at math
than he was!'}]

`generate` is similar to apply, except it returns an `LLMResult` instead of a string. Use

this when you want the entire `LLMResult` object returned, not just the generated text.
This gives you access to metadata like the number of tokens used.
llm_chain.generate(input_list)

LLMResult(generations=
[[Generation(text="\n\nThe Punjabi rapper was so famous that he was known as the
'Samosa King',
but his fame was short-lived when he ate so many samosas that he had to be
hospitalized for a stomachache!",
generation_info={'finish_reason': 'stop', 'logprobs': None})],

[Generation(text="\n\nA blind eye doctor was so successful that he was able to cure
his own vision - but he still couldn't find his glasses.",
generation_info={'finish_reason': 'stop', 'logprobs': None})],

[Generation(text='\n\nA data scientist was so bad at math that he had to hire a

calculator to do his calculations for him. Unfortunately, the calculator was even
worse at math than he was!', generation_info={'finish_reason': 'stop', 'logprobs':
None})]],

llm_output={'token_usage': {'prompt_tokens': 75, 'total_tokens': 187,

'completion_tokens': 112}, 'model_name': 'text-davinci-003'},
run=[RunInfo(run_id=UUID('b638d2c6-77d9-4346-8494-866892e36bc5')),
RunInfo(run_id=UUID('427f9e51-4848-49d3-83c1-e96131f2b34f')),
RunInfo(run_id=UUID('4201eea9-1616-42e7-8cb2-a5b26128decd'))])

Use `predict` when you want to pass inputs as keyword arguments instead of a dictionary.
This can be convenient if you don’t want to construct an input dictionary.

llm_chain.predict(thing="colorful socks")

The socks were so colorful that when the washing machine finished its cycle, the socks
had formed a rainbow in the laundry basket!

Use `LLMChain.run` when you want to pass the input as a dictionary and get the raw
text output from the LLM.

`LLMChain.run` is convenient when your LLMChain has a single input key and a single
output key.

llm_chain.run("the red hot chili peppers")

['1. Wear a Hawaiian shirt\n2. Sing along to the wrong lyrics\n3. Bring a beach ball
to the concert\n4. Try to start a mosh pit\n5. Bring a kazoo and try to join in on
the music']
Parsing output

To parse the output, you simply pass an output parser directly to `LLMChain`.

from langchain.output_parsers import CommaSeparatedListOutputParser

llm = OpenAI(temperature=0)
# the prompt template

prompt_template = "Act like a Captain Obvious and list 5 funny things to not do at
{place}?"

output_parser=CommaSeparatedListOutputParser()
llm_chain = LLMChain(
llm=llm,
prompt=PromptTemplate.from_template(prompt_template),
output_parser= output_parser
)

llm_chain.predict(place='Disneyland')

['1. Wear a costume of a Disney villain.\n2. Bring your own food and drinks into the
park.\n3. Try to ride the roller coasters without a ticket.\n4. Try to sneak into
the VIP area.\n5. Try to take a selfie with a Disney character without asking
permission.']

Router Chains

Router chains allow routing inputs to different destination chains based on the input text.
This allows the building of chatbots and assistants that can handle diverse requests.

- Router chains examine the input text and route it to the appropriate destination chain

- Destination chains handle the actual execution based on the input

- Router chains are powerful for building multi-purpose chatbots/assistants

The following example will show routing chains used in a `MultiPromptChain` to create a
question-answering chain that selects the prompt which is most relevant for a given
question and then answers the question using that prompt.
from langchain.chains.router import MultiPromptChain
from langchain.llms import OpenAI
from langchain.chains import ConversationChain
from langchain.chains.llm import LLMChain
from langchain.prompts import PromptTemplate

physics_template = """You are a very smart physics professor. \

You are great at answering questions about physics in a concise and easy to
understand manner. \
When you don't know the answer to a question you admit that you don't know.

Here is a question:
{input}"""

math_template = """You are a very good mathematician. You are great at answering
math questions. \
You are so good because you are able to break down hard problems into their
component parts, \
answer the component parts, and then put them together to answer the broader
question.

Here is a question:
{input}"""

prompt_infos = [
{
"name": "physics",
"description": "Good for answering questions about physics",
"prompt_template": physics_template,
},
{
"name": "math",
"description": "Good for answering math questions",
"prompt_template": math_template,
},
]

destination_chains = {}

for p_info in prompt_infos:

name = p_info["name"]
prompt_template = p_info["prompt_template"]
prompt = PromptTemplate(template=prompt_template, input_variables=["input"])
chain = LLMChain(llm=llm, prompt=prompt)
destination_chains[name] = chain

default_chain = ConversationChain(llm=llm, output_key="text")

default_chain.run("What is math?")

Math is the study of numbers, shapes, and patterns. It is used to solve problems and
understand the world around us. It is a fundamental part of our lives and is used in many
different fields, from engineering to finance.
Sequential Chains
Sometimes, you might want to make a series of calls to a language model, take the output
from one call and use it as the input to another. Sequential chains allow you to connect
multiple chains and compose them into pipelines executing a specific scenario.

There are two types of sequential chains:

1) `SimpleSequentialChain`: The simplest form of sequential chains, where each step has
a singular input/output, and the output of one step is the input to the next.

2) `SequentialChain`: A more general form of sequential chains allows multiple

inputs/outputs.

SimpleSequentialChain
The simplest form of a sequential chain is where each step has a single input and output.

The output of one step is passed as input to the next step in the chain. You would use
`SimpleSequentialChain` it when you have a linear pipeline where each step has a single
input and output. `SimpleSequentialChain` implicitly passes the output of one step as
input to the next.

This is great for composing a precise sequence of LLMChains where each builds directly
on the previous output.

### When to use:

- You have a clear pipeline of steps, each with a single input and output

- Each step builds directly off the previous step’s output

- Useful for simple linear pipelines with one input and output per step.

- Create each step as an `LLMChain`.

- Pass list of `LLMChains` to `SimpleSequentialChain`.

- Call `run()` passing the initial input.

### How to use:

1) Define each step as an `LLMChain` with a single input and output

2) Create a `SimpleSequentialChain` passing a list of the LLMChain steps

3) Call `run()` on the SimpleSequentialChain with the initial input

from langchain.llms import OpenAI

from langchain.chains import LLMChain
from langchain.prompts import PromptTemplate

# This is an LLMChain to write a rap.

llm = OpenAI(temperature=.7)

template = """

You are a Punjabi Jatt rapper, like AP Dhillon or Sidhu Moosewala.

Given a topic, it is your job to spit bars on of pure heat.

Topic: {topic}
"""
prompt_template = PromptTemplate(input_variables=["topic"], template=template)

rap_chain = LLMChain(llm=llm, prompt=prompt_template)

# This is an LLMChain to write a diss track

llm = OpenAI(temperature=.7)

template = """

You are an extremely competitive Punjabi Rapper.

Given the rap from another rapper, it's your job to write a diss track which
tears apart the rap and shames the original rapper.

Rap:
{rap}
"""

prompt_template = PromptTemplate(input_variables=["rap"], template=template)

diss_chain = LLMChain(llm=llm, prompt=prompt_template)

# This is the overall chain where we run these two chains in sequence.
from langchain.chains import SimpleSequentialChain

overall_chain = SimpleSequentialChain(chains=[rap_chain, diss_chain], verbose=True)

review = overall_chain.run("Drinking Crown Royal and mobbin in my red Challenger")

SequentialChain
A more general form of sequential chain allows multiple inputs and outputs per step.

You would use `SequentialChain` when you have a more complex pipeline where steps
might have multiple inputs and outputs.

`SequentialChain` allows you to explicitly specify all the input and output variables at
each step and map outputs from one step to inputs of the next. This provides more
flexibility when steps might have multiple dependencies or produce multiple results to
pass along.

### When to use:

- You have a sequence of steps but with more complex input/output requirements

- You need to track multiple variables across steps in the chain

### How to use

- Define each step as an LLMChain, specifying multiple input/output variables

- Create a SequentialChain specifying all input/output variables

- Map outputs from one step to inputs of the next

- Call run() passing a dict of all input variables

- The key difference is `SimpleSequentialChain` handles implicit variable passing

whereas SequentialChain allows explicit variable specification and mapping.

### When you would use SequentialChain vs SimpleSequentialChain

Use `SimpleSequentialChain` for linear sequences with a single input/output. Use

`SequentialChain` for more complex sequences with multiple inputs/outputs.

### The key difference

`SimpleSequentialChain` is for linear pipelines with a single input/output per step.

Implicitly passes variables.

`SequentialChain` handles more complex pipelines with multiple inputs/outputs per step.
Allows explicitly mapping variables.
This uses a standard ChatOpenAI model and prompt template. You chain them together
with the `|` operator and then call it with `chain.invoke`. We can also get async, batch, and
streaming support out of the box.

llm = OpenAI(temperature=.7)

template = """

You are a Punjabi Jatt rapper, like AP Dhillon or Sidhu Moosewala.

Given two topics, it is your job to create a rhyme of two verses and one chorus
for each topic.

Topic: {topic1} and {topic2}

Rap:

"""

prompt_template = PromptTemplate(input_variables=["topic1", "topic2"],

template=template)

rap_chain = LLMChain(llm=llm, prompt=prompt_template, output_key="rap")

template = """

You are a rap critic from the Rolling Stone magazine and Metacritic.

Given a, it is your job to write a review for that rap.

Your review style should be scathing, critical, and no holds barred.

Rap:

{rap}

Review from the Rolling Stone magazine and Metacritic critic of the above rap:

"""

prompt_template = PromptTemplate(input_variables=["rap"], template=template)

review_chain = LLMChain(llm=llm, prompt=prompt_template, output_key="review")

# This is the overall chain where we run these two chains in sequence.
from langchain.chains import SequentialChain

overall_chain = SequentialChain(
chains=[rap_chain, review_chain],
input_variables=["topic1", "topic2"],
# Here we return multiple variables
output_variables=["rap", "review"],
verbose=True)

overall_chain({"topic1":"Tractors and sugar canes", "topic2": "Dasuya, Punjab"})

```
> Entering new SequentialChain chain...

> Finished chain.

{'topic1': 'Tractors and sugar canes',

'topic2': 'Dasuya, Punjab',

'rap': "Verse 1\nI come from a place with lots of fame\nDasuya, Punjab, where the
tractors reign\nI'm a Jatt rapper with a game to play\nSo I'm gonna take it up and make it
my way\n\nChorus\nTractors and sugar canes, that's what I'm talking about\nTractors and
sugar canes, it's all about\nDasuya, Punjab, a place so grand\nTractors and sugar canes,
that's our jam\n\nVerse 2\nFrom Punjab's beauty I derive my pride\nMy heart belongs to
the place, where the sugar canes reside\nWhere the soil is my home, I'm never
apart\nFrom the tractors and sugar canes of Dasuya, Punjab\n\nChorus\nTractors and
sugar canes, that's what I'm talking about\nTractors and sugar canes, it's all
about\nDasuya, Punjab, a place so grand\nTractors and sugar canes, that's our jam",

'review': "\nThis rap artist hails from the small town of Dasuya, Punjab, and takes pride in
his hometown's culture and agricultural way of life. While the lyrical content of this rap is
filled with references to tractors and sugar canes, unfortunately the artist's delivery falls
flat and fails to capture the unique essence of his home. The basic rhyme scheme,
repetitive chorus, and lack of originality make this a forgettable track. The artist's
enthusiasm for his hometown is admirable, but unfortunately it is not enough to make this
rap stand out from the crowd."}

Transformation
Transformation Chains allows you to define custom data transformation logic as a step in
your LangChain pipeline. This is useful when you must preprocess or transform data
before passing it to the next step.
from langchain.chains import TransformChain, LLMChain, SimpleSequentialChain
from langchain.llms import OpenAI
from langchain.prompts import PromptTemplate

!wget https://www.gutenberg.org/files/2680/2680-0.txt

with open("/content/2680-0.txt") as f:
meditations = f.read()

def transform_func(inputs: dict) -> dict:

"""
Extracts specific sections from a given text based on newline separators.

The function assumes the input text is divided into sections or paragraphs
separated
by one newline characters (`\n`). It extracts the sections from index 922 to 950
(inclusive) and returns them in a dictionary.

Parameters:
- inputs (dict): A dictionary containing the key "text" with the input text as
its value.

Returns:
- dict: A dictionary containing the key "output_text" with the extracted
sections as its value.
"""
text = inputs["text"]
shortened_text = "\n".join(text.split("\n")[921:950])
return {"output_text": shortened_text}

transform_chain = TransformChain(
input_variables=["text"], output_variables=["output_text"],
transform=transform_func, verbose=True
)

transform_chain.run(meditations)

II. Let it be thy earnest and incessant care as a Roman and a man to

perform whatsoever it is that thou art about, with true and unfeigned

gravity, natural affection, freedom and justice: and as for all other

cares, and imaginations, how thou mayest ease thy mind of them. Which

thou shalt do; if thou shalt go about every action as thy last action,

free from all vanity, all passionate and wilful aberration from reason,

and from all hypocrisy, and self-love, and dislike of those things,

which by the fates or appointment of God have happened unto thee. Thou

seest that those things, which for a man to hold on in a prosperous

course, and to live a divine life, are requisite and necessary, are not

many, for the gods will require no more of any man, that shall but keep

and observe these things.

III. Do, soul, do; abuse and contemn thyself; yet a while and the time

for thee to respect thyself, will be at an end. Every man's happiness

depends from himself, but behold thy life is almost at an end, whiles

affording thyself no respect, thou dost make thy happiness to consist in

the souls, and conceits of other men.

IV. Why should any of these things that happen externally, so much

distract thee? Give thyself leisure to learn some good thing, and cease

roving and wandering to and fro. Thou must also take heed of another

kind of wandering, for they are idle in their actions, who toil and

labour in this life, and have no certain scope to which to direct all

their motions, and desires. V. For not observing the state of another

man's soul, scarce was ever any man known to be unhappy. Tell whosoever

they be that intend not, and guide not by reason and discretion the

motions of their own souls, they must of necessity be unhappy.

template = """

Rephrase this text:

{output_text}

In the style of a 90s gangster rapper speaking to his homies.

Rephrased:"""

prompt = PromptTemplate(input_variables=["output_text"], template=template)

llm_chain = LLMChain(llm=OpenAI(), prompt=prompt)

sequential_chain = SimpleSequentialChain(chains=[transform_chain, llm_chain],

verbose=True)

sequential_chain.run(meditations)
> Entering new SimpleSequentialChain chain...

> Entering new TransformChain chain...