0% found this document useful (0 votes)

6 views20 pages

Atharva Presentation

The document presents an overview of Large Language Models (LLMs) and Generative AI (GenAI), highlighting their evolution from traditional methods like RNNs and LSTMs to the Transformer architecture. It discusses the concept of Agentic AI, which enhances AI capabilities by enabling autonomous decision-making and task execution. Real-life applications of these technologies, such as Goose.ai and Microsoft AutoGen, demonstrate their potential in automating complex tasks and improving productivity.

Uploaded by

dekux948

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views20 pages

Atharva Presentation

Uploaded by

dekux948

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

DnT Infotech

LARGE
LANGUAGE
MODEL
Present by Atharva Virkar
FLOW
1 Introduction 6 What’s the Hype

2 Traditional Vs Transformers 7 Workflow

3 LLMs 8 Concept Breakdown

4 Gen AI 9 Real life Adaptation and

Example
5 Introduction to Agentic AI
INTRODUCTION
In recent years, we’ve entered a new era of artificial intelligence, one thats
driven by Large Language Models (LLMs) and Generative AI (GenAI). These
technologies are reshaping how we interact with machines, automate tasks,
and generate content across industries.

It all started with a breakthrough research paper published by

Google Brain in 2017 titled “Attention is All You Need.”

This paper introduced the Transformer architecture, the

building block of all modern LLMs and GenAI systems.
TRADITIONAL METHODS
VS
TRANSFORMERS
Before Transformers, it was RNNs & LSTMs Era.

RNN (Recurrent Neural Networks) : A neural network designed to handle

sequential data by having connections that form directed cycles. It remembers
previous inputs using a hidden state.

It struggles with remembering long-term memory .

Used for: Time series prediction, Speech recognition, Language modeling

LSTM (Long Short-Term Memory) : A type of RNN with a more complex

TRADITIONAL architecture using gates (input, forget, output) to better remember and

METHODS manage long-term and short-term dependencies.

Used for : Text generation, Machine translation, Sentiment analysis

Major setbacks of RNN’s and LSTM’s are :

Sequence handling
Memory limitations
Bottleneck with content vector
Training challenges
Limited scalability
For example we are building an email autocompletion assistant, which needs to
predict the next word "I wanted to follow up on the meeting we had last..."
Words such as week, Friday, or month.

RNNs take on it will be :

RNN reads the sentence word by word and tries to remember what came
before.
It might forget the early context ("I wanted to follow up") and focus only on
EXAMPLE OF recent words like “we had last”.
Because it has a limited memory, by the time we reach "last...", the memory
WORKING of “meeting” is faint or lost. So, it might predict something vague like "time".
RNN fails when the important clue was far back in the sentence.
RNN & LSTM LSTMs take on it will be :

LSTM also reads word by word, but it has gates that help it remember that
the sentence started with “follow up on the meeting”.
It has 3 gates : Forget gate, Input gate, & Output gate. Which decides what
to ignore, what to remember and what to show to the next step.
So even if “meeting” was several words ago, it remembers that it’s important.
It is more likely to predict "week" or "Friday" because it knows we’re
referring to a meeting.
RNN Structure: Think of it as a Relay Race
RNN processes the sentence one word at a time.
Each word passes its meaning to the next like a baton in a relay race.
There's one shared memory (hidden state) passed along the chain.
Early information (like "meeting") has to travel through every step to reach the end.

The message gets weaker with each step — like a story passed from person to person. So by the time we
reach "last...", the memory of “meeting” is faint or lost.

To summarise it, we can say RNN is a one linear path, memory passed forward step-by-step, prone to
forgetting earlier parts.
STRUCTURE OF RNN

LSTM Structure: Like a Smart Assistant with a Notepad.

Still processes word-by-word, but with internal gates:

Forget gate: Decides what to ignore
Input gate: Decides what to remember
Output gate: Decides what to show to the next step

Maintains a separate memory cell, like a notepad, so it can hold onto key facts.

It remembers important words (like “meeting”) for a long time without fading. This helps it
make better decisions even many steps later.

To summarise it, we can say LSTM is a Step-by-step model with memory management, uses STRUCTURE OF LSTM
gates to decide what to remember or forget like a person jotting down key info.
Transformer doesn’t pass memory step by step.

Instead, it lets every word look at every other word directly

using self-attention.

Structure Of Transformer Input sentence: ["I","wanted",...,"meeting",...,"last"]

Each word is turned into a vector xi , with positional encoding.

To summarise it , we can say all words interact simultaneously,

using attention to decide what's important — like a group of
experts cross-checking notes in one meeting.
Transformer : A Transformer is a type of model that helps computers understand
and generate human language by paying attention to all the words in a sentence at
once, rather than one by one.

Unlike earlier models, Transformers uses self-attention mechanism which can see
the whole sentence at once, making them better at understanding meaning.

It follows Encoder–Decoder architecture :

Encoder: Reads and understands the input (e.g., a sentence).
Decoder: Generates the output (e.g., translated sentence).

TRANSFORMERS Transformers led to Birth of PLMs (Pretrained Language Models) transformers

powered models which were pretrained on large text data to learn general
language patterns, such as:
BERT (2018) – Read whole sentences for understanding.
GPT (2018) – Generated text left-to-right with transformer decoders.

As transformer-based models were scaled up with more data and powerful

hardware, they evolved into GPT-2, GPT-3, and GPT-4. Models trained on
massive text corpora with billions of parameters. This scaling marked the
transition from PLMs to Large Language Models (LLMs), laying the foundation for
today's Generative AI.
GEN AI
Generative AI (Gen AI) is essentially built on Large Language Models
(LLMs) or similar foundational models.

While LLMs are the underlying engine trained to predict and generate
language, Gen AI refers to the broader suite of tools and systems using
these models to create new, original outputs.

Gen AI systems can be multimodal, combining text, images, audio, or

video generation, but the underlying principle remains the use of large,
pretrained models similar to LLMs.

In summary, Gen AI = LLM (or related foundational model) + task-specific

tuning + generation capabilities to produce creative, usable content.

Some examples of Gen AI are :

ChatGPT
DALL·E
GitHub Copilot
Music generation models
Text-to-Speech (TTS) systems
LLM
LLM : An LLM is a type of artificial intelligence (AI) that
has been trained on a massive amount of text (like books,
websites, articles) so that it can understand and generate
human-like language.

Examples of LLM are :

GPT-3,GPT-3.5,GPT-4,
LLaMA
Claude
Gemini

Parameters are the learnable values (weights) in the

model that help it understand patterns in data — they
decide how the model processes and generates text.

Every input is embedded into vectors, and these vectors

are multiplied by weight matrices , and other matrices ,
through which we fine tune our model to generate
desired text accordingly.
WHAT IS
AGENTIC
AI
INTRODUCTION TO
AGENTIC AI
Agentic AI refers to AI systems that go beyond just completing tasks or
answering prompts but actually behave like an autonomous agents, capable
of planning, decision making and acting over time to achieve a goal.

AGENTIC AI :- LLM + Memory + Goals + Autonomy

Instead of passive assistants like siri or ChatGpt, It doesn’t just

answer questions like a chatbot, it plans, takes action, uses
tools and adapts autonomously to achive a given objective.
Endorsements from Tech Leaders
Mark Zuckerberg (CEO, Meta) - Predicts that AI agents will
replace mid-level engineers by 2025, allowing human
engineers to focus on higher-level problem-solving and
creativity.

Jensen Huang (CEO, NVIDIA) - Stated that "IT departments

will become the HR of AI agents," indicating a paradigm
shift in organizational structures to accommodate AI agents
as digital employees.

Industry Transformation
Automating Complex Tasks: AI agents can autonomously
perform tasks that traditionally required human

WHATS THE intervention, increasing efficiency and reducing errors.

HYPE ?
Enhancing Productivity: By handling routine and time-
consuming activities, AI agents free up human workers to
focus on strategic and creative endeavors.

Driving Innovation: The integration of AI agents fosters

innovation by enabling rapid prototyping, data analysis, and
decision-making processes.
WORKFLOW
There are phases of Agentic AI:
a. Goal Understanding
b. Planning
c. Tool usage & Execution
d. Memory + Adaptation

It takes a human written prompt/task and understands

the desired outcome.

Breaks the tasks into substeps ( e.g. :- Search → Read →

Extract → Draft).

Uses API’s, Webtools or software to complete those

substeps.

Remembers what it did, checks for failures, and adjusts

the plan if needed for reiteration.
CONCEPT
BREAKDOWN
Agent :- A system that can plan, reason, and act on its
own to accomplish goals.

It's not just answering questions like a chatbot — it's

deciding what to do, in what order, and how
LLM Model Tools
LangChain, AutoGen are some popular open-source
library to orchestrate agents.

LLM (Large Language Model) — it’s the brain of the agent.

Used for planning and creating substeps.

Such as to Understand what user wants, decide what

steps to take, interpret results from tools, generate code,
summaries, decisions, etc. A gent
LLMs like GPT-3.5/4, Claude, Gemini, LLaMA, Mistral,
Zephyr are used.
CONCEPT
BREAKDOWN
Tools :- Anything the agent can "use" to get work done,
they are like the arms of the agent, they let it take action.

Tools are used for tasks like Searching, Coding, File

parser, Browser, Databases, & APIs.

DuckDuckGo, Tavily, Python REPL, code interpreter, LLM Model Tools

SQL, MongoDB, PDF reader, are used as a tool plugged
into the agent so it can act in the real world.

Agent Workflow example:- Prompt: User says

"Summarize this PDF and send it to my email.", LLM
(within agent) interprets the request.

Agent decides: → Use PDF tool → summarize → call

email API. Tools are used one by one and final output is
A gent
sent to the user.

The LLM reasons, the Agent coordinates, and the Tools

act.
Trigger – "When chat message received"
WORKING FOR This is how it starts. The system listens for a chat (like from a user
or chatbot).

AGENTIC AI AI Agent : This is the core "brain."

It decides what to do next , just like a virtual assistant.
It doesn’t just follow steps, it thinks, plans, and uses tools to act.

Chat Model (OpenAI) : This is the "language understanding" (LLM)

part.
It understands user intent (e.g., "Schedule a meeting with team").

Memory (Simple Memory) : Helps the AI remember context across

chats.
For example, it can recall who “John” is or what the last task was.

Tools:
Google Sheets (read) – Agent reads data (e.g., get tasks or meeting
list)
Google Sheets1 (update) – Agent writes new info (e.g., log a
“This diagram represents an AI agent that acts like a smart digital assistant. When someone completed task)
sends a message, this assistant understands it, remembers useful information, and uses
external tools like Google Sheets or Google Calendar to take actions — automatically.” Google Calendar (create event) – Agent creates meetings
automatically
REAL LIFE ADAPTATION
& EXAMPLE

Goose.ai (Agent-Driven Automation Platform) Microsoft AutoGen – Enterprise Email Assistants

A platform by Jack Dorsey's Block that lets businesses build and Microsoft uses AutoGen (multi-agent) setup internally
deploy AI agents to automate tasks like customer support, and externally.
document analysis, and workflow management. One agent reads incoming email, another plans the task
Agents are powered by LLMs and integrated with tools (email, (e.g., schedule meeting, approve document).
web search, databases). Another executes the task or generates a reply and
Example: A "Legal AI Agent" that reads 20-page contracts, finally, a checker agent reviews and sends.
highlights key clauses, and sends compliance reports Enterprise Use: Auto-pilot mode for inboxes, especially
automatically. for executives and managers.
DnT Infotech

THANK YOU
for your time and attention

Presented by Atharva Virkar

Unpacked Learning Competencies Eapp
100% (2)
Unpacked Learning Competencies Eapp
16 pages
Whitepaper - Foundational Large Language Models & Text Generation
100% (2)
Whitepaper - Foundational Large Language Models & Text Generation
75 pages
Art Education in The Philippines
50% (4)
Art Education in The Philippines
15 pages
LLM and Gen AI
No ratings yet
LLM and Gen AI
4 pages
Week 13 LLM ChatGPT HAAI IITKgp v2
No ratings yet
Week 13 LLM ChatGPT HAAI IITKgp v2
119 pages
Mod 4
No ratings yet
Mod 4
69 pages
Eight Habits of Highly Effective 21st Century Teachers
100% (3)
Eight Habits of Highly Effective 21st Century Teachers
6 pages
Chatgpt: A Technical Perspective: Presented by Teamx
No ratings yet
Chatgpt: A Technical Perspective: Presented by Teamx
18 pages
Business Portfolio Rubric
No ratings yet
Business Portfolio Rubric
4 pages
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
No ratings yet
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
44 pages
Roles and Responsibilities
67% (3)
Roles and Responsibilities
21 pages
Generative AI Interview Questions and Answers
No ratings yet
Generative AI Interview Questions and Answers
7 pages
CS480 Lecture November 28th
No ratings yet
CS480 Lecture November 28th
96 pages
Guidebook2010 11
0% (1)
Guidebook2010 11
122 pages
New Agentic AI
No ratings yet
New Agentic AI
16 pages
Leveraging Language Models With RAG
No ratings yet
Leveraging Language Models With RAG
57 pages
AIML
No ratings yet
AIML
13 pages
Report
No ratings yet
Report
17 pages
Generative Ai and Large Language Models (LLMS) : Unit - 7
No ratings yet
Generative Ai and Large Language Models (LLMS) : Unit - 7
42 pages
WEL - COME Bade f4 Proficiency in English
No ratings yet
WEL - COME Bade f4 Proficiency in English
16 pages
AI Docs
No ratings yet
AI Docs
20 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
Gen AI Learning Concepts Linkedin
No ratings yet
Gen AI Learning Concepts Linkedin
18 pages
GenAI Workshop
No ratings yet
GenAI Workshop
35 pages
Final Semi-Detailed LP (Literature)
100% (1)
Final Semi-Detailed LP (Literature)
4 pages
Generative AI and LLMS
No ratings yet
Generative AI and LLMS
34 pages
Week4 LLMs EN
No ratings yet
Week4 LLMs EN
48 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
K.Raja Sravan Kumar, Contact No. +91-9963819172 Career Objective
No ratings yet
K.Raja Sravan Kumar, Contact No. +91-9963819172 Career Objective
4 pages
BTech Advanced AI Unit03
No ratings yet
BTech Advanced AI Unit03
109 pages
Innovations in LLMs Presentation Expanded MSOffice
No ratings yet
Innovations in LLMs Presentation Expanded MSOffice
24 pages
Task 1 Instructional Management-Kounin
No ratings yet
Task 1 Instructional Management-Kounin
33 pages
Creación de Aplicaciones LLM Modelos de Lenguaje
No ratings yet
Creación de Aplicaciones LLM Modelos de Lenguaje
5 pages
Gen Ai
No ratings yet
Gen Ai
17 pages
Explicitlessonplan
No ratings yet
Explicitlessonplan
11 pages
What Are LLMs
No ratings yet
What Are LLMs
3 pages
Leson Plan Template Rocks and Minerals
No ratings yet
Leson Plan Template Rocks and Minerals
4 pages
BK12 Pre Training Material 1
No ratings yet
BK12 Pre Training Material 1
16 pages
Attention Is All You Need.
No ratings yet
Attention Is All You Need.
5 pages
Week 6 Ai Llms Gpts
No ratings yet
Week 6 Ai Llms Gpts
17 pages
Pranay Report-1
No ratings yet
Pranay Report-1
36 pages
Definition:: Large Language Models (LLMS)
No ratings yet
Definition:: Large Language Models (LLMS)
41 pages
Economics BSC Honours 2025 26
No ratings yet
Economics BSC Honours 2025 26
10 pages
Quantum Learning
No ratings yet
Quantum Learning
11 pages
LLM - A Introduction To Generative AI
100% (1)
LLM - A Introduction To Generative AI
31 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
Weekly Home Learning Plan - Grade 12 Week 3
100% (1)
Weekly Home Learning Plan - Grade 12 Week 3
1 page
Pranay Report
No ratings yet
Pranay Report
26 pages
Final Corrected Study Methods Vis A Vis Academic Performance
No ratings yet
Final Corrected Study Methods Vis A Vis Academic Performance
81 pages
39-03 Generative AI Models
No ratings yet
39-03 Generative AI Models
17 pages
Artificial Intelligence For Business - Video Presentation
No ratings yet
Artificial Intelligence For Business - Video Presentation
23 pages
Let Us Python by Yashavant Kanetkar
88% (26)
Let Us Python by Yashavant Kanetkar
429 pages
Large Language Models
No ratings yet
Large Language Models
10 pages
Gender and Development in Philippines
No ratings yet
Gender and Development in Philippines
3 pages
SMK (L) Bukit Bintang Mid-Year Examination 2018: ENGLISH 1119/1 Form 5 Paper 1 One Hour Forty-Five Minutes
No ratings yet
SMK (L) Bukit Bintang Mid-Year Examination 2018: ENGLISH 1119/1 Form 5 Paper 1 One Hour Forty-Five Minutes
3 pages
Tukuran Technical - Vocational High School
No ratings yet
Tukuran Technical - Vocational High School
2 pages
9626 Learner Guide (For Examination From 2022)
No ratings yet
9626 Learner Guide (For Examination From 2022)
27 pages
LLM Review
No ratings yet
LLM Review
16 pages
Lecture 1
No ratings yet
Lecture 1
7 pages
Artikel Kartini
No ratings yet
Artikel Kartini
8 pages
03 GenAI Intro
No ratings yet
03 GenAI Intro
13 pages
Physics Global News: Aloysius Niko, A Best Madya Laboratory's Assistant About Me
No ratings yet
Physics Global News: Aloysius Niko, A Best Madya Laboratory's Assistant About Me
3 pages
To Create A LLM
No ratings yet
To Create A LLM
53 pages
(Coursera) GenAI
No ratings yet
(Coursera) GenAI
27 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
Introduction To Gen AI
No ratings yet
Introduction To Gen AI
7 pages
Quick Reference Guide For Understanding AI
No ratings yet
Quick Reference Guide For Understanding AI
14 pages
《A Primer on Large Language Models and their Limitations
No ratings yet
《A Primer on Large Language Models and their Limitations
33 pages
CH 5 Modern Artificial Intelligence
No ratings yet
CH 5 Modern Artificial Intelligence
5 pages
Unit 4 LLM
No ratings yet
Unit 4 LLM
11 pages
Group 11
No ratings yet
Group 11
27 pages
District Memorandum No. 06 S. 2025
No ratings yet
District Memorandum No. 06 S. 2025
7 pages
The - Role - of - Teaching - Practice
No ratings yet
The - Role - of - Teaching - Practice
12 pages
LLM 1
No ratings yet
LLM 1
6 pages
Intro Gen AI 6p
100% (1)
Intro Gen AI 6p
6 pages
Presentation On Ai
No ratings yet
Presentation On Ai
10 pages
LLMS&EMBEDDINGS
No ratings yet
LLMS&EMBEDDINGS
10 pages
Basic AI & ML Concepts Explained - LinkedIn
No ratings yet
Basic AI & ML Concepts Explained - LinkedIn
10 pages
Demonstration Rating Scale REVISED
No ratings yet
Demonstration Rating Scale REVISED
2 pages
Pe 1
No ratings yet
Pe 1
5 pages
LLM
No ratings yet
LLM
3 pages
Fai Unit-5 TB
No ratings yet
Fai Unit-5 TB
7 pages
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
No ratings yet
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
11 pages
Lesson Plan
No ratings yet
Lesson Plan
3 pages
Performance Evaluation HR 6 - 2-EPE-01 - HASAN MD MASUD
No ratings yet
Performance Evaluation HR 6 - 2-EPE-01 - HASAN MD MASUD
1 page
The Diverse Landscape of Large Language Models Deepsense Ai
No ratings yet
The Diverse Landscape of Large Language Models Deepsense Ai
16 pages
Top Agentic AI Architecture Design Patterns
100% (4)
Top Agentic AI Architecture Design Patterns
8 pages
Large Language Models
No ratings yet
Large Language Models
3 pages
Karan Patel
No ratings yet
Karan Patel
1 page
RPA Documentation
No ratings yet
RPA Documentation
1 page
QA Automation Testing Expanded
No ratings yet
QA Automation Testing Expanded
9 pages
100 Generative AI Use Cases Examples For Industries
100% (6)
100 Generative AI Use Cases Examples For Industries
63 pages
QA Test Documentation Expanded
No ratings yet
QA Test Documentation Expanded
8 pages
Expanded Model Evaluation Metrics
No ratings yet
Expanded Model Evaluation Metrics
8 pages
Expanded Data Science Overview
No ratings yet
Expanded Data Science Overview
10 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
Deep Learning
No ratings yet
Deep Learning
4 pages
Head First Python PDF
96% (26)
Head First Python PDF
494 pages
Agentic and Genai Aws GCP
80% (5)
Agentic and Genai Aws GCP
34 pages
Introduction To Artificial Intelligence
93% (41)
Introduction To Artificial Intelligence
316 pages
Feature Engineering
No ratings yet
Feature Engineering
1 page
Top 100 Applications of Generative AI 1683282083
100% (15)
Top 100 Applications of Generative AI 1683282083
119 pages
Applied Generative AI For Beginners Practical Knowledge 1703207445
93% (14)
Applied Generative AI For Beginners Practical Knowledge 1703207445
221 pages
LLM Application Through Production
100% (11)
LLM Application Through Production
254 pages
The Python Bible
97% (31)
The Python Bible
506 pages
Volleyball Differentiation
No ratings yet
Volleyball Differentiation
9 pages
Generative Ai Fundamentals v1
100% (16)
Generative Ai Fundamentals v1
80 pages
Artificial Intelligence With Python (Machine Learning Foundations, Methodologies, and Applications) (Teik Toe Teoh, Zheng Rong)
93% (15)
Artificial Intelligence With Python (Machine Learning Foundations, Methodologies, and Applications) (Teik Toe Teoh, Zheng Rong)
334 pages
RAG Architecture
100% (8)
RAG Architecture
52 pages
Generative AI With Large Language Models
100% (3)
Generative AI With Large Language Models
31 pages
AI Artificial Intelligence, 60 Leaders 17 Questions
100% (12)
AI Artificial Intelligence, 60 Leaders 17 Questions
236 pages
Tom Taulli - Generative AI - A Non-Technical Introduction-Apress (2023)
100% (7)
Tom Taulli - Generative AI - A Non-Technical Introduction-Apress (2023)
211 pages
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
100% (14)
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
132 pages
EBOOK - Python Crash Course For Data Analysis
100% (12)
EBOOK - Python Crash Course For Data Analysis
168 pages
45 ChatGPT Use Cases For Product Managers 1674466304
100% (18)
45 ChatGPT Use Cases For Product Managers 1674466304
100 pages
Hands On Machine Learning With Python Concepts and Applications For Beginners - John Anderson 2018
91% (11)
Hands On Machine Learning With Python Concepts and Applications For Beginners - John Anderson 2018
166 pages
Master Your Mindset How To Get What You Truly Deserve - Reading Mindset
95% (22)
Master Your Mindset How To Get What You Truly Deserve - Reading Mindset
66 pages
LLMs and Generative AI For (Z-Library)
100% (3)
LLMs and Generative AI For (Z-Library)
58 pages
Natural Language Processing With PyTorch - Build Intelligent Language Applications Using Deep Learning PDF
100% (14)
Natural Language Processing With PyTorch - Build Intelligent Language Applications Using Deep Learning PDF
210 pages
Create LLM Application Using Langchain With Ease
100% (5)
Create LLM Application Using Langchain With Ease
12 pages
Full Course of Machine Learning
100% (16)
Full Course of Machine Learning
660 pages
Machine Learning Projects Python
94% (18)
Machine Learning Projects Python
134 pages
Python Notes For Professionals
100% (18)
Python Notes For Professionals
814 pages
Practical Projects
100% (30)
Practical Projects
478 pages
Machine Learning Projects in Python
100% (16)
Machine Learning Projects in Python
135 pages
Machine Learning Masterclass
100% (11)
Machine Learning Masterclass
108 pages
Unraveling the Magic of Large Language Models: A Journey into the Future of Communication
From Everand
Unraveling the Magic of Large Language Models: A Journey into the Future of Communication
Lila Hartney
No ratings yet
The Beginner’s Guide to Creating AI Chatbots
From Everand
The Beginner’s Guide to Creating AI Chatbots
Steven Mcananey
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Atharva Presentation

Uploaded by

Atharva Presentation

Uploaded by

DnT Infotech

2 Traditional Vs Transformers 7 Workflow

3 LLMs 8 Concept Breakdown

4 Gen AI 9 Real life Adaptation and

It all started with a breakthrough research paper published by

This paper introduced the Transformer architecture, the

RNN (Recurrent Neural Networks) : A neural network designed to handle

It struggles with remembering long-term memory .

Used for: Time series prediction, Speech recognition, Language modeling

LSTM (Long Short-Term Memory) : A type of RNN with a more complex

METHODS manage long-term and short-term dependencies.

Used for : Text generation, Machine translation, Sentiment analysis

Major setbacks of RNN’s and LSTM’s are :

RNNs take on it will be :

LSTM Structure: Like a Smart Assistant with a Notepad.

Still processes word-by-word, but with internal gates:

Instead, it lets every word look at every other word directly

Structure Of Transformer Input sentence: ["I","wanted",...,"meeting",...,"last"]

Each word is turned into a vector xi , with positional encoding.

To summarise it , we can say all words interact simultaneously,

It follows Encoder–Decoder architecture :

TRANSFORMERS Transformers led to Birth of PLMs (Pretrained Language Models) transformers

As transformer-based models were scaled up with more data and powerful

Gen AI systems can be multimodal, combining text, images, audio, or

In summary, Gen AI = LLM (or related foundational model) + task-specific

Some examples of Gen AI are :

Examples of LLM are :

Parameters are the learnable values (weights) in the

Every input is embedded into vectors, and these vectors

AGENTIC AI :- LLM + Memory + Goals + Autonomy

Instead of passive assistants like siri or ChatGpt, It doesn’t just

Jensen Huang (CEO, NVIDIA) - Stated that "IT departments

WHATS THE intervention, increasing efficiency and reducing errors.

Driving Innovation: The integration of AI agents fosters

It takes a human written prompt/task and understands

Breaks the tasks into substeps ( e.g. :- Search → Read →

Uses API’s, Webtools or software to complete those

Remembers what it did, checks for failures, and adjusts

It's not just answering questions like a chatbot — it's

LLM (Large Language Model) — it’s the brain of the agent.

Such as to Understand what user wants, decide what

Tools are used for tasks like Searching, Coding, File

DuckDuckGo, Tavily, Python REPL, code interpreter, LLM Model Tools

Agent Workflow example:- Prompt: User says

Agent decides: → Use PDF tool → summarize → call

The LLM reasons, the Agent coordinates, and the Tools

AGENTIC AI AI Agent : This is the core "brain."

Chat Model (OpenAI) : This is the "language understanding" (LLM)

Memory (Simple Memory) : Helps the AI remember context across

Goose.ai (Agent-Driven Automation Platform) Microsoft AutoGen – Enterprise Email Assistants

Presented by Atharva Virkar

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.