0% found this document useful (0 votes)

20 views6 pages

nlfynx7RfS0IZ9YGOtls - Some Core Concepts

Uploaded by

aayan.feb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views6 pages

nlfynx7RfS0IZ9YGOtls - Some Core Concepts

Uploaded by

aayan.feb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Some core concepts

Navigating the landscape of generative AI models is quite like steering a ship through uncharted
waters for most of us. Understanding these models and some of the core concepts is not just
about tech jargon; it's about wielding the tools that can redefine how businesses innovate,
communicate, and stay ahead in the dynamic seas of industry evolution.

Here is a list of a few of the most popular models used in the generative space.

Foundational Models
Foundational models are advanced AI frameworks transforming language, image generation,
and comprehension tasks across diverse industries. Foundational models are large,
multipurpose machine learning models that are pre-trained on diverse data at scale to learn
representations and patterns that enable adapting them to downstream tasks through transfer
learning instead of training bespoke models from scratch. This accelerates development and
enhances performance.

Let’s review some of them.

Here is the table with various types of models:

Type Definition Use Cases Examples

Models trained on large

text corpuses to
generate human-like Content generation,
Language text and power natural chatbots, search,
Models language tasks. document analysis GPT-3, BERT, T5

Models that synthesize Creative tools,

Image realistic images from media, advertising, DALL-E 2, Stable Diffusion,
Generators text descriptions. ecommerce Imagen
Models that translate Voice assistants,
Speech text to realistic human audiobooks,
Models speech. accessibility tools Whisper, WaveNet

Models that suggest Retail, media,

Recommende content to users based advertising, YouTube RS, Amazon RS,
rs on preferences. ecommerce Spotify RS

Translation Models that translate Localization, travel, Google Translate, WMT,

Models text between languages. customer support M2M-100

Models that predict 3D Drug discovery,

Protein protein structure from materials science,
Models amino acid sequence. agriculture AlphaFold, RoseTTAFold

Models trained with

Game-Playing reinforcement learning Game testing, AlphaGo, AlphaStar,
Models to play games. education, robotics OpenAI Five

Tabular Data Models that generate Finance, healthcare,

Models synthetic tabular data. insurance TableGAN, TVAE

Models that solve Logistics,

Optimization complex optimization manufacturing,
Models problems. transportation DeepMind Gato

Large Language Models (LLMs)

Large language models are neural networks trained on vast text datasets, enabling them to
generate human-like language and power advanced natural language applications through
few-shot learning. Their foundation of extracting meaning from text allows adapting them to
many downstream NLP tasks by learning surface level patterns on small datasets.

Some of the LLMs are listed in the following table:

Type Description Use Cases Examples

Predict next word based

Autoregressiv on context. Good for Content creation,
e fluent generation. summaries, chatbots GPT-3, GPT-4

Encode input, decode for Translation, question

Encoder-Deco tasks like translation. answering, text
der Flexible. summarization BART, T5

Learn context by
processing text both Sentiment analysis,
directions. Good for entity recognition,
Bidirectional analysis. search BERT, RoBERTa

Trained to directly answer Conversational

Question questions. Specialized for search, analytics,
Answering QA. dialog systems ELI5, FARM

Virtual assistants,
Trained on conversations. customer service,
Dialog Useful for chatbots. conversational AI Meena, Blenderbot

Process text and images Creative applications,

Multi-Modal for multimodal generation. contextual generation DALL-E 2, Imagen

Tailored for specific tasks Drug discovery,

like code or protein software
Specialized structure. Trade generality development,
Task for depth. theorem proving AlphaFold, Codex

Generative adversarial networks (GANs)

Generative adversarial networks (GANs) are a powerful AI technique where two neural networks
– a generator and discriminator – compete against each other in a minimax game framework.
The generator learns to produce increasingly realistic synthetic data while the discriminator tries
to differentiate real from fake data. This drives the models to improve until the outputs are
indistinguishable from actual training data.

GAN Type Description Use Cases Examples

Original basic architecture

with generator and
Vanilla GAN discriminator Concept learning GAN, DCGAN

Generator and discriminator

Conditional conditioned on additional Controlled
GAN input generation pix2pix, CycleGAN

Image-to-Ima Input image transformed to Image editing,

ge GAN output image colorization Pix2Pix

Paired Style transfer,

generators/discriminators domain
CycleGAN between domains adaptation CycleGAN, DiscoGAN

GAN specialized for very Media,

StyleGAN realistic image generation entertainment StyleGAN, StyleGAN2

Massively scaled up GAN High-res image

BigGAN architecture generation BigGAN

Text-to-Image Generator maps text to Multimodal

GAN images creative apps DALL-E, Imagen

Diffusion Models
Diffusion models are generative deep learning models that progressively add structured noise to
data and then train a neural network to reverse that process for high-fidelity generation. By
odelling noise schedules, they create fine-grained conditional control for manipulating images,
audio, 3D scenes, and other data with neural nets.

Various diffusion models are elaborated in the following table

Type Description Use Cases Examples

Denoising Progressive image Image

Diffusion denoising generation DDPM
Extend to
Diffusion probabilistic Accurate
Probabilistic modeling generation DPIM
Leverage score
Score-Based function for Efficient
Diffusion sampling generation SDSM
Language-Co Condition on text Text-based
nditioned for control generation DALL-E
Video Apply across time
Diffusion for video Video generation V-Diffusion

Audio Capture properties

Diffusion of natural sound Audio generation WaveGrad
3D Scene Generate 3D 3D scene
Diffusion spaces modeling msd-nerf

Variational autoencoders (VAEs)

Variational autoencoders (VAEs) are deep generative models that learn latent representations of
data through probabilistic encoders and decoders. They consist of an encoder network that
maps data examples to distributions over latent space, and a decoder network that reconstructs
the data given samples from those distributions. The encoder and decoder are trained jointly to
maximize the likelihood of reconstructed data while keeping the latent space continuous.

VAEs are used for:

● Generating new data based on the data on which the model was trained (text, images,
audio, etc). Examples: GPT-2, Dall-E
● Anomaly or outlier detection, Example: Credit card fraud
● Dimensionality reduction for visualizing high-dimensional data

Types of VAEs include:

● Conditional VAEs - Encoder/decoder conditioned on auxiliary inputs to target generation

● Disentangled VAEs - Latent codes isolate explanatory factors of variation
● Hierarchical VAEs - Model hierarchical dependencies with layers of latent variables
● Multimodal VAEs - Jointly model data across modalities like text & images

Autoregressive Models

Autoregressive models are generative deep learning models that factorize the joint probability of
a sequence by modeling it as a product of conditional probabilities. They estimate the probability
of a token conditioned on the previous tokens in a process that can generate variable-length
outputs.

Autoregressive models are commonly used for:

● Natural language generation, Examples: GPT-3, XLNet

● Time series forecasting, Example: Retail sales predictions
● Image generation, Example: PixelCNN
● Audio generation, Example: WaveNet

Types of autoregressive models include:

● Transformer-based models - Leverage attention mechanisms over sequences

● Pixel autoregressive models - Model 2D image structures
● Neural additive models - Jointly train both shallow and deep networks
● Temporal autoregressive models - Specialize in forecasting temporal sequences
● Distribution-based models - Model distributions rather than scalar outputs

The modeling flexibility of autoregressive factorization has made this technique effective across
different data types. Fine-tuning on downstream tasks further leverages generative pretraining.

Generative Ai Fundamentals v1
100% (16)
Generative Ai Fundamentals v1
80 pages
Generative Ai and Large Language Models (LLMS) : Unit - 7
No ratings yet
Generative Ai and Large Language Models (LLMS) : Unit - 7
42 pages
Introduction To Gen Ai
No ratings yet
Introduction To Gen Ai
13 pages
Generative AI
No ratings yet
Generative AI
24 pages
Session 2 Introduction To Generative AI
No ratings yet
Session 2 Introduction To Generative AI
17 pages
Unit - DL
No ratings yet
Unit - DL
22 pages
Class Note 1: Introduction To Generative AI (Beginner Level)
No ratings yet
Class Note 1: Introduction To Generative AI (Beginner Level)
4 pages
Class Notes Astronomy 3 of 5
No ratings yet
Class Notes Astronomy 3 of 5
2 pages
OCI GIA & LLM Fundations
No ratings yet
OCI GIA & LLM Fundations
11 pages
Mini Project On Generative AI 2
No ratings yet
Mini Project On Generative AI 2
44 pages
Generative AI
No ratings yet
Generative AI
19 pages
Module1 L1 L2
No ratings yet
Module1 L1 L2
35 pages
Gen AI Module1
No ratings yet
Gen AI Module1
130 pages
Intro To Generative AI-STM
No ratings yet
Intro To Generative AI-STM
10 pages
Generative Model Type
No ratings yet
Generative Model Type
1 page
ASWIN TS Gen Ai and Autoregressive Ai Simplified Notes Unit 1
No ratings yet
ASWIN TS Gen Ai and Autoregressive Ai Simplified Notes Unit 1
4 pages
Unit 4 Generative AI
No ratings yet
Unit 4 Generative AI
5 pages
Gen AI Notes Part 1
No ratings yet
Gen AI Notes Part 1
15 pages
For A Change
No ratings yet
For A Change
10 pages
Generative AI Notes
No ratings yet
Generative AI Notes
29 pages
Unit 1 Intoduction To Generative AI
No ratings yet
Unit 1 Intoduction To Generative AI
8 pages
03 GenAI Intro
No ratings yet
03 GenAI Intro
13 pages
GenAI Unit1 3
No ratings yet
GenAI Unit1 3
31 pages
Class Note 2: Intermediate Concepts in Generative AI
No ratings yet
Class Note 2: Intermediate Concepts in Generative AI
4 pages
Generative Ai
No ratings yet
Generative Ai
7 pages
PDF 2: Core Generative Ai Models (Gans, Vaes, Transformers, Diffusion)
No ratings yet
PDF 2: Core Generative Ai Models (Gans, Vaes, Transformers, Diffusion)
2 pages
GenAI Interview Questions-Draft
No ratings yet
GenAI Interview Questions-Draft
55 pages
Generative AI
No ratings yet
Generative AI
9 pages
Introduction To Generative AI-en
No ratings yet
Introduction To Generative AI-en
3 pages
Gen AI
No ratings yet
Gen AI
8 pages
Gen Ai Ibm
No ratings yet
Gen Ai Ibm
5 pages
Unit1 Gen Ai
No ratings yet
Unit1 Gen Ai
15 pages
Intro Gen AI 6p
100% (1)
Intro Gen AI 6p
6 pages
Download
No ratings yet
Download
1 page
Gai Etict
No ratings yet
Gai Etict
8 pages
Generative
No ratings yet
Generative
19 pages
GAPE Module 1
No ratings yet
GAPE Module 1
29 pages
Madhavan M Ts Report
No ratings yet
Madhavan M Ts Report
32 pages
Module 1 Gen
No ratings yet
Module 1 Gen
48 pages
Bny Sec Ahw 2412261636 0626605944 1
No ratings yet
Bny Sec Ahw 2412261636 0626605944 1
3 pages
How GenAI Works
No ratings yet
How GenAI Works
5 pages
Gen Ai
No ratings yet
Gen Ai
17 pages
39-03 Generative AI Models
No ratings yet
39-03 Generative AI Models
17 pages
Module 1 Presentation
No ratings yet
Module 1 Presentation
48 pages
Index
No ratings yet
Index
2 pages
Genaitable
No ratings yet
Genaitable
3 pages
UNIT 4 Generative AI PDF
No ratings yet
UNIT 4 Generative AI PDF
6 pages
PE - Module 2
No ratings yet
PE - Module 2
30 pages
Types of AI Models and Their Uses-PDF-Format
No ratings yet
Types of AI Models and Their Uses-PDF-Format
14 pages
10 - Generative AI
No ratings yet
10 - Generative AI
71 pages
Step-By-Step Roadmap To Generative AI
No ratings yet
Step-By-Step Roadmap To Generative AI
27 pages
Class 5 - Deep Dive Into AI
No ratings yet
Class 5 - Deep Dive Into AI
32 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
14 pages
gENERATIVE Ai
No ratings yet
gENERATIVE Ai
15 pages
IJRPR24698
No ratings yet
IJRPR24698
4 pages
Generative AI Roadmap
No ratings yet
Generative AI Roadmap
36 pages
Generative AI
No ratings yet
Generative AI
2 pages
The Generative AI List of Lists
No ratings yet
The Generative AI List of Lists
17 pages
Unit3sem7 Generative Ai
No ratings yet
Unit3sem7 Generative Ai
41 pages
RAG-Driven Generative AI: Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone
From Everand
RAG-Driven Generative AI: Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone
Denis Rothman
No ratings yet
Plant Treasure
No ratings yet
Plant Treasure
6 pages
Climate Science Solutions Modelling
No ratings yet
Climate Science Solutions Modelling
11 pages
Sports Day Races
No ratings yet
Sports Day Races
1 page
Circulatory System Science Chapter-2
No ratings yet
Circulatory System Science Chapter-2
2 pages
Sport
No ratings yet
Sport
2 pages
Job Description Software and Platform 7
No ratings yet
Job Description Software and Platform 7
4 pages
Naval Research Laboratory Washington, DC 20375-5320 Nrl/Mr/6410!93!7192
No ratings yet
Naval Research Laboratory Washington, DC 20375-5320 Nrl/Mr/6410!93!7192
134 pages
Towards Democratizing Joint-Embedding Self-Supervised Learning
No ratings yet
Towards Democratizing Joint-Embedding Self-Supervised Learning
11 pages
Grammar and Language: Grammar: It Is System That Specifies
No ratings yet
Grammar and Language: Grammar: It Is System That Specifies
40 pages
QFTSolutions 2
No ratings yet
QFTSolutions 2
17 pages
OEE Templet
No ratings yet
OEE Templet
2 pages
DSP Lab Manual PDF
100% (1)
DSP Lab Manual PDF
51 pages
A Survey of Path Planning Algorithms For Mobile Robots
No ratings yet
A Survey of Path Planning Algorithms For Mobile Robots
21 pages
QM Lecture 17 2023
No ratings yet
QM Lecture 17 2023
31 pages
Lecture 3 DES
No ratings yet
Lecture 3 DES
45 pages
MCL705 Intro 2023
No ratings yet
MCL705 Intro 2023
2 pages
GRE Practice Exams
0% (1)
GRE Practice Exams
5 pages
Cryptography 2
No ratings yet
Cryptography 2
9 pages
NA Lec 11 Fall2024 Anzar Roots Bisection Method 04102024 123920pm
No ratings yet
NA Lec 11 Fall2024 Anzar Roots Bisection Method 04102024 123920pm
17 pages
Excel For Data Analysis
No ratings yet
Excel For Data Analysis
14 pages
University of Zimbabwe Faculty of Commerce Business Studies Department Second Semester
100% (1)
University of Zimbabwe Faculty of Commerce Business Studies Department Second Semester
4 pages
Taming The Waves Sine As Activation Function in Deep Neural - Networks PDF
No ratings yet
Taming The Waves Sine As Activation Function in Deep Neural - Networks PDF
12 pages
COS4851 Assignment 1 2024
No ratings yet
COS4851 Assignment 1 2024
5 pages
First Order Open Loop System: Che 529 Process Dynamics and Control
No ratings yet
First Order Open Loop System: Che 529 Process Dynamics and Control
5 pages
FiniteElements2D - TimeDependent
No ratings yet
FiniteElements2D - TimeDependent
31 pages
Lecture 4 The Time Value Is Money
No ratings yet
Lecture 4 The Time Value Is Money
6 pages
Sec 1-1
No ratings yet
Sec 1-1
8 pages
Model-Based Deep Learning
No ratings yet
Model-Based Deep Learning
35 pages
Good - Analyzer - 3 (Manual - En)
No ratings yet
Good - Analyzer - 3 (Manual - En)
6 pages
Government Polytechnic, Washim: "Implement Modifier Caesar's Cipher With Shift of Any Key
No ratings yet
Government Polytechnic, Washim: "Implement Modifier Caesar's Cipher With Shift of Any Key
12 pages
Transportation Model FInal
No ratings yet
Transportation Model FInal
21 pages
MIT6 045JS11 Lec08
No ratings yet
MIT6 045JS11 Lec08
51 pages
Birla Institute of Technology & Science, Pilani Pilani Campus SECOND SEMESTER 2015 - 2016 Database Systems (Cs F212/ Is F243) Mid Semester Exam
No ratings yet
Birla Institute of Technology & Science, Pilani Pilani Campus SECOND SEMESTER 2015 - 2016 Database Systems (Cs F212/ Is F243) Mid Semester Exam
3 pages
Hierarchical Data Structures
No ratings yet
Hierarchical Data Structures
21 pages
Paper 03
No ratings yet
Paper 03
13 pages
Left Recursion
No ratings yet
Left Recursion
9 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

nlfynx7RfS0IZ9YGOtls - Some Core Concepts

Uploaded by

nlfynx7RfS0IZ9YGOtls - Some Core Concepts

Uploaded by

Some core concepts

Let’s review some of them.

Here is the table with various types of models:

Type Definition Use Cases Examples

Models trained on large

Models that synthesize Creative tools,

Models that suggest Retail, media,

Translation Models that translate Localization, travel, Google Translate, WMT,

Models that predict 3D Drug discovery,

Models trained with

Tabular Data Models that generate Finance, healthcare,

Models that solve Logistics,

Large Language Models (LLMs)

Some of the LLMs are listed in the following table:

Predict next word based

Encode input, decode for Translation, question

Trained to directly answer Conversational

Process text and images Creative applications,

Tailored for specific tasks Drug discovery,

Generative adversarial networks (GANs)

GAN Type Description Use Cases Examples

Original basic architecture

Generator and discriminator

Image-to-Ima Input image transformed to Image editing,

Paired Style transfer,

GAN specialized for very Media,

Massively scaled up GAN High-res image

Text-to-Image Generator maps text to Multimodal

Various diffusion models are elaborated in the following table

Type Description Use Cases Examples

Denoising Progressive image Image

Audio Capture properties

Variational autoencoders (VAEs)

VAEs are used for:

Types of VAEs include:

● Conditional VAEs - Encoder/decoder conditioned on auxiliary inputs to target generation

Autoregressive models are commonly used for:

● Natural language generation, Examples: GPT-3, XLNet

Types of autoregressive models include:

● Transformer-based models - Leverage attention mechanisms over sequences

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.