0% found this document useful (0 votes)

44 views44 pages

Generative AI Applications

Uploaded by

lincypsg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views44 pages

Generative AI Applications

Uploaded by

lincypsg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Customizing generative AI
applications for your business
using your own data
Maira Ladeira Tanke
(she/her)
Sr. Generative AI Data Scientist
AWS

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda

01 Approaches to customizing generative AI solutions

02 Amazon Bedrock overview

03 Retrieval-augmented generation using Amazon Bedrock

04 Fine-tuning and continued pretraining using Amazon Bedrock

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Why customize?

Customize for Adapt to Enhance Improve

specific business domain-specific performance context awareness
needs language for specific tasks in responses

E.g., healthcare – Understand E.g., finance – Teach financial and E.g., customer service – Improve E.g., legal services – Better
medical terminology and provide accounting terms to provide good ability to understand and respond understand case facts and law to
accurate responses related to a analysis for earnings reports to a customer’s inquires and provide useful insights for
patient’s health complaints attorneys

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Customizing foundation models to understand
your use case

FOUNDATION
MODEL

CUSTOMIZATION YOUR DATA

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Augment models without changing pretrained
model weights with knowledge bases

FOUNDATION
MODEL

KNOWLEDGE BASE VECTOR

DATABASES
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Connect your model with dynamic data sources
using agents

FOUNDATION
MODEL

AGENTS DATABASES,
APIS
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Adapt models for your use case with fine-tuning

FOUNDATION
MODEL

FINE-TUNING LABELED
DATA
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Update your models through continued
pretraining

FOUNDATION
MODEL

CONTINUED UNLABELED
PRETRAINING DATA
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Approaches to customizing models with your data

Augment model with

Augment model with Maximizing model
relatively static data Maximizing accuracy
PURPOSE dynamic data without accuracy for your
without changing for specific tasks
changing weights domain
weights

Contextual
DATA Contextual information Small number Large number
information
NEED based on user’s query of labeled examples of unlabeled datasets
based on user’s query

Medium – once new Infrequent – only Infrequent – only

DATA Frequent – every time a
data arrives necessary when task necessary if domain
UPDATE database changes
(e.g., new product) changes context changes

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Customize
External data
Consolidated or
sources
historical info
or up-to-date info Task
information

Relatively static Dynamic

information information Complex Simple
(e.g., docs, FAQs) (e.g., DBs, APIs) or specific or generic
Real time Simple
required? task?

Augment with Augment with Prompt

Knowledge Bases Customize
agents & tools engineering

Amazon Bedrock Amazon Bedrock Amazon Bedrock Amazon Bedrock

Knowledge Bases Agents Custom Models FMs

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Bedrock

Amazon Bedrock Choose Use as is or Send Receive

an FM customize prompt response
Build generative AI
applications using Use the playground Fine-tune FMs as Use the Bedrock API Receive model
foundation models (FMs) to experiment with needed. Bedrock will to send your prompts response in your
through a serverless FMs and select the automatically deploy to the model application
API service one that suits your the FM for inference
needs

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon
Broad choice of models

Jurassic-2 Ultra Titan Text Embeddings Claude 2 Command + Embed Llama 2 Stable Diffusion XL1.0
Jurassic-2 Mid Titan Multimodal Embeddings Claude 2.1 Cohere Command Light Llama 2 13B
Titan Text Lite Claude Instant Cohere Embed English Llama 2 70B
Titan Text Express Cohere Embed Multilingual
Titan Image Generator

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Retrieval-augmented
generation (RAG)

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What is retrieval-augmented generation?

Augmentation Generation
Retrieval
Fetches the relevant Adding the retrieved Response from the
content from the relevant context to the foundation model based
external knowledge base user prompt, which goes on the augmented
or data sources based on as an input to the prompt
a user query foundation model

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
RAG use cases

Improved content Contextual chatbots and Personalized search Real-time data

quality question answering summarization

E.g., helps in reducing E.g., enhances chatbot capabilities E.g., searches based on a user’s E.g., retrieves and summarizes
hallucinations and connecting by integrating with real-time data previous search history and transactional data from databases
with recent knowledge, including persona or API calls
enterprise data

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What are embeddings?
• Numerical representation of
text (vectors) that captures
semantics and relationships
New York 0.027 -0.011 … -0.023
between words.
Paris 0.025 -0.009 … -0.025
• Embedding models capture
features and nuances of the EMBEDDING
MODEL
text. Animal -0.011 0.021 … 0.013

Horse -0.009 0.019 … 0.015

• Rich embeddings can be used
to compare text similarity. Human text Vector embeddings
• Multilingual text embeddings
can identify meaning in
different languages.

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Why are embeddings important for RAG?

Powers text retrieval based Used to augment prompts High-accuracy embeddings

on semantic meaning. with more accurate lead to improved context
context from vector stores and higher-quality LLM-
using the retrieval- generated responses to a
augmented generation. user query.

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Titan text embeddings model

Amazon Titan Text Embeddings Highlights

V2.0

Translates text inputs (words, phrases) into numerical • Titan Text Embeddings offers fast, cost-
representations (embeddings). Comparing effective, high-performance, accurate
embeddings produces more relevant and contextual embeddings in 25 languages.
responses than word matching.
• Optimized for text retrieval tasks, semantic
similarity, and clustering.

• Applications of this model include semantic

Max tokens: 8,000 search and personalization.
Output vectors: 1,536
Language: Multilingual (25 languages)

Model ID: amazon.titan-embed-g1-text-02

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Cohere embeddings model

Cohere Embed – English Cohere Embed – Multilingual

V3.0 V3.0

Embed is Cohere's text representation, or embeddings, Embed is Cohere's text representation, or embeddings,
model. This version supports English only. model. This version supports multiple languages.

Supported use cases: Semantic search, retrieval- Supported use cases: Semantic search, retrieval-
augmented generation (RAG), classification, augmented generation (RAG), classification,
clustering. clustering.

Output vectors: 1,024 Output vectors: 1,024

Language: English Language: Multilingual (100+ supported languages)

Model ID: cohere.embed-english-v3 Model ID: cohere.embed-multilingual-v3

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
How RAG works

User input

Text User
Prompt
augmentation
Large language
model
Response
generation
workflow Embeddings
model
Context

Embeddings 0.89 -0.02 -0.53 0.95 0.17 -0.38

Data Semantic
ingestion search

workflow Vector store Embeddings model Document store New data

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
However, when it comes to implementing RAG,
there are challenges…

Managing multiple Creating vector Incremental updates

data sources embeddings for large to vector store
volumes of data

Coding effort Scaling retrieval Orchestration

mechanism

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Knowledge bases for Amazon Bedrock
NATIVE SUPPORT FOR RETRIEVAL-AUGMENTED GENERATION (RAG)

Securely connect FMs Fully managed RAG Built-in session Automatic citations
to data sources for workflow, including context management with retrievals to
RAG to deliver more ingestion, retrieval, for multi-turn improve transparency
relevant responses and augmentation conversations

USER QUERY AUGMENTED PROMPT MODEL ANSWER

1 4 5 AI21 Labs—Jurassic2 6
A M A Z O N
B E D R O C K

Amazon—Titan Text

2 3
Anthropic—Claude

KNOWLEDGE Meta—Llama2
BASES FOR
AMAZON BEDROCK
Cohere—Command

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data ingestion workflow
KNOWLEDGE BASES FOR AMAZON BEDROCK

Fully
managed
data
ingestion
Data source Embeddings
workflow New data Document chunks
model
Vector store

• Choose your data source Choose your chunking Choose your Choose your vector store
strategy embedding model • Amazon OpenSearch
(Amazon S3)
• Fixed chunks • Amazon Titan
Serverless
• Support for incremental
• No chunking • Cohere Embed
• Amazon Aurora
updates
• Default (200 tokens)
• Pinecone
• Multiple data file formats
• Redis
supported
• MongoDB
(coming soon)

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Retrieval and generate

User input

User Prompt Large language

Response
augmentation model

Text Embeddings
generation model Context

workflow
0.89 -0.02 -0.53 0.95 0.17 -0.38

Embedding
Semantic
search

Knowledge bases for Amazon

Bedrock
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
RetrieveAndGenerate API
KNOWLEDGE BASES FOR AMAZON BEDROCK

User Input RetrieveAndGenerate

API

User Response
Fully User query Generated
managed response
RAG

Generate query Retrieve similar Augment query with Generate response

embeddings documents from retrieved documents from LLM
knowledge bases

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Customize RAG workflows using Retrieve API
KNOWLEDGE BASES FOR AMAZON BEDROCK

User input

User Prompt Large Response

augmentation language
model

Customized
RAG Retrieve API
workflow
User Retrieved
Context
query documents

Generate query Retrieve similar documents

embeddings from knowledge bases

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Vector databases supported by Amazon Bedrock

COMING SOON

Vector Engine for Amazon Redis Enterprise Pinecone MongoDB

Amazon Aurora Cloud
OpenSearch
Serverless

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agents

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agents for Amazon Bedrock
ENABLE GENERATIVE AI APPLICATIONS TO EXECUTE MULTISTEP TASKS USING COMPANY SYSTEMS AND
DATA SOURCES

1 2 3 4

SELECT YOUR PROVIDE BASIC SELECT RELEVANT SPECIFY AVAILABLE

FOUNDATION MODEL INSTRUCTIONS DATA SOURCES ACTIONS

| Breaks down and orchestrates tasks |

| Securely accesses and retrieves company data for RAG |
| Takes action by invoking API calls on your behalf |
| Chain-of-thought trace and ability to modify agent prompts |
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Benefits

Automates Simplifies building Provides secure Lets you choose Provides fully
orchestration of and deploying access to enterprise implementation managed
multistep tasks AI assistants data and APIs languages infrastructure

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agents build on existing enterprise resources
HR knowledge base Existing resources

HR policy docs

Vacation actions
get-Vacation-Balance

HR time-off
Vacation Vacation
agent microservice database
Leave of absence actions

Leave of absence (LOA)

database

Knowledge bases
Query

Retrieval

Agent

Query + Retrieval
Large language
model
Response generation

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Bedrock custom models
CREATE CUSTOM MODELS USING THE CONSOLE OR API’S

• Maximize accuracy of FMs by providing

labeled or raw unlabeled data

• Once deployed, custom models are invoked

the same way as base models
(playground or API)

• Customizations now supported for Amazon

Titan and some third-party FMs

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Fine-tuning and continued pretraining
Domain
adaptation
(e.g., extend
knowledge)
Fine-tuning
Continued
• Instruction training dataset is available? pretraining
• Specific style, behavior required? Continued +
Pretraining Fine-tuning

Continued pretraining
• Raw dataset (e.g., PDFs)
• Additional knowledge through domain adaptation
Fine-tuning

Task specialization
(e.g., behavior, style)
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Datasets for fine-tuning and continued pretraining
Instruction dataset Raw data
(e.g., question-answer) (e.g. PDFs)
Dataset

Fine-tuning Continued
pretraining

Amazon Bedrock Amazon Bedrock

Custom Models Custom Models

{”prompt": "<prompt text>", ”completion": "<expected generated text>"} {"input": "<raw text>"}
{”prompt": "<prompt text>", ”completion": "<expected generated text>"} {"input": "<raw text>"}
{”prompt": "<prompt text>", ”completion": "<expected generated text>"} {"input": "<raw text>"}

Inputs Outputs Storage Inferencing

Base FM Metrics and Custom models Playground

logs stored securely
by Amazon
Hyper Bedrock API
parameters Output
model
Input
data

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Customization architecture overview
Amazon Bedrock service account Model deployment account
(AWS owned and operated)
via the console, SDKs, and API
All incoming network traffic

Training orchestration

Amazon Bedrock
Base model S3
bucket Custom
Provisioned Job
Runtime inference capacity
API compute
endpoint

Fine-tuned model
S3 bucket

Customer account

Identity and access,

Virtual private cloud
monitoring and logging

Training data
AWS Amazon AWS S3 bucket
CloudTrail CloudWatch IAM
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Security and privacy
You are always in control of your data

✓ Data not used to improve models, and not shared with model providers

✓ Customer data remain in Region

✓ Support for AWS PrivateLink and VPC configurations

✓ Integration with AWS IAM

✓ API monitoring in AWS CloudTrail, logging and metrics in Amazon CloudWatch

✓ Custom models encrypted and stored with service or customer managed keys
(CMK) – Only you have access to your models

Customizing an FM is Amazon Bedrock You can combine

a cost-effective way to supports multiple multiple customization
improve model ways to customize techniques for better
accuracy FMs performance

Gen AI in Action From POC To Business Value
No ratings yet
Gen AI in Action From POC To Business Value
42 pages
Session 7 LLMs Fine Tuning and RAG
No ratings yet
Session 7 LLMs Fine Tuning and RAG
21 pages
MP - English (R - 23)
No ratings yet
MP - English (R - 23)
192 pages
Building AI Agents With Autogen - Workshop
No ratings yet
Building AI Agents With Autogen - Workshop
49 pages
Generative AI Executive Deck
No ratings yet
Generative AI Executive Deck
63 pages
5 Junior P.E and Arts
No ratings yet
5 Junior P.E and Arts
83 pages
Generative AI Keynote
No ratings yet
Generative AI Keynote
59 pages
A CR CCP 702 PF 001 Red Star IG
No ratings yet
A CR CCP 702 PF 001 Red Star IG
730 pages
02 - Manufacturing Symposium - GenAI For Manufacturing
No ratings yet
02 - Manufacturing Symposium - GenAI For Manufacturing
22 pages
AWS SageMaker Built-In Algorithms Cheat Sheet
No ratings yet
AWS SageMaker Built-In Algorithms Cheat Sheet
20 pages
Analytics Platform Ai Extension Guide
No ratings yet
Analytics Platform Ai Extension Guide
69 pages
AWS AI Services
No ratings yet
AWS AI Services
30 pages
CH 2 Emerging Trends 1
No ratings yet
CH 2 Emerging Trends 1
43 pages
Build Your Generative AI Application With Amazon Bedrock
No ratings yet
Build Your Generative AI Application With Amazon Bedrock
23 pages
GAI Workshop L200 Budiling With GenAI On AWS ASEAN
No ratings yet
GAI Workshop L200 Budiling With GenAI On AWS ASEAN
87 pages
Embeddings
No ratings yet
Embeddings
83 pages
Lecture # 14-1 Introduction To RAG
No ratings yet
Lecture # 14-1 Introduction To RAG
56 pages
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
No ratings yet
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
14 pages
File 21
No ratings yet
File 21
27 pages
Intro To Intelligent Apps Workshop
100% (1)
Intro To Intelligent Apps Workshop
106 pages
Azure OpenAI Workshop
No ratings yet
Azure OpenAI Workshop
30 pages
CBSE Class 6 Social Science Sample Paper SA 2 SET 1
No ratings yet
CBSE Class 6 Social Science Sample Paper SA 2 SET 1
2 pages
Aws Ai Official Test Q
No ratings yet
Aws Ai Official Test Q
9 pages
Untitled 2
No ratings yet
Untitled 2
40 pages
Amazon Bedrock Limited Preview - 20230630 Release Features For The Field 20230717 VF
No ratings yet
Amazon Bedrock Limited Preview - 20230630 Release Features For The Field 20230717 VF
38 pages
Recruitment Selection Training
No ratings yet
Recruitment Selection Training
29 pages
Deep Learning For Industries
No ratings yet
Deep Learning For Industries
45 pages
AES401 Use Gen AI To Query Space Imagery APIs With Natural Language Prompts
No ratings yet
AES401 Use Gen AI To Query Space Imagery APIs With Natural Language Prompts
22 pages
Embeddings 1686516367
No ratings yet
Embeddings 1686516367
82 pages
Kowsi Final Project
No ratings yet
Kowsi Final Project
50 pages
Architecture Patterns For Building Generative AI Applications
No ratings yet
Architecture Patterns For Building Generative AI Applications
29 pages
Embedding S
No ratings yet
Embedding S
83 pages
Prashant Generative AI On AWS
No ratings yet
Prashant Generative AI On AWS
32 pages
AIM001 Introduction To AI Services
No ratings yet
AIM001 Introduction To AI Services
28 pages
Pospiszyl 2023 The Fifth Element The Enlightenment and The Draining of Eastern Europe
No ratings yet
Pospiszyl 2023 The Fifth Element The Enlightenment and The Draining of Eastern Europe
28 pages
4.1 Guest Lecture - Intro To AI - Melissa Van Schaik
No ratings yet
4.1 Guest Lecture - Intro To AI - Melissa Van Schaik
38 pages
Building Blocks of Rag Ebook Final
100% (2)
Building Blocks of Rag Ebook Final
9 pages
Embeddings
No ratings yet
Embeddings
82 pages
AIM307 - Retrieval Augmented Generation With Amazon Bedrock
No ratings yet
AIM307 - Retrieval Augmented Generation With Amazon Bedrock
15 pages
NeurIPS 2023 Openagi When LLM Meets Domain Experts Paper Datasets - and - Benchmarks
No ratings yet
NeurIPS 2023 Openagi When LLM Meets Domain Experts Paper Datasets - and - Benchmarks
30 pages
Retrieval Augmented Generation Options Good 5 38
No ratings yet
Retrieval Augmented Generation Options Good 5 38
34 pages
CrateDB and LangChain
No ratings yet
CrateDB and LangChain
14 pages
AI With ICA 18092024 074806pm
No ratings yet
AI With ICA 18092024 074806pm
36 pages
File 38
No ratings yet
File 38
9 pages
Model Training and Fine Tuning
No ratings yet
Model Training and Fine Tuning
11 pages
Harness Proprietary Data With Foundational Models and RAG: by Marian Veteanu
No ratings yet
Harness Proprietary Data With Foundational Models and RAG: by Marian Veteanu
20 pages
GENAI1
No ratings yet
GENAI1
25 pages
MMBT3S4 Slides
No ratings yet
MMBT3S4 Slides
29 pages
Activity Based Costing
No ratings yet
Activity Based Costing
34 pages
Exploring HuggingFace
No ratings yet
Exploring HuggingFace
16 pages
Sheet Metal Shop Exp 1.3
No ratings yet
Sheet Metal Shop Exp 1.3
30 pages
Steps Involved in RAG
No ratings yet
Steps Involved in RAG
4 pages
Mathworks - Yann Debray - GPT-4o
No ratings yet
Mathworks - Yann Debray - GPT-4o
17 pages
The Energy Transition Conference 2023 - Delegates Brochure
No ratings yet
The Energy Transition Conference 2023 - Delegates Brochure
25 pages
Ai 1
No ratings yet
Ai 1
22 pages
Transformers 2023
No ratings yet
Transformers 2023
36 pages
AWS ML Ready Services
No ratings yet
AWS ML Ready Services
5 pages
CFE Final Course Output 2024 2025 1
No ratings yet
CFE Final Course Output 2024 2025 1
8 pages
Log
No ratings yet
Log
8 pages
PPE Lab Manual
No ratings yet
PPE Lab Manual
52 pages
DRAGO COSIC-prezentacija HIDROGEN
No ratings yet
DRAGO COSIC-prezentacija HIDROGEN
12 pages
AWSEducate Introduction To-Generative AI Transcript v1
No ratings yet
AWSEducate Introduction To-Generative AI Transcript v1
13 pages
Ucc2817, Ucc2818, Ucc3817 and Ucc3818 Bicmos Power Factor Pregulator
No ratings yet
Ucc2817, Ucc2818, Ucc3817 and Ucc3818 Bicmos Power Factor Pregulator
45 pages
Annisa Reiny HF - UKSW - Summary 2
No ratings yet
Annisa Reiny HF - UKSW - Summary 2
3 pages
Scipy - Stats.norm - SciPy v1.11.2 Manual
No ratings yet
Scipy - Stats.norm - SciPy v1.11.2 Manual
3 pages
Rag
No ratings yet
Rag
10 pages
Red Zone Equipment Checklist
No ratings yet
Red Zone Equipment Checklist
4 pages
R1 Nokia
No ratings yet
R1 Nokia
6 pages
159.52 101870341003 101870349999 Heating Climatic Unit
No ratings yet
159.52 101870341003 101870349999 Heating Climatic Unit
5 pages
A Taxonomy of Retrieval Augmented Generation
100% (2)
A Taxonomy of Retrieval Augmented Generation
56 pages
Generativeaiconamazonbedrock 231229150142 844d444e
No ratings yet
Generativeaiconamazonbedrock 231229150142 844d444e
48 pages
Item Analysis Procedures 1
No ratings yet
Item Analysis Procedures 1
2 pages
Fluostar 2L
No ratings yet
Fluostar 2L
1 page
5th and 6th Topic
No ratings yet
5th and 6th Topic
8 pages
NVIDIA RAG Whitepaper
No ratings yet
NVIDIA RAG Whitepaper
7 pages
Ways To Use LLM in Finance Organisation
No ratings yet
Ways To Use LLM in Finance Organisation
5 pages
How It Service Providers Can Innovate Ai ML Solutions With Minimal Data
No ratings yet
How It Service Providers Can Innovate Ai ML Solutions With Minimal Data
7 pages
SAP PM Configuration 3
100% (1)
SAP PM Configuration 3
30 pages
Getting Started With Generative Ai and Foundation Models
No ratings yet
Getting Started With Generative Ai and Foundation Models
16 pages
Darrel Todd Woodruff 261 WEST 600 NORTH #1, Logan, UT 84321 435-232-4326 Email Website
No ratings yet
Darrel Todd Woodruff 261 WEST 600 NORTH #1, Logan, UT 84321 435-232-4326 Email Website
2 pages
Data Quality DMB Ok Dam A Brasil
100% (1)
Data Quality DMB Ok Dam A Brasil
46 pages
Cyclotron
72% (61)
Cyclotron
20 pages
Aa BPG 375001
No ratings yet
Aa BPG 375001
36 pages
CNP Bill
No ratings yet
CNP Bill
1 page
Molo District Health Center: AP (Pre-Natal)
No ratings yet
Molo District Health Center: AP (Pre-Natal)
2 pages
Umakant B
No ratings yet
Umakant B
3 pages
Pollution Emitting From Guernsey Power Plant/PEH Incinerator and Proposed EtW
No ratings yet
Pollution Emitting From Guernsey Power Plant/PEH Incinerator and Proposed EtW
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Generative AI Applications

Uploaded by

Generative AI Applications

Uploaded by

© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.

01 Approaches to customizing generative AI solutions

02 Amazon Bedrock overview

03 Retrieval-augmented generation using Amazon Bedrock

04 Fine-tuning and continued pretraining using Amazon Bedrock

Customize for Adapt to Enhance Improve

CUSTOMIZATION YOUR DATA

KNOWLEDGE BASE VECTOR

Augment model with

Medium – once new Infrequent – only Infrequent – only

Relatively static Dynamic

Augment with Augment with Prompt

Amazon Bedrock Amazon Bedrock Amazon Bedrock Amazon Bedrock

Amazon Bedrock Choose Use as is or Send Receive

Improved content Contextual chatbots and Personalized search Real-time data

Horse -0.009 0.019 … 0.015

Powers text retrieval based Used to augment prompts High-accuracy embeddings

Amazon Titan Text Embeddings Highlights

• Applications of this model include semantic

Model ID: amazon.titan-embed-g1-text-02

Cohere Embed – English Cohere Embed – Multilingual

Output vectors: 1,024 Output vectors: 1,024

Model ID: cohere.embed-english-v3 Model ID: cohere.embed-multilingual-v3

Embeddings 0.89 -0.02 -0.53 0.95 0.17 -0.38

workflow Vector store Embeddings model Document store New data

Managing multiple Creating vector Incremental updates

Coding effort Scaling retrieval Orchestration

USER QUERY AUGMENTED PROMPT MODEL ANSWER

User Prompt Large language

Knowledge bases for Amazon

User Input RetrieveAndGenerate

Generate query Retrieve similar Augment query with Generate response

User Prompt Large Response

Generate query Retrieve similar documents

Vector Engine for Amazon Redis Enterprise Pinecone MongoDB

SELECT YOUR PROVIDE BASIC SELECT RELEVANT SPECIFY AVAILABLE

| Breaks down and orchestrates tasks |

Leave of absence (LOA)

• Maximize accuracy of FMs by providing

• Once deployed, custom models are invoked

• Customizations now supported for Amazon

Amazon Bedrock Amazon Bedrock

Inputs Outputs Storage Inferencing

Base FM Metrics and Custom models Playground

Identity and access,

✓ Customer data remain in Region

✓ Support for AWS PrivateLink and VPC configurations

✓ Integration with AWS IAM

✓ API monitoring in AWS CloudTrail, logging and metrics in Amazon CloudWatch

Customizing an FM is Amazon Bedrock You can combine

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.