0% found this document useful (0 votes)

113 views33 pages

f5 Ai Reference Architecture

The document outlines the architecture and deployment models for AI and ML applications, emphasizing the importance of generative AI and its multi-modal, decomposed nature. It discusses various deployment options including SaaS, cloud-hosted, self-hosted, and edge-hosted solutions, along with key considerations for building AI products. Additionally, it highlights security risks associated with LLM and generative AI applications and provides insights into design requirements and AI building blocks.

Uploaded by

bullmohitbull

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

113 views33 pages

f5 Ai Reference Architecture

Uploaded by

bullmohitbull

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

PREVIEW DECK – More to come AppWorld 2025

Questions? businessdevelopment@f5.com

AI / ML Reference
Architecture Overview
Mike Rau Alysia Groves
SVP, Enterprise Technical Strategy Sr. Business Manager, Business Development

Mark J Menger Eric Ji

Solution Architect, Business Development Senior Solution Architect, Business Development

Paul Pindell Gregory Coward

Principal Solution Architect, Business Development Senior Solution Architect, Business Development

Ian Lauth
Senior Manager, Product Marketing for AI
Generative AI
threatens to make New GPU
centric clouds
this scary complexity New foundational
model providers
even more acute
AWS SaaS

Generative AI app experiences

1 will be multi-modal

Generative AI apps will be Azure Colocation

2 highly decomposed

“Data gravity” will significantly influence

3 placement of apps and models
Google Data Centers
Cloud Traditional &
Generative AI apps will be
4 especially dependent on APIs
Private Cloud

Edge

AI Apps
2 © 2024 F5
Are you building an AI Product or
delivering Operational Efficiency?

What are your

objectives? Do you want to build, buy, or
out-source the solution?

How mature is your AI practice?

Are you exploring, integrating,
or transforming?

3 © 2024 F5
SaaS AI
The AI solution is provided as a fully managed service by a third-party provider. Customers
can access and use the AI capabilities over the internet without worrying about the underlying
infrastructure, maintenance, or updates, making it a convenient and scalable option.

Cloud-Hosted AI
The AI solution runs on cloud infrastructure provided by cloud service providers such as AWS,
Google Cloud, or Azure. It offers flexibility, scalability, and ease of integration with other

Four cloud services, while the customer maintains control over the configuration and
management of their AI systems.

Deployment Self-Hosted AI
Models The AI solution is deployed on the customer's own infrastructure, such as on-premises
servers or private data centers. This provides maximum control and customization options
but requires significant resources for setup, maintenance, and management of the
hardware and software components.

Edge-Hosted AI
The AI solution in an edge environment, outside traditional cloud or data center
infrastructure. An example is a machine learning solution operating on a device like a kiosk in
a retail storefront. This model reduces latency, enhances privacy, and ensures real-time
processing by bringing the computation closer to the data source or end-user.

4 © 2024 F5
OWASP LLM Top Ten
Educate developers, designers, architects, managers, and organizations about the potential
security risks when deploying and managing LLM and Generative AI applications.

AI Ecosystem F5 Application Delivery Top Ten

Considerations The top unforeseen challenges that arise in today’s hybrid multicloud application delivery
model cause by too many point solutions, a lack of interoperability, multiple management
consoles and manual complexity.

Design Requirements
Define the essential capabilities, technologies, and principles needed to address technical
challenges and ensure effective solution implementation.

5 © 2024 F5
Web Apps & APIs

Inference Retrieval-Augmented Agentic External

Generation Services Integration
Focus Area

Hybrid Multicloud & Data Ingest

Seven AI
Building Blocks
RAG Corpus Fine-Tuning Training
Management
In this deck we will be showing two of the
Focus Area
seven building blocks.
For access to the full deck, please reach
out to your F5 account team or email
App Development
businessdevelopment@f5.com

Development

6 © 2024 F5
AI Component Architecture

DEVELOPMENT
FINE-TUNING SERVICES TRAINING SERVICES
SERVICES

Fine-Tuning Training
Data Data

Source/
Config Control

FRONT-END LLM INFERENCE PLUGINS,

APPLICATIONS ORCHESTRATION SERVICES DATA CONNECTORS
IDE

End Users
CI/CD

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Primary Data Path

Knowledge
Secondary Data Path Corpus Data
Development Path Databases Websites Queues Developer

7 © 2024 F5
Seven AI Building Blocks

8 © 2024 F5
Inference
This building block involves the process of making predictions or generating outputs
based on input data using pre-trained AI models. It's the core function where the AI
system applies its learned knowledge to new, unseen data.

DEVELOPMENT
FINE-TUNING SERVICES TRAINING SERVICES
SERVICES

Fine-Tuning Training
Data Data

Source/
Config Control

FRONT-END LLM INFERENCE PLUGINS,

APPLICATIONS ORCHESTRATION SERVICES DATA CONNECTORS
IDE

End Users
CI/CD

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Knowledge
Corpus Data
Databases Websites Queues Developer

9 © 2024 F5
Inference with Retrieval Augmented Generation (RAG)
RAG combines the capabilities of retrieval and generation models to produce more informed and
accurate responses. It retrieves relevant information from a predefined corpus and uses it to enhance
the generation process, resulting in more contextually appropriate outputs.

DEVELOPMENT
FINE-TUNING SERVICES TRAINING SERVICES
SERVICES

Fine-Tuning Training
Data Data

Source/
Config Control

FRONT-END LLM INFERENCE PLUGINS,

APPLICATIONS ORCHESTRATION SERVICES DATA CONNECTORS
IDE

End Users
CI/CD

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Knowledge
Corpus Data
Databases Websites Queues Developer

10 © 2024 F5
RAG Corpus Management
This focuses on maintaining and curating the database or corpus of information that the AI system
uses for Retrieval-Augmented Generation. It includes updating, organizing, and ensuring the quality
of the data to support accurate and relevant retrieval.

DEVELOPMENT
FINE-TUNING SERVICES TRAINING SERVICES
SERVICES

Fine-Tuning Training
Data Data

Source/
Config Control

FRONT-END LLM INFERENCE PLUGINS,

APPLICATIONS ORCHESTRATION SERVICES DATA CONNECTORS
IDE

End Users
CI/CD

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Knowledge
Corpus Data
Databases Websites Queues Developer

11 © 2024 F5
External Services Integration
This involves connecting the AI system with external services and APIs, enabling it to interact, retrieve data, or
perform actions based on user requests or model inference. It allows the AI to leverage external tools and
databases to extend its functionality and autonomously make decisions or take actions as necessary.

DEVELOPMENT
FINE-TUNING SERVICES TRAINING SERVICES
SERVICES

Fine-Tuning Training
Data Data

Source/
Config Control

FRONT-END LLM INFERENCE PLUGINS,

APPLICATIONS ORCHESTRATION SERVICES DATA CONNECTORS
IDE

End Users
CI/CD

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Knowledge
Corpus Data
Databases Websites Queues Developer

12 © 2024 F5
Fine-Tuning
This process involves adjusting a pre-trained AI model on specific datasets to improve its
performance for a particular task or domain. Fine-tuning helps tailor the model's capabilities to
better meet the unique needs of specific applications or industries.

DEVELOPMENT
FINE-TUNING SERVICES TRAINING SERVICES
SERVICES

Fine-Tuning Training
Data Data

Source/
Config Control

FRONT-END LLM INFERENCE PLUGINS,

APPLICATIONS ORCHESTRATION SERVICES DATA CONNECTORS
IDE

End Users
CI/CD

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Knowledge
Corpus Data
Databases Websites Queues Developer

13 © 2024 F5
Training
This is the process of teaching an AI model by exposing it to large amounts of data and allowing it
to learn patterns and features. Training involves multiple iterations and optimizations to develop
a model that can generalize well to new, unseen data.

DEVELOPMENT
FINE-TUNING SERVICES TRAINING SERVICES
SERVICES

Fine-Tuning Training
Data Data

Source/
Config Control

FRONT-END LLM INFERENCE PLUGINS,

APPLICATIONS ORCHESTRATION SERVICES DATA CONNECTORS
IDE

End Users
CI/CD

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Knowledge
Corpus Data
Databases Websites Queues Developer

14 © 2024 F5
Development
This encompasses the overall creation, testing, and deployment of AI solutions.
It involves coding, integrating various AI components, and ensuring that the
system is robust, scalable, and ready for production use.

DEVELOPMENT
FINE-TUNING SERVICES TRAINING SERVICES
SERVICES

Fine-Tuning Training
Data Data

Source/
Config Control

FRONT-END LLM INFERENCE PLUGINS,

APPLICATIONS ORCHESTRATION SERVICES DATA CONNECTORS
IDE

End Users
CI/CD

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Knowledge
Corpus Data
Databases Websites Queues Developer

15 © 2024 F5
Inference with Retrieval
Augmented-Generation (RAG)

16 © 2024 F5
IN FER ENCE W ITH RA G

Featured AI Building Block

DEVELOPMENT
FINE-TUNING SERVICES TRAINING SERVICES
SERVICES

Fine-Tuning Training
Data Data

Source/
Config Control

FRONT-END LLM INFERENCE PLUGINS,

APPLICATIONS ORCHESTRATION SERVICES DATA CONNECTORS
IDE

End Users
CI/CD

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Knowledge
Corpus Data
Databases Websites Queues Developer

17 © 2024 F5
IN FER ENCE W ITH RA G

Detailed Component Architecture

INFERENCE SERVICES

FRONT-END LLM
APPLICATIONS ORCHESTRATION INFERENCE MODEL
CLUSTER REPOSITORY

End Users

RETRIEVAL AUGMENTATION SERVICES

RETRIEVAL ENGINE EMBEDDING LLM

Vector DB Object Storage

18 © 2024 F5
IN FER ENCE W ITH RA G
OWASP LLM Top Ten

OWASP LLM Top Ten Insights LLM01

LLM02
Prompt Injection

Sensitive Information Disclosure

LLM03 Supply Chain

LLM04 Data and Model Poisoning

LLM09 LLM10
LLM05 Improper Output Handling
LLM05 LLM07
LLM06 Excessive Agency
LLM02 LLM05
LLM07 System Prompt Leakage
LLM02 LLM01 LLM02
LLM08 Vector and Embedding Weakness

LLM09 Misinformation
INFERENCE SERVICES
LLM10 Unbounded consumption
FRONT-END LLM
APPLICATIONS ORCHESTRATION INFERENCE MODEL
CLUSTER REPOSITORY

End Users

LLM05

LLM02

RETRIEVAL AUGMENTATION SERVICES

RETRIEVAL ENGINE EMBEDDING LLM

LLM08

Vector DB Object Storage

19 © 2024 F5
IN FER ENCE W ITH RA G
OWASP LLM Top Ten

F5 ADC Top Ten Insights LLM01

LLM02
Prompt Injection

Sensitive Information Disclosure

LLM03 Supply Chain

ADC10
LLM04 Data and Model Poisoning
ADC07 ADC05 LLM09 ADC10 LLM10
LLM05 Improper Output Handling
ADC05 LLM05 ADC04 LLM07 ADC05
LLM06 Excessive Agency
ADC02 LLM02 ADC03 LLM05 ADC03
LLM07 System Prompt Leakage
LLM02 LLM01 ADC02 LLM02 ADC02
LLM08 Vector and Embedding Weakness

LLM09 Misinformation
INFERENCE SERVICES
LLM10 Unbounded consumption
FRONT-END LLM
APPLICATIONS ORCHESTRATION INFERENCE MODEL
CLUSTER REPOSITORY
4 5 4 5 4 5
7 6 7 F5 Application Delivery Top Ten
7 9 8 9
End Users ADC01 Weak DNS Practices
1
ADC02 Lack of Fault Tolerance & Resilience
LLM05 4 5
1 2
ADC03 Incomplete Observability
ADC02 LLM02 8 9
ADC04 Insufficient Traffic Controls
RETRIEVAL AUGMENTATION SERVICES
ADC05 Unoptimized Traffic Steering
RETRIEVAL ENGINE EMBEDDING LLM ADC06 Inability to Handle Latency

ADC07 Incompatible Delivery Policies

LLM08 ADC08 Lack of Security & Regulatory Compliance

Vector DB Object Storage ADC09 Bespoke Application Requirements

ADC10 Poor Resource Utilization

20 © 2024 F5
IN FER ENCE W ITH RA G

Design Requirements
1 Distributed Compute Services

2 AI Compute Resources

3 Centralized Networking Management

INFERENCE SERVICES 4 Distributed App & API Security Services

FRONT-END LLM
APPLICATIONS ORCHESTRATION INFERENCE MODEL

4 5 4 5 4 5
CLUSTER REPOSITORY 5 Centralized Security Policy Management
7 6 7
7 9 8 9
End Users 6 AI/ML Data Loss Prevention
1
4 5
1 2
7 AI/ML Security
8 9

RETRIEVAL AUGMENTATION SERVICES

8 AI/ML Observability
RETRIEVAL ENGINE EMBEDDING LLM

9 Inter-Cluster Traffic Management

Vector DB Object Storage

21 © 2024 F5
IN FER ENCE W ITH RA G

SaaS Deployment
Site Mgmt

Global

AI Gateway

INFERENCE SERVICES

FRONT-END LLM
APPLICATIONS ORCHESTRATION INFERENCE MODEL
CLUSTER REPOSITORY
4 5 4 5 4 5
7 6 7
7 9 8 9
End Users
1
4 5
1 2
App 8 9 Site

RETRIEVAL AUGMENTATION SERVICES

RETRIEVAL ENGINE EMBEDDING LLM

AI Gateway

Vector DB Object Storage

22 © 2024 F5
IN FER ENCE W ITH RA G

Cloud-Hosted Deployment
Site Mgmt

Global

AI Gateway

INFERENCE SERVICES

FRONT-END LLM
APPLICATIONS ORCHESTRATION INFERENCE MODEL
CLUSTER REPOSITORY
4 5 4 5 4 5
7 6 7
7 9 8 9
End Users
1
4 5
1 2
App 8 9 Site

RETRIEVAL AUGMENTATION SERVICES

RETRIEVAL ENGINE EMBEDDING LLM

AI Gateway

Vector DB Object Storage

23 © 2024 F5
IN FER ENCE W ITH RA G

Self-hosted Deployment
Site

Global

INFERENCE SERVICES

FRONT-END LLM
APPLICATIONS ORCHESTRATION INFERENCE MODEL
CLUSTER REPOSITORY
4 5 4 5 4 5
7 6 7
7 9 8 9
End Users
1
4 5
1 2
8 9 Site

RETRIEVAL AUGMENTATION SERVICES

RETRIEVAL ENGINE EMBEDDING LLM

Vector DB Object Storage

Featured AI Building Block

DEVELOPMENT
FINE-TUNING SERVICES TRAINING SERVICES
SERVICES

Fine-Tuning Training
Data Data

Source/
Config Control

FRONT-END LLM INFERENCE PLUGINS,

APPLICATIONS ORCHESTRATION SERVICES DATA CONNECTORS
IDE

End Users
CI/CD

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Knowledge
Corpus Data
Databases Websites Queues Developer

Detailed Component Architecture

RETRIEVAL AUGMENTATION SERVICES ENTERPRISE DATA STORES

DOCUMENT
PRE-PROCESSING & EMBEDDING

RETRIEVAL ENGINE

EMBEDDING LLM

Object Vector
Storage DB

RAG CO RP US MANAG EMENT
OWASP LLM Top Ten

OWASP LLM Top Ten Insights LLM01

LLM02
Prompt Injection

Sensitive Information Disclosure

LLM03 Supply Chain

LLM06
LLM04 Data and Model Poisoning
LLM05
LLM05 Improper Output Handling
LLM02 LLM03
LLM06 Excessive Agency

LLM07 System Prompt Leakage

RETRIEVAL AUGMENTATION SERVICES ENTERPRISE DATA STORES LLM08 Vector and Embedding Weakness

LLM09 Misinformation
DOCUMENT
PRE-PROCESSING & EMBEDDING LLM10 Unbounded consumption

RETRIEVAL ENGINE

EMBEDDING LLM

Object Vector
Storage DB

LLM04 LLM06 LLM03

LLM10

RAG CO RP US MANAG EMENT
OWASP LLM Top Ten

F5 ADC Top Ten Insights LLM01

LLM02
Prompt Injection

Sensitive Information Disclosure

LLM03 Supply Chain

ADN07 LLM06
LLM04 Data and Model Poisoning
ADN01 ADN06 LLM05
LLM05 Improper Output Handling
LLM02 ADN02 LLM03
LLM06 Excessive Agency

LLM07 System Prompt Leakage

RETRIEVAL AUGMENTATION SERVICES ENTERPRISE DATA STORES LLM08 Vector and Embedding Weakness

LLM09 Misinformation
DOCUMENT
PRE-PROCESSING & EMBEDDING LLM10 Unbounded consumption
1
3 5 6
2
F5 Application Delivery Top Ten
3 RETRIEVAL ENGINE
4 ADC01 Weak DNS Practices
5 ADC02 Lack of Fault Tolerance & Resilience
EMBEDDING LLM
9 ADC03 Incomplete Observability

Object Vector ADC04 Insufficient Traffic Controls

Storage DB
ADC05 Unoptimized Traffic Steering
9
ADC06 Inability to Handle Latency

ADC07 Incompatible Delivery Policies

LLM04 LLM06 LLM03
ADC08 Lack of Security & Regulatory Compliance
ADN04 LLM10 ADN03 ADC09 Bespoke Application Requirements

ADN09 ADN02 ADN08 ADC10 Poor Resource Utilization

RAG CO RP US MANAG EMENT

Design Requirements
1 Distributed Compute Services

2 AI Compute Resources

3 Centralized Networking Management

RETRIEVAL AUGMENTATION SERVICES ENTERPRISE DATA STORES

DOCUMENT 4 Distributed App & API Security Services

PRE-PROCESSING & EMBEDDING

1
3 5 6 5 Centralized Security Policy Management
2

3 RETRIEVAL ENGINE
4 6 AI/ML Data Loss Prevention
5
EMBEDDING LLM
9 7 AI/ML Security

Object Vector
Storage DB
8 AI/ML Observability
9

9 Inter-Cluster Traffic Management

RAG CO RP US MANAG EMENT

Cloud Deployment
Site Site Mgmt

RETRIEVAL AUGMENTATION SERVICES ENTERPRISE DATA STORES Site

DOCUMENT
PRE-PROCESSING & EMBEDDING

1
3 5 6
2

3 RETRIEVAL ENGINE
4
5
EMBEDDING LLM
9

Object Vector
Storage DB
9

Global Mgmt Global

Site

RAG CO RP US MANAG EMENT

Self-Hosted Deployment

Site

RETRIEVAL AUGMENTATION SERVICES ENTERPRISE DATA STORES Site

DOCUMENT
PRE-PROCESSING & EMBEDDING

1
3 5 6
2

3 RETRIEVAL ENGINE
4
5
EMBEDDING LLM
9

Object Vector
Storage DB
9

Site
Site

Cheat Sheet AWS AI Practitioner
100% (1)
Cheat Sheet AWS AI Practitioner
50 pages
Architecting Scalable AI RAG Systems
No ratings yet
Architecting Scalable AI RAG Systems
32 pages
TMA AI Center Profile
No ratings yet
TMA AI Center Profile
24 pages
What Is Generative AI
No ratings yet
What Is Generative AI
16 pages
Ljybtwsye0gzyeq9z Embedding GenAI With MongoDB
No ratings yet
Ljybtwsye0gzyeq9z Embedding GenAI With MongoDB
17 pages
AI ML RL GenAI
No ratings yet
AI ML RL GenAI
37 pages
Generative AI-233444
No ratings yet
Generative AI-233444
11 pages
Understand The Technology Ecosystem
No ratings yet
Understand The Technology Ecosystem
3 pages
Unit 1-2
No ratings yet
Unit 1-2
58 pages
Understanding Key AI Terminologies: A Brief Guide With Examples
No ratings yet
Understanding Key AI Terminologies: A Brief Guide With Examples
3 pages
Technicalseminar
No ratings yet
Technicalseminar
11 pages
Gen AI Foundation
No ratings yet
Gen AI Foundation
40 pages
Ai 1
No ratings yet
Ai 1
13 pages
PE - Module 2
No ratings yet
PE - Module 2
30 pages
Microsoft Program Management
No ratings yet
Microsoft Program Management
11 pages
Generative AI at The Edge
100% (1)
Generative AI at The Edge
37 pages
Artificial Intelligence Activities
No ratings yet
Artificial Intelligence Activities
34 pages
Introduction To Artificial Intelligence
No ratings yet
Introduction To Artificial Intelligence
5 pages
Building Scalable AI-Powered Applications With Clo
No ratings yet
Building Scalable AI-Powered Applications With Clo
9 pages
Notes of AI
No ratings yet
Notes of AI
98 pages
Artificial Intelligancy Architecture
No ratings yet
Artificial Intelligancy Architecture
13 pages
Ai Opportunity's
No ratings yet
Ai Opportunity's
11 pages
Mastering Lead Generation with DeepSeek AI/ A Comprehensive Guide to Transforming Your Sales Strategy
From Everand
Mastering Lead Generation with DeepSeek AI/ A Comprehensive Guide to Transforming Your Sales Strategy
Robert Cullen
No ratings yet
Generative AI From Use Cases To Organizational Paradigm v1.1
No ratings yet
Generative AI From Use Cases To Organizational Paradigm v1.1
44 pages
Updated Unit 1 Ai
No ratings yet
Updated Unit 1 Ai
14 pages
People AI
No ratings yet
People AI
6 pages
Architecting To Support Machine Learning
No ratings yet
Architecting To Support Machine Learning
47 pages
ML AI Google Masterclass
No ratings yet
ML AI Google Masterclass
50 pages
20250423-EB-Event-Driven Design For Agents
No ratings yet
20250423-EB-Event-Driven Design For Agents
29 pages
AI Unit 1
No ratings yet
AI Unit 1
32 pages
Ways To Use LLM in Finance Organisation
No ratings yet
Ways To Use LLM in Finance Organisation
5 pages
Artificial Intelligence (Ai) Brief Description
No ratings yet
Artificial Intelligence (Ai) Brief Description
2 pages
AI Notes Module 1
No ratings yet
AI Notes Module 1
14 pages
Artificial Intelligence For Business - Video Presentation
No ratings yet
Artificial Intelligence For Business - Video Presentation
23 pages
AI & Its Industrial Use
No ratings yet
AI & Its Industrial Use
23 pages
Lec 7
No ratings yet
Lec 7
18 pages
Class Smart 199827 Snotes
No ratings yet
Class Smart 199827 Snotes
2 pages
Ai Assignment 1
No ratings yet
Ai Assignment 1
4 pages
Google Opr Databases
No ratings yet
Google Opr Databases
13 pages
AI-900 Related Question Bank
No ratings yet
AI-900 Related Question Bank
52 pages
Introduction To AI Technology and Tools
No ratings yet
Introduction To AI Technology and Tools
5 pages
Updated UNIT 1 AI
No ratings yet
Updated UNIT 1 AI
12 pages
Chapter 01
No ratings yet
Chapter 01
22 pages
Fast Track Innovation With Generative AI and Machine Learning
No ratings yet
Fast Track Innovation With Generative AI and Machine Learning
18 pages
AI Computing Trends - Challenges Innovations-Final
No ratings yet
AI Computing Trends - Challenges Innovations-Final
18 pages
Agents 101: Artificial Intelligence & Machine Learning
100% (1)
Agents 101: Artificial Intelligence & Machine Learning
27 pages
Unit 8 - AI and Emerging Technologies
No ratings yet
Unit 8 - AI and Emerging Technologies
29 pages
Artificial Intelligence Notes
No ratings yet
Artificial Intelligence Notes
9 pages
AI
No ratings yet
AI
7 pages
PDF 5: Generative AI For Business and Industry: 1. Healthcare & Pharmaceuticals
No ratings yet
PDF 5: Generative AI For Business and Industry: 1. Healthcare & Pharmaceuticals
2 pages
New Technologies
No ratings yet
New Technologies
10 pages
AI Guidebook APA Formatted
No ratings yet
AI Guidebook APA Formatted
19 pages
Domo Platform Essentials: Definitive Reference for Developers and Engineers
From Everand
Domo Platform Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Week 6 - Quick Reference Guide
No ratings yet
Week 6 - Quick Reference Guide
10 pages
Chapter 4 Introduction To Huawei Cloud ModelArts
No ratings yet
Chapter 4 Introduction To Huawei Cloud ModelArts
34 pages
Keynote 1 - Generative AI Inferencing For LLM and Multimodal Models With NEMO
No ratings yet
Keynote 1 - Generative AI Inferencing For LLM and Multimodal Models With NEMO
46 pages
Prince Gen Ai - 2024-2
No ratings yet
Prince Gen Ai - 2024-2
35 pages
Artificial Intelligence (AI) Usage
No ratings yet
Artificial Intelligence (AI) Usage
6 pages
White-Paper-4 Gen AI
No ratings yet
White-Paper-4 Gen AI
12 pages
03 Lecture ICT
No ratings yet
03 Lecture ICT
39 pages
Azure Dumps
No ratings yet
Azure Dumps
5 pages
Spen Eaflet: Transformer Damage Curve Made Easy
No ratings yet
Spen Eaflet: Transformer Damage Curve Made Easy
2 pages
A Practical Guide To Estimating
100% (3)
A Practical Guide To Estimating
69 pages
SQL Tutorial On Data Analysis in R
No ratings yet
SQL Tutorial On Data Analysis in R
5 pages
C++ Srs For Advertising System
No ratings yet
C++ Srs For Advertising System
29 pages
NJB Stormcad Product Data Sheet
No ratings yet
NJB Stormcad Product Data Sheet
2 pages
Yasser H. Qureshi Resume January 2011
No ratings yet
Yasser H. Qureshi Resume January 2011
4 pages
Sample Paper - Ii: Instructions
No ratings yet
Sample Paper - Ii: Instructions
16 pages
SQL Notes
No ratings yet
SQL Notes
77 pages
Code For VAULT
No ratings yet
Code For VAULT
6 pages
Expdp Impdp
No ratings yet
Expdp Impdp
7 pages
Basic SQL: IS 2511 - Fundamentals of Database Systems
No ratings yet
Basic SQL: IS 2511 - Fundamentals of Database Systems
53 pages
0131477005
No ratings yet
0131477005
10 pages
CS403 Quiz-2 by Vu Topper RM-1
No ratings yet
CS403 Quiz-2 by Vu Topper RM-1
67 pages
R23 Unit-3 PDF V1
No ratings yet
R23 Unit-3 PDF V1
37 pages
Aspen Tutorial
No ratings yet
Aspen Tutorial
11 pages
Stock Database Project Report v1
No ratings yet
Stock Database Project Report v1
9 pages
DKNF
No ratings yet
DKNF
29 pages
What Causes A Failover of A Redundant Server System?: Related Topics
No ratings yet
What Causes A Failover of A Redundant Server System?: Related Topics
3 pages
Essbase Integration Service Loading Data From RDBMS Server: Oravision Oracle Online Training/Consultancy Solution
75% (4)
Essbase Integration Service Loading Data From RDBMS Server: Oravision Oracle Online Training/Consultancy Solution
16 pages
SAP Kernel Upgrade Winows
No ratings yet
SAP Kernel Upgrade Winows
13 pages
SolidCAM 2015 Port Machining
No ratings yet
SolidCAM 2015 Port Machining
57 pages
Fitness Centre Management System (Final)
No ratings yet
Fitness Centre Management System (Final)
38 pages
ICT-Theory - Practical Questions
No ratings yet
ICT-Theory - Practical Questions
6 pages
Database Security Course Handout 2023
No ratings yet
Database Security Course Handout 2023
2 pages
Question #1: Correct Answer: CDE
No ratings yet
Question #1: Correct Answer: CDE
93 pages
Oomd 1
No ratings yet
Oomd 1
18 pages
Follow Below Steps For Deinstalling XDB Component
No ratings yet
Follow Below Steps For Deinstalling XDB Component
10 pages
Analisis Dan Perancangan Sistem Informasi Rekam Medis Pada Rumah Sakit Islam Siti Khodijah Kebumen
No ratings yet
Analisis Dan Perancangan Sistem Informasi Rekam Medis Pada Rumah Sakit Islam Siti Khodijah Kebumen
15 pages
Pragathi Pagade: Education
No ratings yet
Pragathi Pagade: Education
1 page

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

f5 Ai Reference Architecture

Uploaded by

f5 Ai Reference Architecture

Uploaded by

PREVIEW DECK – More to come AppWorld 2025

Mark J Menger Eric Ji

Paul Pindell Gregory Coward

Generative AI app experiences

Generative AI apps will be Azure Colocation

“Data gravity” will significantly influence

What are your

How mature is your AI practice?

AI Ecosystem F5 Application Delivery Top Ten

Inference Retrieval-Augmented Agentic External

Hybrid Multicloud & Data Ingest

FRONT-END LLM INFERENCE PLUGINS,

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Primary Data Path

FRONT-END LLM INFERENCE PLUGINS,

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

FRONT-END LLM INFERENCE PLUGINS,

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

FRONT-END LLM INFERENCE PLUGINS,

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

FRONT-END LLM INFERENCE PLUGINS,

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

FRONT-END LLM INFERENCE PLUGINS,

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

FRONT-END LLM INFERENCE PLUGINS,

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

FRONT-END LLM INFERENCE PLUGINS,

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Featured AI Building Block

FRONT-END LLM INFERENCE PLUGINS,

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Detailed Component Architecture

RETRIEVAL AUGMENTATION SERVICES

RETRIEVAL ENGINE EMBEDDING LLM

Vector DB Object Storage

OWASP LLM Top Ten Insights LLM01

Sensitive Information Disclosure

LLM03 Supply Chain

LLM04 Data and Model Poisoning

RETRIEVAL AUGMENTATION SERVICES

RETRIEVAL ENGINE EMBEDDING LLM

Vector DB Object Storage

F5 ADC Top Ten Insights LLM01

Sensitive Information Disclosure

LLM03 Supply Chain

ADC07 Incompatible Delivery Policies

Vector DB Object Storage ADC09 Bespoke Application Requirements

ADC10 Poor Resource Utilization

3 Centralized Networking Management

INFERENCE SERVICES 4 Distributed App & API Security Services

RETRIEVAL AUGMENTATION SERVICES

9 Inter-Cluster Traffic Management

Vector DB Object Storage

RETRIEVAL AUGMENTATION SERVICES

RETRIEVAL ENGINE EMBEDDING LLM

Vector DB Object Storage

RETRIEVAL AUGMENTATION SERVICES

RETRIEVAL ENGINE EMBEDDING LLM

Vector DB Object Storage

RETRIEVAL AUGMENTATION SERVICES

RETRIEVAL ENGINE EMBEDDING LLM

Vector DB Object Storage

Featured AI Building Block

FRONT-END LLM INFERENCE PLUGINS,

RETRIEVAL AUGMENTATION SERVICES DOWNSTREAM SERVICES

Detailed Component Architecture

RETRIEVAL AUGMENTATION SERVICES ENTERPRISE DATA STORES

27 © 2024 F5 External Data

OWASP LLM Top Ten Insights LLM01

Sensitive Information Disclosure

LLM03 Supply Chain

LLM07 System Prompt Leakage

LLM04 LLM06 LLM03

28 © 2024 F5 External Data

F5 ADC Top Ten Insights LLM01

Sensitive Information Disclosure