0% found this document useful (0 votes)

55 views17 pages

Prompt Engineering For Vision Models Slides 1720084286

Uploaded by

Ubaid Mujahid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views17 pages

Prompt Engineering For Vision Models Slides 1720084286

Uploaded by

Ubaid Mujahid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Prompt Engineering for

Vision Models
What is a Prompt?
“A photorealistic image
of an astronaut riding a
horse on the moon.”

[0.24, -0.18, 0.14, 0.07, -0.03, …, 0.23]

What is Visual Prompting?

Visual prompting is a method of interacting with

a pre-trained model to accomplish a specific
task that it might not necessarily have been
explicitly trained to do.

This often involves passing a set of instructions to

the model, describing what you’d like it to do.

“Highlight the dog

on the left.”
Prompt vs. Input

Input (Data)

Prompt (Instructions)

“Segment the dog

on the left.”
Traditional ML Workflows

Update data and Test

hyperparameters

Train Update model

weights
Image segmentation
Image segmentation

Source: Jeremy Jordan

"An overview of semantic image segmentation"
https://www.jeremyjordan.me/semantic-segmentation/
Segment Anything Model
valid masks
(top 3)

image
encoder + IoU score

mask
decoder

+ IoU score
bounding box
prompt
encoder
coordinates

+ IoU score
FastSAM

Source: "Fast Segment Anything"

Xu Zhao, Wenchao Ding, Yongqi An, Yinglong Du, Tao Yu, Min Li, Ming Tang,
Jinqiao Wang
Example image
Prompting with coordinates
Prompting with bounding
boxes
Embeddings
“Ships at a distance
have every man’s wish [0.12, -0.31, 0.79, 0.05, …, -0.41]
on board.”

"Too much sanity may be

madness — and maddest [0.92, 0.31, -0.22, -0.39, …, 0.03]
of all: to see life as it is,
and not as it should be!"

[-0.72, -0.05, 0.82, 0.74, …, 0.06]

[0.75, -0.93, -0.27, 0.40, …, 0.08]

Intersection Over Union

ground truth

prediction

intersection prediction

IoU =
union ground truth

prediction

prediction
bounding boxes
[[[x1, y1], [x2, y2]]]

[[[xmin, ymin, xmax, ymax]]]

OWL-ViT
Text prompt Bounding Boxes

"Simple Open-Vocabulary Object Detection with Vision Transformers"

by Matthias Minderer, Alexey Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey
Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani, Zhuoran Shen, Xiao Wang,
Xiaohua Zhai, Thomas Kipf, and Neil Houlsby
MobileSAM

Model distillation is the process of transferring

knowledge from a large model to a smaller one.
Model distillation is different from other model
compression techniques in that it doesn’t actually
change the model format, but trains an entirely new
(and smaller) model.

Source: "MobileSAMv2: Faster Segment Anything to Everything"

Chaoning Zhang, Dongshen Han, Sheng Zheng, Jinwoo Choi, Tae-Ho Kim,
Choong Seon Hong

L4C1 Examiner Report March 2022
No ratings yet
L4C1 Examiner Report March 2022
7 pages
Skip Gram
100% (1)
Skip Gram
37 pages
AI.02a - Solving Problems by Searching - T
No ratings yet
AI.02a - Solving Problems by Searching - T
118 pages
Fraser Parker - Occlus
100% (4)
Fraser Parker - Occlus
24 pages
CSE860 - 08 - Searching For Solutions
No ratings yet
CSE860 - 08 - Searching For Solutions
11 pages
Informed Search
No ratings yet
Informed Search
36 pages
Neural Networks PDF
No ratings yet
Neural Networks PDF
89 pages
Artificial Intelligence Unit IV
No ratings yet
Artificial Intelligence Unit IV
105 pages
4 - C Problem Solving Agents
No ratings yet
4 - C Problem Solving Agents
17 pages
CSC445: Neural Networks
No ratings yet
CSC445: Neural Networks
51 pages
Unit 2 AI
No ratings yet
Unit 2 AI
107 pages
EDA - The Right Way
No ratings yet
EDA - The Right Way
111 pages
Lesson 4 Logic and Knowledge Representation
No ratings yet
Lesson 4 Logic and Knowledge Representation
100 pages
2011 Design House Catalog
No ratings yet
2011 Design House Catalog
220 pages
4 - Vector Data Model
No ratings yet
4 - Vector Data Model
31 pages
Chapters 8 & 9 First-Order Logic: Dr. Daisy Tang
No ratings yet
Chapters 8 & 9 First-Order Logic: Dr. Daisy Tang
76 pages
PPT03-First Order Logic & Inference in FOL
No ratings yet
PPT03-First Order Logic & Inference in FOL
59 pages
Mitosis
No ratings yet
Mitosis
15 pages
Knowledge Representation First Order Logic
No ratings yet
Knowledge Representation First Order Logic
49 pages
Artificial Intelligence For R-2017 by Krishna Sankar P., Shangaranarayanee N. P., Nithyananthan S.
0% (1)
Artificial Intelligence For R-2017 by Krishna Sankar P., Shangaranarayanee N. P., Nithyananthan S.
8 pages
Tf-Idf: David Kauchak cs160 Fall 2009
No ratings yet
Tf-Idf: David Kauchak cs160 Fall 2009
51 pages
Figurative Speech
No ratings yet
Figurative Speech
9 pages
Lecture 05 - Part A First Order Logic (FOL) : Dr. Shazzad Hosain
No ratings yet
Lecture 05 - Part A First Order Logic (FOL) : Dr. Shazzad Hosain
80 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
28 pages
Lab I TENSOR FLOW AND KERAS
No ratings yet
Lab I TENSOR FLOW AND KERAS
3 pages
Knowledge Based Systems (Sistem Berbasis Pengetahuan) : Ir. Wahidin Wahab M.SC PH.D
No ratings yet
Knowledge Based Systems (Sistem Berbasis Pengetahuan) : Ir. Wahidin Wahab M.SC PH.D
33 pages
Data Science Introduction
No ratings yet
Data Science Introduction
82 pages
AutoGen - The Automated Program Generator
No ratings yet
AutoGen - The Automated Program Generator
196 pages
Topic For The Class:: Knowledge and Reasoning
No ratings yet
Topic For The Class:: Knowledge and Reasoning
41 pages
Model With One-Word Context: 2vec 2vec 2vec 2vec
100% (1)
Model With One-Word Context: 2vec 2vec 2vec 2vec
17 pages
Sree Kaala Hastiswara Satakam in Telugu PDF
No ratings yet
Sree Kaala Hastiswara Satakam in Telugu PDF
21 pages
ch9 Ensemble Learning
No ratings yet
ch9 Ensemble Learning
19 pages
m8 Fol
No ratings yet
m8 Fol
27 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Data Science Project
No ratings yet
Data Science Project
3 pages
Chapter 7 Suffrage
No ratings yet
Chapter 7 Suffrage
7 pages
Metric Screw Thread Chart: Metric Tap Size Tap Drill (Inches) Clearance Drill (Inches)
No ratings yet
Metric Screw Thread Chart: Metric Tap Size Tap Drill (Inches) Clearance Drill (Inches)
2 pages
2023 Intro To Generative Ai
No ratings yet
2023 Intro To Generative Ai
15 pages
542 315 Word2vec
No ratings yet
542 315 Word2vec
20 pages
Frida Kahlo: By: Maria Jose Castillo, Camila Amaya, Danna Valencia
No ratings yet
Frida Kahlo: By: Maria Jose Castillo, Camila Amaya, Danna Valencia
9 pages
Technical Seminar: Sapthagiri College of Engineering
No ratings yet
Technical Seminar: Sapthagiri College of Engineering
18 pages
Statistics Presentation
No ratings yet
Statistics Presentation
21 pages
Session 11-12 - Text Analytics
No ratings yet
Session 11-12 - Text Analytics
38 pages
UNIT 3 KR Predicate Logic
No ratings yet
UNIT 3 KR Predicate Logic
53 pages
CS 8520: Artificial Intelligence: Knowledge Representation
No ratings yet
CS 8520: Artificial Intelligence: Knowledge Representation
30 pages
Inference in First Order Logic
No ratings yet
Inference in First Order Logic
26 pages
60N3LH5 STMicroelectronics
No ratings yet
60N3LH5 STMicroelectronics
16 pages
6 - Train - Test - Split - Ipynb - Colaboratory
No ratings yet
6 - Train - Test - Split - Ipynb - Colaboratory
5 pages
Punctuation Worksheet
No ratings yet
Punctuation Worksheet
4 pages
Data Science Intervieew Questions
100% (1)
Data Science Intervieew Questions
16 pages
Goals of Machine Learning in Artificial Intelligence
No ratings yet
Goals of Machine Learning in Artificial Intelligence
3 pages
Automatic Music Generation
No ratings yet
Automatic Music Generation
16 pages
Knowledge Representation Additional Reading
No ratings yet
Knowledge Representation Additional Reading
26 pages
All Pairs Shortest Path
No ratings yet
All Pairs Shortest Path
28 pages
Lecture Notes - Logistic Regression
100% (1)
Lecture Notes - Logistic Regression
11 pages
The Reign of Terror
No ratings yet
The Reign of Terror
11 pages
Application of First-Order Logic in Knowledge Based Systems PDF
No ratings yet
Application of First-Order Logic in Knowledge Based Systems PDF
7 pages
Agents & Environment
No ratings yet
Agents & Environment
24 pages
Generative Ai Guide
No ratings yet
Generative Ai Guide
2 pages
Annual Report On CSR Activities 2021-22
No ratings yet
Annual Report On CSR Activities 2021-22
16 pages
Monstrous - Map Crow - Cloud Curigwxfuo - Preview - 10272022
No ratings yet
Monstrous - Map Crow - Cloud Curigwxfuo - Preview - 10272022
8 pages
Dropout Vs Pruning
No ratings yet
Dropout Vs Pruning
2 pages
Module-5:: Network Analysis
No ratings yet
Module-5:: Network Analysis
22 pages
Generative AI For Media Analysis - Partner Use Case Package
No ratings yet
Generative AI For Media Analysis - Partner Use Case Package
45 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
21 pages
Case Daka
No ratings yet
Case Daka
7 pages
MAPEH 7 Badminton
No ratings yet
MAPEH 7 Badminton
3 pages
Mining The Web Graph: Technical Seminar Presentation On
No ratings yet
Mining The Web Graph: Technical Seminar Presentation On
15 pages
Infinitiv Ili - Ing
0% (1)
Infinitiv Ili - Ing
4 pages
Mineral Resource Conflict Jharkhand
No ratings yet
Mineral Resource Conflict Jharkhand
20 pages
Hill Climbing Vs Simulated Annealing
100% (1)
Hill Climbing Vs Simulated Annealing
14 pages
LLM Agents - Prompt Engineering Guide
No ratings yet
LLM Agents - Prompt Engineering Guide
16 pages
6.1comprehensive Interviews
No ratings yet
6.1comprehensive Interviews
2 pages
Zamoras Vs Su Case Digest
No ratings yet
Zamoras Vs Su Case Digest
1 page
On Ai
No ratings yet
On Ai
24 pages
All Ges101 Past Questions-1-1
No ratings yet
All Ges101 Past Questions-1-1
55 pages
Holiday Homework 8th-1
No ratings yet
Holiday Homework 8th-1
4 pages
Mihir (J) Bhatt - LinkedIn
No ratings yet
Mihir (J) Bhatt - LinkedIn
7 pages
Test Accessories Main Catalog: Test & Measureline - Test & Measurement
No ratings yet
Test Accessories Main Catalog: Test & Measureline - Test & Measurement
188 pages
Summer Vacation Assignment of Class Xii (2023-2024)
No ratings yet
Summer Vacation Assignment of Class Xii (2023-2024)
3 pages
Generative Adversial Network
No ratings yet
Generative Adversial Network
21 pages
Evaluating and Choosing An Iot Platform
No ratings yet
Evaluating and Choosing An Iot Platform
26 pages
Autogen Core Concepts
No ratings yet
Autogen Core Concepts
9 pages
Translation For University Students - College of Artsdocx
No ratings yet
Translation For University Students - College of Artsdocx
28 pages
MCA 301 Data Mining Notes
No ratings yet
MCA 301 Data Mining Notes
6 pages
Deep CNN Based Brain Tumor Detection in - 2024 - International Journal of Intel
No ratings yet
Deep CNN Based Brain Tumor Detection in - 2024 - International Journal of Intel
8 pages
English ss1 2nd Term
No ratings yet
English ss1 2nd Term
17 pages
Growth Unhinged Carousel
No ratings yet
Growth Unhinged Carousel
10 pages
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
From Everand
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
Fouad Sabry
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Prompt Engineering For Vision Models Slides 1720084286

Uploaded by

Prompt Engineering For Vision Models Slides 1720084286

Uploaded by

Prompt Engineering for

[0.24, -0.18, 0.14, 0.07, -0.03, …, 0.23]

Visual prompting is a method of interacting with

This often involves passing a set of instructions to

“Highlight the dog

“Segment the dog

Update data and Test

Train Update model

Source: Jeremy Jordan

Source: "Fast Segment Anything"

"Too much sanity may be

[-0.72, -0.05, 0.82, 0.74, …, 0.06]

[0.75, -0.93, -0.27, 0.40, …, 0.08]

[[[xmin, ymin, xmax, ymax]]]

"Simple Open-Vocabulary Object Detection with Vision Transformers"

Model distillation is the process of transferring

Source: "MobileSAMv2: Faster Segment Anything to Everything"

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.