Prompt Engineering For Vision Models Slides 1720084286
Prompt Engineering For Vision Models Slides 1720084286
Vision Models
What is a Prompt?
“A photorealistic image
of an astronaut riding a
horse on the moon.”
Input (Data)
Prompt (Instructions)
image
encoder + IoU score
mask
decoder
+ IoU score
bounding box
prompt
encoder
coordinates
+ IoU score
FastSAM
ground truth
prediction
intersection prediction
IoU =
union ground truth
prediction
prediction
bounding boxes
[[[x1, y1], [x2, y2]]]