0% found this document useful (0 votes)
63 views7 pages

What Is The Goal Stereo Vision?

Stereo vision uses two or more images of a 3D scene captured from different viewpoints to recover depth information. It determines the 3D position of points by finding the intersection of the lines passing through the point projections in each image. There are two main problems in stereo vision: the correspondence problem of identifying matching points, and the reconstruction problem of using disparities between matches to compute depth. Key concepts include the baseline between cameras, epipolar geometry, disparity maps, and triangulation to reconstruct 3D structure from 2D image correspondences.

Uploaded by

Sentinel Berg
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
63 views7 pages

What Is The Goal Stereo Vision?

Stereo vision uses two or more images of a 3D scene captured from different viewpoints to recover depth information. It determines the 3D position of points by finding the intersection of the lines passing through the point projections in each image. There are two main problems in stereo vision: the correspondence problem of identifying matching points, and the reconstruction problem of using disparities between matches to compute depth. Key concepts include the baseline between cameras, epipolar geometry, disparity maps, and triangulation to reconstruct 3D structure from 2D image correspondences.

Uploaded by

Sentinel Berg
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

http://www.cse.unr.edu/~bebis/CS791E/Notes/StereoCamera.

pdf

Stereo Vision
What is the goal stereo vision?
- The recovery of the 3D structure of a scene using two or more images of the 3D
scene, each acquired from a different viewpoint in space.
- The images can be obtained using muliple cameras or one moving camera.
- The term binocular vision is used when two cameras are employed.

-2-

Stereo setup and terminology


Fixation point: the point of intersection of the optical axis.
Baseline: the distance between the centers of projection.
Epipolar plane: the plane passing through the centers of projection and the point in
the scene.
Epipolar line: the intersection of the epipolar plane with the image plane.
Conjugate pair: any point in the scene that is visible in both cameras will be projected to a pair of image points in the two images.
Disparity: the distance between corresponding points when the two images are superimposed.
Disparity map: the disparities of all points form the disparity map (can be displayed
as an image).

-3-

(two cameras in arbitrary position and orientation)

Triangulation - the principle underlying stereo vision


- The 3D location of any visible object point in space is restricted to the straight line
that passes through the center of projection and the projection of the object point.
- Binocular stereo vision determines the position of a point in space by finding the
intersection of the two lines passing through the center of projection and the projection of the point in each image.

-4-

The two problems of stereo


(1) The correspondence problem.
(2) The reconstruction problem.

The correspondence problem


- Finding pairs of matched points such that each point in the pair is the projection of
the same 3D point.
- Triangulation depends crucially on the solution of the correspondence problem.

- Ambiguous correspondence between points in the two images may lead to several
different consistent interpretations of the scene.

-5-

The reconstruction problem


- Given the corresponding points, we can compute the disparity map.
- The disparity map can be converted to a 3D map of the scene (i.e., recover the 3D
structure) if the stereo geometry is known.

-6-

Recovering depth (reconstruction)


- Consider recovering the position of P from its projections pl and pr .
xl = f

Xl
xZ
X
x Z
or X l = l l and x r = f r or X r = r r
Zl
f
Zr
f

- In general, the two cameras are related by the following transformation:


P r = R(P l T )
- Using Z r = Z l = Z and X r = X l T we have:
xl Z
x Z
T = r
f
f

or Z =

Tf
d

where d = x l x r is the disparity (i.e., the difference in the position between the corresponding points in the two images)

-7-

Stereo camera parameters


Intrinsic parameters: characterize the transformation from image plane coordinates
to pixel coordinates, in each camera.
Extrinsic parameters (R, T ): describe the relative position and orientation of the
two cameras
P r = R(P l T ) (aligns right camera with left camera)
- Can be determined from the extrinsic parameters of each camera:
R = R r RTl
T = T l RT T r

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy