12 CS1AC16 Detection and Tracking
12 CS1AC16 Detection and Tracking
The aim of object detection is to get automated systems to automatically discover interesting objects
in the scene.
Detected objects can then be used by other subsystems to do something interesting
What's interesting depends on the system application
The detectors take images an inputs and generate target positions – a bounding box containing the
object
Change detection
Instead of using the first frame as reference, we can use the previous one – subtracting it from
the current frame we get an update on what has changed in the scene
In this way we get continuous updates of what has changed in the picture – we do not need to
worry if the first frame was a good reference
It's important to threshold the results – this can get rid of things like trees swaying in the wind.
Setting a wrong threshold can result in the loss of some objects of interest.
Trackers take positional data and generate a trajectory – a description of movement through the scene
There are 2 types of trackers:
Single target – following a single objects
Multi target – following many objects without mixing them up
One of the simplest trackers is distance minimization tracker – for each target in a frame it matches a
target in a previous frame so that the total distance of movement is minimized.
Hungarian method – an algorithm for finding the total minimum distance between pairs
Set up: make a table of all possible distances between pairs:
Step 1: For each row, find the minimum value and subtract it from every element in that row (if
each column contains exactly one 0, finish at this step):
Step 2: For each column find the smallest value and subtract it from all elements in the column
(if each row contains exactly one 0, finish):
Step 3: Cover the grid with the minimum number of lines so that all 0s are covered (if the
number of lines equals to the number of rows/columns, stop):
Step 4: Find the lowest value from all uncovered elements. Subtract that value from all
uncovered elements. Add this value to every element covered by 2 lines.