0% found this document useful (0 votes)

23 views30 pages

A62 Vocabulary Tree

Uploaded by

alex.muravev

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views30 pages

A62 Vocabulary Tree

Uploaded by

alex.muravev

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Vocabulary

tree
Vocabulary tree

• Recogni1on can scale to very large databases using the Vocabulary Tree indexing approach
[Nistér and Stewénius, CVPR 2006]. Vocabulary Tree performs instance object recogni1on.
It does not perform recogni1on at category-‐level.

• Vocabulary Tree method follows three steps:
1. organize local descriptors of images a in a tree using hierarchical k-‐means clustering.
Inverted files are stored at each node with scores (offline).
2. generate a score for a given query image based on Term Frequency – Inverse Document
Frequency.
3. find the images in the database that best match that score.

• Vocabulary tree supports very eﬃcient retrieval. It only cares about the distance between a query
feature and each node.

2
Building the Vocabulary Tree

• The vocabulary tree is a hierarchical set of cluster centers and their corresponding Voronoi
regions:
− For each image in the database, extract MSER regions and calculate a set of feature point
descriptors (e.g. 128 SIFT).
− Build the vocabulary tree using hierarchical k-‐means clustering:
• run k-‐means recursively on each of the resul1ng quan1za1on cells up to a max number
of levels L (L=6 max suggested)
• nodes are the centroids; leaves are the visual words.
• k deﬁnes the branch-‐factor of the tree, which indicates how fast the tree branches
(k=10 max suggested)

3
• A large number
Scalable of ellip1cal
Recognition with raegions
Vocabularyare extracted
Tree from the image and warped to canonical
posi1ons. A descriptor vector is computed for each region. The descriptor vector is then
hierarchically
DavidqNistér
uan1zed by the
and Henrik vocabulary tree.
Stewénius
Center for Visualization and Virtual Environments
Department of Computer Science, University of Kentucky
• With each node in the vocabulary
http://www.vis.uky.edu/∼dnister/
tree there is an associated inverted ﬁle with references to the
http://www.vis.uky.edu/∼stewe/
images containing an instance of that node.

Abstract

nition scheme that scales efficiently to a large

bjects is presented. The efficiency and quality is
a live demonstration that recognizes CD-covers
base of 40000 images of popular music CD’s.
me builds upon popular techniques of indexing
extracted from local regions, and is robust
und clutter and occlusion. The local region
are hierarchically quantized in a vocabulary
vocabulary tree allows a larger and more
ory vocabulary to be used efficiently, which we
imentally leads to a dramatic improvement in
uality. The most significant property of the
hat the tree directly defines the quantization. The
n and the indexing are therefore fully integrated,
being one and the same.
ognition quality is evaluated through retrieval
ase with ground truth, showing the power of
ary tree approach, going as high as 1 million

uction
recognition is one of the core problems in
Hierarchical k-‐means clustering

k =3 L=1 L=2

L=4
L=3
Slide from D. Nister
Slide from D. Nister
Slide from D. Nister
Perform hierarchical k-‐means clustering

K=3

Slide from D. Nister

Slide from D. Nister
Slide from D. Nister
Slide from D. Nister
Slide from D. Nister
Slide from D. Nister
Slide from D. Nister
Slide from D. Nister
Slide from D. Nister
Slide from D. Nister
As the vocabulary tree is formed and is on-‐line, new images can be inserted in the
database.

Slide from D. Nister

Adding images to the tree

• Adding an image to the database requires the following steps:

‒ Image feature descriptors are computed.
‒ Each descriptor vector is dropped down from the root of the tree and quan1zed into a path
down the tree

Slide from D. Nister

Slide from D. Nister
Slide from D. Nister
Slide from D. Nister
Querying with Vocabulary Tree

• In the online phase, each descriptor vector is propagated down the tree by at each level comparing
the descriptor vector to the k candidate cluster centers (represented by k children in the tree) and
choosing the closest one.

• k dot products are performed at each level, resul1ng in a total of kL dot products, which is very
eﬃcient if k is not too large. The path down the tree can be encoded by a single integer and is then
available for use in scoring.

• The relevance of a database image to the query image based on how similar the paths down the
vocabulary tree are for the descriptors from the database image and the query image. The scheme
assigns weights to the tree nodes and deﬁnes relevance scores associated to images.
logarithmic in the number of leaf nodes. The memory usage
is linear in the number of leaf nodes k L . The total ! number
of descriptor vectors that must be represented is L i
i=1 k =
L+1
k −k
k−1 ≈ k L . For D-dimensional descriptors represented
as char the size of the tree is approximately Dk L bytes.
With our current implementation, a tree with D = 128, L =
Paths
6 and k = 10, resulting in 1Mof tleaf
he nodes,
tree uses
for 143M
one B of
memory.
image with 400 features
4. Definition of Scoring
Once the quantization is defined, we wish to determine
the relevance of a database image to the query image based
on how similar the paths down the vocabulary tree are Figure 3. Three levels of a vocabulary tree with branch factor 10
Scoring

• At each node i a weight wi is assigned that can be deﬁned according to one of diﬀerent schemes:
- a constant weigh1ng scheme wi = k ! $
N
- an entropy weigh-ng scheme: w i = log
## && (inverse document frequency)
" N i % where N is the number of database images
and Ni is the number of images with at least
one descriptor vector path through node i
• It is possible to use stop lists, where wi is set to zero for the most frequent and/or infrequent
symbols.

Node score

N=4 Ni=2 w = log(2)

N=4 Ni=1 w = log(4)

Image from D. Nister

• Query qi and database vectors di are deﬁned according to the assigned weights as :
‒ qi = mi wi
‒ di = ni wi

where mi is the number of the descriptor

vectors of the query with a path along the
node i and wi its weight

ni is the the number of the descriptor vectors of

each database image with a path along the
node i.

di = ni wi S= 2 log(2) S =2 log(4)

• Each database image is given a relevance score based on the L1 normalized diﬀerence between
the query and the database vectors

• Scores for the images in the database are accumulated. The winner is the image in the database
with the most common informa1on with the input image.

Inverted file index
• To implement scoring efficiently, an inverted file index is associated to each node of the vocabulary
tree (the inverted file of inner nodes is the concatena1on of it’s children’s inverted files).
• Inverted files at each node store the id-‐numbers of the images in which a par1cular node occurs
and the term frequency of that image. Indexes back to the new image are then added to the
relevant inverted files.

nid = n. 1mes visual word i appears in doc d

nd = n. visual words in doc d

Inverted ﬁle index

D1, t11=1/n1, Img1 D1, t21=1/n2, Img2

D2, t11=2/n1, Img1
Image from D.
Nister
Slide from D. Nister
Performance considera1ons

• Performance of vocabulary tree is largely dependent upon its structure. Most important factors
to make the method eﬀec1ve are:
− A large vocabulary tree (16M words against 10K of Video Google)
− Using informa1ve features vs. uniform (compute informa-on gain of features and select
the most informa1ve to build the tree i.e. features found in all images of a loca1on,
features not in any image of another loca1on)

29
Performance ﬁgures on 6376 images

Performance increases signiﬁcantly with the number of leaf nodes

Performance increases with the branch factor k

Performance increases when the amount of training data grows

From Tommasi

CV 2025 Spring 12 Short
No ratings yet
CV 2025 Spring 12 Short
120 pages
Lecture6 2
No ratings yet
Lecture6 2
37 pages
Lecture 06
No ratings yet
Lecture 06
72 pages
VietNguyen MasterThesis
No ratings yet
VietNguyen MasterThesis
66 pages
Joint Optimization Toward Effective and Efficient Image Search
No ratings yet
Joint Optimization Toward Effective and Efficient Image Search
12 pages
Jegou Improvingbof Preprint
No ratings yet
Jegou Improvingbof Preprint
22 pages
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
No ratings yet
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
38 pages
(Ebook) Cause and Correlation in Biology: A User's Guide To Path Analysis, Structural Equations and Causal Inference by Bill Shipley ISBN 9780521529211, 0521529212 PDF Download
No ratings yet
(Ebook) Cause and Correlation in Biology: A User's Guide To Path Analysis, Structural Equations and Causal Inference by Bill Shipley ISBN 9780521529211, 0521529212 PDF Download
54 pages
ATHLON Brochure
No ratings yet
ATHLON Brochure
75 pages
Ipl TR 80 111
No ratings yet
Ipl TR 80 111
122 pages
Budapest2020
No ratings yet
Budapest2020
22 pages
Graph Tree
No ratings yet
Graph Tree
20 pages
PL MEHTA Demand Forecasting
No ratings yet
PL MEHTA Demand Forecasting
12 pages
Content-Based Image Retrieval (CBIR) : Match
No ratings yet
Content-Based Image Retrieval (CBIR) : Match
71 pages
Bag of Features
No ratings yet
Bag of Features
49 pages
AS CRJ Vol5 Aircraft Operating Manual Part 2
No ratings yet
AS CRJ Vol5 Aircraft Operating Manual Part 2
136 pages
DLL Els Quarter 1 Week 3
No ratings yet
DLL Els Quarter 1 Week 3
3 pages
Section 7 Features Extraction
No ratings yet
Section 7 Features Extraction
42 pages
Image Matching: - Alok Talekar - Sairam Sundaresan
No ratings yet
Image Matching: - Alok Talekar - Sairam Sundaresan
70 pages
LBP Image Processing
No ratings yet
LBP Image Processing
15 pages
Wcist 2021
No ratings yet
Wcist 2021
12 pages
The W-Tree: An Index Structure For High-Dimensional Data: King-Lp Lin, H.V. Jagadish, and Christos Faloutsos
No ratings yet
The W-Tree: An Index Structure For High-Dimensional Data: King-Lp Lin, H.V. Jagadish, and Christos Faloutsos
26 pages
HCIA-HarmonyOS Device Developer V1.0 学员用书
No ratings yet
HCIA-HarmonyOS Device Developer V1.0 学员用书
166 pages
Radiometric Correction of Laser Scanning Intensity Data Applied For Terrestrial Laser Scanning
No ratings yet
Radiometric Correction of Laser Scanning Intensity Data Applied For Terrestrial Laser Scanning
32 pages
Bag-Of-Words Models: Noah Snavely
No ratings yet
Bag-Of-Words Models: Noah Snavely
47 pages
Human Computation and Computer Vision
No ratings yet
Human Computation and Computer Vision
50 pages
Bài Phô Cho Học Trò
No ratings yet
Bài Phô Cho Học Trò
27 pages
Information Retrieval Models: Vector Space Models: Chengxiang Zhai
No ratings yet
Information Retrieval Models: Vector Space Models: Chengxiang Zhai
30 pages
Cylinder Liner - Production Recommendation 0742048 3
No ratings yet
Cylinder Liner - Production Recommendation 0742048 3
17 pages
EDST2003 Week 1 Final
No ratings yet
EDST2003 Week 1 Final
54 pages
Part 2
No ratings yet
Part 2
20 pages
Spatial, Text, and Multimedia Databases: Erik Zeitler Udbl
No ratings yet
Spatial, Text, and Multimedia Databases: Erik Zeitler Udbl
53 pages
The W-Tree: An Index Structure For High-Dimensional Data: King-Lp Lin, H.V. Jagadish, and Christos Faloutsos
No ratings yet
The W-Tree: An Index Structure For High-Dimensional Data: King-Lp Lin, H.V. Jagadish, and Christos Faloutsos
26 pages
Martinez 2015 TTB Paper
No ratings yet
Martinez 2015 TTB Paper
12 pages
For Thesis
No ratings yet
For Thesis
12 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
Web Image Re-Ranking Using Query-Specific Semantic Signatures
No ratings yet
Web Image Re-Ranking Using Query-Specific Semantic Signatures
14 pages
pfg2015 5 Lenda
No ratings yet
pfg2015 5 Lenda
16 pages
Local Pattern Based Cbir
No ratings yet
Local Pattern Based Cbir
13 pages
Schuetz Potree 0
No ratings yet
Schuetz Potree 0
9 pages
Fundamentals of Cbir
No ratings yet
Fundamentals of Cbir
26 pages
Presented by Pratap Solapur Under The Guidance of Prof. P. B. Patil
No ratings yet
Presented by Pratap Solapur Under The Guidance of Prof. P. B. Patil
20 pages
Learning A Complete Image Indexing Pipeline
No ratings yet
Learning A Complete Image Indexing Pipeline
10 pages
Automatic Linguistic Indexing of Pictures by A Statistical Modeling Approach
No ratings yet
Automatic Linguistic Indexing of Pictures by A Statistical Modeling Approach
14 pages
1 s2.0 S009784932400236X Main
No ratings yet
1 s2.0 S009784932400236X Main
12 pages
Vector Space Model
No ratings yet
Vector Space Model
11 pages
Isprs 2012
No ratings yet
Isprs 2012
13 pages
PDF Pub5162 2
No ratings yet
PDF Pub5162 2
9 pages
Web Image Re-Ranking Using
No ratings yet
Web Image Re-Ranking Using
14 pages
12.coupled Binary Embedding For
No ratings yet
12.coupled Binary Embedding For
13 pages
RCF-1865 Rechageable Fan R5 (IB Format - ENG)
No ratings yet
RCF-1865 Rechageable Fan R5 (IB Format - ENG)
16 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Mid Semester Project Seminar On: Content Based Image Retrieval
No ratings yet
Mid Semester Project Seminar On: Content Based Image Retrieval
21 pages
Automatic Image Annotation and Retrieval Using Cross-Media Relevance Models
No ratings yet
Automatic Image Annotation and Retrieval Using Cross-Media Relevance Models
8 pages
Efficient Filtering With Sketches in The Ferret Toolkit: Qin LV, William Josephson, Zhe Wang, Moses Charikar and Kai Li
No ratings yet
Efficient Filtering With Sketches in The Ferret Toolkit: Qin LV, William Josephson, Zhe Wang, Moses Charikar and Kai Li
10 pages
Pattern Recognition: Guang-Hai Liu, Lei Zhang, Ying-Kun Hou, Zuo-Yong Li, Jing-Yu Yang
No ratings yet
Pattern Recognition: Guang-Hai Liu, Lei Zhang, Ying-Kun Hou, Zuo-Yong Li, Jing-Yu Yang
10 pages
Attribute Discovery Via Predictable Discriminative Binary Codes
No ratings yet
Attribute Discovery Via Predictable Discriminative Binary Codes
14 pages
Content-Based Image Retrieval Over The Web Using Query by Sketch and Relevance Feedback
No ratings yet
Content-Based Image Retrieval Over The Web Using Query by Sketch and Relevance Feedback
8 pages
Research Scope - Period Panties Market. - Global Industry Analysis Size Share Growth Trends and Forecasts 2023 - 2031
No ratings yet
Research Scope - Period Panties Market. - Global Industry Analysis Size Share Growth Trends and Forecasts 2023 - 2031
13 pages
Multiclass Recognition With Multiple Feature Trees
No ratings yet
Multiclass Recognition With Multiple Feature Trees
7 pages
78221000
No ratings yet
78221000
7 pages
Quiz FMG
100% (1)
Quiz FMG
11 pages
5843 HRT
No ratings yet
5843 HRT
38 pages
Rail Gun
100% (1)
Rail Gun
20 pages
Project Mg'T-Group Project Sec-A
No ratings yet
Project Mg'T-Group Project Sec-A
13 pages
Isprs Archives XLVIII M 2 2023 661 2023
No ratings yet
Isprs Archives XLVIII M 2 2023 661 2023
8 pages
Segmentation
No ratings yet
Segmentation
1 page
(IJCST-V4I4P5) :priyanka Sharma
No ratings yet
(IJCST-V4I4P5) :priyanka Sharma
4 pages
An Integrated Approach For Image Retrieval Based On Content
No ratings yet
An Integrated Approach For Image Retrieval Based On Content
6 pages
SUB1583461
No ratings yet
SUB1583461
10 pages
Content-Based Image Retrieval by Ontology-Based Object Recognition
No ratings yet
Content-Based Image Retrieval by Ontology-Based Object Recognition
10 pages
Vikram Narayan Research Engineer GREYC, University of Caen
No ratings yet
Vikram Narayan Research Engineer GREYC, University of Caen
19 pages
Content Based Image Retrieval Using Feature Coding
No ratings yet
Content Based Image Retrieval Using Feature Coding
4 pages
Intentsearch: Capturing User Intention For One-Click Internet Image Search
No ratings yet
Intentsearch: Capturing User Intention For One-Click Internet Image Search
19 pages
Large-Scale Image Retrieval As A Classification Problem: Research Paper
No ratings yet
Large-Scale Image Retrieval As A Classification Problem: Research Paper
10 pages
Laser Scanning: December 2018
No ratings yet
Laser Scanning: December 2018
5 pages
Sample Paper PDF
No ratings yet
Sample Paper PDF
3 pages
Guidance On Road Markings
No ratings yet
Guidance On Road Markings
17 pages
Writing Research Report
No ratings yet
Writing Research Report
33 pages
Scalable Recognition With A Vocabulary Tree PDF
No ratings yet
Scalable Recognition With A Vocabulary Tree PDF
8 pages
Image Classification For Content-Based Indexing
No ratings yet
Image Classification For Content-Based Indexing
14 pages
Clustering Art
No ratings yet
Clustering Art
8 pages
Pagerank For Product Image Search
No ratings yet
Pagerank For Product Image Search
9 pages
Catalogo Pompe 2014
No ratings yet
Catalogo Pompe 2014
2 pages
Ref - Integrity Problems of Concrete Piles - FPrimeC - FPrimeC Solutions Inc
No ratings yet
Ref - Integrity Problems of Concrete Piles - FPrimeC - FPrimeC Solutions Inc
7 pages
A Literature Survey On Various Approaches On Content Based Image Search
No ratings yet
A Literature Survey On Various Approaches On Content Based Image Search
6 pages
Terrestrial Laser Scanning 2
No ratings yet
Terrestrial Laser Scanning 2
3 pages
Content Based Image Retrieval Methods Using Self Supporting Retrieval Map Algorithm
No ratings yet
Content Based Image Retrieval Methods Using Self Supporting Retrieval Map Algorithm
7 pages
Astm F 1145
100% (2)
Astm F 1145
12 pages
Battery Impedance Test Equipment: Bite 2 and BITE 2P
No ratings yet
Battery Impedance Test Equipment: Bite 2 and BITE 2P
4 pages
Internal Energy Change Equations
No ratings yet
Internal Energy Change Equations
2 pages
Compusoft, 3 (7), 1020-1023 PDF
No ratings yet
Compusoft, 3 (7), 1020-1023 PDF
4 pages
Reliability: Supplement Outline
No ratings yet
Reliability: Supplement Outline
19 pages
Image Semantic Classification Using SVM in Image Retrieval: Xiaohong Yu, and Hong Liu
No ratings yet
Image Semantic Classification Using SVM in Image Retrieval: Xiaohong Yu, and Hong Liu
4 pages
Maxwellian Distribution Revisited: Maxwell - 2a.m
No ratings yet
Maxwellian Distribution Revisited: Maxwell - 2a.m
7 pages
Wuthering Heights Timeline Project Questions
No ratings yet
Wuthering Heights Timeline Project Questions
2 pages
Trip of Dreams PDF
No ratings yet
Trip of Dreams PDF
6 pages
Banking Theory Law and Practice
No ratings yet
Banking Theory Law and Practice
17 pages
Public Notice: Dr. NTR University of Health Sciences: Andhra Pradesh
No ratings yet
Public Notice: Dr. NTR University of Health Sciences: Andhra Pradesh
4 pages
Philippine Indigenous Craft - ICC
No ratings yet
Philippine Indigenous Craft - ICC
8 pages
2.21 Dynamo For Civil 3D PDF
No ratings yet
2.21 Dynamo For Civil 3D PDF
1 page
Search Tree: Fundamentals and Applications
From Everand
Search Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

A62 Vocabulary Tree

Uploaded by

A62 Vocabulary Tree

Uploaded by

Vocabulary

nition scheme that scales efficiently to a large

Slide from D. Nister

Slide from D. Nister

Slide from D. Nister

N=4 Ni=2 w = log(2)

N=4 Ni=1 w = log(4)

Image from D. Nister

where mi is the number of the descriptor

ni is the the number of the descriptor vectors of

nid = n. 1mes visual word i appears in doc d

Inverted ﬁle index

D1, t11=1/n1, Img1 D1, t21=1/n2, Img2

Performance increases signiﬁcantly with the number of leaf nodes

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.