Abstract
Interest point detection is an active area in computer vision due to its importance in many applications. Measuring the pixel-wise difference between image pixel intensities is the mechanism of most detectors that have been proposed in literature. Recently, interest point detectors were proposed that incorporated the histogram representation instead of image pixel intensity. In this paper, research that extends histogram-based interest point detectors is introduced. Four colour-space representations were used to construct new detectors: HSV, Opponent, Transformed and Ohta colour spaces. Several experiments were performed to evaluate the new colour histogram-based detectors and compare them with previous detectors. First, the proposed detectors were evaluated in an image-matching task. Then, we studied and evaluated the performance of some of the local image descriptors that were extracted from the interest points and regions detected by the proposed detectors. Finally, the four top-ranked descriptors in the descriptor evaluation experiments were used to evaluate the new colour histogram-based detectors in an image-classification task using different object and scene image datasets. The experimental results demonstrate that our new detectors possess an increased ability to distinguish and more robust in regards to image matching, particularly with respect to textured scene images that involve transformations, such as illumination, viewpoint and blur changes. Furthermore, the descriptor performance may change depending on the detector and data set type. The image-classification results demonstrate that the proposed detectors exhibit higher classification accuracy for certain descriptors and data sets than the other detectors.




















Similar content being viewed by others
References
Abdel-Hakim A, Farag A (2006) Csift: A sift descriptor with color invariant characteristics, In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Ashbrook AP, Thacker NA, Rockett PI, Brown CI (1995) Robust recognition of scaled shapes using pairwise geometric histograms. In: Proceedings of the British conference on Machine vision, pp 503–512
Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. IEEE Trans Pattern Anal Mach Intell 24(4):509 –522
Benmokhtar R (2012) Robust human action recognition scheme based on high-level feature fusion. Multimed Tools Appl:1–23
Bosch A, Zisserman A, Muoz X (2007a) Image classification using random forests and ferns. In: Proceedings of the International Conference on Computer Vision, pp 1–8
Boschm A, Zisserman A, Muoz X (2007b) Representing shape with a spatial pyramid kernel. In: Proceedings of the ACM international conference on Image and video retrieval, pp 401–408
Calonder M, Lepetit V, Strecha C, Fua P (2010) BRIEF: Binary robust independent elementary features. In: Proceedings of the European Conference on Computer Vision, Springer, pp 778–792
Comaniciu D, Ramesh V, Meer P (2000) Real-time tracking of non-rigid objects using mean shift. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 142–149
Duan X, Lin L, Chao H (2013) Discovering video shot categories by unsupervised stochastic graph partition. IEEE Trans Multimed 15(1):167–180
Freeman W, Adelson E (1991) The design and use of steerable filters. IEEE Trans Pattern Anal Mach Intell 13(9):891–906
Gabor D (1946) Theory of communication. J IEE 93:429–457
Gijsenij A, Gevers T, Van De Weijer J (2012) Improving color constancy by photometric edge weighting. IEEE Trans on Pattern Anal Mach Intell 34(5):918–929
Gool LJV, Moons T, Ungureanu D (1996) Affine photometric invariants for planar intensity patterns. In: Proceedings of the European Conference on Computer Vision, pp 642–651
Han J, Xu M, Li X, Guo L, Liu T (2013) Interactive object-based image retrieval and annotation on ipad. Multimed Tools Appl pp 1–23
Harris C, Stephens M (1988) A combined corner and edge detection. In: Proceedings of the Fourth Alvey Vision Conference, pp 147–151
Heinly J, Dunn E, Frahm JM (2012) Comparative evaluation of binary features. In: Proceedings of the European Conference on Computer Vision, Springer, pp 759–773
Kadir T, Zisserman A, Brady M (2004) An affine invariant salient region detector. In: Proceedings of the European Conference on Computer Vision, pp 228–241
Ke Y, Sukthankar R (2004) PCA-SIFT: A more distinctive representation for local image descriptors. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 506–513
Kinnunen T, Laine-Hernandez M, Oittinen P (2013) Evaluating local feature detectors in salient region detection. In: Image Analysis, Lecture Notes in Computer Science, Vol 7944, Springer Berlin Heidelberg, pp 85–94
Koenderink JJ, van Doom AJ (1987) Representation of local geometry in the visual system. Biol Cybern 55(6):367–375
Lazebnik S, Schmid C, Ponce J (2003) A sparse texture representation using affine-invariant regions. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 319–324
Lazebnik S, Schmid C, Ponce J (2004) Semi-local affine parts for object recognition. In: Proceedings of the British Machine Vision Conference, pp 779–788
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Lee WT, tzong Chen H (2009) Histogram-based interest point detectors. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 1590–1596
Leutenegger S, Chli M, Siegwart RY (2011) BRISK: Binary robust invariant scalable keypoints. In: Proceedings of the, Vol 2011 IEEE International Conference on Computer Vision, IEEE, pp 2548–2555
Lin L, Liu X, Zhu SC (2010) Layered graph matching with composite cluster sampling. IEEE Trans on Pattern Anal Mach Intell 32(8):1426–1442
Lin L, Luo P, Chen X, Zeng K (2012) Representing and recognizing objects with massive local image patches. Pattern Recognit 45(1):231–240
Lindeberg T (1998) Feature detection with automatic scale selection. Int J Comput Vis 30:79–116
Lindeberg T, Garding J (1997) Shape-adapted smoothing in estimation of 3-D shape cues from affine deformations of local 2-D brightness structure. Image Vis Comput 15(6):415–434
Lowe D (2004) Distinctive image features from scale-invariant key-points. Int J Comput Vis 60:91–110
Lowe DG (1999) Object recognition from local scale-invariant features. In: International Conference on Computer Vision, pp 1150–1157
Lukac R, Plataniotis KN (2010) Color image processing: methods and applications. CRC Press
Luo HL, Wei H, Hu FX (2011) Improvements in image categorization using codebook ensembles. Image Vis Comput 29(11):759–773
Ma Z (2008) Artificial intelligence for maximizing content based image retrieval. Information Science Reference - Imprint of: IGI Publishing, Hershey, PA
Matas J, Burianek J, Kittler J (2000) Object recognition using the invariant pixel-set signature. In: Proceedings of the British Machine Vision Conference, pp 606–615
Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767
Mikolajczyk K, Schmid C (2001) Indexing based on scale invariant interest points. In: Proceedings of the IEEE International Conference on Computer Vision, Vol 1, pp 525–531
Mikolajczyk K, Schmid C (2004) Scale & affine invariant interest point detectors. Int J Comput Vis 60:63–86
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630
Mikolajczyk K, Tuytelaars T, Schmid C, Zisserman A, Matas J, Schaffalitzky F, Kadir T, Gool LV (2005) A comparison of affine region detectors. Int J Comput Vis 65:43–72
Miksik O, Mikolajczyk K (2012) Evaluation of local detectors and descriptors for fast feature matching. In: Proceedings of the International Conference on Pattern Recognition, IEEE, pp 2681–2684
Mokhtarian F, Mackworth AK (1992) A theory of multiscale, curvature-based shape representation for planar curves. IEEE Trans Pattern Anal Mach Intell 14(8):789–805
Ohta YI, Kanade T, Sakai T (1980) Color information for region segmentation. Comput Graph Image Process 13(3):222–241
Oliva A, Torralba A (2001) Modeling the shape of the scene: A holistic representation of the spatial envelope. Int J Comput Vis 42:145–175
Qu Y, Wu S, Liu H, Xie Y, Wang H (2012) Evaluation of local features and classifiers in bow model for image classification. Multimed Tools Appl pp 1–20
Rublee E, Rabaud V, Konolige K, Bradski G (2011) ORB: an efficient alternative to sift or surf. In: Proceedings of the, Vol 2011 IEEE International Conference on Computer Vision, IEEE, pp 2564–2571
Van de Sande K, Gevers T, Snoek C (2010) Evaluating color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell 32(9):1582–1596
Schaffalitzky F, Zisserman A (2002) Multi-view matching for unordered image sets, or ”how do i organize my holiday snaps?”. In: Proceedings of the European Conference on Computer Vision, pp 414–431
Schmid C, Mohr R (1997) Local gray value invariants for image retrieval. IEEE Trans Pattern Anal Mach Intell 19:530–535
Schmid C, Mohr R, Bauckhage C (2000) Evaluation of interest point detectors. Int J Comput Vis 37:151–172
Shen X, Hua G, Williams L, Wu Y (2012) Dynamic hand gesture recognition: An exemplar-based approach from motion divergence fields. Image Vis Comput 30(3):227–235
Snoek CG, Van de Sande KE, Li X, Mazloom M, Jiang YG, Koelma DC, Smeulders AW (2011) The MediaMill TRECVID 2011 semantic video search engine. In: Proceedings of the 9th TRECVID Workshop
Szeliski R (2010) Computer Vision: Algorithms and applications, 1st edn. Springer
Tola E, Lepetit V, Fua P (2008) A fast local descriptor for dense matching. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 1–8
Tola E, Lepetit V, Fua P (2010) DAISY: An efficient dense descriptor applied to wide baseline stereo. IEEE Trans Pattern Anal Mach Intell 32(5):815–830
Tsagaris V, Anastassopoulos V (2004) Multispectral image fusion method using perceptual attributes. In: Remote Sensing, International Society for Optics and Photonics, pp 357–367
Tuytelaars T (2010) Dense interest points. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2281 –2288
Tuytelaars T, Gool LJV (2000) Wide baseline stereo matching based on local, affinely invariant regions. In: British Machine Vision Conference
Tuytelaars T, Mikolajczyk K (2008) Local invariant feature detectors: A survey. Now Publishers Inc. Hanover, MA, USA
Tuytelaars T, Gool LJV, D’haene L, Koch R (1999) Matching of affinely invariant regions for visual servoing. In: Proceedings of the International Conference on Robotics and Automation, pp 1601–1606
Van De Weijer J, Gevers T, Bagdanov AD (2006) Boosting color saliency in image feature detection. IEEE Trans on Pattern Anal Mach Intell 28(1):150–156
Vetterli M, Kovacevic J (1995) Wavelets and subband coding, 1st edn. Prentice Hall Signal Processing Series, Prentice Hall PTR
Zhang W (2009) Image features and learning algorithms for biological, generic and social object recognition, PhD thesis, School of Electrical Engineering and Computer Science, Oregon State University
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Rassem, T.H., Khoo, B.E. Performance evaluation of new colour histogram-based interest point detectors. Multimed Tools Appl 74, 11357–11398 (2015). https://doi.org/10.1007/s11042-014-2235-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-014-2235-4