Skip to main content
Log in

Performance evaluation of new colour histogram-based interest point detectors

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Interest point detection is an active area in computer vision due to its importance in many applications. Measuring the pixel-wise difference between image pixel intensities is the mechanism of most detectors that have been proposed in literature. Recently, interest point detectors were proposed that incorporated the histogram representation instead of image pixel intensity. In this paper, research that extends histogram-based interest point detectors is introduced. Four colour-space representations were used to construct new detectors: HSV, Opponent, Transformed and Ohta colour spaces. Several experiments were performed to evaluate the new colour histogram-based detectors and compare them with previous detectors. First, the proposed detectors were evaluated in an image-matching task. Then, we studied and evaluated the performance of some of the local image descriptors that were extracted from the interest points and regions detected by the proposed detectors. Finally, the four top-ranked descriptors in the descriptor evaluation experiments were used to evaluate the new colour histogram-based detectors in an image-classification task using different object and scene image datasets. The experimental results demonstrate that our new detectors possess an increased ability to distinguish and more robust in regards to image matching, particularly with respect to textured scene images that involve transformations, such as illumination, viewpoint and blur changes. Furthermore, the descriptor performance may change depending on the detector and data set type. The image-classification results demonstrate that the proposed detectors exhibit higher classification accuracy for certain descriptors and data sets than the other detectors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20

Similar content being viewed by others

Notes

  1. http://www.robots.ox.ac.uk/~vgg/research/affine/

References

  1. Abdel-Hakim A, Farag A (2006) Csift: A sift descriptor with color invariant characteristics, In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

  2. Ashbrook AP, Thacker NA, Rockett PI, Brown CI (1995) Robust recognition of scaled shapes using pairwise geometric histograms. In: Proceedings of the British conference on Machine vision, pp 503–512

  3. Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. IEEE Trans Pattern Anal Mach Intell 24(4):509 –522

    Article  Google Scholar 

  4. Benmokhtar R (2012) Robust human action recognition scheme based on high-level feature fusion. Multimed Tools Appl:1–23

  5. Bosch A, Zisserman A, Muoz X (2007a) Image classification using random forests and ferns. In: Proceedings of the International Conference on Computer Vision, pp 1–8

  6. Boschm A, Zisserman A, Muoz X (2007b) Representing shape with a spatial pyramid kernel. In: Proceedings of the ACM international conference on Image and video retrieval, pp 401–408

  7. Calonder M, Lepetit V, Strecha C, Fua P (2010) BRIEF: Binary robust independent elementary features. In: Proceedings of the European Conference on Computer Vision, Springer, pp 778–792

  8. Comaniciu D, Ramesh V, Meer P (2000) Real-time tracking of non-rigid objects using mean shift. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 142–149

  9. Duan X, Lin L, Chao H (2013) Discovering video shot categories by unsupervised stochastic graph partition. IEEE Trans Multimed 15(1):167–180

    Article  Google Scholar 

  10. Freeman W, Adelson E (1991) The design and use of steerable filters. IEEE Trans Pattern Anal Mach Intell 13(9):891–906

    Article  Google Scholar 

  11. Gabor D (1946) Theory of communication. J IEE 93:429–457

    Google Scholar 

  12. Gijsenij A, Gevers T, Van De Weijer J (2012) Improving color constancy by photometric edge weighting. IEEE Trans on Pattern Anal Mach Intell 34(5):918–929

    Article  Google Scholar 

  13. Gool LJV, Moons T, Ungureanu D (1996) Affine photometric invariants for planar intensity patterns. In: Proceedings of the European Conference on Computer Vision, pp 642–651

  14. Han J, Xu M, Li X, Guo L, Liu T (2013) Interactive object-based image retrieval and annotation on ipad. Multimed Tools Appl pp 1–23

  15. Harris C, Stephens M (1988) A combined corner and edge detection. In: Proceedings of the Fourth Alvey Vision Conference, pp 147–151

  16. Heinly J, Dunn E, Frahm JM (2012) Comparative evaluation of binary features. In: Proceedings of the European Conference on Computer Vision, Springer, pp 759–773

  17. Kadir T, Zisserman A, Brady M (2004) An affine invariant salient region detector. In: Proceedings of the European Conference on Computer Vision, pp 228–241

  18. Ke Y, Sukthankar R (2004) PCA-SIFT: A more distinctive representation for local image descriptors. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 506–513

  19. Kinnunen T, Laine-Hernandez M, Oittinen P (2013) Evaluating local feature detectors in salient region detection. In: Image Analysis, Lecture Notes in Computer Science, Vol 7944, Springer Berlin Heidelberg, pp 85–94

  20. Koenderink JJ, van Doom AJ (1987) Representation of local geometry in the visual system. Biol Cybern 55(6):367–375

    Article  MATH  Google Scholar 

  21. Lazebnik S, Schmid C, Ponce J (2003) A sparse texture representation using affine-invariant regions. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 319–324

  22. Lazebnik S, Schmid C, Ponce J (2004) Semi-local affine parts for object recognition. In: Proceedings of the British Machine Vision Conference, pp 779–788

  23. Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

  24. Lee WT, tzong Chen H (2009) Histogram-based interest point detectors. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 1590–1596

  25. Leutenegger S, Chli M, Siegwart RY (2011) BRISK: Binary robust invariant scalable keypoints. In: Proceedings of the, Vol 2011 IEEE International Conference on Computer Vision, IEEE, pp 2548–2555

  26. Lin L, Liu X, Zhu SC (2010) Layered graph matching with composite cluster sampling. IEEE Trans on Pattern Anal Mach Intell 32(8):1426–1442

    Article  Google Scholar 

  27. Lin L, Luo P, Chen X, Zeng K (2012) Representing and recognizing objects with massive local image patches. Pattern Recognit 45(1):231–240

    Article  MATH  Google Scholar 

  28. Lindeberg T (1998) Feature detection with automatic scale selection. Int J Comput Vis 30:79–116

    Article  Google Scholar 

  29. Lindeberg T, Garding J (1997) Shape-adapted smoothing in estimation of 3-D shape cues from affine deformations of local 2-D brightness structure. Image Vis Comput 15(6):415–434

    Article  Google Scholar 

  30. Lowe D (2004) Distinctive image features from scale-invariant key-points. Int J Comput Vis 60:91–110

    Article  Google Scholar 

  31. Lowe DG (1999) Object recognition from local scale-invariant features. In: International Conference on Computer Vision, pp 1150–1157

  32. Lukac R, Plataniotis KN (2010) Color image processing: methods and applications. CRC Press

  33. Luo HL, Wei H, Hu FX (2011) Improvements in image categorization using codebook ensembles. Image Vis Comput 29(11):759–773

    Article  Google Scholar 

  34. Ma Z (2008) Artificial intelligence for maximizing content based image retrieval. Information Science Reference - Imprint of: IGI Publishing, Hershey, PA

    Google Scholar 

  35. Matas J, Burianek J, Kittler J (2000) Object recognition using the invariant pixel-set signature. In: Proceedings of the British Machine Vision Conference, pp 606–615

  36. Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767

    Article  Google Scholar 

  37. Mikolajczyk K, Schmid C (2001) Indexing based on scale invariant interest points. In: Proceedings of the IEEE International Conference on Computer Vision, Vol 1, pp 525–531

  38. Mikolajczyk K, Schmid C (2004) Scale & affine invariant interest point detectors. Int J Comput Vis 60:63–86

    Article  Google Scholar 

  39. Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630

    Article  Google Scholar 

  40. Mikolajczyk K, Tuytelaars T, Schmid C, Zisserman A, Matas J, Schaffalitzky F, Kadir T, Gool LV (2005) A comparison of affine region detectors. Int J Comput Vis 65:43–72

    Article  Google Scholar 

  41. Miksik O, Mikolajczyk K (2012) Evaluation of local detectors and descriptors for fast feature matching. In: Proceedings of the International Conference on Pattern Recognition, IEEE, pp 2681–2684

  42. Mokhtarian F, Mackworth AK (1992) A theory of multiscale, curvature-based shape representation for planar curves. IEEE Trans Pattern Anal Mach Intell 14(8):789–805

    Article  Google Scholar 

  43. Ohta YI, Kanade T, Sakai T (1980) Color information for region segmentation. Comput Graph Image Process 13(3):222–241

    Article  Google Scholar 

  44. Oliva A, Torralba A (2001) Modeling the shape of the scene: A holistic representation of the spatial envelope. Int J Comput Vis 42:145–175

    Article  MATH  Google Scholar 

  45. Qu Y, Wu S, Liu H, Xie Y, Wang H (2012) Evaluation of local features and classifiers in bow model for image classification. Multimed Tools Appl pp 1–20

  46. Rublee E, Rabaud V, Konolige K, Bradski G (2011) ORB: an efficient alternative to sift or surf. In: Proceedings of the, Vol 2011 IEEE International Conference on Computer Vision, IEEE, pp 2564–2571

  47. Van de Sande K, Gevers T, Snoek C (2010) Evaluating color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell 32(9):1582–1596

    Article  Google Scholar 

  48. Schaffalitzky F, Zisserman A (2002) Multi-view matching for unordered image sets, or ”how do i organize my holiday snaps?”. In: Proceedings of the European Conference on Computer Vision, pp 414–431

  49. Schmid C, Mohr R (1997) Local gray value invariants for image retrieval. IEEE Trans Pattern Anal Mach Intell 19:530–535

    Article  Google Scholar 

  50. Schmid C, Mohr R, Bauckhage C (2000) Evaluation of interest point detectors. Int J Comput Vis 37:151–172

    Article  MATH  Google Scholar 

  51. Shen X, Hua G, Williams L, Wu Y (2012) Dynamic hand gesture recognition: An exemplar-based approach from motion divergence fields. Image Vis Comput 30(3):227–235

    Article  Google Scholar 

  52. Snoek CG, Van de Sande KE, Li X, Mazloom M, Jiang YG, Koelma DC, Smeulders AW (2011) The MediaMill TRECVID 2011 semantic video search engine. In: Proceedings of the 9th TRECVID Workshop

  53. Szeliski R (2010) Computer Vision: Algorithms and applications, 1st edn. Springer

  54. Tola E, Lepetit V, Fua P (2008) A fast local descriptor for dense matching. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 1–8

  55. Tola E, Lepetit V, Fua P (2010) DAISY: An efficient dense descriptor applied to wide baseline stereo. IEEE Trans Pattern Anal Mach Intell 32(5):815–830

    Article  Google Scholar 

  56. Tsagaris V, Anastassopoulos V (2004) Multispectral image fusion method using perceptual attributes. In: Remote Sensing, International Society for Optics and Photonics, pp 357–367

  57. Tuytelaars T (2010) Dense interest points. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2281 –2288

  58. Tuytelaars T, Gool LJV (2000) Wide baseline stereo matching based on local, affinely invariant regions. In: British Machine Vision Conference

  59. Tuytelaars T, Mikolajczyk K (2008) Local invariant feature detectors: A survey. Now Publishers Inc. Hanover, MA, USA

    Google Scholar 

  60. Tuytelaars T, Gool LJV, D’haene L, Koch R (1999) Matching of affinely invariant regions for visual servoing. In: Proceedings of the International Conference on Robotics and Automation, pp 1601–1606

  61. Van De Weijer J, Gevers T, Bagdanov AD (2006) Boosting color saliency in image feature detection. IEEE Trans on Pattern Anal Mach Intell 28(1):150–156

    Article  Google Scholar 

  62. Vetterli M, Kovacevic J (1995) Wavelets and subband coding, 1st edn. Prentice Hall Signal Processing Series, Prentice Hall PTR

  63. Zhang W (2009) Image features and learning algorithms for biological, generic and social object recognition, PhD thesis, School of Electrical Engineering and Computer Science, Oregon State University

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bee Ee Khoo.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rassem, T.H., Khoo, B.E. Performance evaluation of new colour histogram-based interest point detectors. Multimed Tools Appl 74, 11357–11398 (2015). https://doi.org/10.1007/s11042-014-2235-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-014-2235-4

Keywords

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy