Skip to main content

Fast shape recognition via a bi-level restraint reduction of contour coding

  • Original article
  • Published:
The Visual Computer Aims and scope Submit manuscript

Abstract

Shape recognition is an active research topic in the field of computer vision and graphic computing. Nevertheless, existing methods are still poor in accuracy and efficiency in some extent, which greatly limits their application in computer vision system. This paper investigates the restraint of feature structure that intrinsically deteriorates recognition performance. Furthermore, we propose a fast shape recognition method based on a bi-level restraint reduction of contour coding (CC2RR), which provides more effective theoretical support for the practical application of the visual algorithm. CC2RR reduces restraint performed from contour feature extraction and expression, respectively. First, for shape contour, the restraint of contour feature extraction is reduced by transforming the direction of contour points to contour segments; second, for the encoded contour segment, the restraint of the contour feature expression is reduced; in other words, the current direction is reduced to the previous and the next direction. Guided by these insights, Hamming code distance is used to match the coding features after the twofold restraint reduction, and the results are obtained. Experimental results verify that the method significantly improves the performance, which runs up to 500 times faster than the existing description methods based on shape contours while increasing robustness. This makes the method useful in practical software system.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Algorithm 1
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

Data available on request from the authors. The data that support the findings of this study are available from the corresponding author, upon reasonable request.

References

  1. Kera, S.B., Tadepalli, A., Ranjani, J.J.: A paced multi-stage block-wise approach for object detection in thermal images. Vis. Comput. 1–17 (2022)

  2. Liu, Z., Xiang, Q., Tang, J., Wang, Y., Zhao, P.: Robust salient object detection for RGB images. Vis. Comput. 36, 1823–1835 (2020)

    Google Scholar 

  3. An, F.-P., Liu, J.-E., Bai, L.: Object recognition algorithm based on optimized nonlinear activation function-global convolutional neural network. Vis. Comput. 1–13 (2022)

  4. Kumar, A.K., Ngoc Mai, N., Guo, S., Han, L.: Entanglement inspired approach for determining the preeminent arrangement of static cameras in a multi-view computer vision system. Vis. Comput. 1–17 (2022)

  5. Yan, W., Yang, J.: Multi-part shape matching by simultaneous partial functional correspondence. Vis. Comput. 39(1), 393–412 (2023)

    Google Scholar 

  6. Ling, H., Yang, X., Latecki, L.J.: Balancing deformability and discriminability for shape matching. In: European Conference on Computer Vision, pp. 411–424. Springer (2010)

  7. Bandyopadhyay, S.K., Kondi, L.P.: Optimal bit allocation for joint texture-aware contour-based shape coding and shape-adaptive texture coding. IEEE Trans. Circuits Syst. Video Technol. 18(6), 840–844 (2008)

    Google Scholar 

  8. Katsaggelos, A.K., Kondi, L.P., Meier, F.W., Ostermann, J., Schuster, G.M.: Mpeg-4 and rate-distortion-based shape-coding techniques. Proc. IEEE 86(6), 1126–1154 (1998)

    Google Scholar 

  9. T ITU-T. Information technology-coded representation of picture and audio information-progressive bi-level image compression. Recommendation (1993)

  10. ITUT Recommendation. Information technology–coded representation of picture and audio information–progressive bi-level image compression. T82 (JBIG)

  11. Organisation Internationale de Normalisation. Information technology–generic coding of audio-visual objects–part 2: visual (1999)

  12. Martin, K., Lukac, R., Plataniotis, K.N.: Spiht-based coding of the shape and texture of arbitrarily shaped visual objects. IEEE Trans. Circuits Syst. Video Technol. 16(10), 1196–1208 (2006)

    Google Scholar 

  13. Schuster, G.M., Katsaggelos, A.K.: An optimal polygonal boundary encoding scheme in the rate distortion sense. IEEE Trans. Image Process. 7(1), 13–26 (1998)

    MathSciNet  Google Scholar 

  14. Kim, K.J., Suh, J.-Y., Kang, M.G.: Generalized interframe vertex-based shape encoding scheme for video sequences. IEEE Trans. Image Process. 9(10), 1667–1676 (2000)

    Google Scholar 

  15. Nunes, P., Marqués, F., Pereira, F., Gasull, A.: A contour-based approach to binary shape coding using a multiple grid chain code. Signal Process. Image Commun. 15(7–8), 585–599 (2000)

    Google Scholar 

  16. Liu, Q., Ngan, K.N.: Arbitrarily shaped object coding based on h. 264/avc. In: 2009 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS), pp. 343–346. IEEE (2009)

  17. Wulandhari, L.A., Haron, H.: Characteristic of rectangular vertex chain code for shapes with hole. In: 2009 International Conference on Information Management and Engineering, pp. 648–650. IEEE (2009)

  18. Lai, H., Zuo, Z., Wang, Z., Yao, Z., Liu, W.: Accurate distortion measurement for b-spline-based shape coding. In: 2011 18th IEEE International Conference on Image Processing, pp. 225–228. IEEE (2011)

  19. Aghito, S.M., Forchhammer, S.: Context-based coding of bilevel images enhanced by digital straight line analysis. IEEE Trans. Image Process. 15(8), 2120–2130 (2006)

    Google Scholar 

  20. Luo, H.: Image-dependent shape coding and representation. IEEE Trans. Circuits Syst. Video Technol. 15(3), 345–354 (2005)

    MathSciNet  Google Scholar 

  21. Bai, X., Wang, B., Yao, C., Liu, W., Zhuowen, T.: Co-transduction for shape retrieval. IEEE Trans. Image Process. 21(5), 2747–2757 (2011)

    MathSciNet  Google Scholar 

  22. Bai, X., Yang, X., Latecki, L.J., Liu, W., Zhuowen, T.: Learning context-sensitive shape similarity by graph transduction. IEEE Trans. Pattern Anal. Mach. Intell. 32(5), 861–874 (2009)

    Google Scholar 

  23. Yang, X., Koknar-Tezel, S., Latecki, L.J.: Locally constrained diffusion process on locally densified distance spaces with applications to shape retrieval. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 357–364. IEEE (2009)

  24. Bai, X., Liu, W., Tu, Z.: Integrating contour and skeleton for shape classification. In: 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, pp. 360–367. IEEE (2009)

  25. Bai, S., Bai, X.: Sparse contextual activation for efficient visual re-ranking. IEEE Trans. Image Process. 25(3), 1056–1069 (2016)

    MathSciNet  Google Scholar 

  26. Matousek, J: Lectures on Discrete Geometry, vol. 212. Springer (2013)

  27. Eden, M., Kocher, M., Eurasip, M.: On the performance of a contour coding algorithm in the context of image coding part I: contour segment coding. Signal Process. 8(4), 381–386 (1985)

    Google Scholar 

  28. Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24(4), 509–522 (2002)

    Google Scholar 

  29. Ling, H., Jacobs, D.W.: Shape classification using the inner-distance. IEEE Trans. Pattern Anal. Mach. Intell. 29(2), 286–299 (2007)

    Google Scholar 

  30. Biswas, S., Aggarwal, G., Chellappa, R.: Efficient indexing for articulation invariant shape matching and retrieval. In: 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2007)

  31. Lai, Z., Zhu, J., Luo, J.: Operationally optimal vertex-based shape coding with arbitrary direction edge encoding structures. J. Electron. Imaging 23(4), 043009–043009 (2014)

    Google Scholar 

  32. Chang, C.-C., Hwang, S.M., Buehrer, D.J.: A shape recognition scheme based on relative distances of feature points from the centroid. Pattern Recognit. 24(11), 1053–1063 (1991)

    Google Scholar 

  33. Shu, X., Xiao-Jun, W.: A novel contour descriptor for 2d shape matching and its application to image retrieval. Image Vis. Comput. 29(4), 286–294 (2011)

    Google Scholar 

  34. Wang, B., Gao, Y., Sun, C., Blumenstein, M., La Salle, J.: Chord bunch walks for recognizing naturally self-overlapped and compound leaves. IEEE Trans. Image Process. 28(12), 5963–5976 (2019)

    MathSciNet  Google Scholar 

  35. Wang, B., Gao, Y., Sun, C., Blumenstein, M., La Salle, J.:. Can walking and measuring along chord bunches better describe leaf shapes? In: CVPR, pp. 2047–2056. IEEE Computer Society (2017)

  36. Kaothanthong, N., Chun, J., Tokuyama, T.: Distance interior ratio: a new shape signature for 2d shape retrieval. Pattern Recognit. Lett. 78, 14–21 (2016)

    Google Scholar 

  37. Zheng, Y., Guo, B., Chen, Z., Li, C.: A Fourier descriptor of 2d shapes based on multiscale centroid contour distances used in object recognition in remote sensing images. Sensors 19(3), 486 (2019)

    Google Scholar 

  38. Bicego, M., Lovato, P.: A bioinformatics approach to 2d shape classification. Comput. Vis. Image Underst. 145, 59–69 (2016)

    Google Scholar 

  39. Yang, C., Wei, H., Qian, Yu.: A novel method for 2d nonrigid partial shape matching. Neurocomputing 275, 1160–1176 (2018)

    Google Scholar 

  40. Rongxiang, H., Jia, W., Ling, H., Huang, D.: Multiscale distance matrix for fast plant leaf recognition. IEEE Trans. Image Process. 21(11), 4667–4672 (2012)

    MathSciNet  Google Scholar 

  41. Li, Z., Guo, B., Meng, F.: Fast shape recognition method using feature richness based on the walking minimum bounding rectangle over an occluded remote sensing target. Remote Sens. 14(22), 5845 (2022)

    Google Scholar 

  42. Wang, B., Gao, Y., Sun, C., Blumenstein, M., La Salle, J.: Chord bunch walks for recognizing naturally self-overlapped and compound leaves. IEEE Trans. Image Process. 28(12), 5963–5976 (2019)

    MathSciNet  Google Scholar 

  43. Yang, C., Wei, H., Qian, Yu.: A novel method for 2d nonrigid partial shape matching. Neurocomputing 275, 1160–1176 (2018)

    Google Scholar 

  44. Zhang, J., Wenyin, L.: A pixel-level statistical structural descriptor for shape measure and recognition. In: 2009 10th International Conference on Document Analysis and Recognition, pp. 386–390. IEEE (2009)

  45. Daliri, M.R., Torre, V.: Robust symbolic representation for shape recognition and retrieval. Pattern Recognit. 41(5), 1782–1798 (2008)

    Google Scholar 

  46. Hoyer, P.O., Hyvärinen, A.: A multi-layer sparse coding network learns contour coding from natural images. Vis. Res. 42(12), 1593–1605 (2002)

    Google Scholar 

  47. Graham, D.N.: Image transmission by two-dimensional contour coding. Proc. IEEE 55(3), 336–346 (1967)

    Google Scholar 

  48. Grattoni, P., Guiducci, A.: Contour coding for image description. Pattern Recognit. Lett. 11(2), 95–105 (1990)

    Google Scholar 

  49. Zheng, A., Cheung, G., Florencio, D.: Context tree-based image contour coding using a geometric prior. IEEE Trans. Image Process. 26(2), 574–589 (2016)

    MathSciNet  Google Scholar 

  50. Mokhtarian, F., Mackworth, A.K.: A theory of multiscale, curvature-based shape representation for planar curves. IEEE Trans. Pattern Anal. Mach. Intell. 14(8), 789–805 (1992)

    Google Scholar 

  51. Zhang, S., Yang, M., Cour, T., Kai, Yu., Metaxas, D.N.: Query specific rank fusion for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 37(4), 803–815 (2014)

    Google Scholar 

  52. Felzenszwalb, P.F., Schwartz, J.D.: Hierarchical matching of deformable shapes. In: 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2007)

  53. Luo, S., Mou, W., Althoefer, K., Liu, H.: Novel tactile-sift descriptor for object shape recognition. IEEE Sens. J. 15(9), 5001–5009 (2015)

    Google Scholar 

  54. Peng, H.-L., Chen, S.-Y.: Trademark shape recognition using closed contours. Pattern Recognit. Lett. 18(8), 791–803 (1997)

    Google Scholar 

  55. Grigorescu, C., Petkov, N.: Distance sets for shape filters and shape recognition. IEEE Trans. Image Process. 12(10), 1274–1286 (2003)

    MathSciNet  Google Scholar 

  56. Ross, M., Wade, T.D.: Shape and weight concern and self-esteem as mediators of externalized self-perception, dietary restraint and uncontrolled eating. Eur. Eat.Disorders Rev. Prof. J. Eat. Disord. Assoc. 12(2), 129–136 (2004)

    Google Scholar 

  57. Tarr, M.J., Pinker, S.: Mental rotation and orientation-dependence in shape recognition. Cogn. Psychol. 21(2), 233–282 (1989)

    Google Scholar 

  58. Deutsch, J.A.: A theory of shape recognition. Br. J. Psychol. 46(1), 30–37 (1955)

    Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China under Grant 62171341 (Corresponding author: Baolong Guo) and Natural Science Basic Research Program of Shaanxi Province of China under Grant 2020JM-196 (Corresponding author: Fanjie Meng).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Baolong Guo.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Z., Guo, B., Meng, F. et al. Fast shape recognition via a bi-level restraint reduction of contour coding. Vis Comput 40, 2599–2614 (2024). https://doi.org/10.1007/s00371-023-02940-9

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00371-023-02940-9

Keywords

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy