Abstract
Shape recognition is an active research topic in the field of computer vision and graphic computing. Nevertheless, existing methods are still poor in accuracy and efficiency in some extent, which greatly limits their application in computer vision system. This paper investigates the restraint of feature structure that intrinsically deteriorates recognition performance. Furthermore, we propose a fast shape recognition method based on a bi-level restraint reduction of contour coding (CC2RR), which provides more effective theoretical support for the practical application of the visual algorithm. CC2RR reduces restraint performed from contour feature extraction and expression, respectively. First, for shape contour, the restraint of contour feature extraction is reduced by transforming the direction of contour points to contour segments; second, for the encoded contour segment, the restraint of the contour feature expression is reduced; in other words, the current direction is reduced to the previous and the next direction. Guided by these insights, Hamming code distance is used to match the coding features after the twofold restraint reduction, and the results are obtained. Experimental results verify that the method significantly improves the performance, which runs up to 500 times faster than the existing description methods based on shape contours while increasing robustness. This makes the method useful in practical software system.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
Data available on request from the authors. The data that support the findings of this study are available from the corresponding author, upon reasonable request.
References
Kera, S.B., Tadepalli, A., Ranjani, J.J.: A paced multi-stage block-wise approach for object detection in thermal images. Vis. Comput. 1–17 (2022)
Liu, Z., Xiang, Q., Tang, J., Wang, Y., Zhao, P.: Robust salient object detection for RGB images. Vis. Comput. 36, 1823–1835 (2020)
An, F.-P., Liu, J.-E., Bai, L.: Object recognition algorithm based on optimized nonlinear activation function-global convolutional neural network. Vis. Comput. 1–13 (2022)
Kumar, A.K., Ngoc Mai, N., Guo, S., Han, L.: Entanglement inspired approach for determining the preeminent arrangement of static cameras in a multi-view computer vision system. Vis. Comput. 1–17 (2022)
Yan, W., Yang, J.: Multi-part shape matching by simultaneous partial functional correspondence. Vis. Comput. 39(1), 393–412 (2023)
Ling, H., Yang, X., Latecki, L.J.: Balancing deformability and discriminability for shape matching. In: European Conference on Computer Vision, pp. 411–424. Springer (2010)
Bandyopadhyay, S.K., Kondi, L.P.: Optimal bit allocation for joint texture-aware contour-based shape coding and shape-adaptive texture coding. IEEE Trans. Circuits Syst. Video Technol. 18(6), 840–844 (2008)
Katsaggelos, A.K., Kondi, L.P., Meier, F.W., Ostermann, J., Schuster, G.M.: Mpeg-4 and rate-distortion-based shape-coding techniques. Proc. IEEE 86(6), 1126–1154 (1998)
T ITU-T. Information technology-coded representation of picture and audio information-progressive bi-level image compression. Recommendation (1993)
ITUT Recommendation. Information technology–coded representation of picture and audio information–progressive bi-level image compression. T82 (JBIG)
Organisation Internationale de Normalisation. Information technology–generic coding of audio-visual objects–part 2: visual (1999)
Martin, K., Lukac, R., Plataniotis, K.N.: Spiht-based coding of the shape and texture of arbitrarily shaped visual objects. IEEE Trans. Circuits Syst. Video Technol. 16(10), 1196–1208 (2006)
Schuster, G.M., Katsaggelos, A.K.: An optimal polygonal boundary encoding scheme in the rate distortion sense. IEEE Trans. Image Process. 7(1), 13–26 (1998)
Kim, K.J., Suh, J.-Y., Kang, M.G.: Generalized interframe vertex-based shape encoding scheme for video sequences. IEEE Trans. Image Process. 9(10), 1667–1676 (2000)
Nunes, P., Marqués, F., Pereira, F., Gasull, A.: A contour-based approach to binary shape coding using a multiple grid chain code. Signal Process. Image Commun. 15(7–8), 585–599 (2000)
Liu, Q., Ngan, K.N.: Arbitrarily shaped object coding based on h. 264/avc. In: 2009 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS), pp. 343–346. IEEE (2009)
Wulandhari, L.A., Haron, H.: Characteristic of rectangular vertex chain code for shapes with hole. In: 2009 International Conference on Information Management and Engineering, pp. 648–650. IEEE (2009)
Lai, H., Zuo, Z., Wang, Z., Yao, Z., Liu, W.: Accurate distortion measurement for b-spline-based shape coding. In: 2011 18th IEEE International Conference on Image Processing, pp. 225–228. IEEE (2011)
Aghito, S.M., Forchhammer, S.: Context-based coding of bilevel images enhanced by digital straight line analysis. IEEE Trans. Image Process. 15(8), 2120–2130 (2006)
Luo, H.: Image-dependent shape coding and representation. IEEE Trans. Circuits Syst. Video Technol. 15(3), 345–354 (2005)
Bai, X., Wang, B., Yao, C., Liu, W., Zhuowen, T.: Co-transduction for shape retrieval. IEEE Trans. Image Process. 21(5), 2747–2757 (2011)
Bai, X., Yang, X., Latecki, L.J., Liu, W., Zhuowen, T.: Learning context-sensitive shape similarity by graph transduction. IEEE Trans. Pattern Anal. Mach. Intell. 32(5), 861–874 (2009)
Yang, X., Koknar-Tezel, S., Latecki, L.J.: Locally constrained diffusion process on locally densified distance spaces with applications to shape retrieval. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 357–364. IEEE (2009)
Bai, X., Liu, W., Tu, Z.: Integrating contour and skeleton for shape classification. In: 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, pp. 360–367. IEEE (2009)
Bai, S., Bai, X.: Sparse contextual activation for efficient visual re-ranking. IEEE Trans. Image Process. 25(3), 1056–1069 (2016)
Matousek, J: Lectures on Discrete Geometry, vol. 212. Springer (2013)
Eden, M., Kocher, M., Eurasip, M.: On the performance of a contour coding algorithm in the context of image coding part I: contour segment coding. Signal Process. 8(4), 381–386 (1985)
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24(4), 509–522 (2002)
Ling, H., Jacobs, D.W.: Shape classification using the inner-distance. IEEE Trans. Pattern Anal. Mach. Intell. 29(2), 286–299 (2007)
Biswas, S., Aggarwal, G., Chellappa, R.: Efficient indexing for articulation invariant shape matching and retrieval. In: 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2007)
Lai, Z., Zhu, J., Luo, J.: Operationally optimal vertex-based shape coding with arbitrary direction edge encoding structures. J. Electron. Imaging 23(4), 043009–043009 (2014)
Chang, C.-C., Hwang, S.M., Buehrer, D.J.: A shape recognition scheme based on relative distances of feature points from the centroid. Pattern Recognit. 24(11), 1053–1063 (1991)
Shu, X., Xiao-Jun, W.: A novel contour descriptor for 2d shape matching and its application to image retrieval. Image Vis. Comput. 29(4), 286–294 (2011)
Wang, B., Gao, Y., Sun, C., Blumenstein, M., La Salle, J.: Chord bunch walks for recognizing naturally self-overlapped and compound leaves. IEEE Trans. Image Process. 28(12), 5963–5976 (2019)
Wang, B., Gao, Y., Sun, C., Blumenstein, M., La Salle, J.:. Can walking and measuring along chord bunches better describe leaf shapes? In: CVPR, pp. 2047–2056. IEEE Computer Society (2017)
Kaothanthong, N., Chun, J., Tokuyama, T.: Distance interior ratio: a new shape signature for 2d shape retrieval. Pattern Recognit. Lett. 78, 14–21 (2016)
Zheng, Y., Guo, B., Chen, Z., Li, C.: A Fourier descriptor of 2d shapes based on multiscale centroid contour distances used in object recognition in remote sensing images. Sensors 19(3), 486 (2019)
Bicego, M., Lovato, P.: A bioinformatics approach to 2d shape classification. Comput. Vis. Image Underst. 145, 59–69 (2016)
Yang, C., Wei, H., Qian, Yu.: A novel method for 2d nonrigid partial shape matching. Neurocomputing 275, 1160–1176 (2018)
Rongxiang, H., Jia, W., Ling, H., Huang, D.: Multiscale distance matrix for fast plant leaf recognition. IEEE Trans. Image Process. 21(11), 4667–4672 (2012)
Li, Z., Guo, B., Meng, F.: Fast shape recognition method using feature richness based on the walking minimum bounding rectangle over an occluded remote sensing target. Remote Sens. 14(22), 5845 (2022)
Wang, B., Gao, Y., Sun, C., Blumenstein, M., La Salle, J.: Chord bunch walks for recognizing naturally self-overlapped and compound leaves. IEEE Trans. Image Process. 28(12), 5963–5976 (2019)
Yang, C., Wei, H., Qian, Yu.: A novel method for 2d nonrigid partial shape matching. Neurocomputing 275, 1160–1176 (2018)
Zhang, J., Wenyin, L.: A pixel-level statistical structural descriptor for shape measure and recognition. In: 2009 10th International Conference on Document Analysis and Recognition, pp. 386–390. IEEE (2009)
Daliri, M.R., Torre, V.: Robust symbolic representation for shape recognition and retrieval. Pattern Recognit. 41(5), 1782–1798 (2008)
Hoyer, P.O., Hyvärinen, A.: A multi-layer sparse coding network learns contour coding from natural images. Vis. Res. 42(12), 1593–1605 (2002)
Graham, D.N.: Image transmission by two-dimensional contour coding. Proc. IEEE 55(3), 336–346 (1967)
Grattoni, P., Guiducci, A.: Contour coding for image description. Pattern Recognit. Lett. 11(2), 95–105 (1990)
Zheng, A., Cheung, G., Florencio, D.: Context tree-based image contour coding using a geometric prior. IEEE Trans. Image Process. 26(2), 574–589 (2016)
Mokhtarian, F., Mackworth, A.K.: A theory of multiscale, curvature-based shape representation for planar curves. IEEE Trans. Pattern Anal. Mach. Intell. 14(8), 789–805 (1992)
Zhang, S., Yang, M., Cour, T., Kai, Yu., Metaxas, D.N.: Query specific rank fusion for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 37(4), 803–815 (2014)
Felzenszwalb, P.F., Schwartz, J.D.: Hierarchical matching of deformable shapes. In: 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2007)
Luo, S., Mou, W., Althoefer, K., Liu, H.: Novel tactile-sift descriptor for object shape recognition. IEEE Sens. J. 15(9), 5001–5009 (2015)
Peng, H.-L., Chen, S.-Y.: Trademark shape recognition using closed contours. Pattern Recognit. Lett. 18(8), 791–803 (1997)
Grigorescu, C., Petkov, N.: Distance sets for shape filters and shape recognition. IEEE Trans. Image Process. 12(10), 1274–1286 (2003)
Ross, M., Wade, T.D.: Shape and weight concern and self-esteem as mediators of externalized self-perception, dietary restraint and uncontrolled eating. Eur. Eat.Disorders Rev. Prof. J. Eat. Disord. Assoc. 12(2), 129–136 (2004)
Tarr, M.J., Pinker, S.: Mental rotation and orientation-dependence in shape recognition. Cogn. Psychol. 21(2), 233–282 (1989)
Deutsch, J.A.: A theory of shape recognition. Br. J. Psychol. 46(1), 30–37 (1955)
Acknowledgements
This work was supported by the National Natural Science Foundation of China under Grant 62171341 (Corresponding author: Baolong Guo) and Natural Science Basic Research Program of Shaanxi Province of China under Grant 2020JM-196 (Corresponding author: Fanjie Meng).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, Z., Guo, B., Meng, F. et al. Fast shape recognition via a bi-level restraint reduction of contour coding. Vis Comput 40, 2599–2614 (2024). https://doi.org/10.1007/s00371-023-02940-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-023-02940-9