Abstract
Smart automated management and access to ever increasing number of scene text images is a pressing need to enable individuals and organizations save time and energy. Text embedded in such an image is an important descriptor of the image itself. In this paper, a novel scheme for automatic generation of annotations by OCRing scene text images is presented and the performance is demonstrated on a smart infobase, a knowledge base designed with trie data structure. A neuro-fuzzy approach is used for text detection and multi-layer perceptron is incorporated for character recognition. Appropriate post-processing has increased the classification performance from 90.73% to 96.86% (i.e. higher than Tesseract 3.01 that yields 93.51%). Q-gram based index keys are generated from the OCR’d text and indexed in the infobase enabling appropriate relevance scoring. Besides ‘query text’, the system also supports ‘query image’. The retrieval engine returns scene images in order of relevance i.e. in decreasing order. The performance is successfully demonstrated on a set of 100 camera captured scene text images. The system works satisfactorily within the present scope of applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zhang, D., Islam, M.M., Lu, G.: A review on automatic image annotation techniques. Pattern Recogn. 45(1), 346–362 (2012)
Feng, Y., Lapata, M.: Automatic caption generation for news images. IEEE Trans. Pattern Anal. Mach. Intell. 35(4), 797–812 (2013)
Tahir, H., Tahir, R., McDonald-Maier, K.: A novel private cloud document archival system architecture based on ICmetrics. In: Fourth International Conference on Emerging Security Technologies (EST), pp. 102–106 (2013)
Baechler, M., Bloechle, J.L., Ingold, R.: Semi-automatic annotation tool for medieval manuscripts. In: International Conference on Frontiers in Handwriting Recognition, pp. 182–187 (2010)
Messaoud, I.B., Abed, H.E.: Automatic annotation for handwritten historical documents using markov models. In: International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 381–386. IEEE (2010)
Lu, S., King, I., Lyu, M.R.: A novel video summarization framework for document preparation and archival applications. In: IEEE Aerospace Conference, pp. 1–10 (2005)
Mishra, A., Karteek, A., Jawahar. C.V.: Image retrieval using textual cues. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 3040–3047 (2013)
Mollah, A.F., Basu, S., Nasipuri, M.: Text detection from camera captured images using a novel fuzzy-based technique. In: Proceedings of the Third International Conference on Emerging Applications of Information Technology (EAIT), pp. 291–294. IEEE (2012)
Mollah, A.F., Basu, S., Nasipuri, M.: Handheld device-based character recognition system for camera captured images. Int. J. Image Graph. 13(4), 1350016 (2013)
Mollah, A.F., Basu, S., Nasipuri, M.: Handheld mobile device based text region extraction and binarization of image embedded text documents. J. Intell. Syst. 22(1), 25–47 (2013)
Mollah, A.F., Basu, S., Nasipuri, M.: Segmentation of camera captured business card images for mobile devices. Int. J. Comput. Sci. Appl. 1(1), 33–37 (2010)
Mollah, A.F., Majumder, N., Basu, S., Nasipuri, M.: Design of an optical character recognition system for camera-based handheld devices. Int. J. Comput. Sci. Issues 8(4), 283–289 (2011)
Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: An MLP based approach for recognition of handwritten Bangla numerals. In: Second Indian International Conference on Artificial Intelligence (IICAI), pp. 407–417 (2005)
Das, N., Mollah, A. F., Saha, S., Haque, S.S.: Handwritten arabic numeral recognition using a multi-layer perceptron. In: National Conference on Recent Trends in Information Systems (ReTIS-06), pp. 200–203 (2006)
Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: Handwritten Bangla alphabet recognition using an MLP based classifier. In: 2nd National Conference on Computer Processing of Bangla, pp. 285–291 (2005)
Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: A hierarchical approach to recognition of handwritten Bangla characters. Pattern Recogn. 42(7), 1467–1484 (2009)
Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: A novel framework for automatic sorting of postal documents with multi-script address blocks. Pattern Recogn. 43(10), 3507–3521 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mollah, A.F., Basu, S., Nasipuri, M. (2018). An Automatic Annotation Scheme for Scene Text Archival Applications. In: Singh, M., Gupta, P., Tyagi, V., Flusser, J., Ören, T. (eds) Advances in Computing and Data Sciences. ICACDS 2018. Communications in Computer and Information Science, vol 906. Springer, Singapore. https://doi.org/10.1007/978-981-13-1813-9_7
Download citation
DOI: https://doi.org/10.1007/978-981-13-1813-9_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1812-2
Online ISBN: 978-981-13-1813-9
eBook Packages: Computer ScienceComputer Science (R0)