Skip to main content

An Automatic Annotation Scheme for Scene Text Archival Applications

  • Conference paper
  • First Online:
Advances in Computing and Data Sciences (ICACDS 2018)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 906))

Included in the following conference series:

  • 953 Accesses

Abstract

Smart automated management and access to ever increasing number of scene text images is a pressing need to enable individuals and organizations save time and energy. Text embedded in such an image is an important descriptor of the image itself. In this paper, a novel scheme for automatic generation of annotations by OCRing scene text images is presented and the performance is demonstrated on a smart infobase, a knowledge base designed with trie data structure. A neuro-fuzzy approach is used for text detection and multi-layer perceptron is incorporated for character recognition. Appropriate post-processing has increased the classification performance from 90.73% to 96.86% (i.e. higher than Tesseract 3.01 that yields 93.51%). Q-gram based index keys are generated from the OCR’d text and indexed in the infobase enabling appropriate relevance scoring. Besides ‘query text’, the system also supports ‘query image’. The retrieval engine returns scene images in order of relevance i.e. in decreasing order. The performance is successfully demonstrated on a set of 100 camera captured scene text images. The system works satisfactorily within the present scope of applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Zhang, D., Islam, M.M., Lu, G.: A review on automatic image annotation techniques. Pattern Recogn. 45(1), 346–362 (2012)

    Article  Google Scholar 

  2. Feng, Y., Lapata, M.: Automatic caption generation for news images. IEEE Trans. Pattern Anal. Mach. Intell. 35(4), 797–812 (2013)

    Article  Google Scholar 

  3. Tahir, H., Tahir, R., McDonald-Maier, K.: A novel private cloud document archival system architecture based on ICmetrics. In: Fourth International Conference on Emerging Security Technologies (EST), pp. 102–106 (2013)

    Google Scholar 

  4. Baechler, M., Bloechle, J.L., Ingold, R.: Semi-automatic annotation tool for medieval manuscripts. In: International Conference on Frontiers in Handwriting Recognition, pp. 182–187 (2010)

    Google Scholar 

  5. Messaoud, I.B., Abed, H.E.: Automatic annotation for handwritten historical documents using markov models. In: International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 381–386. IEEE (2010)

    Google Scholar 

  6. Lu, S., King, I., Lyu, M.R.: A novel video summarization framework for document preparation and archival applications. In: IEEE Aerospace Conference, pp. 1–10 (2005)

    Google Scholar 

  7. Mishra, A., Karteek, A., Jawahar. C.V.: Image retrieval using textual cues. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 3040–3047 (2013)

    Google Scholar 

  8. Mollah, A.F., Basu, S., Nasipuri, M.: Text detection from camera captured images using a novel fuzzy-based technique. In: Proceedings of the Third International Conference on Emerging Applications of Information Technology (EAIT), pp. 291–294. IEEE (2012)

    Google Scholar 

  9. Mollah, A.F., Basu, S., Nasipuri, M.: Handheld device-based character recognition system for camera captured images. Int. J. Image Graph. 13(4), 1350016 (2013)

    Article  Google Scholar 

  10. Mollah, A.F., Basu, S., Nasipuri, M.: Handheld mobile device based text region extraction and binarization of image embedded text documents. J. Intell. Syst. 22(1), 25–47 (2013)

    Google Scholar 

  11. Mollah, A.F., Basu, S., Nasipuri, M.: Segmentation of camera captured business card images for mobile devices. Int. J. Comput. Sci. Appl. 1(1), 33–37 (2010)

    Google Scholar 

  12. Mollah, A.F., Majumder, N., Basu, S., Nasipuri, M.: Design of an optical character recognition system for camera-based handheld devices. Int. J. Comput. Sci. Issues 8(4), 283–289 (2011)

    Google Scholar 

  13. Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: An MLP based approach for recognition of handwritten Bangla numerals. In: Second Indian International Conference on Artificial Intelligence (IICAI), pp. 407–417 (2005)

    Google Scholar 

  14. Das, N., Mollah, A. F., Saha, S., Haque, S.S.: Handwritten arabic numeral recognition using a multi-layer perceptron. In: National Conference on Recent Trends in Information Systems (ReTIS-06), pp. 200–203 (2006)

    Google Scholar 

  15. Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: Handwritten Bangla alphabet recognition using an MLP based classifier. In: 2nd National Conference on Computer Processing of Bangla, pp. 285–291 (2005)

    Google Scholar 

  16. Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: A hierarchical approach to recognition of handwritten Bangla characters. Pattern Recogn. 42(7), 1467–1484 (2009)

    Article  Google Scholar 

  17. Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: A novel framework for automatic sorting of postal documents with multi-script address blocks. Pattern Recogn. 43(10), 3507–3521 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ayatullah Faruk Mollah .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mollah, A.F., Basu, S., Nasipuri, M. (2018). An Automatic Annotation Scheme for Scene Text Archival Applications. In: Singh, M., Gupta, P., Tyagi, V., Flusser, J., Ören, T. (eds) Advances in Computing and Data Sciences. ICACDS 2018. Communications in Computer and Information Science, vol 906. Springer, Singapore. https://doi.org/10.1007/978-981-13-1813-9_7

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-1813-9_7

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-1812-2

  • Online ISBN: 978-981-13-1813-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy