Robust handwriting extraction and lecture video summarization

Lee, Greg C.; Yeh, Fu-Hao; Chen, Ying-Ju; Chang, Tao-Ku

doi:10.1007/s11042-016-3353-y

Robust handwriting extraction and lecture video summarization

Published: 25 February 2016

Volume 76, pages 7067–7085, (2017)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

In e-Learning research, teachers can record lecture videos in e-class and upload these lecture videos to e-Learning system themselves. Once lecture videos and handouts can be generated automatically in traditional classroom, it can help students with self-learning and teacher with lecture content development for e-Learning services. This paper proposed a teaching assistant system based on computer vision that can help in content development for e-Learning services. Lecture videos are taken by using two cameras and merged on both sides so that students can see a clear and complete teaching content. The k-means segmentation is used to extract board area and then connected component technique helps refill the board area which is covered by lecturer’s body. Then we use adaptive threshold to extract handwritings in various light conditions and time-series denoising technique is designed to reduce noise. According to extracted handwritings, the lecture videos can be automatically structured with high level of semantics. The lecture videos are segmented into video clips and all key-frames are integrated as handouts of the education videos.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bay H, Tuytelaars T, Gool LV (2008) SURF: speeded up robust features. Comput Vis Image Underst 110(3):346–359
Article Google Scholar
Bhogal AK, Singla N, Kaur M (2010) Color image segmentation using K-means clustering algorithm. 1(2):18–20
Brown M, Lowe DG (2007) Automatic panoramic image stitching using invariant features. Int J Comput Vis 74(1):59–73
Article Google Scholar
Chang HS, Sull S, Lee SU (1999) Efficient video indexing scheme for content-based retrieval. IEEE Trans Circ Syst Video Technol 9(8):1269–1279
Article Google Scholar
Choudary C, Liu T (2007) Summarization of visual content in instructional videos. IEEE Trans Multimed 9(7):1443–1455
Article Google Scholar
Ferman AM, Tekalp AM, Mehrotra R (2002) Robust color histogram descriptors for video segment retrieval and identification. IEEE Trans Image Process 11(5):497–508
Article Google Scholar
Fink GA, Wienecke M, Plötz T (2005) Experiments in video-based whiteboard reading. Proceedings of International Workshop on Camera-Based Document Analysis and Recognition, pp. 95–100
Hartley R, Zisserman A (2000) Multiple view geometry in computer vision. Cambridge University Press, Cambridge
MATH Google Scholar
He L, Zhang Z (2007) Real-time whiteboard capture and processing using a video camera for remote collaboration. IEEE Trans Multimed 9(1):198–206
Article Google Scholar
Hirzallah N, Nusir S, Al Sayyed A, Kayed A (2008) Notes extraction algorithm from traditional presentations without the use of e-boards. Proceedings of the International Conference on Computer and Communication Engineering, pp. 195–200
Imran AS, Cheikh FA (2011) Blackboard content classification for lecture videos. Proceedings of International Conference on Image Processing, pp. 2989–2992
Imran AS, Cheikh FA (2012) Lecture content classification tool. Proceedings of International Symposium on Communications Control and Signal Processing, pp. 1–6
Imran AS, Rahadianti L, Cheikh FA, Yayilgan SY (2012) Semantic tags for lecture videos. Proceedings of International Conference on Semantic Computing, pp. 117–120
Jain A (1986) Fundamentals of digital image processing. Prentice-Hall
Lei Z, Chou W, Zhong J, Lee CH (2000) Video segmentation using spatial and temporal statistical analysis method. Proc Int Conf Multimed Expo 3:1527–1530
Article Google Scholar
Lin C, Sheu M, Chiang H, Liaw C, Tsai C (2005) An efficient video de-interlacing with scene change detection. Proceedings of the International Conference on Information, Communications and Signal Processing, pp. 36–40
Liu TT, Choudary C (2006) Content Extraction and Summarization of Instructional Videos”, Proceedings of International Conference on Image Processing, pp. 149–152
Okuni S, Tsuruoka S, Rayat GP, Kawanaka H, Shinogi T (2007) Video scene segmentation using the state recognition of blackboard for blended learning. Proceedings of International Conference on Convergence Information Technology, pp. 2437–2442
Onishi M, Izumi M, Fukunaga K (2000) Blackboard segmentation using video image of lecture and its applications. Proceedings of International Conference on Pattern Recognition, pp. 615–618
Saez E, Benavides JI, Guil N (2004) Reliable real time scene change detection in MPEG compressed video. Proc Int Conf Multimed Expo 1:567–570
Google Scholar
Saund E (1999) Image mosaicing and a diagrammatic user interface for an office whiteboard scanner. Technical report, Xerox Palo Alto Research Center
Zhang Z, He LW (2004) Notetaking with a camera: whiteboard scanning and image enhancement. Proc IEEE Int Conf Acoust Speech Signal Process 3:533–536
Google Scholar
Zhang HJ, Kankanhalli A, Smoliar SW (1993) Automatic partitioning of full-motion video. Multimedia Systems 1(1):10–28
Article Google Scholar
Zhang D, Qi W, Zhang HJ (2001) A new shot boundary detection algorithm. Proceedings of the Second Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing, pp. 63–70
Zhao L, Qi W, Li SZ, Yang SQ, Zhang HJ (2001) Content-based retrieval of video shot using the-improved nearest feature line method. Proc Int Conf Acoust Speech Signal Process 3:1625–1628
Google Scholar
Zhou J, Zhang XP (2004) A web-enabled video indexing system. Proceedings of the International Workshop on Multimedia Information Retrieval, pp. 307–314

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, National Taiwan Normal University, No.88, Sec. 4, Tingzhou Rd., Wenshan Dist., Taipei City, Taiwan (116), China
Greg C. Lee & Ying-Ju Chen
Program of Information Technology, Fooyin University, No. 151, Chinhsueh Rd., Ta-liao, Kaohsiung, Taiwan, China
Fu-Hao Yeh
Department of Computer Science and Information Engineering, National Dong Hwa University, Hualien, Taiwan, China
Tao-Ku Chang

Authors

Greg C. Lee
View author publications
You can also search for this author inPubMed Google Scholar
Fu-Hao Yeh
View author publications
You can also search for this author inPubMed Google Scholar
Ying-Ju Chen
View author publications
You can also search for this author inPubMed Google Scholar
Tao-Ku Chang
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Fu-Hao Yeh.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lee, G.C., Yeh, FH., Chen, YJ. et al. Robust handwriting extraction and lecture video summarization. Multimed Tools Appl 76, 7067–7085 (2017). https://doi.org/10.1007/s11042-016-3353-y

Download citation

Received: 03 August 2015
Revised: 18 December 2015
Accepted: 09 February 2016
Published: 25 February 2016
Issue Date: March 2017
DOI: https://doi.org/10.1007/s11042-016-3353-y

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust handwriting extraction and lecture video summarization

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Generalized framework for summarization of fixed-camera lecture videos by detecting and binarizing handwritten content

A Robust Video Text Extraction and Recognition Approach Using OCR Feedback Information

Unsupervised Text Binarization in Handwritten Historical Documents Using k-Means Clustering

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Robust handwriting extraction and lecture video summarization

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Generalized framework for summarization of fixed-camera lecture videos by detecting and binarizing handwritten content

A Robust Video Text Extraction and Recognition Approach Using OCR Feedback Information

Unsupervised Text Binarization in Handwritten Historical Documents Using k-Means Clustering

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.