Definition
Latest breakthroughs in machine learning methodologies have made it feasible to accurately detect objects and to model complex events with interrelated objects.
Introduction
The explosive growth in digital videos has sparked an urgent need for new technologies able to access and retrieve desired videos from large video archives with both efficiency and accuracy. Content-based video retrieval (CBVR) techniques developed in the past decide strive to accomplish this goal by using low level image features, such as colors, textures, shapes, motions, etc. However, as there is a huge semantic gap between data representations and real video contents, CBVR techniques generally suffer from poor video retrieval performances.
The key to strengthening video retrieval capabilities lies in higher level understanding and representation of video contents. Video content understanding based on machine learning techniques is one of the promising research directions to accomplish this goal....
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
T. Mitchell, “Machine Learning,” McGraw Hill, New York, 1997.
R.A. Redner and H.F. Walker, “Mixture Densities, Maximum Likelihood and the EM Algorithm,” SIAM Review, Vol. 26, No. 2, 1984, pp. 195–239.
S. Haykin, “Neural Networks,” 2nd Edn, Prentice-Hall, Englewood, Cliffs, NJ, 1999.
N. Cristianini and J. Shawe-Taylor, “An Introduction to Support Vector Machines and Other Kernel-based Learning Methods,” Cambridge University Press, 2000.
L.R. Rabiner, “A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition,” Proceedings of the IEEE, Vol. 77, No. 2, 1989, pp. 257–286.
A.L. Berger, S.A. Della Pietra, and V.J. Della Pietra, “A Maximum Entropy Approach to Natural Language Processing,” Computational Linguistics, Vol. 22, No. 1, March 1996.
S. Geman and D. Geman, “Stochastic Relaxation, Gibbs Distributions and the Bayesian Restoration of Images,” IEEE Transactions on PAMI, Vol. 6, No. 6, 1984, pp. 721–741.
J. Lafferty, A. McCallum, and F. Pereira, “Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data,” Proceedings of ICML, 2001, pp. 282–289.
S.A. Della Pietra, V.J. Della Pietra, and J. Lafferty, “Inducing Features of Random Fields,” IEEE Transactions on PAMI, 19, 1997, pp. 380–393.
Y. Gong, M. Han, W. Hua, and W. Xu, “Maximum Entropy Model-based Baseball Highlight Detection and Classification,” Journal of CVIU, 96, 2004, pp. 181–199.
Y.T. Tse and R.L. Baker, “Camera Zoom/Pan Estimation and Compensation for Video Compression,” Image Processing Algorithms Technology, 1452, 1991, pp. 468–479.
L. Rabiner, and B.H. Juang, “Fundamentals of Speech Recognition,” Prentice-Hall, Englewood, Cliffs, NJ, 1993.
X. Liu, Y. Gong, and S.H. Zhou, “Document Clustering with Cluster Refinement and Model Selection Capabilities,” Proceedings of the ACM SIGIR'02, Tampere, Finland, August 2002.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag
About this entry
Cite this entry
Gong, Y. (2008). Video Content Analysis Using Machine Learning Tools. In: Furht, B. (eds) Encyclopedia of Multimedia. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-78414-4_242
Download citation
DOI: https://doi.org/10.1007/978-0-387-78414-4_242
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-74724-8
Online ISBN: 978-0-387-78414-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering