Estimation of Mixture Models Using Co-EM

Bickel, Steffen; Scheffer, Tobias

doi:10.1007/11564096_9

Steffen Bickel²³ &
Tobias Scheffer²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3720))

Included in the following conference series:

European Conference on Machine Learning

6064 Accesses
24 Citations

Abstract

We study estimation of mixture models for problems in which multiple views of the instances are available. Examples of this setting include clustering web pages or research papers that have intrinsic (text) and extrinsic (references) attributes. Our optimization criterion quantifies the likelihood and the consensus among models in the individual views; maximizing this consensus minimizes a bound on the risk of assigning an instance to an incorrect mixture component. We derive an algorithm that maximizes this criterion. Empirically, we observe that the resulting clustering method incurs a lower cluster entropy than regular EM for web pages, research papers, and many text collections.

Download to read the full chapter text

Chapter PDF

Old and New Constraints in Model Based Clustering

A Variational Approximations-DIC Rubric for Parameter Estimation and Mixture Model Selection Within a Family Setting

Article 04 March 2020

Advances in Robust Constrained Model Based Clustering

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Abney, S.: Bootstrapping. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (2002)
Google Scholar
Barnard, K., Duygulu, P., Forsyth, D., de Freitas, N., Blei, D., Jordan, M.: Matching words and pictures. Journal of Machine Learning Research 3, 1107–1135 (2002)
Article Google Scholar
Becker, S., Hinton, G.: A self-organizing neural network that discovers surfaces in random-dot stereograms. Nature 355, 161–163 (1992)
Article Google Scholar
Bickel, S., Scheffer, T.: Multi-view clustering. In: Proceedings of the IEEE International Conference on Data Mining (2004)
Google Scholar
Bickel, S., Scheffer, T.: Estimation of mixture models using Co-EM. In: Proceedings of the ICML Workshop on Learning with Multiple Views (2005)
Google Scholar
Blei, D., Jordan, M.: Modeling annotated data. In: Proceedings of the ACM SIGIR Conference on Information Retrieval (2003)
Google Scholar
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Conference on Computational Learning Theory (1998)
Google Scholar
Brefeld, U., Scheffer, T.: Co-EM support vector learning. In: Proceedings of the International Conference on Machine Learning (2004)
Google Scholar
Collins, M., Singer, Y.: Unsupervised models for named entity classification. In: Proc. of the Conf. on Empirical Methods in Natural Language Processing (1999)
Google Scholar
Dasgupta, S., Littman, M., McAllester, D.: PAC generalization bounds for co-training. In: Proceedings of Neural Information Processing Systems (2001)
Google Scholar
de Sa, V.: Learning classification with unlabeled data. In: Proceedings of Neural Information Processing Systems (1994)
Google Scholar
Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B 39 (1977)
Google Scholar
Ghani, R.: Combining labeled and unlabeled data for multiclass text categorization. In: Proceedings of the International Conference on Machine Learning (2002)
Google Scholar
Kailing, K., Kriegel, H., Pryakhin, A., Schubert, M.: Clustering multi-represented objects with noise. In: Dai, H., Srikant, R., Zhang, C. (eds.) PAKDD 2004. LNCS (LNAI), vol. 3056, pp. 394–403. Springer, Heidelberg (2004)
Chapter Google Scholar
McCallum, A., Nigam, K.: Employing EM in pool-based active learning for text classification. In: Proc. of the International Conference on Machine Learning (1998)
Google Scholar
Muslea, I., Kloblock, C., Minton, S.: Active + semi-supervised learning = robust multi-view learning. In: Proc. of the International Conf. on Machine Learning (2002)
Google Scholar
Nigam, K., Ghani, R.: Analyzing the effectiveness and applicability of co-training. In: Proceedings of the Workshop on Information and Knowledge Management (2000)
Google Scholar
Sinkkonen, J., Nikkilä, J., Lahti, L., Kaski, S.: Associative clustering. In: Proceedings of the European Conference on Machine Learning (2004)
Google Scholar
Wu, J.: On the convergence properties of the EM algorithm. The Annals of Statistics 11, 95–103 (1983)
Article MATH MathSciNet Google Scholar
Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proc. of the Annual Meeting of the Association for Comp. Ling. (1995)
Google Scholar
Zhao, Y., Karypis, G.: Criterion functions for document clustering: Experiments and analysis. Technical Report TR 01-40, Department of Computer Science, University of Minnesota, Minneapolis, MN, 2001 (2001)
Google Scholar
Zhou, Z., Li, M.: Semi-supervised regression with co-training. In: Proceedings of the International Joint Conference on Artificial Intelligence (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099, Berlin, Germany
Steffen Bickel & Tobias Scheffer

Authors

Steffen Bickel
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Scheffer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Economics of the University of Porto, Portugal
João Gama
Faculdade de Engenharia & LIAAD, Universidade do Porto, Portugal
Rui Camacho
LIAAD-INESC Porto L.A./Faculty of Economics, University of Porto, Rua de Ceuta, 118-6, 4050-190, Porto, Portugal
Pavel B. Brazdil
LIACC/FEP, Universidade do Porto, Portugal
Alípio Mário Jorge
LIAAD-INESC Porto LA / FEP, University of Porto, R. de Ceuta, 118, 6., 4050-190, Porto, Portugal
Luís Torgo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bickel, S., Scheffer, T. (2005). Estimation of Mixture Models Using Co-EM. In: Gama, J., Camacho, R., Brazdil, P.B., Jorge, A.M., Torgo, L. (eds) Machine Learning: ECML 2005. ECML 2005. Lecture Notes in Computer Science(), vol 3720. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11564096_9

Download citation

DOI: https://doi.org/10.1007/11564096_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29243-2
Online ISBN: 978-3-540-31692-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Estimation of Mixture Models Using Co-EM

Abstract

Chapter PDF

Similar content being viewed by others

Old and New Constraints in Model Based Clustering

A Variational Approximations-DIC Rubric for Parameter Estimation and Mixture Model Selection Within a Family Setting

Advances in Robust Constrained Model Based Clustering

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Estimation of Mixture Models Using Co-EM

Abstract

Chapter PDF

Similar content being viewed by others

Old and New Constraints in Model Based Clustering

A Variational Approximations-DIC Rubric for Parameter Estimation and Mixture Model Selection Within a Family Setting

Advances in Robust Constrained Model Based Clustering

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.