Summary
Summary
1561/0600000035
Decision Forests
A Unified Framework for
Classification, Regression, Density
Estimation, Manifold Learning
and Semi-Supervised Learning
Full text available at: http://dx.doi.org/10.1561/0600000035
Decision Forests
A Unified Framework for
Classification, Regression, Density
Estimation, Manifold Learning
and Semi-Supervised Learning
Antonio Criminisi
Microsoft Research Ltd.
Cambridge, UK
antcrim@microsoft.com
Jamie Shotton
Microsoft Research Ltd.
Cambridge, UK
jamiesho@microsoft.com
Ender Konukoglu
Microsoft Research Ltd.
Cambridge, UK
enderk@microsoft.com
Boston – Delft
Full text available at: http://dx.doi.org/10.1561/0600000035
ISBN: 978-1-60198-540-8
c 2012 A. Criminisi, J. Shotton and E. Konukoglu
All rights reserved. No part of this publication may be reproduced, stored in a retrieval
system, or transmitted in any form or by any means, mechanical, photocopying, recording
or otherwise, without prior written permission of the publishers.
Photocopying. In the USA: This journal is registered at the Copyright Clearance Cen-
ter, Inc., 222 Rosewood Drive, Danvers, MA 01923. Authorization to photocopy items for
internal or personal use, or the internal or personal use of specific clients, is granted by
now Publishers Inc for users registered with the Copyright Clearance Center (CCC). The
‘services’ for users can be found on the internet at: www.copyright.com
For those organizations that have been granted a photocopy license, a separate system
of payment has been arranged. Authorization does not extend to other kinds of copy-
ing, such as that for general distribution, for advertising or promotional purposes, for
creating new collective works, or for resale. In the rest of the world: Permission to pho-
tocopy must be obtained from the copyright owner. Please apply to now Publishers Inc.,
PO Box 1024, Hanover, MA 02339, USA; Tel. +1-781-871-0245; www.nowpublishers.com;
sales@nowpublishers.com
now Publishers Inc. has an exclusive license to publish this material worldwide. Permission
to use this content must be obtained from the copyright license holder. Please apply to now
Publishers, PO Box 179, 2600 AD Delft, The Netherlands, www.nowpublishers.com; e-mail:
sales@nowpublishers.com
Full text available at: http://dx.doi.org/10.1561/0600000035
Editor-in-Chief:
Brian Curless
University of Washington
Luc Van Gool
KU Leuven/ETH Zurich
Richard Szeliski
Microsoft Research
Editors
Editorial Scope
1
Microsoft Research Ltd., 7 J J Thomson Ave, Cambridge, CB3 0FB, UK,
antcrim@microsoft.com
2
Microsoft Research Ltd., 7 J J Thomson Ave, Cambridge, CB3 0FB, UK,
jamiesho@microsoft.com
3
Microsoft Research Ltd., 7 J J Thomson Ave, Cambridge, CB3 0FB, UK,
enderk@microsoft.com
Abstract
This review presents a unified, efficient model of random decision forests
which can be applied to a number of machine learning, computer vision,
and medical image analysis tasks.
Our model extends existing forest-based techniques as it unifies
classification, regression, density estimation, manifold learning, semi-
supervised learning, and active learning under the same decision forest
framework. This gives us the opportunity to write and optimize the core
implementation only once, with application to many diverse tasks.
Full text available at: http://dx.doi.org/10.1561/0600000035
Contents
3 Classification Forests 25
4 Regression Forests 51
ix
Full text available at: http://dx.doi.org/10.1561/0600000035
5 Density Forests 69
5.1 Literature on Density Estimation 70
5.2 Specializing the Forest Model for Density Estimation 70
5.3 Effect of Model Parameters 76
5.4 Comparison with Alternative Algorithms 79
5.5 Sampling from the Generative Model 83
5.6 Dealing with Non-function Relations 85
5.7 Quantitative Analysis 91
6 Manifold Forests 95
6.1 Literature on Manifold Learning 96
6.2 Specializing the Forest Model for Manifold Learning 97
6.3 Experiments and the Effect of Model Parameters 104
6.4 Learning Manifold of Object Shapes 110
6.5 Learning Manifold of Text Documents 112
6.6 Discussion 113
9 Conclusions 135
Acknowledgements 141
References 143
Full text available at: http://dx.doi.org/10.1561/0600000035
1
Overview and Scope
1
Full text available at: http://dx.doi.org/10.1561/0600000035
also describes techniques for predicting the forest test error based on
measures of tree strength and correlation.
In computer vision, ensemble methods became popular with
the seminal face and pedestrian detection papers of Viola and
Jones [107, 108]. Random decision forests where used in [63] for image
classification and in [60] for keypoint tracking in videos. Recent years
have seen an explosion of forest-based techniques in the machine learn-
ing, vision and medical imaging literature [9, 15, 25, 31, 35, 37, 58, 59,
65, 67, 68, 69, 74, 79, 89, 92, 97, 110]. Decision forests compare favor-
ably with respect to other techniques [15] and have lead to one of the
biggest success stories of computer vision in recent years: the Microsoft
Kinect for XBox 360 [39, 91, 66].
Full text available at: http://dx.doi.org/10.1561/0600000035
References
143
Full text available at: http://dx.doi.org/10.1561/0600000035
144 References
References 145
146 References
References 147
148 References
References 149
[102] M. E. Tipping, “Sparse bayesian learning and the relevance vector machine,”
Journal of Machine Learning Research, vol. 1, pp. 211–244, 2001.
[103] A. Torralba, K. P. Murphy, and W. T. Freeman, “Sharing visual features
for multiclass and multiview object detection,” Institute of Electrical and
Electronics Engineers Transactions on Pattern Analysis and Machine Intelli-
gence, 2007.
[104] Z. Tu, “Probabilistic boosting-tree: Learning discriminative models for classi-
fication, recognition, and clustering,” in Proceedings of the Institute of Electri-
cal and Electronics Engineers International Conference on Computer Vision
(International Conference on Computer Vision), pp. 1589–1596, 2005.
[105] Z. Tu, “Auto-context and its application to high-level vision tasks,” in IEEE
Conference on Computer Vision and Pattern Recognition (CVPR), 2008.
[106] V. Vapnik, The nature of statistical learning theory. Springer Verlag, 2000.
[107] P. Viola and M. J. Jones, “Robust real-time face detection,” International
Journal of Conference Vision, 2004.
[108] P. Viola, M. J. Jones, and D. Snow, “Detecting pedestrians using patterns of
motion and appearance,” in International Conference on Computer Vision,
2003.
[109] J. Wang, “On transductive support vector machines,” in Prediction and Dis-
covery, American Mathematical Society, 2007.
[110] P. Yin, A. Criminisi, I. Essa, and J. Winn, “Tree-based classifiers for bilayer
video segmentation,” in Computer Vision and Pattern Recognition, 2007.
[111] Q. Zhang, R. Souvenir, and R. Pless, “On manifold structure of cardiac MRI
data: Application to segmentation,” in Institute of Electrical and Electronics
Engineers Computer Vision and Pattern Recognition, Los Alamitos, CA, USA,
2006.
[112] X. Zhu and A. Goldberg, “Introduction to semi-supervised learning,”
Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan
and Claypool Publishers, 2009.
[113] A. Zien and C. S. Ong, “Multiclass multiple kernel learning,” in International
Conference on Machine Learning, 2007.