0% found this document useful (0 votes)

8 views4 pages

Jguytibu

The document outlines a comprehensive table of contents for a text on statistical methods and machine learning, covering topics such as Monte Carlo methods, unsupervised learning, regression, regularization, classification, and decision trees. Each section includes subtopics and exercises, indicating a structured approach to teaching these concepts. The content appears to be aimed at providing both theoretical foundations and practical applications in data analysis.

Uploaded by

sabsebada

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views4 pages

Jguytibu

Uploaded by

sabsebada

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

viii Contents

3.3.1 Crude Monte Carlo . . . . . . . . . . . . . . . . . . . . . . . . . 85

3.3.2 Bootstrap Method . . . . . . . . . . . . . . . . . . . . . . . . . . 88
3.3.3 Variance Reduction . . . . . . . . . . . . . . . . . . . . . . . . . 92
3.4 Monte Carlo for Optimization . . . . . . . . . . . . . . . . . . . . . . . . 96
3.4.1 Simulated Annealing . . . . . . . . . . . . . . . . . . . . . . . . 96
3.4.2 Cross-Entropy Method . . . . . . . . . . . . . . . . . . . . . . . 100
3.4.3 Splitting for Optimization . . . . . . . . . . . . . . . . . . . . . . 103
3.4.4 Noisy Optimization . . . . . . . . . . . . . . . . . . . . . . . . . 106
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114

4 Unsupervised Learning 121

4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
4.2 Risk and Loss in Unsupervised Learning . . . . . . . . . . . . . . . . . . 122
4.3 Expectation–Maximization (EM) Algorithm . . . . . . . . . . . . . . . . 128
4.4 Empirical Distribution and Density Estimation . . . . . . . . . . . . . . . 131
4.5 Clustering via Mixture Models . . . . . . . . . . . . . . . . . . . . . . . 135
4.5.1 Mixture Models . . . . . . . . . . . . . . . . . . . . . . . . . . . 135
4.5.2 EM Algorithm for Mixture Models . . . . . . . . . . . . . . . . . 137
4.6 Clustering via Vector Quantization . . . . . . . . . . . . . . . . . . . . . 142
4.6.1 K-Means . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
4.6.2 Clustering via Continuous Multiextremal Optimization . . . . . . 146
4.7 Hierarchical Clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
4.8 Principal Component Analysis (PCA) . . . . . . . . . . . . . . . . . . . 153
4.8.1 Motivation: Principal Axes of an Ellipsoid . . . . . . . . . . . . . 154
4.8.2 PCA and Singular Value Decomposition (SVD) . . . . . . . . . . 155
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 160

5 Regression 167
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
5.2 Linear Regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
5.3 Analysis via Linear Models . . . . . . . . . . . . . . . . . . . . . . . . . 171
5.3.1 Parameter Estimation . . . . . . . . . . . . . . . . . . . . . . . . 171
5.3.2 Model Selection and Prediction . . . . . . . . . . . . . . . . . . . 172
5.3.3 Cross-Validation and Predictive Residual Sum of Squares . . . . . 173
5.3.4 In-Sample Risk and Akaike Information Criterion . . . . . . . . . 175
5.3.5 Categorical Features . . . . . . . . . . . . . . . . . . . . . . . . 177
5.3.6 Nested Models . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
5.3.7 Coefficient of Determination . . . . . . . . . . . . . . . . . . . . 181
5.4 Inference for Normal Linear Models . . . . . . . . . . . . . . . . . . . . 182
5.4.1 Comparing Two Normal Linear Models . . . . . . . . . . . . . . 183
5.4.2 Confidence and Prediction Intervals . . . . . . . . . . . . . . . . 186
5.5 Nonlinear Regression Models . . . . . . . . . . . . . . . . . . . . . . . . 188
5.6 Linear Models in Python . . . . . . . . . . . . . . . . . . . . . . . . . . 191
5.6.1 Modeling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191
5.6.2 Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193
5.6.3 Analysis of Variance (ANOVA) . . . . . . . . . . . . . . . . . . 196
Contents ix

5.6.4 Confidence and Prediction Intervals . . . . . . . . . . . . . . . . 198

5.6.5 Model Validation . . . . . . . . . . . . . . . . . . . . . . . . . . 199
5.6.6 Variable Selection . . . . . . . . . . . . . . . . . . . . . . . . . . 200
5.7 Generalized Linear Models . . . . . . . . . . . . . . . . . . . . . . . . . 204
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207

6 Regularization and Kernel Methods 215

6.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215
6.2 Regularization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
6.3 Reproducing Kernel Hilbert Spaces . . . . . . . . . . . . . . . . . . . . . 222
6.4 Construction of Reproducing Kernels . . . . . . . . . . . . . . . . . . . . 224
6.4.1 Reproducing Kernels via Feature Mapping . . . . . . . . . . . . . 224
6.4.2 Kernels from Characteristic Functions . . . . . . . . . . . . . . . 225
6.4.3 Reproducing Kernels Using Orthonormal Features . . . . . . . . 227
6.4.4 Kernels from Kernels . . . . . . . . . . . . . . . . . . . . . . . . 229
6.5 Representer Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
6.6 Smoothing Cubic Splines . . . . . . . . . . . . . . . . . . . . . . . . . . 235
6.7 Gaussian Process Regression . . . . . . . . . . . . . . . . . . . . . . . . 238
6.8 Kernel PCA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 242
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245

7 Classification 251
7.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 251
7.2 Classification Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . 253
7.3 Classification via Bayes’ Rule . . . . . . . . . . . . . . . . . . . . . . . 257
7.4 Linear and Quadratic Discriminant Analysis . . . . . . . . . . . . . . . . 259
7.5 Logistic Regression and Softmax Classification . . . . . . . . . . . . . . 266
7.6 K-Nearest Neighbors Classification . . . . . . . . . . . . . . . . . . . . . 268
7.7 Support Vector Machine . . . . . . . . . . . . . . . . . . . . . . . . . . . 269
7.8 Classification with Scikit-Learn . . . . . . . . . . . . . . . . . . . . . . . 277
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 279

8 Decision Trees and Ensemble Methods 287

8.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 287
8.2 Top-Down Construction of Decision Trees . . . . . . . . . . . . . . . . . 289
8.2.1 Regional Prediction Functions . . . . . . . . . . . . . . . . . . . 290
8.2.2 Splitting Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . 291
8.2.3 Termination Criterion . . . . . . . . . . . . . . . . . . . . . . . . 292
8.2.4 Basic Implementation . . . . . . . . . . . . . . . . . . . . . . . . 294
8.3 Additional Considerations . . . . . . . . . . . . . . . . . . . . . . . . . . 298
8.3.1 Binary Versus Non-Binary Trees . . . . . . . . . . . . . . . . . . 298
8.3.2 Data Preprocessing . . . . . . . . . . . . . . . . . . . . . . . . . 298
8.3.3 Alternative Splitting Rules . . . . . . . . . . . . . . . . . . . . . 298
8.3.4 Categorical Variables . . . . . . . . . . . . . . . . . . . . . . . . 299
8.3.5 Missing Values . . . . . . . . . . . . . . . . . . . . . . . . . . . 299
8.4 Controlling the Tree Shape . . . . . . . . . . . . . . . . . . . . . . . . . 300
8.4.1 Cost-Complexity Pruning . . . . . . . . . . . . . . . . . . . . . . 303
xii Contents

D.12 Pandas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 486

D.12.1 Series and DataFrame . . . . . . . . . . . . . . . . . . . . . . . . 486
D.12.2 Manipulating Data Frames . . . . . . . . . . . . . . . . . . . . . 487
D.12.3 Extracting Information . . . . . . . . . . . . . . . . . . . . . . . 489
D.12.4 Plotting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 491
D.13 Scikit-learn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 491
D.13.1 Partitioning the Data . . . . . . . . . . . . . . . . . . . . . . . . 491
D.13.2 Standardization . . . . . . . . . . . . . . . . . . . . . . . . . . . 492
D.13.3 Fitting and Prediction . . . . . . . . . . . . . . . . . . . . . . . . 493
D.13.4 Testing the Model . . . . . . . . . . . . . . . . . . . . . . . . . . 493
D.14 System Calls, URL Access, and Speed-Up . . . . . . . . . . . . . . . . . 494

Bibliography 496

Index 505
P REFACE

In our present world of automation, cloud computing, algorithms, artificial intelligence,

and big data, few topics are as relevant as data science and machine learning. Their recent
popularity lies not only in their applicability to real-life questions, but also in their natural
blending of many different disciplines, including mathematics, statistics, computer science,
engineering, science, and finance.
To someone starting to learn these topics, the multitude of computational techniques
and mathematical ideas may seem overwhelming. Some may be satisfied with only learn-
ing how to use off-the-shelf recipes to apply to practical situations. But what if the assump-
tions of the black-box recipe are violated? Can we still trust the results? How should the
algorithm be adapted? To be able to truly understand data science and machine learning it
is important to appreciate the underlying mathematics and statistics, as well as the resulting
algorithms.
The purpose of this book is to provide an accessible, yet comprehensive, account of
data science and machine learning. It is intended for anyone interested in gaining a better
understanding of the mathematics and statistics that underpin the rich variety of ideas and
machine learning algorithms in data science. Our viewpoint is that computer languages
come and go, but the underlying key ideas and algorithms will remain forever and will
form the basis for future developments.
Before we turn to a description of the topics in this book, we would like to say a
few words about its philosophy. This book resulted from various courses in data science
and machine learning at the Universities of Queensland and New South Wales, Australia.
When we taught these courses, we noticed that students were eager to learn not only how
to apply algorithms but also to understand how these algorithms actually work. However,
many existing textbooks assumed either too much background knowledge (e.g., measure
theory and functional analysis) or too little (everything is a black box), and the information
overload from often disjointed and contradictory internet sources made it more difficult for
students to gradually build up their knowledge and understanding. We therefore wanted to
write a book about data science and machine learning that can be read as a linear story,
with a substantial “backstory” in the appendices. The main narrative starts very simply and
builds up gradually to quite an advanced level. The backstory contains all the necessary
xiii

6 390 Lecture Notes Spring24
No ratings yet
6 390 Lecture Notes Spring24
144 pages
Stats 205 Notes
No ratings yet
Stats 205 Notes
99 pages
Machine Learning Mathematics in Python - Jamie Flux - 2024
No ratings yet
Machine Learning Mathematics in Python - Jamie Flux - 2024
238 pages
Comp Data Science
No ratings yet
Comp Data Science
821 pages
1 All Notes G
No ratings yet
1 All Notes G
217 pages
Orange3 Data Mining Library Using Python
50% (2)
Orange3 Data Mining Library Using Python
102 pages
Applied Statistics
No ratings yet
Applied Statistics
361 pages
Statistics Coping With Uncertainty-2024!10!15
No ratings yet
Statistics Coping With Uncertainty-2024!10!15
346 pages
Sumit Ji
No ratings yet
Sumit Ji
3 pages
Gandu
No ratings yet
Gandu
3 pages
Machine Learning Notes 1
No ratings yet
Machine Learning Notes 1
120 pages
Machine Learning Guide: Meher Krishna Patel
No ratings yet
Machine Learning Guide: Meher Krishna Patel
121 pages
Fundamentals of Machine Learning
No ratings yet
Fundamentals of Machine Learning
97 pages
GR 11 Paper 1 Business 2025 Term 1
No ratings yet
GR 11 Paper 1 Business 2025 Term 1
7 pages
Poly ML SIR
No ratings yet
Poly ML SIR
378 pages
Introduction To Data Mining 2005
60% (5)
Introduction To Data Mining 2005
400 pages
6 390 Lecture Notes Fall24
No ratings yet
6 390 Lecture Notes Fall24
146 pages
Statistical Machine Learning: Yiqiao YIN Department of Statistics Columbia University
No ratings yet
Statistical Machine Learning: Yiqiao YIN Department of Statistics Columbia University
204 pages
Learning Book 11 Feb
No ratings yet
Learning Book 11 Feb
322 pages
Machine Learnig Revision
No ratings yet
Machine Learnig Revision
93 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
112 pages
TOBo ML
No ratings yet
TOBo ML
135 pages
Data Mining Notes
100% (1)
Data Mining Notes
178 pages
High School DXD Volume 05 - Hellcat of The Underworld Training Camp PDF
No ratings yet
High School DXD Volume 05 - Hellcat of The Underworld Training Camp PDF
312 pages
Theoretical Bioinformatics and Machine Learning - Hochreiter - 2013
No ratings yet
Theoretical Bioinformatics and Machine Learning - Hochreiter - 2013
400 pages
Human Capital Theory
No ratings yet
Human Capital Theory
4 pages
Pelamis - Sociedade.unipessoal - Limitada Bank - Account.statement CBD 2024-12-09
No ratings yet
Pelamis - Sociedade.unipessoal - Limitada Bank - Account.statement CBD 2024-12-09
1 page
Orange 3
100% (1)
Orange 3
46 pages
Y10 English Language Remote Learning 01.02.2021
No ratings yet
Y10 English Language Remote Learning 01.02.2021
8 pages
Undergraduate Fundamentals of Machine Learning
No ratings yet
Undergraduate Fundamentals of Machine Learning
163 pages
Agrowell New Catalogue
No ratings yet
Agrowell New Catalogue
32 pages
Mlpy
0% (1)
Mlpy
113 pages
Adv Stat Inf
No ratings yet
Adv Stat Inf
194 pages
Foundations of Machine
No ratings yet
Foundations of Machine
120 pages
Machine Learning and Data Mining Notes 1647447657
No ratings yet
Machine Learning and Data Mining Notes 1647447657
134 pages
Shreya Jiiiii
No ratings yet
Shreya Jiiiii
5 pages
Machine Learning Lecture Notes
No ratings yet
Machine Learning Lecture Notes
119 pages
178 HW 9
No ratings yet
178 HW 9
153 pages
Textbook
No ratings yet
Textbook
161 pages
PCML Notes
No ratings yet
PCML Notes
249 pages
Haaaaaaaaa A A
No ratings yet
Haaaaaaaaa A A
3 pages
Cs181 Textbook
No ratings yet
Cs181 Textbook
163 pages
Yasin
No ratings yet
Yasin
2 pages
Extra Lecturenotes Cs725
No ratings yet
Extra Lecturenotes Cs725
119 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
135 pages
Second Year - English
No ratings yet
Second Year - English
128 pages
Torres-Castaño Et Al., 2024 IJIC Empodera1 SDM
No ratings yet
Torres-Castaño Et Al., 2024 IJIC Empodera1 SDM
15 pages
Golu
No ratings yet
Golu
3 pages
Machine Learning Complete-Course-Notes Polimi
No ratings yet
Machine Learning Complete-Course-Notes Polimi
107 pages
Sach Ka Samanaa Jof
No ratings yet
Sach Ka Samanaa Jof
4 pages
Argus Coalindo Indonesian Coal Index Report
No ratings yet
Argus Coalindo Indonesian Coal Index Report
3 pages
A Comprehensive Guide To Machine Learning
No ratings yet
A Comprehensive Guide To Machine Learning
152 pages
The Hundred-Page Machine Learning Book-Andriy Burkov (2019) - Removed
No ratings yet
The Hundred-Page Machine Learning Book-Andriy Burkov (2019) - Removed
145 pages
An Adventure of Epic Porpoises
No ratings yet
An Adventure of Epic Porpoises
174 pages
Dkdsjfhiuhfhueruofh
No ratings yet
Dkdsjfhiuhfhueruofh
3 pages
Machine Learning
No ratings yet
Machine Learning
216 pages
Exercises
No ratings yet
Exercises
69 pages
Machine Learning Algorithms Applications and Practices in Data Science PDF
No ratings yet
Machine Learning Algorithms Applications and Practices in Data Science PDF
113 pages
How To Process Malta Student Visa Applications
No ratings yet
How To Process Malta Student Visa Applications
8 pages
Fjkdhufhfishfuihffkvh
No ratings yet
Fjkdhufhfishfuihffkvh
4 pages
Machine Learning and Data Mining
No ratings yet
Machine Learning and Data Mining
134 pages
Delica n2 v1v2 Text Medium
No ratings yet
Delica n2 v1v2 Text Medium
18 pages
Machine Leaning and Dimensionality Reduction Course UCLouvain
No ratings yet
Machine Leaning and Dimensionality Reduction Course UCLouvain
36 pages
Ourlog 5343
No ratings yet
Ourlog 5343
10 pages
Regulations and Norms For Recognition As A PG Teacher and Guide
No ratings yet
Regulations and Norms For Recognition As A PG Teacher and Guide
1 page
Practical Machine Learning Course Notes
No ratings yet
Practical Machine Learning Course Notes
76 pages
Sentosa Case - SIS Experience
100% (2)
Sentosa Case - SIS Experience
14 pages
Computer Analysis of Power Systems by Jos Arrillaga, C. P. Arnold (Z-Lib - Org) - 1-125
No ratings yet
Computer Analysis of Power Systems by Jos Arrillaga, C. P. Arnold (Z-Lib - Org) - 1-125
125 pages
6thgrade Math I Can Statements
No ratings yet
6thgrade Math I Can Statements
155 pages
Ministries PPT For SHS - 23.9.2023 - V2
No ratings yet
Ministries PPT For SHS - 23.9.2023 - V2
25 pages
Web Application Attacks in Practice: Ing. Pavol Lupták, CISSP, CEH
No ratings yet
Web Application Attacks in Practice: Ing. Pavol Lupták, CISSP, CEH
29 pages
HF2020 XFS ATM Jackpotting Alexandre Beaulieu
No ratings yet
HF2020 XFS ATM Jackpotting Alexandre Beaulieu
40 pages
Nervous System
No ratings yet
Nervous System
1 page
Introduction To Python
No ratings yet
Introduction To Python
13 pages
Advanced college algebra study guide
From Everand
Advanced college algebra study guide
Harrison Cook
No ratings yet
10 1 1 672 7118 PDF
No ratings yet
10 1 1 672 7118 PDF
35 pages
ADVANCED COLLEGE ALGEBRA STUDY GUIDE
From Everand
ADVANCED COLLEGE ALGEBRA STUDY GUIDE
Harrison K Cook
No ratings yet
Conditionals Random Pages Sample2 PDF
No ratings yet
Conditionals Random Pages Sample2 PDF
22 pages
Sabse Bada Kalakar
No ratings yet
Sabse Bada Kalakar
4 pages
Speaking (Daily Activities)
100% (1)
Speaking (Daily Activities)
3 pages
Aws Resume Sample
67% (3)
Aws Resume Sample
1 page
Practical Machine Learning R
90% (10)
Practical Machine Learning R
149 pages
Xiii Xiv Contents: 2 Probability Distributions 67
No ratings yet
Xiii Xiv Contents: 2 Probability Distributions 67
6 pages
Xiii Xiv Contents: 2 Probability Distributions 67
No ratings yet
Xiii Xiv Contents: 2 Probability Distributions 67
6 pages
Preface To The Second Edition V 1 1
No ratings yet
Preface To The Second Edition V 1 1
9 pages
Stats 1
No ratings yet
Stats 1
6 pages
Chapter 703
No ratings yet
Chapter 703
14 pages
0975 Data Science and Machine Learning
No ratings yet
0975 Data Science and Machine Learning
6 pages
Stats 2
No ratings yet
Stats 2
6 pages
Preface VII Mathematical Notation Xi Contents Xiii
No ratings yet
Preface VII Mathematical Notation Xi Contents Xiii
6 pages
ChatGPT for Business: Strategies for Success
From Everand
ChatGPT for Business: Strategies for Success
Matthew C. Smith
1/5 (1)
Relation-Reincarnation and Globalisation.
No ratings yet
Relation-Reincarnation and Globalisation.
3 pages
M 33
No ratings yet
M 33
1 page
Inverter: Power
No ratings yet
Inverter: Power
1 page
Gray Hat Hacking the Ethical Hacker's
From Everand
Gray Hat Hacking the Ethical Hacker's
Çağatay Şanlı
5/5 (1)
Corwyl - Village of The Wood Elves
100% (8)
Corwyl - Village of The Wood Elves
97 pages
Securing ChatGPT: Best Practices for Protecting Sensitive Data in AI Language Models
From Everand
Securing ChatGPT: Best Practices for Protecting Sensitive Data in AI Language Models
Matthew C. Smith
No ratings yet
Selective High School Placement Test: Session
100% (1)
Selective High School Placement Test: Session
10 pages
Kellory the Warlock
From Everand
Kellory the Warlock
Lin Carter
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Jguytibu

Uploaded by

Jguytibu

Uploaded by

viii Contents

3.3.1 Crude Monte Carlo . . . . . . . . . . . . . . . . . . . . . . . . . 85

4 Unsupervised Learning 121

5.6.4 Confidence and Prediction Intervals . . . . . . . . . . . . . . . . 198

6 Regularization and Kernel Methods 215

8 Decision Trees and Ensemble Methods 287

D.12 Pandas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 486

In our present world of automation, cloud computing, algorithms, artificial intelligence,

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.