Study Guide ML Math PDF
Study Guide ML Math PDF
Version 1
5/9/2020
This is a map of subjects and the corresponding textbooks that one should study in order to
have a very solid mathematical foundation for doing machine learning work. This is a guide for
learning the math that you will use, not for learning the machine learning algorithms themselves.
Obviously not all of this is necessary, and you can find work in the machine learning field without
knowing all of this. However, if your goal is to have a deep understanding, on both an applied
and theoretical level, of algorithms that are typically used in this field, then following this guide
will enable you to do so. None of the topics here cover the software aspect of machine learning.
There are many applied machine learning courses available online that cover that aspect of the
field. Websites like Coursera or Udemy are a good place to start.
The only prerequisite knowledge this guide will assume is a year of calculus. There are
many resources for learning calculus so they will not be covered here. The book by Stewart or
Khan Academy are a perfectly fine way to learn the subject. My goal in writing this guide is to
provide someone who wants to read ESL and the Deep Learning book with enough mathematical
maturity to do so. Covering the required material will put you in good shape for that. These
subjects are difficult and require a serious level of dedication. I’ve done my best to provide
textbooks that have solutions available. I will not link to the solution manuals, they can be found
with a little bit of searching. In fact, all required texts have solution manuals available aside from
the linear models text by Christensen. Also, my goal isn’t to provide resources that are free. I’ve
picked what I believe to be are the best quality texts for the subject that will give you the deepest
understanding. With that said, it shouldn’t be hard to find electronic copies of them.
You will need to be very comfortable with proving things, this is non-negotiable. So step
0 would be to go through a book like How to Prove It by Velleman, or Discrete Mathematics
by Rosen. From there we start with linear algebra and introductory analysis. You need a
good grasp of these in order to understand statistics at the level that is required for machine
learning. It is impossible to understand things like linear models and the central limit theorem
without having a good grasp of linear algebra and analysis. For these topics I recommend Linear
Algebra by Friedberg, Insel, and Spence and Understanding Analysis by Abbott. These are
both great textbooks. The linear algebra one is great because it blends applied and theoretical
understanding, and Understanding Analysis helps build intuition for doing further work in
analysis. From here we move on to statistics, and then things branch out. As a machine learning
practitioner, having a working knowledge of probability can be very helpful, but a rigorous
understanding of probability cannot be had unless we first learn some measure theory. Thus
I’ve included that in our path as well. Below is the flowchart of subjects to study and their
corresponding texts. A blue node is a required subject/text and an orange node is an optional
subject/text.
Proofs
How to Prove It
Velleman