Lecture5
Lecture5
and examples
Terminology
There are other various concepts of distance, which one will work best is application dependent
Propose a
classification of the
foods shown on the
plot into a few
categories
[Wikipedia]
- pk is the value of the k-th feature for the first data and qk is the value of
the k-th feature for the second data
Common practices: start with K equal to the square root of the training set size, use a larger K with a
weighted voting process based on the distances of the neighbors, and/or use cross-validation to evaluate
the model’s performance (this will be discussed in a later lecture).
- If a value is exactly equal to the mean of all the feature values, it will be
normalized to 0. If it is below the mean, it will result in a negative value, and if
it is above the mean it will result in a positive number (see examples here).
Amount of data
✓ Neural networks excel at utilizing large amounts of data, while traditional machine learning techniques
tend to plateau after reaching a certain data threshold.
✓ If a traditional ML algorithm is performing well, and there is a significant amount of data, it is highly
likely that using a neural network could yield even better results.
Introduction to Machine Learning 23