DS Module2 L3 L13
DS Module2 L3 L13
Module -2
Lecture-3
Presented By
Dr. Rini Saxena
Professor (Computer Science & Engineering)
CEC Jhanjeri Mohali
rini.cgctc@gmail.com
Preview of last Lecture
Quantitative data collection
Descriptive,
Correlational,
Experimental, and
Quasi-experimental.
If our dataset and model are such that a lot of training
is required, then we use a larger chunk of the data just
for training purposes (usually the case) — For
instance, training on textual data, image data, or video
data usually involves thousands of features!
Data Pre-processing
If the model has a lot of hyperparameters that can be
tuned, then keeping a higher percentage of data for
the validation set is advisable.
Models with less number of hyperparameters are easy
to tune and update, and so we can keep a smaller
validation set.