Machine Learning Theory Updated
Machine Learning Theory Updated
Support Vector Machine (SVM) is a Supervised Learning Algorithm mainly used for classification tasks. It
works by finding the best boundary (hyperplane) that separates the data into classes. It tries to maximize the
margin between the classes, and the closest points to the margin are called Support Vectors.
Key Points:
Advantages:
Disadvantages:
CART stands for Classification And Regression Tree. It is a Supervised Learning algorithm that creates a
decision tree based on features. It works for both classification and regression tasks.
Key Points:
Advantages:
Disadvantages:
Naive Bayes is a Supervised Learning algorithm based on Bayes' Theorem. It assumes that features are
Key Points:
Types:
Advantages:
Disadvantages:
PCA is an Unsupervised Learning technique for dimensionality reduction. It reduces the number of features
while keeping the most important information. It does this by creating new variables (principal components)
Key Points:
Advantages:
- Reduces overfitting.
- Faster computation.
Disadvantages:
LDA is a Supervised Learning technique for dimensionality reduction that also tries to separate classes as
much as possible. It projects data to a new space where class separation is maximized.
Key Points:
Advantages:
- Improves classification.
- Reduces complexity.
Disadvantages:
|----------------------|-------------------------------|------------------------------|
CNN (Convolutional Neural Network) is a type of deep learning model mainly used for image-related tasks. It
automatically detects patterns and features like edges, textures, or shapes from images.
Key Points:
Use Cases:
Advantages:
Disadvantages:
RNN (Recurrent Neural Network) is used for sequential data where order matters, like time series or
Key Points:
Use Cases:
Advantages:
Disadvantages:
Machine Learning - 10 Marks Theoretical Questions
- Hard to train.
K-Means Clustering
K-Means is an unsupervised algorithm that groups data into K clusters based on the mean (average). Each
Steps:
1. Choose K clusters.
Used For:
Pros:
- Simple, fast.
Cons:
K-Modes Clustering
K-Modes is like K-Means but for categorical data. It uses the mode (most common value) instead of mean.
Steps:
1. Choose K clusters.
Used For:
Pros:
Cons:
K-Medoids Clustering
K-Medoids is similar to K-Means but uses actual data points (medoids) as centers. It is more robust to
outliers.
Steps:
1. Choose K medoids.
Used For:
Pros:
Cons:
|--------------------|------------------|---------------------|--------------------------------|