0% found this document useful (0 votes)

27 views4 pages

AAM 1st Unit QB

The document discusses supervised learning algorithms, feature engineering techniques, and the process of training and evaluating supervised learning models. It defines supervised learning and lists common algorithm types. It also describes various feature engineering methods and the overall steps involved in building a supervised learning model.

Uploaded by

Sachin Mahale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views4 pages

AAM 1st Unit QB

Uploaded by

Sachin Mahale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

1. What is supervised learning algorithm?

List types
Supervised learning is a type of machine learning where the model is trained on a labelled dataset,
meaning that each input data point is associated with a corresponding target label. The goal is to
learn a mapping from inputs to outputs, based on the labelled examples provided during training, so
that the model can make predictions on unseen data.

Types of supervised learning algorithms listed:

1. Linear Regression

2. Logistic Regression

3. Decision Trees

4. Random Forest

5. Support Vector Machines (SVM)

6. K-Nearest Neighbours (KNN)

7. Naive Bayes

8. Gradient Boosting Machines (GBM)

3. What is Feature Engineering?

Feature engineering is the process of preparing and selecting the most relevant data features for
training machine learning models, including transforming, creating, and selecting features to
improve model performance.

Key aspects of feature engineering:

1. Feature Selection

2. Feature Transformation

3. Feature Creation

4. Handling Missing Values

5. Encoding Categorical Variables

6. Feature Scaling

7. Feature Extraction
5. List out feature engineering techniques for text data.
Feature engineering techniques for text data:

1. Tokenization

2. Stopwords Removal

3. Stemming

4. Lemmatization

5. TF-IDF (Term Frequency-Inverse Document Frequency)

6. Bag of Words (BoW)

7. N-grams

8. Word Embeddings

9. Topic Modeling

10. Named Entity Recognition (NER)

11. Part-of-Speech Tagging

12. Sentiment Analysis

8. Which steps are involved while training a supervised learning

model?
Training a supervised learning model involves several steps:

1. **Data Collection**: Gather the data needed to train the model. This data should include both
input features and corresponding target labels.

2. **Data Preprocessing**:
- **Cleaning**: Handle missing values, outliers, or errors in the dataset.
- **Feature Engineering**: Transform, select, or create features to represent the data effectively.
- **Normalization/Scaling**: Scale the features to ensure they are on a similar scale, which can
improve the performance of some algorithms.
- **Encoding**: Encode categorical variables into numerical format if necessary.

3. **Splitting the Data**: Divide the dataset into two or three subsets:
- **Training Set**: The portion of data used to train the model.
- **Validation Set**: (Optional) Used to tune hyperparameters and evaluate model performance
during training.
- **Test Set**: (Optional) Used to evaluate the final model's performance after training.

4. Selecting a Model: Choose an appropriate supervised learning algorithm based on the

problem type (classification or regression), dataset size, complexity, and other factors.
5. **Training the Model**:
- **Input**: Provide the training data (features and labels) to the model.
- **Learning**: The model learns the relationship between the input features and the target labels.
- **Optimization**: Adjust the model parameters to minimize the difference between predicted
and actual labels (i.e., minimize the loss function).

6. **Model Evaluation**:
- **Validation**: Evaluate the model's performance on the validation set. Adjust hyperparameters
if necessary to improve performance.
- **Test**: Evaluate the final trained model on the test set to assess its performance on unseen
data and ensure generalization.

7. Hyperparameter Tuning (Optional):

- Adjust the hyperparameters of the model (e.g., learning rate, regularization parameter) to
optimize performance.
- Techniques such as grid search, random search, or Bayesian optimization can be used for
hyperparameter tuning.

8. Model Deployment (Optional):

- Once satisfied with the model's performance, deploy it to production for making predictions on
new, unseen data.
- Monitor the model's performance over time and retrain/update it as needed.

10. Describe the process of feature extraction.

Feature extraction is the process of transforming raw data into a format suitable for machine learning
algorithms by selecting, combining, or creating relevant features that capture essential information
from the original data. Here's a detailed description of the process:

1. **Understanding the Data**: Begin by understanding the nature of the data and the problem at
hand. This involves analyzing the data's structure, distributions, relationships between variables, and
the specific requirements of the machine learning task.

2. **Feature Selection**:
- Identify relevant features that are likely to have predictive power for the target variable.
- Remove irrelevant features that do not contribute to the predictive performance or may introduce
noise into the model.

3. **Feature Transformation**:
- Scale or normalize numerical features to ensure they have similar ranges and distributions.
Common techniques include min-max scaling or z-score normalization.
- Transform skewed distributions using techniques like logarithmic or Box-Cox transformations to
make them more symmetrical.

4. Handling Categorical Variables:

- Encode categorical variables into numerical format using techniques like one-hot encoding, label
encoding, or target encoding.
- Convert ordinal categorical variables into numerical format while preserving their ordinal
relationship.

5. Creating New Features:

- Generate new features by combining or transforming existing ones. This may involve
mathematical operations (e.g., addition, subtraction, multiplication, division), aggregations (e.g.,
mean, median, sum), or domain-specific transformations.
- Create interaction features by combining pairs of features to capture potential synergistic effects.

6. Dimensionality Reduction (Optional):

- Reduce the dimensionality of the feature space to alleviate the curse of dimensionality and
improve computational efficiency.
- Techniques such as principal component analysis (PCA), linear discriminant analysis (LDA), or
feature selection algorithms can be used for dimensionality reduction.

7. **Feature Scaling**:
- Scale the features to ensure that they have similar magnitudes and do not dominate the model
training process. This is particularly important for algorithms sensitive to feature scales, such as
gradient descent-based optimization algorithms.

8. Validation and Iteration:

- Validate the extracted features using cross-validation or holdout validation to assess their
effectiveness in improving model performance.
- Iterate on the feature extraction process, refining feature selection, transformation, or creation
based on insights gained from model evaluation and validation results.

9. Documentation and Communication:

- Document the feature extraction process, including the rationale behind feature selection,
transformation techniques used, and any domain-specific insights.
- Communicate the extracted features to stakeholders, including data scientists, domain experts,
and business users, to ensure alignment with the problem domain and the objectives of the machine
learning project.

By following these steps, feature extraction can effectively transform raw data into informative
features that enable machine learning models to learn patterns and make accurate predictions on
new, unseen data.

Unit 1,2,3
No ratings yet
Unit 1,2,3
30 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
Basic Machine Learning Terms 2
No ratings yet
Basic Machine Learning Terms 2
4 pages
PYTHON PROGRAMMING FOR MACHINE LEARNING-220901004 - Compressed
No ratings yet
PYTHON PROGRAMMING FOR MACHINE LEARNING-220901004 - Compressed
6 pages
Lecture 5 - Feature Extraction, Model Building & Evaluation
No ratings yet
Lecture 5 - Feature Extraction, Model Building & Evaluation
35 pages
AI ML Concepts
No ratings yet
AI ML Concepts
97 pages
Data Science Notes C
No ratings yet
Data Science Notes C
4 pages
Foml Paper Solution 1
No ratings yet
Foml Paper Solution 1
35 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
1 page
Unit 7 ML
No ratings yet
Unit 7 ML
33 pages
Supervised Learning PDF
No ratings yet
Supervised Learning PDF
2 pages
Machine Learning Life Cycle
No ratings yet
Machine Learning Life Cycle
11 pages
ML Notes
No ratings yet
ML Notes
16 pages
ML Unit-2
No ratings yet
ML Unit-2
17 pages
Day 4 - Preprocessing, Model Code
No ratings yet
Day 4 - Preprocessing, Model Code
5 pages
Mechanisms in Modern Engineering Design PDF
100% (3)
Mechanisms in Modern Engineering Design PDF
618 pages
6 Workflow
No ratings yet
6 Workflow
11 pages
Social Media Analytics Techniques
No ratings yet
Social Media Analytics Techniques
77 pages
MACHINE LEARNING 1-5 (Ai &DS)
100% (1)
MACHINE LEARNING 1-5 (Ai &DS)
60 pages
ML Viva Practice (Answers)
No ratings yet
ML Viva Practice (Answers)
4 pages
Machine Learning Project Checklist
100% (1)
Machine Learning Project Checklist
10 pages
0/1 Knapsack: Branch and Bound
No ratings yet
0/1 Knapsack: Branch and Bound
15 pages
ML Notes All
No ratings yet
ML Notes All
32 pages
VIVA
No ratings yet
VIVA
5 pages
Oracle: Question & Answers
No ratings yet
Oracle: Question & Answers
18 pages
Core Machine Learning Concepts
No ratings yet
Core Machine Learning Concepts
2 pages
ChatPDF IMG 20250313 WA0000
No ratings yet
ChatPDF IMG 20250313 WA0000
2 pages
Overfitting & Feature Engineering
No ratings yet
Overfitting & Feature Engineering
37 pages
Machine Learning Model Workflow
No ratings yet
Machine Learning Model Workflow
3 pages
Unit 2
No ratings yet
Unit 2
6 pages
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
No ratings yet
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
20 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
Full ml-2
No ratings yet
Full ml-2
1 page
Introduction To Machine Learning Algorithms - Scribd
No ratings yet
Introduction To Machine Learning Algorithms - Scribd
2 pages
RBS 6102 4+4+4 900 and 1800 PDF
96% (23)
RBS 6102 4+4+4 900 and 1800 PDF
16 pages
LTE Radio Access Network Protocols and Procedures
0% (1)
LTE Radio Access Network Protocols and Procedures
151 pages
MCS224 Dec 2024 Solved
No ratings yet
MCS224 Dec 2024 Solved
22 pages
Step-by-Step Machine Learning
No ratings yet
Step-by-Step Machine Learning
3 pages
Technical Questions and Answers
No ratings yet
Technical Questions and Answers
12 pages
SLD-04 Single Line Diagram (4 of 11)
No ratings yet
SLD-04 Single Line Diagram (4 of 11)
1 page
METTL - Logical Building 1 - 2 and 3 Links
100% (1)
METTL - Logical Building 1 - 2 and 3 Links
2 pages
Mechanic CV Examples Uk
100% (1)
Mechanic CV Examples Uk
4 pages
ML Sem
No ratings yet
ML Sem
24 pages
ML Pipeline
No ratings yet
ML Pipeline
6 pages
Rohit Unit 1 ML Notes
No ratings yet
Rohit Unit 1 ML Notes
27 pages
Ass Bigd
No ratings yet
Ass Bigd
9 pages
IITG Credit Linked DS
No ratings yet
IITG Credit Linked DS
10 pages
AL Tamil Medium Answer
No ratings yet
AL Tamil Medium Answer
93 pages
A Practical and Technical Introduction To Machine Learning
No ratings yet
A Practical and Technical Introduction To Machine Learning
23 pages
5 - Unit 2 - Lecture 2-Data Handling
No ratings yet
5 - Unit 2 - Lecture 2-Data Handling
15 pages
En-12697-22-2003-Bituminous Mixtures Test Methods For Hot Mix Asphalt Part 22 Wheel Tracking
No ratings yet
En-12697-22-2003-Bituminous Mixtures Test Methods For Hot Mix Asphalt Part 22 Wheel Tracking
32 pages
Cross Validation
No ratings yet
Cross Validation
2 pages
ML Notion 1
No ratings yet
ML Notion 1
18 pages
DPT Week 1
No ratings yet
DPT Week 1
3 pages
Unit - 2 ML
No ratings yet
Unit - 2 ML
8 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
DS Model Steps
No ratings yet
DS Model Steps
8 pages
Annexure D - For Cable Cellar MVWS System
No ratings yet
Annexure D - For Cable Cellar MVWS System
1 page
ML Module 1
No ratings yet
ML Module 1
12 pages
Unit - 2 ML
No ratings yet
Unit - 2 ML
8 pages
Data Collection
No ratings yet
Data Collection
8 pages
Data Science Notes B
No ratings yet
Data Science Notes B
5 pages
Unit 5
No ratings yet
Unit 5
11 pages
Hira For Cement Mill
No ratings yet
Hira For Cement Mill
6 pages
CPE 445-Internet of Things - Chapter 7
No ratings yet
CPE 445-Internet of Things - Chapter 7
39 pages
Deloitte ISAE3402 SOC1
No ratings yet
Deloitte ISAE3402 SOC1
15 pages
ML Theory
No ratings yet
ML Theory
5 pages
Microsoft FLow Offficial Documentation
100% (2)
Microsoft FLow Offficial Documentation
538 pages
What Are The Basic Concepts in Machine Learning
No ratings yet
What Are The Basic Concepts in Machine Learning
3 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
4 pages
Notes On Machine Learning (ML)
No ratings yet
Notes On Machine Learning (ML)
3 pages
Digital Check SmartSource Pro English
No ratings yet
Digital Check SmartSource Pro English
2 pages
1.write The Formula For Sigmoid, Hyperbolic Tangen...
No ratings yet
1.write The Formula For Sigmoid, Hyperbolic Tangen...
3 pages
1 s2.0 S2772940024000171 Main1
No ratings yet
1 s2.0 S2772940024000171 Main1
10 pages
ML Checklist PDF
No ratings yet
ML Checklist PDF
4 pages
Signal
No ratings yet
Signal
3 pages
Learning Episode 11 Updated
No ratings yet
Learning Episode 11 Updated
7 pages
Duracell CR2 Datasheet
No ratings yet
Duracell CR2 Datasheet
2 pages
Apporio Taxi Uber Clone
No ratings yet
Apporio Taxi Uber Clone
5 pages
Lab 12
No ratings yet
Lab 12
8 pages
Cs403 Assignment Solution 1 Fall 2023
No ratings yet
Cs403 Assignment Solution 1 Fall 2023
7 pages
Application For Admission in " KV NO.2 NAUSENABAUGH "
No ratings yet
Application For Admission in " KV NO.2 NAUSENABAUGH "
7 pages
Assignment - 01 Install and Uninstall Software
No ratings yet
Assignment - 01 Install and Uninstall Software
4 pages
Chapter Two and Exception Handling
No ratings yet
Chapter Two and Exception Handling
6 pages
Objective:: Lab 07 - Introduction To Computing (EC-102)
No ratings yet
Objective:: Lab 07 - Introduction To Computing (EC-102)
10 pages
Plot Plan Wellpad E - SUPERIMPOSE RIG (E31P, E56P) (WI)
No ratings yet
Plot Plan Wellpad E - SUPERIMPOSE RIG (E31P, E56P) (WI)
1 page
Rubrica 3: Conalep 1
No ratings yet
Rubrica 3: Conalep 1
4 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

AAM 1st Unit QB

Uploaded by

AAM 1st Unit QB

Uploaded by

1. What is supervised learning algorithm?

Types of supervised learning algorithms listed:

5. Support Vector Machines (SVM)

6. K-Nearest Neighbours (KNN)

8. Gradient Boosting Machines (GBM)

3. What is Feature Engineering?

Key aspects of feature engineering:

4. Handling Missing Values

5. Encoding Categorical Variables

5. TF-IDF (Term Frequency-Inverse Document Frequency)

6. Bag of Words (BoW)

10. Named Entity Recognition (NER)

11. Part-of-Speech Tagging

12. Sentiment Analysis

8. Which steps are involved while training a supervised learning

4. Selecting a Model: Choose an appropriate supervised learning algorithm based on the

7. Hyperparameter Tuning (Optional):

8. Model Deployment (Optional):

10. Describe the process of feature extraction.

4. Handling Categorical Variables:

5. Creating New Features:

6. Dimensionality Reduction (Optional):

8. Validation and Iteration:

9. Documentation and Communication:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

AAM 1st Unit QB

Uploaded by

AAM 1st Unit QB

Uploaded by

1. What is supervised learning algorithm?

Types of supervised learning algorithms listed:

5. Support Vector Machines (SVM)

6. K-Nearest Neighbours (KNN)

8. Gradient Boosting Machines (GBM)

3. What is Feature Engineering?

Key aspects of feature engineering:

4. Handling Missing Values

5. Encoding Categorical Variables

5. TF-IDF (Term Frequency-Inverse Document Frequency)

6. Bag of Words (BoW)

10. Named Entity Recognition (NER)

11. Part-of-Speech Tagging

12. Sentiment Analysis

8. Which steps are involved while training a supervised learning

4. **Selecting a Model**: Choose an appropriate supervised learning algorithm based on the

7. **Hyperparameter Tuning** (Optional):

8. **Model Deployment** (Optional):

10. Describe the process of feature extraction.

4. **Handling Categorical Variables**:

5. **Creating New Features**:

6. **Dimensionality Reduction** (Optional):

8. **Validation and Iteration**:

9. **Documentation and Communication**:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

4. Selecting a Model: Choose an appropriate supervised learning algorithm based on the

7. Hyperparameter Tuning (Optional):

8. Model Deployment (Optional):

4. Handling Categorical Variables:

5. Creating New Features:

6. Dimensionality Reduction (Optional):

8. Validation and Iteration:

9. Documentation and Communication: