0% found this document useful (0 votes)
34 views6 pages

Mldap

The document outlines a course on Machine Learning and Data Analytics using Python, detailing its objectives, modules, teaching methods, assessment criteria, and learning resources. It covers foundational concepts, supervised and unsupervised learning techniques, advanced methods, and real-world applications, with practical coding exercises and projects. The course aims to equip students with skills for data-driven decision-making and predictive modeling in industry settings.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
34 views6 pages

Mldap

The document outlines a course on Machine Learning and Data Analytics using Python, detailing its objectives, modules, teaching methods, assessment criteria, and learning resources. It covers foundational concepts, supervised and unsupervised learning techniques, advanced methods, and real-world applications, with practical coding exercises and projects. The course aims to equip students with skills for data-driven decision-making and predictive modeling in industry settings.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 6

Semester- II

Machine learning and Data analytics using Python


Course Code MMC201 CIE Marks 50
Teaching Hours/Week (L:P: 2:2:0 SEE Marks 50
SDA/T/T)
Total Hours of Pedagogy 50 Total Marks 100
Credits 04 Exam Hours 03
Course Learning Objectives:
1. Understand foundational concepts in machine learning and data analytics.
1. Gain proficiency in Python for data analysis and machine learning tasks.
2. Learn and apply various machine learning algorithms and techniques.
3. Develop skills in data preprocessing, visualization, and model evaluation.
4. Prepare students for industry roles involving data-driven decision making and
predictive modeling.
Module-1 08 Hours
Introduction to Machine Learning and Python:
Introduction to Machine Learning: Definition and importance of machine learning, Types
of machine learning: Supervised, unsupervised, and reinforcement learning, Applications of
machine learning in various domains.
Python for Data Analysis: Introduction to Python programming, Python libraries for data
analysis: NumPy, Pandas, Matplotlib, Data manipulation and visualization using Pandas
and Matplotlib.
Data Preprocessing: Data cleaning and transformation, Handling missing values and
outliers, Feature scaling and normalization.

Teaching Learning Process:


Lectures with PowerPoint presentations, Hands-on coding exercises using Jupyter
notebooks, Interactive discussions and problem-solving sessions, Assignments and quizzes
for assessment.
Module-2 08 Hours
Supervised Learning:
Regression: Linear regression, Polynomial regression, Model evaluation metrics: MAE,
MSE, RMSE. Classification: Logistic regression, K-Nearest Neighbors (KNN), Decision
Trees and Random Forests, Model evaluation metrics: Accuracy, precision, recall, F1-score,
ROC-AUC.
Model Training and Evaluation: Train-test split and cross-validation, Hyper parameter
tuning using GridSearchCV, Overfitting and underfitting.
Teaching Learning Process:
Lab exercises on regression and classification models, Practical coding sessions with real-
time problem-solving, Group projects on developing and evaluating supervised learning
models, Continuous assessment through quizzes and coding challenges.
Module-3 08 Hours
Unsupervised Learning:
Clustering: K-Means clustering, Hierarchical clustering, Evaluation of clustering results.
Dimensionality Reduction: Principal Component Analysis (PCA), Linear Discriminant
Analysis (LDA) t-Distributed Stochastic Neighbor Embedding (t-SNE).
Association Rule Learning: Apriori algorithm, Market Basket Analysis, Evaluation metrics
for association rules
Teaching Learning Process:
Lab exercises on clustering and dimensionality reduction, Practical coding sessions with
unsupervised learning techniques, Group projects on applying unsupervised learning to real-
world data, Continuous assessment through quizzes and practical tests.
Module-4 08 Hours
Advanced Machine Learning Techniques:
Ensemble Methods: Bagging and Boosting, Gradient Boosting Machines (GBM), Extreme
Gradient Boosting (XGBoost).
Support Vector Machines (SVM): Linear and non-linear SVM, Kernel trick, Model
evaluation and tuning.
Neural Networks and Deep Learning: Introduction to neural networks, Building and
training neural networks using TensorFlow and Keras, Convolutional Neural Networks
(CNN) and Recurrent Neural Networks (RNN).
Teaching Learning Process:
Practical sessions on advanced machine learning techniques, Interactive coding exercises to
implement neural networks, Group projects on applying advanced techniques to complex
data problems, Continuous assessment through quizzes and practical tests.
Module-5 08 Hours
Data Analytics and Real-World Applications:
Exploratory Data Analysis (EDA): Data visualization techniques, Statistical analysis and
hypothesis testing, Identifying patterns and insights from data.
Time Series Analysis: Introduction to time series data, Time series forecasting using
ARIMA and Prophet, Evaluating time series models.
Integrating Machine Learning Models: Deployment of machine learning models, Building
web applications with Flask and Django, Case studies on real-world applications of machine
learning.

Teaching Learning Process:


Lab exercises on EDA and time series analysis, Practical sessions on deploying machine
learning models, Group discussions on real-world case studies, Final project presentation
and assessment.
Sl. Experiments
NO
1 Implement and demonstratethe FIND-Salgorithm for finding the most specific hypothesis based on a given t
se of training data samples. Read the training data from a .CSV file.
2
For a given set of training data examples stored in a .CSV file, implement and demonstrate the Candidate-
Elimination algorithmto output a description of the set of all hypotheses consistent with the training
examples.
3
Write a program to demonstrate the working of the decision tree based ID3 algorithm. Use an appropriate ta
da set for building the decision tree and apply this knowledge to classify a new sample.
4
Write a program to implement the naïve Bayesian classifier for a sample training data set stored as a .CSV file.
Compute the accuracy of the classifier, considering few test data sets.
5
Write a program to implement k-Nearest Neighbour algorithm to classify the iris data set. Print both correct nd
a wrong predictions.
6
Build an Artificial Neural Network by implementing the Backpropagation algorithm and test the same using
appropriate data sets.
7
Write a program to demonstrate Regression analysis with residual plots on a given data set.

8
Write a program to compute summary statistics such as mean, median, mode, standard deviationand variance of
the given different types of data.
9
Write a program to implement k-Means clustering algorithm to cluster the set of data stored in .CSV file.
Assessment Details (both CIE and SEE)
The weightage of Continuous Internal Evaluation (CIE) is 50% and for Semester End Exam (SEE) is
50%. The minimum passing mark for the CIE is 50% of the maximum marks. Minimum passing
marks in SEE is 40% of the maximum marks of SEE. A student shall be deemed to have satisfied the
academic requirements and earned the credits allotted to each subject/ course if the student secures not
less than 50% (50 marks out of 100) in the sum total of the CIE (Continuous Internal Evaluation) and
SEE (Semester End Examination) taken together.
CIE for the theory component of IPCC
1. Two Tests each of 20 Marks
1. Two assignments each of 10 Marks/One Skill Development Activity of 20 marks
2. Total Marks of two tests and two assignments/one Skill Development Activity added will be CIE
for 60 marks, marks scored will be proportionally scaled down to 30 marks.
CIE for the practical component of IPCC
On completion of every experiment/program in the laboratory, the students shall be evaluated
and marks shall be awarded on the same day. The 15 marks are for conducting the experiment
and preparation of the laboratory record, the other 05 marks shall be for the test conducted at
the end of the semester.
The CIE marks awarded in the case of the Practical component shall be based on the
continuous evaluation of the laboratory report. Each experiment report can be evaluated for 10
- ups are added and scaled down to 15 marks.
The laboratory test at the end /after completion of all the experiments shall be conducted for 50
marks and scaled down to 05 marks. Scaled-down marks of write-up evaluations and tests
added will be CIE marks for the laboratory component of IPCC for 20 marks.
SEE for IPCC
Theory SEE will be conducted by University as per the scheduled timetable, with common question
papers for the course (duration 03 hours)
1. The question paper will be set for 100 marks and marks scored will be scaled down proportionately
to 50 marks.
1. The question paper will have ten questions. Each question is set for 20 marks.
2. There will be 2 questions from each module. Each of the two questions under a module (with a
maximum of 3 sub-questions), should have a mix of topics under that module.
2. The students have to answer 5 full questions, selecting one full question from each module.
The theory portion of the IPCC shall be for both CIE and SEE, whereas the practical portion
will have a CIE component only. Questions mentioned in the SEE paper shall include questions
from the practical component).
The minimum marks to be secured in CIE to appear for SEE shall be the 15 (50% of maximum
marks-30) in the theory component and 10 (50% of maximum marks -20) in the practical
component. The laboratory component of the IPCC shall be for CIE only. However, in SEE,
the questions from the laboratory component shall be included. The maximum of 04/05
questions to be set from the practical component of IPCC, the total marks of all questions
should not be more than the 20 marks.
SEE will be conducted for 100 marks and students shall secure 40% of the maximum marks to qualify
in the SEE. Marks secured will be scaled down to 50. (Student has to secure an aggregate of 50% of
maximum marks of the course(CIE+SEE)
Suggested Learning Resources:
Text Books:
1. Practical Machine Learning for Data Analysis Using Python Abdulhamit Subasi.
2. Advance Machine Learning with Python by John Hearty.

Reference Textbooks:
1. "Python Machine Learning: Machine Learning and Deep Learning with Python, scikit-
learn, and
Tensor Flow " by Sebastian Raschka and Vahid Mirjalili.
2. "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow" by Aurélien
Géron.
3. "Introduction to Machine Learning with Python".

Web links and Video Lectures (e-Resources):


https://youtu.be/7eh4d6sabA0?si=JWHNJRVS6NhQOPYe
https://youtu.be/kqtD5dpn9C8?si=LBep4HWaMFRrAPsD
https://youtu.be/4SJ7bEILPJk?si=5LurvjzUOuCew1W9
Skill Development Activities Suggested
The students with the help of the course teacher can take up technical activities
which will enhance their skill or the students should interact with industry (small, medium
and large), understand their problems or foresee what can be undertaken for study in the
form of research/testing/projects, and for creative and innovative methods to solve the
identified problem. The prepared report shall be evaluated for CIE marks.
Course outcome (Course Skill Set)
At the end of the course the student will be able to:
Sl. No. Description Blooms Level
CO1 Demonstrate an understanding of machine learning concepts L3
and techniques.
CO2 Perform data preprocessing and exploratory data analysis L4
using Python.
CO3 Develop and evaluate machine learning models using Python L5
libraries.
CO4 Apply machine learning algorithms to real-world data L4
problems.
CO5 Integrate data analytics and machine learning models into L5
practical applications.

Mapping of COS and POs


PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8
CO1 X X
CO2 X
CO3 X X
CO4 X X X
CO5 X X
Industry-Relevant Key Points:
Emphasis on coding standards and best practices.
Integration of version control systems (e.g., Git) in project work.
Exposure to industry-standard tools and frameworks.
Real-world application development projects.
Focus on collaborative development and agile methodologies.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy