0% found this document useful (0 votes)

3 views19 pages

ML Lab Manual

The document outlines various Python programming experiments focused on statistical analysis, machine learning, and data visualization. It includes implementations of central tendency measures, linear regression, decision trees, KNN, logistic regression, and K-Means clustering, utilizing libraries such as NumPy, Pandas, and Scikit-learn. Each section provides code examples and expected outputs for better understanding of the concepts.

Uploaded by

Sofia tarannum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views19 pages

ML Lab Manual

Uploaded by

Sofia tarannum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 19

NAME OF THE EXPERIMENT PAGE

NO
1 python program to compute Central Tendency
Measures :Mean, Median, Mode Measures of
Dispersion: variance ,standard Deviation

2 .Study of Python Basic Libraries such as Statistics,

Math, Numpy and Scipy

3 Study of Python Libraries for ML application such

as Pandas and Matplotlib
4 Python Program for Simple Linear Regression.

5 Implementation of Multiple Linear Regression for

House Pricing Pricing Prediction using sklearn
6 Implementation of Decision tree using sklearn and
its parameter tuning

7 Implementation of KNN using sklearn

8 Implementation of Logistic Regression using

sklearn

9 Implementation of K-Means Clustering

import numpy as np

10 Performance analysis of Classification Algorithms

1
Program 1: python program to compute Central Tendency
Measures :Mean, Median, Mode Measures of Dispersion:
variance ,standard Deviation

import statistics as stats

def central_tendency_dispersion(data):

# Central Tendency Measures

mean = stats.mean(data)

median = stats.median(data)

try:

mode = stats.mode(data)

except stats.StatisticsError:

mode = "No unique mode found"

# Measures of Dispersion

variance = stats.variance(data)

std_dev = stats.stdev(data)

# Display results

print(f"Mean: {mean}")

print(f"Median: {median}")

print(f"Mode: {mode}")

print(f"Variance: {variance}")

print(f"Standard Deviation: {std_dev}")

# Example data

2
data = [10, 15, 14, 10, 15, 18, 20, 25, 30]

central_tendency_dispersion(data)

OUTPUT:

Mean: 17.444444444444443
Median: 15
Mode: 10
Variance: 44.52777777777778
Standard Deviation: 6.672913739722534

3
2.Study of Python Basic Libraries such as Statistics, Math, Numpy and
Scipy

Python provides a wide range of basic libraries that are essential for various computational
tasks. These libraries offer functionality to handle statistical calculations, mathematical
operations, and scientific computing. Here is an overview:

Statistics Module

 Used for statistical computations such as mean, median, mode, variance, etc.
 Example

import statistics

data = [1, 2, 2, 3, 4]

print("Mean:", statistics.mean(data))

print("Median:", statistics.median(data))

print("Mode:", statistics.mode(data))

Math Module

 Provides mathematical functions such as trigonometric calculations, logarithms,

factorials, and more.
 Example

import math

print("Square root of 16:", math.sqrt(16))

print("Factorial of 5:", math.factorial(5))

print("Cosine of 45 degrees:", math.cos(math.radians(45)))

Numpy Library

 Widely used for numerical computations with arrays, matrices, and linear algebra
functions.
 Example:

import numpy as np

array = np.array([1, 2, 3, 4, 5])

4
print("Mean of array:", np.mean(array))

print("Sum of array:", np.sum(array))

Scipy Library

 Built on Numpy, it provides additional functionality for optimization, integration, and

scientific computations.
 Example

from scipy import integrate

# Define a function to integrate

result, _ = integrate.quad(lambda x: x**2, 0, 1)

print("Integral of x^2 from 0 to 1:", result)

5
3. Study of Python Libraries for ML application such as Pandas and
Matplotlib

For machine learning and data analysis, Python libraries like Pandas and Matplotlib are
essential for data manipulation and visualization.

Pandas

 Provides data structures like Series and DataFrame for handling and analyzing data
efficiently.
 Example:

import pandas as pd

data = {'Name': ['Alice', 'Bob', 'Charlie'], 'Age': [25, 30, 35]}

df = pd.DataFrame(data)

print(df)

print("Mean Age:", df['Age'].mean())

Matplotlib

 A visualization library used for creating static, interactive, and animated plots.
 Example:

import matplotlib.pyplot as plt

x = [1, 2, 3, 4, 5]

y = [10, 20, 25, 30, 35]

plt.plot(x, y, marker='o', linestyle='--', color='r')

plt.title("Sample Line Plot")

plt.xlabel("X-axis")

plt.ylabel("Y-axis")

plt.show()

6
Program 4:Python Program for Simple Linear Regression.

import numpy as np

import matplotlib.pyplot as plt

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LinearRegression

from sklearn.metrics import mean_squared_error, r2_score

# Generate some example data

np.random.seed(0)

X = 2 * np.random.rand(100, 1)

y = 4 + 3 * X + np.random.randn(100, 1)

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

random_state=42)

# Create and train the model

model = LinearRegression()

model.fit(X_train, y_train)

# Make predictions

y_pred = model.predict(X_test)

# Evaluate the model

mse = mean_squared_error(y_test, y_pred)

r2 = r2_score(y_test, y_pred)

7
print(f"Mean Squared Error: {mse:.2f}")

print(f"R-squared: {r2:.2f}")

# Plotting the results

plt.scatter(X_test, y_test, color="black", label="Actual data")

plt.plot(X_test, y_pred, color="blue", linewidth=2, label="Fitted line")

plt.xlabel("X")

plt.ylabel("y")

plt.title("Simple Linear Regression")

plt.legend()

plt.show()

OUTPUT:

8
program5: Implementation of Multiple Linear Regression for House
Pricing Pricing Prediction using sklearn

import numpy as np

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LinearRegression

from sklearn.metrics import mean_squared_error, r2_score

# Load the dataset

data = pd.read_csv('house_prices.csv')

# Display the first few rows of the dataset

print(data.head())

# Selecting features and target variable

X = data[['Size', 'Bedrooms', 'Age']]

y = data['Price']

# Handling missing data

X = X.fillna(X.mean())

y = y.fillna(y.mean())

# Splitting the data into training and testing sets

9
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

# Creating and training the model

model = LinearRegression()

model.fit(X_train, y_train)

# Making predictions on the testing set

y_pred = model.predict(X_test)

# Evaluating the model's performance

mse = mean_squared_error(y_test, y_pred)

r2 = r2_score(y_test, y_pred)

print(f'Mean Squared Error: {mse}')

print(f'R-squared: {r2}')

# Model coefficients

print("Intercept:", model.intercept_)

print("Coefficients:", model.coef_)

coefficients = pd.DataFrame(model.coef_, X.columns, columns=['Coefficient'])

print(coefficients)

10
6. Implementation of Decision tree using sklearn and its parameter tuning
11
Importing necessary libraries

import numpy as np

import pandas as pd

from sklearn.model_selection import train_test_split, GridSearchCV

from sklearn.tree import DecisionTreeClassifier

from sklearn.metrics import accuracy_score, classification_report

from sklearn.datasets import load_iris

# Load dataset (for example, the Iris dataset)

data = load_iris()

X = data.data

y = data.target

# Split dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

random_state=42)

# Initialize a basic DecisionTreeClassifier

clf = DecisionTreeClassifier(random_state=42)

# Fit the model with the training data

clf.fit(X_train, y_train)

# Predict on the test set

y_pred = clf.predict(X_test)

12
# Evaluate model performance

print("Accuracy without tuning: ", accuracy_score(y_test, y_pred))

print("Classification Report:\n", classification_report(y_test, y_pred))

# Parameter tuning using GridSearchCV

param_grid = {

'criterion': ['gini', 'entropy'], # Different criteria for splitting

'splitter': ['best', 'random'], # Split strategy

'max_depth': [None, 10, 20, 30], # Depth of tree

'min_samples_split': [2, 5, 10], # Minimum number of samples to split a

node

'min_samples_leaf': [1, 2, 4], # Minimum number of samples to be at a

leaf node

'max_features': [None, 'auto', 'sqrt', 'log2'] # Number of features to consider

for the best split

# Using GridSearchCV for parameter tuning

grid_search = GridSearchCV(estimator=clf, param_grid=param_grid, cv=5,

n_jobs=-1, verbose=1)

# Fit GridSearchCV

grid_search.fit(X_train, y_train)

# Best parameters from GridSearchCV

13
print("Best Parameters: ", grid_search.best_params_)

# Predict with the best estimator from grid search

best_clf = grid_search.best_estimator_

y_pred_best = best_clf.predict(X_test)

# Evaluate performance with the tuned model

print("Accuracy with tuning: ", accuracy_score(y_test, y_pred_best))

print("Classification Report:\n", classification_report(y_test, y_pred_best))

OUTPUT:

Accuracy with tuning: 1.0

Classification Report:
precision recall f1-score support

0 1.00 1.00 1.00 10

1 1.00 1.00 1.00 9
2 1.00 1.00 1.00 11

accuracy 1.00 30
macro avg 1.00 1.00 1.00 30
weighted avg 1.00 1.00 1.00 30

7. Implementation of KNN using sklearn

14
# Import necessary libraries

from sklearn.datasets import load_iris

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassifier

from sklearn.metrics import accuracy_score

# Load the dataset (Iris dataset)

iris = load_iris()

X = iris.data # Features

y = iris.target # Target labels

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3,

random_state=42)

# Create the KNN model with k=3

knn = KNeighborsClassifier(n_neighbors=3)

# Train the model

knn.fit(X_train, y_train)

# Make predictions

y_pred = knn.predict(X_test)

# Evaluate the model's performance

accuracy = accuracy_score(y_test, y_pred)

print(f"Accuracy: {accuracy * 100:.2f}%")

OUTPUT:

Accuracy: 100.00%

8.Implementation of Logistic Regression using sklearn

15
# Import necessary libraries

import numpy as np

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LogisticRegression

from sklearn.metrics import accuracy_score, confusion_matrix,

classification_report

from sklearn.datasets import load_iris

# Load a sample dataset

# Here, we're using the Iris dataset for simplicity.

# We'll use only two classes (binary classification) for logistic regression.

iris = load_iris()

X = iris.data

y = iris.target

# For binary classification, we'll select only two classes (e.g., class 0 and 1)

X = X[y != 2] # Select only class 0 and 1

y = y[y != 2] # Select only class 0 and 1

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3,

random_state=42)

# Create a Logistic Regression model

log_reg = LogisticRegression()

# Train the model

log_reg.fit(X_train, y_train)

16
# Make predictions on the test set

y_pred = log_reg.predict(X_test)

# Evaluate the model

accuracy = accuracy_score(y_test, y_pred)

conf_matrix = confusion_matrix(y_test, y_pred)

class_report = classification_report(y_test, y_pred)

# Print the results

print("Accuracy:", accuracy)

print("\nConfusion Matrix:\n", conf_matrix)

print("\nClassification Report:\n", class_report)

OUTPUT:

Accuracy: 1.0

Confusion Matrix:

[[17 0]

[ 0 13]]

Classification Report:

precision recall f1-score support

0 1.00 1.00 1.00 17

1 1.00 1.00 1.00 13

accuracy 1.00 30

macro avg 1.00 1.00 1.00 30

weighted avg 1.00 1.00 1.00 30

9.Implementation of K-Means Clustering

17
import numpy as np

from sklearn.cluster import KMeans

from sklearn.datasets import make_blobs

import matplotlib.pyplot as plt

# Generate synthetic data with 4 clusters

X, y_true = make_blobs(n_samples=300, centers=4, cluster_std=0.60,

random_state=0)

# Create a KMeans model with the number of clusters set to 4

kmeans = KMeans(n_clusters=4, random_state=0)

# Fit the model to the data

kmeans.fit(X)

# Predict the cluster labels for each data point

y_kmeans = kmeans.predict(X)

# Plotting the clusters and their centroids

plt.scatter(X[:, 0], X[:, 1], c=y_kmeans, s=50, cmap='viridis')

# Marking the centroids

centers = kmeans.cluster_centers_

plt.scatter(centers[:, 0], centers[:, 1], c='red', s=200, alpha=0.75, marker='X')

plt.title("K-Means Clustering")

18
plt.xlabel("Feature 1")

plt.ylabel("Feature 2")

plt.show()

OUTPUT:

Big Data Practical
No ratings yet
Big Data Practical
20 pages
ML Lab
No ratings yet
ML Lab
33 pages
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
No ratings yet
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
24 pages
ML Lab-1
No ratings yet
ML Lab-1
32 pages
ML Lab Record
No ratings yet
ML Lab Record
17 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
R22 ML Lab Manual
No ratings yet
R22 ML Lab Manual
25 pages
21CSC305P ML - Lab Programs 1 - 9
No ratings yet
21CSC305P ML - Lab Programs 1 - 9
36 pages
27 KrishParasShah
No ratings yet
27 KrishParasShah
17 pages
Machinelearning - Lab Manual
No ratings yet
Machinelearning - Lab Manual
26 pages
Lab ML
No ratings yet
Lab ML
26 pages
ML Record
No ratings yet
ML Record
21 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
10 pages
ML Full For Print New 1
No ratings yet
ML Full For Print New 1
38 pages
ML Lab Record - 250625 - 105014
No ratings yet
ML Lab Record - 250625 - 105014
29 pages
Karmbir 19 ML
No ratings yet
Karmbir 19 ML
20 pages
Machine Learning Algorithms Are Generally Categorized Into Three Main Types
No ratings yet
Machine Learning Algorithms Are Generally Categorized Into Three Main Types
7 pages
ML Manual
No ratings yet
ML Manual
24 pages
Tushar ML
No ratings yet
Tushar ML
52 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
22 pages
ML Record
No ratings yet
ML Record
19 pages
Machine Learning LAB
No ratings yet
Machine Learning LAB
20 pages
Smec ML Lab Manual R22
No ratings yet
Smec ML Lab Manual R22
21 pages
Python科学计算
No ratings yet
Python科学计算
635 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
Quantitative Economics With Python PDF
No ratings yet
Quantitative Economics With Python PDF
670 pages
ML Lab Manual
No ratings yet
ML Lab Manual
14 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Seminar Presentation
No ratings yet
Seminar Presentation
25 pages
ML File Syllabus
No ratings yet
ML File Syllabus
43 pages
Udacity Machine Learning Analysis Supervised Learning
100% (1)
Udacity Machine Learning Analysis Supervised Learning
504 pages
ML Lab Manual
No ratings yet
ML Lab Manual
28 pages
Regression Analysis - Cheatsheet
No ratings yet
Regression Analysis - Cheatsheet
9 pages
ML Lab
No ratings yet
ML Lab
23 pages
cp4252 Machine Learning Lab Manual
No ratings yet
cp4252 Machine Learning Lab Manual
21 pages
CSE455/CSE552 Machine Learning (Spring 2024) Homework #1: Hand-In Policy Collaboration Policy Grading
No ratings yet
CSE455/CSE552 Machine Learning (Spring 2024) Homework #1: Hand-In Policy Collaboration Policy Grading
2 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
ML Lab - Manual
No ratings yet
ML Lab - Manual
15 pages
Scipy Cookbook
No ratings yet
Scipy Cookbook
527 pages
Machine Learning Final Manual
No ratings yet
Machine Learning Final Manual
45 pages
LAB MANUAL For Machine Learning
No ratings yet
LAB MANUAL For Machine Learning
15 pages
Easy Pract ML
No ratings yet
Easy Pract ML
7 pages
Pandas
No ratings yet
Pandas
698 pages
Sahil ML
No ratings yet
Sahil ML
21 pages
ML With Python Practical
No ratings yet
ML With Python Practical
22 pages
Python Artificial Learning: Rahul Mula
No ratings yet
Python Artificial Learning: Rahul Mula
272 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
ML RECORD - Merged
No ratings yet
ML RECORD - Merged
33 pages
ML Cyber Lab
No ratings yet
ML Cyber Lab
16 pages
Unit 1
No ratings yet
Unit 1
164 pages
Final ML File
No ratings yet
Final ML File
34 pages
ML Practical File
No ratings yet
ML Practical File
30 pages
ML Lab (R22) Manual
No ratings yet
ML Lab (R22) Manual
25 pages
Sr. No. Practical No. Date Sign: Index
No ratings yet
Sr. No. Practical No. Date Sign: Index
11 pages
Ardent Report
No ratings yet
Ardent Report
62 pages
Python Slides PDF
No ratings yet
Python Slides PDF
35 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
LightGBM - Release 2.2.4 PDF
No ratings yet
LightGBM - Release 2.2.4 PDF
183 pages
ML - LAB - FILE Pankaj
No ratings yet
ML - LAB - FILE Pankaj
13 pages
ML - LAB - FILE Amrit
No ratings yet
ML - LAB - FILE Amrit
13 pages
Mca Final Year Project
100% (2)
Mca Final Year Project
76 pages
01 ASAP TimeSeriesForcasting Day1 2 Introduction
No ratings yet
01 ASAP TimeSeriesForcasting Day1 2 Introduction
62 pages
Python Interview Questions
100% (2)
Python Interview Questions
26 pages
Obspy Overview
No ratings yet
Obspy Overview
51 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
23 pages
Sean Jordan Synthesis Paper
No ratings yet
Sean Jordan Synthesis Paper
18 pages
Heart Analysis With Python (Part 3 - How To Flatten A Wandering EKG) - by Proto Bioengineering - Medium
No ratings yet
Heart Analysis With Python (Part 3 - How To Flatten A Wandering EKG) - by Proto Bioengineering - Medium
17 pages
Lab Experiments Vi Sem-1
No ratings yet
Lab Experiments Vi Sem-1
10 pages
Google Collab & Python
100% (1)
Google Collab & Python
50 pages
More On Pandas
No ratings yet
More On Pandas
47 pages
Important Questions
No ratings yet
Important Questions
4 pages
Object Detection and Identification A Project Report: November 2019
No ratings yet
Object Detection and Identification A Project Report: November 2019
45 pages
Day01 - Welcome To Data Science Fundamental
No ratings yet
Day01 - Welcome To Data Science Fundamental
30 pages
SELFI Id Match
No ratings yet
SELFI Id Match
17 pages
NumPy Essentials - Sample Chapter
50% (2)
NumPy Essentials - Sample Chapter
16 pages
Python FFT Filters
No ratings yet
Python FFT Filters
12 pages
Report Horn Detection
No ratings yet
Report Horn Detection
14 pages
1.1. Scientific Computing With Tools and Workflow: 1.1.1. Why Python?
No ratings yet
1.1. Scientific Computing With Tools and Workflow: 1.1.1. Why Python?
8 pages
An Exception Is An Error or Unexpected Event That Occurs During Execution of A Program
No ratings yet
An Exception Is An Error or Unexpected Event That Occurs During Execution of A Program
3 pages
Python Scientific
No ratings yet
Python Scientific
146 pages
20191120122749-Data Science Certification Training
No ratings yet
20191120122749-Data Science Certification Training
4 pages
1 Introduction Python Programming For Data Science
No ratings yet
1 Introduction Python Programming For Data Science
11 pages
Coding For Kids Python - A Comprehensive Guide That Can Teach Children To Code With Simple Methods
100% (10)
Coding For Kids Python - A Comprehensive Guide That Can Teach Children To Code With Simple Methods
45 pages
Sachin Shastri Resume
No ratings yet
Sachin Shastri Resume
1 page
Python for Data Science: Data Science Mastery by Nikhil Khan, #1
From Everand
Python for Data Science: Data Science Mastery by Nikhil Khan, #1
Nikhil Khan
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.