ML Lab Manual
ML Lab Manual
NO
1 python program to compute Central Tendency
Measures :Mean, Median, Mode Measures of
Dispersion: variance ,standard Deviation
1
Program 1: python program to compute Central Tendency
Measures :Mean, Median, Mode Measures of Dispersion:
variance ,standard Deviation
def central_tendency_dispersion(data):
mean = stats.mean(data)
median = stats.median(data)
try:
mode = stats.mode(data)
except stats.StatisticsError:
# Measures of Dispersion
variance = stats.variance(data)
std_dev = stats.stdev(data)
# Display results
print(f"Mean: {mean}")
print(f"Median: {median}")
print(f"Mode: {mode}")
print(f"Variance: {variance}")
# Example data
2
data = [10, 15, 14, 10, 15, 18, 20, 25, 30]
central_tendency_dispersion(data)
OUTPUT:
Mean: 17.444444444444443
Median: 15
Mode: 10
Variance: 44.52777777777778
Standard Deviation: 6.672913739722534
3
2.Study of Python Basic Libraries such as Statistics, Math, Numpy and
Scipy
Python provides a wide range of basic libraries that are essential for various computational
tasks. These libraries offer functionality to handle statistical calculations, mathematical
operations, and scientific computing. Here is an overview:
Statistics Module
Used for statistical computations such as mean, median, mode, variance, etc.
Example
import statistics
data = [1, 2, 2, 3, 4]
print("Mean:", statistics.mean(data))
print("Median:", statistics.median(data))
print("Mode:", statistics.mode(data))
Math Module
import math
Numpy Library
Widely used for numerical computations with arrays, matrices, and linear algebra
functions.
Example:
import numpy as np
4
print("Mean of array:", np.mean(array))
Scipy Library
5
3. Study of Python Libraries for ML application such as Pandas and
Matplotlib
For machine learning and data analysis, Python libraries like Pandas and Matplotlib are
essential for data manipulation and visualization.
Pandas
Provides data structures like Series and DataFrame for handling and analyzing data
efficiently.
Example:
import pandas as pd
df = pd.DataFrame(data)
print(df)
Matplotlib
A visualization library used for creating static, interactive, and animated plots.
Example:
x = [1, 2, 3, 4, 5]
plt.xlabel("X-axis")
plt.ylabel("Y-axis")
plt.show()
6
Program 4:Python Program for Simple Linear Regression.
import numpy as np
np.random.seed(0)
X = 2 * np.random.rand(100, 1)
y = 4 + 3 * X + np.random.randn(100, 1)
model = LinearRegression()
model.fit(X_train, y_train)
# Make predictions
y_pred = model.predict(X_test)
r2 = r2_score(y_test, y_pred)
7
print(f"Mean Squared Error: {mse:.2f}")
print(f"R-squared: {r2:.2f}")
plt.xlabel("X")
plt.ylabel("y")
plt.legend()
plt.show()
OUTPUT:
8
program5: Implementation of Multiple Linear Regression for House
Pricing Pricing Prediction using sklearn
import numpy as np
import pandas as pd
data = pd.read_csv('house_prices.csv')
print(data.head())
y = data['Price']
X = X.fillna(X.mean())
y = y.fillna(y.mean())
9
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)
model = LinearRegression()
model.fit(X_train, y_train)
y_pred = model.predict(X_test)
r2 = r2_score(y_test, y_pred)
print(f'R-squared: {r2}')
# Model coefficients
print("Intercept:", model.intercept_)
print("Coefficients:", model.coef_)
print(coefficients)
10
6. Implementation of Decision tree using sklearn and its parameter tuning
11
Importing necessary libraries
import numpy as np
import pandas as pd
data = load_iris()
X = data.data
y = data.target
clf = DecisionTreeClassifier(random_state=42)
clf.fit(X_train, y_train)
y_pred = clf.predict(X_test)
12
# Evaluate model performance
param_grid = {
# Fit GridSearchCV
grid_search.fit(X_train, y_train)
13
print("Best Parameters: ", grid_search.best_params_)
best_clf = grid_search.best_estimator_
y_pred_best = best_clf.predict(X_test)
OUTPUT:
accuracy 1.00 30
macro avg 1.00 1.00 1.00 30
weighted avg 1.00 1.00 1.00 30
14
# Import necessary libraries
iris = load_iris()
X = iris.data # Features
knn = KNeighborsClassifier(n_neighbors=3)
knn.fit(X_train, y_train)
# Make predictions
y_pred = knn.predict(X_test)
OUTPUT:
Accuracy: 100.00%
15
# Import necessary libraries
import numpy as np
# We'll use only two classes (binary classification) for logistic regression.
iris = load_iris()
X = iris.data
y = iris.target
# For binary classification, we'll select only two classes (e.g., class 0 and 1)
log_reg = LogisticRegression()
log_reg.fit(X_train, y_train)
16
# Make predictions on the test set
y_pred = log_reg.predict(X_test)
print("Accuracy:", accuracy)
OUTPUT:
Accuracy: 1.0
Confusion Matrix:
[[17 0]
[ 0 13]]
Classification Report:
accuracy 1.00 30
17
import numpy as np
kmeans.fit(X)
y_kmeans = kmeans.predict(X)
centers = kmeans.cluster_centers_
plt.title("K-Means Clustering")
18
plt.xlabel("Feature 1")
plt.ylabel("Feature 2")
plt.show()
OUTPUT:
19