0% found this document useful (0 votes)

3 views49 pages

Aiml Record

The document is a record notebook for a practical course in Artificial Intelligence and Machine Learning at Dhanalakshmi Srinivasan College of Engineering. It includes a bonafide certificate, a table of contents outlining various experiments, and detailed implementations of algorithms such as BFS, DFS, A*, and Naïve Bayes for tasks like search algorithms and disease prediction. Each experiment includes aims, algorithms, programs, and results demonstrating successful execution.

Uploaded by

ustmaheshtj00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views49 pages

Aiml Record

Uploaded by

ustmaheshtj00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

RECORD NOTE BOOK

Name :…………………..………………..

Subject Code/Title: ………………………………………

Year/Semester : …….……………………………….
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

[CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING]

FOURTH SEMESTER

DHANALAKSHMI SRINIVASAN COLLEGE OF

ENGINEERING
COIMBATORE-641105
BONAFIDE CERTIFICATE

Certified that this is the bonafide record of the work done by

Mr./Ms.…………………………………….……. in the CS3491-ARTIFICIAL

INTELLIGENCE AND MACHINE LEARNING of this institution, as per the Anna

University, Chennai for the fourth Semester of B. E. Computer Science and

Engineering during the period of March 2024 to May 2024.

Staff-In-Charge Head of the Department

Submitted for the University Practical Examination held on ……………………..

at Dhanalakshmi Srinivasan College of Engineering, Coimbatore-641105.

INTERNAL EXAMINER EXTERNAL EXAMINER

TABLE OF CONTENTS

Page Faculty
SI.NO. Date Name of the Experiment Marks
No. Signature

Implementation of Uninformed
1
search algorithms (BFS, DFS)
Implementation of Informed search
2. algorithms (A*, memory-bounded
A*)
Implement naïve Bayes models
3
Implement Bayesian Networks
4
Build Regression models
5
Build decision trees and
6 random forests

7 Build SVM models

Implement ensembling
8. techniques
Implement clustering
9 algorithms
Implement EM for Bayesian
10 networks
Build simple NN models
11
Build deep learning NN models
12
Ex. no: 1 Implementation Of Uninformed Search Algorithms
Date:
(BFS, DFS)

AIM:
To implement uninformed search algorithms such as BFS and DFS.

ALGORITHM (BFS):

Step 1: SET STATUS = 1 (ready state) for each node in G

Step 2: Enqueue the starting node A and set its STATUS = 2 (waiting state)
Step 3: Repeat Steps 4 and 5 until QUEUE is empty
Step 4: Dequeue a node N. Process it and set its STATUS = 3 (processed state).
Step 5: Enqueue all the neighbours of N that are in the ready state (whose STATUS
= 1) and set their STATUS = 2(waiting state) [END OF
LOOP] Step 6: EXIT

ALGORITHM (DFS):

Step 1 : SET STATUS = 1 (ready state) for each node in G

Step 2 : Push the starting node A on the stack and set its STATUS = 2 (waiting state)
Step 3 : Repeat Steps 4 and 5 until STACK is empty
Step 4 :Pop the top node N. Process it and set its STATUS = 3 (processed state)
Step 5 : Push on the stack all the neighbors of N that are in the ready state (whose STATUS =
1) and set their STATUS = 2 (waiting state)
[END OF LOOP]
Step 6 : Exit

PROGRAM (BFS):

from collections import defaultdict

class Graph:
def init (self):
self.graph = defaultdict(list)
def addEdge(self,u,v):
self.graph[u].append(v)
def BFS(self, s):
visited = [False] * (len(self.graph))
queue = []
queue.append(s)
visited[s] = True
while queue:
s = queue.pop(0)
print (s, end = " ")
for i in self.graph[s]:
if visited[i] ==
False:
queue.append(i)
visited[i] = True
g = Graph()
g.addEdge(0, 1)
g.addEdge(0, 2)
g.addEdge(1, 2)
g.addEdge(2, 0)
g.addEdge(2, 3)
g.addEdge(3, 3)

print ("Following is Breadth First

Traversal" " (starting from vertex
2)")
g.BFS(2)

OUTPUT(BFS):

Following is Breadth First Traversal (starting from vertex 2)

2031

PROGRAM(DFS):

from collections import defaultdict

class Graph:
def init (self):
self.graph = defaultdict(list)
def addEdge(self, u, v):
self.graph[u].append(v)
def DFSUtil(self, v, visited):
visited.add(v)
print(v, end=' ')
for neighbour in self.graph[v]:
if neighbour not in visited:
self.DFSUtil(neighbour,
visited) def DFS(self, v):
visited = set()
self.DFSUtil(v, visited)
if name == " main ":
g = Graph()
g.addEdge(0, 1)
g.addEdge(0, 2)
g.addEdge(1, 2)
g.addEdge(2, 0)
g.addEdge(2, 3)
g.addEdge(3, 3)
print("Following is DFS from (starting from vertex 2)")
g.DFS(2)
OUTPUT(DFS):

Following is Depth First Traversal (starting from vertex 2)

2013

RESULT:

Thus the uninformed search algorithms such as BFS and DFS have been executed
successfully and the output got verified.
Ex. no: 2 Implementation of Informed search algorithm (A*)
Date:

AIM:

To implement the informed search using A* algorithm.

ALGORITHM:

Step 1 :Initialize the open list

Step 2 :Initialize the closed list put the starting node on the openlist
Step 3 :while the open list is not empty
Step 4 :find the node with the least f on the open list, call it "q"
a) pop q off the open list
b) generate q's 8 successors and set theirparents to q for each
successor
Step 5 :if successor is the goal, stop search else compute both g and h for
successor.g = q.g + distance between successor .
successor.h = distance from goal to successor .
successor.f = successor.g+ successor.h
Step 6 :if a node with the same position as successor is in the OPEN list which has a lower
f than successor, skip this successor
Step 7 :if a node with the same position as successor is in the CLOSED list
which has a lower f than successor, skip this successor otherwise, add the
node to the open list end (for loop)
Step 8 :push q on the closed listend .

PROGRAM:

def aStarAlgo(start_node, stop_node):

open_set = set(start_node) closed_set
= set()
g = {}
parents = {}
g[start_node] = 0
parents[start_node] = start_node
while len(open_set) > 0:
n = None
for v in open_set:
if n == None or g[v] + heuristic(v) < g[n] +
heuristic(n): n = v
if n == stop_node or Graph_nodes[n] == None:
pass
else:
for (m, weight) in get_neighbors(n):
if m not in open_set and m not in closed_set:
open_set.add(m)
parents[m] = n
g[m] = g[n] +
weight
else:
if g[m] > g[n] +
weight: g[m] = g[n]
+ weight parents[m]
=n
if m in closed_set:
closed_set.remove(m)
open_set.add(m)
if n == None:
print('Path does not exist!')
return None
if n == stop_node:
path = []
while parents[n] != n:
path.append(n)
n = parents[n]
path.append(start_node)
path.reverse()
print('Path found: {}'.format(path))
return path
open_set.remove(n)
closed_set.add(n)
print('Path does not exist!') return
None
def get_neighbors(v):
if v in Graph_nodes:
return Graph_nodes[v]
else:
return None
def heuristic(n):
H_dist =
{'A':
11,
'B': 6,
'C': 5,
'D': 7,
'E': 3,
'F': 6,
'G': 5,
'H': 3,
'I': 1,
'J': 0
}
return H_dist[n]
Graph_nodes = {
'A': [('B', 6), ('F', 3)],
'B': [('A', 6), ('C', 3), ('D', 2)],
'C': [('B', 3), ('D', 1), ('E', 5)],
'D': [('B', 2), ('C', 1), ('E', 8)],
'E': [('C', 5), ('D', 8), ('I', 5), ('J', 5)],
'F': [('A', 3), ('G', 1), ('H', 7)],
'G': [('F', 1), ('I', 3)],
'H': [('F', 7), ('I', 2)],
'I': [('E', 5), ('G', 3), ('H', 2), ('J', 3)],
}

aStarAlgo('A', 'J')

OUTPUT:

Path found: ['A', 'F', 'G', 'I', 'J']

RESULT:

Thus the program to implement informed search algorithm have been executed
successfully and output got verified.
Ex.no: 3
Implement Naïve Bayes models
Date:

AIM:

To diagnose heart patients and predict disease using heart disease dataset withNaïve
Bayes Classifier Algorithm.

ALGORITHM:

Step 1:- Read the training dataset T;

Step 2:- Calculate the mean and standard deviation of the predictorvariables in eachclass
Step 3:- Repeat Calculate the probability of fi using the gauss density equation
in eachclass; Until the probability of all predictor variables (f1, f2, f3, .. , fn)
has been calculated.
Step 4:- Calculate the likelihood for each class;
Step 5:- Get the greatest likelihood;

PROGRAM:

NB_from_scratch.py

import csv
import numpy as np
from sklearn.metrics import confusion_matrix, f1_score, roc_curve, auc
import matplotlib.pyplot as plt
from itertools import cycle
from scipy import interp
import warnings
import random
import math

# convert txt file to csv

with open('heartdisease.txt', 'r') as in_file:
stripped = (line.strip() for line in in_file)
lines = (line.split(",") for line in stripped if line)
with open('heartdisease.csv', 'w', newline='') as out_file:
writer = csv.writer(out_file)
writer.writerow(('age', 'sex', 'cp', 'restbp', 'chol', 'fbs', 'restecg',
'thalach', 'exang', 'oldpeak', 'slope', 'ca', 'thal', 'num'))
writer.writerows(lines)

warnings.filterwarnings("ignore")
# Example of Naive Bayes implemented from Scratch in Python

# calculating mean of column values belonging to one class

def mean(columnvalues):
s=0
n = float(len(columnvalues))
for i in range(len(columnvalues)):
s = s + float(columnvalues[i])
return s / n

# calculating standard deviation of column values belonging to one class

def stdev(columnvalues):
avg = mean(columnvalues)
s = 0.0
num = len(columnvalues)
for i in range(num):
s = s + pow(float(columnvalues[i]) - avg,
2) variance = s / (float(num - 1))
return math.sqrt(variance)

# Reading CSV file

filename = 'heartdisease.csv'
lines = csv.reader(open(filename, "r"))
dataset = list(lines)
for i in range(len(dataset) - 1):
dataset[i] = [float(x) for x in dataset[i + 1]]

for z in range(5):
print("\n\n\nTest Train Split no. ", z + 1, "\n\n\n") trainsize
= int(len(dataset) * 0.75)
trainset = []
testset = list(dataset) for
i in range(trainsize):
index = random.randrange(len(testset))
trainset.append(testset.pop(index))

# separate list according to class

classlist = {}
for i in range(len(dataset)):
class_num = float(dataset[i][-1])
row = dataset[i]
if (class_num not in classlist):
classlist[class_num] = []
classlist[class_num].append(row)
# preparing data class wise
class_data = {}
for class_num, row in classlist.items():
class_datarow = [(mean(columnvalues), stdev(columnvalues)) for columnvalues in
zip(*row)]
class_datrow = class_datarow[0:13]
class_data[class_num] = class_datarow

# Getting test vector

y_test = []
for j in range(len(testset)):
y_test.append(testset[j][-1])

# Getting prediction vector

y_pred = []
for i in range(len(testset)):
class_probability = {}
for class_num, row in class_data.items():
class_probability[class_num] = 1
for j in range(len(row)):
calculated_mean, calculated_dev = row[j]
x = float(testset[i][j])
if (calculated_dev != 0):
power = math.exp(-(math.pow(x - calculated_mean, 2) / (2 *
math.pow(calculated_dev, 2))))
probability = (1 / (math.sqrt(2 * math.pi) * calculated_dev)) * power
class_probability[class_num] *= probability

resultant_class, max_prob = -1, -1

for class_num, probability in class_probability.items():
if resultant_class == -1 or probability > max_prob:
max_prob = probability
resultant_class = class_num

y_pred.append(resultant_class)

# Getting Accuracy
count = 0
for i in range(len(testset)):
if testset[i][-1] == y_pred[i]:
count += 1
accuracy = (count / float(len(testset))) * 100.0
print("\n\n Accuracy: ", accuracy, "%")

y1 = [float(k) for k in y_test]

y_pred1 = [float(k) for k in y_pred]
print("\n\n\n\nConfusion Matrix")
cf_matrix = confusion_matrix(y1, y_pred1)
print(cf_matrix)

print("\n\n\n\nF1 Score")
f_score = f1_score(y1, y_pred1, average='weighted')
print(f_score)

# Matrix from 1D array

y2 = np.zeros(shape=(len(y1), 5))
y3 = np.zeros(shape=(len(y_pred1), 5))
for i in range(len(y1)):
y2[i][int(y1[i])] = 1

for i in range(len(y_pred1)):
y3[i][int(y_pred1[i])] = 1

# ROC Curve generation

n_classes = 5

fpr =
dict() tpr =
dict()
roc_auc = dict()
for i in range(n_classes):
fpr[i], tpr[i], _ = roc_curve(y2[:, i], y3[:, i])
roc_auc[i] = auc(fpr[i], tpr[i])

# Compute micro-average ROC curve and ROC area

fpr["micro"], tpr["micro"], _ = roc_curve(y2.ravel(), y3.ravel())
roc_auc["micro"] = auc(fpr["micro"], tpr["micro"])

# Compute macro-average ROC curve and ROC area

print("\n\n\n\nROC Curve")
# First aggregate all false positive rates
lw = 2
all_fpr = np.unique(np.concatenate([fpr[i] for i in range(n_classes)]))

# Then interpolate all ROC curves at this points

mean_tpr = np.zeros_like(all_fpr)
for i in range(n_classes):
mean_tpr += interp(all_fpr, fpr[i], tpr[i])

# Finally average it and compute

AUC mean_tpr /= n_classes

fpr["macro"] = all_fpr
tpr["macro"] = mean_tpr
roc_auc["macro"] = auc(fpr["macro"], tpr["macro"])

# Plot all ROC

curves plt.figure()
plt.plot(fpr["micro"], tpr["micro"],
label='micro-average (area =
{0:0.2f})'
''.format(roc_auc["micro"]),
color='deeppink', linestyle=':', linewidth=4)

plt.plot(fpr["macro"], tpr["macro"],
label='macro-average (area =
{0:0.2f})'
''.format(roc_auc["macro"]),
color='navy', linestyle=':', linewidth=4)

colors = cycle(['aqua', 'darkorange', 'cornflowerblue', 'red', 'black']) for

i, color in zip(range(n_classes), colors):
plt.plot(fpr[i], tpr[i], color=color, lw=lw,
label='ROC of class {0} (area = {1:0.2f})'
''.format(i, roc_auc[i]))

plt.plot([0, 1], [0, 1], 'k--', lw=lw)

plt.xlim([0.0, 1.0])
plt.ylim([0.0, 1.05])
plt.xlabel('False Positive Rate')
plt.ylabel('True Positive Rate')
plt.title('Receiver operating characteristic for multi-class')
plt.legend(loc="lower right")
plt.savefig('Exp-
8') plt.show()

NB_from_Gaussian_Sklearn.py

import csv
import pandas as pd
import numpy as np
from sklearn.naive_bayes import GaussianNB
from sklearn.model_selection import train_test_split
from sklearn import metrics
from sklearn.metrics import confusion_matrix, f1_score, roc_curve, auc
import matplotlib.pyplot as plt
from itertools import cycle
from scipy import interp

# converting txt file to csv file

with open('heartdisease.txt', 'r') as in_file:
stripped = (line.strip() for line in in_file)
lines = (line.split(",") for line in stripped if line)
with open('heartdisease.csv', 'w') as out_file:
writer = csv.writer(out_file)
writer.writerow(('age', 'sex', 'cp', 'restbp', 'chol', 'fbs', 'restecg',
'thalach', 'exang', 'oldpeak', 'slope', 'ca', 'thal', 'num'))
writer.writerows(lines)

# reading CSV using Pandas and storing in dataframe

df = pd.read_csv('heartdisease.csv', header=None)

training_x = df.iloc[1:df.shape[0],
0:13] # print(training_set)

training_y = df.iloc[1:df.shape[0],
13:14] # print(testing_set)

# converting dataframe into arrays x

= np.array(training_x)
y = np.array(training_y)

for z in range(5):
print("\n\n\nTest Train Split no. ", z + 1, "\n\n\n")
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.25, random_state=None) #
Gaussian function of sklearn
gnb = GaussianNB()
gnb.fit(x_train, y_train.ravel())
y_pred = gnb.predict(x_test)

print("\n\nGaussian Naive Bayes model accuracy(in %):", metrics.accuracy_score(y_test,

y_pred) * 100)

# convert 2D array to 1D array

y1 = y_test.ravel()
y_pred1 = y_pred.ravel()

print("\n\n\n\nConfusion Matrix")
cf_matrix = confusion_matrix(y1, y_pred1)
print(cf_matrix)

print("\n\n\n\nF1 Score")
f_score = f1_score(y1, y_pred1, average='weighted')
print(f_score)

# Matrix from 1D array

y2 = np.zeros(shape=(len(y1), 5))
y3 = np.zeros(shape=(len(y_pred1), 5))
for i in range(len(y1)):
y2[i][int(y1[i])] = 1

for i in range(len(y_pred1)):
y3[i][int(y_pred1[i])] = 1

# ROC Curve generation

n_classes = 5

fpr =
dict() tpr =
dict()
roc_auc = dict()
for i in range(n_classes):
fpr[i], tpr[i], _ = roc_curve(y2[:, i], y3[:, i])
roc_auc[i] = auc(fpr[i], tpr[i])

# Compute micro-average ROC curve and ROC area

fpr["micro"], tpr["micro"], _ = roc_curve(y2.ravel(), y3.ravel())
roc_auc["micro"] = auc(fpr["micro"], tpr["micro"])

# Compute macro-average ROC curve and ROC area

print("\n\n\n\nROC Curve")
# First aggregate all false positive rates
lw = 2
all_fpr = np.unique(np.concatenate([fpr[i] for i in range(n_classes)]))

# Then interpolate all ROC curves at this points

mean_tpr = np.zeros_like(all_fpr)
for i in range(n_classes):
mean_tpr += interp(all_fpr, fpr[i], tpr[i])

# Finally average it and compute

AUC mean_tpr /= n_classes

fpr["macro"] = all_fpr
tpr["macro"] = mean_tpr
roc_auc["macro"] = auc(fpr["macro"], tpr["macro"])

# Plot all ROC

curves plt.figure()
plt.plot(fpr["micro"], tpr["micro"],
label='micro-average (area =
{0:0.2f})'
''.format(roc_auc["micro"]),
color='deeppink', linestyle=':', linewidth=4)

plt.plot(fpr["macro"], tpr["macro"],
label='macro-average (area =
{0:0.2f})'
''.format(roc_auc["macro"]),
color='navy', linestyle=':', linewidth=4)

colors = cycle(['aqua', 'darkorange', 'cornflowerblue', 'red', 'black']) for

i, color in zip(range(n_classes), colors):
plt.plot(fpr[i], tpr[i], color=color, lw=lw,
label='ROC of class {0} (area = {1:0.2f})'
''.format(i, roc_auc[i]))

plt.plot([0, 1], [0, 1], 'k--', lw=lw)

OUTPUT
RESULT:

Thus the program to diagnose heart patients and predict disease using heart disease
dataset with Naïve Bayes Classifier Algorithm have been executed successfully and
output got verified.
Ex.no: 4
Implement Bayesian Networks
Date:

Aim:
To construct a Bayesian network, to demonstrate the diagnosis of heart patients
using standard Heart Disease Data Set.

Algorithm:

Step 1 :Read the training dataset T;

Step 2 :Calculate the mean and standard deviation of the predictor variables in each class;
Step 3 :Repeat Calculate the probability of fi using the gauss density equation in each class;Until
the probability of all predictor variables (f1, f2, f3, .. , fn) has been calculated.
Step 4 :Calculate the likelihood for each class;
Step 5 :Get the greatest likelihood;

Program:

import bayespy as bp
import numpy as np
import csv
from colorama import init
from colorama import Fore, Back, Style
init()

ageEnum = {'SuperSeniorCitizen':0, 'SeniorCitizen':1, 'MiddleAged':2, 'Youth':3,

'Teen':4} genderEnum = {'Male':0, 'Female':1}
familyHistoryEnum = {'Yes':0, 'No':1}
dietEnum = {'High':0, 'Medium':1, 'Low':2}
lifeStyleEnum = {'Athlete':0, 'Active':1, 'Moderate':2, 'Sedetary':3}
cholesterolEnum = {'High':0, 'BorderLine':1, 'Normal':2}
heartDiseaseEnum = {'Yes':0, 'No':1}

with open('heart_disease_data.csv') as csvfile:

lines = csv.reader(csvfile)
dataset = list(lines)
data = []
for x in dataset:
data.append([ageEnum[x[0]],genderEnum[x[1]],familyHistoryEnum[x[2]],dietEnum[
x[3]],lifeStyleEnum[x[4]],cholesterolEnum[x[5]],heartDiseaseEnum[x[6]]])
data = np.array(data)
N = len(data)

p_age = bp.nodes.Dirichlet(1.0*np.ones(5))
age = bp.nodes.Categorical(p_age, plates=(N,))
age.observe(data[:,0])
p_gender = bp.nodes.Dirichlet(1.0*np.ones(2))
gender = bp.nodes.Categorical(p_gender, plates=(N,))
gender.observe(data[:,1])

p_familyhistory = bp.nodes.Dirichlet(1.0*np.ones(2))
familyhistory = bp.nodes.Categorical(p_familyhistory,
plates=(N,)) familyhistory.observe(data[:,2])

p_diet = bp.nodes.Dirichlet(1.0*np.ones(3))
diet = bp.nodes.Categorical(p_diet, plates=(N,))
diet.observe(data[:,3])

p_lifestyle = bp.nodes.Dirichlet(1.0*np.ones(4))
lifestyle = bp.nodes.Categorical(p_lifestyle,
plates=(N,)) lifestyle.observe(data[:,4])

p_cholesterol = bp.nodes.Dirichlet(1.0*np.ones(3))
cholesterol = bp.nodes.Categorical(p_cholesterol, plates=(N,))
cholesterol.observe(data[:,5])

p_heartdisease = bp.nodes.Dirichlet(np.ones(2), plates=(5, 2, 2, 3, 4, 3))

heartdisease = bp.nodes.MultiMixture([age, gender, familyhistory, diet, lifestyle,
cholesterol], bp.nodes.Categorical, p_heartdisease)
heartdisease.observe(data[:,6])
p_heartdisease.update()

m=0
while m == 0:
print("\n")
res = bp.nodes.MultiMixture([int(input('Enter Age: ' + str(ageEnum))), int(input('Enter
Gender: ' + str(genderEnum))), int(input('Enter FamilyHistory: ' +
str(familyHistoryEnum))), int(input('Enter dietEnum: ' + str(dietEnum))),
int(input('Enter LifeStyle: ' + str(lifeStyleEnum))), int(input('Enter Cholesterol: '
+ str(cholesterolEnum)))], bp.nodes.Categorical,
p_heartdisease).get_moments()[0][heartDiseaseEnum['Yes']]
print("Probability(HeartDisease) = " + str(res))
m = int(input("Enter for Continue:0, Exit :1 "))

OUTPUT

Enter Age: {'SuperSeniorCitizen': 0, 'SeniorCitizen': 1, 'MiddleAged': 2, 'Youth': 3, 'Teen': 4}1

Enter Gender: {'Male': 0, 'Female': 1}0
Enter FamilyHistory: {'Yes': 0, 'No': 1}0
Enter dietEnum: {'High': 0, 'Medium': 1, 'Low': 2}2
Enter LifeStyle: {'Athlete': 0, 'Active': 1, 'Moderate': 2, 'Sedetary': 3}2
Enter Cholesterol: {'High': 0, 'BorderLine': 1, 'Normal': 2}1
Probability(HeartDisease) = 0.5
Enter for Continue:0, Exit :1 1

RESULT:

Thus the program to implement a bayesian networks in the given heart disease dataset
have been executed successfully and the output got verified.
Ex.no: 5
Build Regression models
Date:

AIM:

To build regression models such as locally weighted linear regression and plot the
necessary graphs.

ALGORITHM:

Step 1 :Read the Given data Sample to X and the curve (linear or non-linear)
Step 2 :Set the value for Smoothening parameter or Free parameter say τ
Step 3 :Set the bias /Point of interest set x0 which is a subset of X Step 4.
Determine the weight matrix using :

Step 4 :Determine the value of model term parameter β using :

Step 5 :Prediction = x0*β.

PROGRAM:

from math import ceil

import numpy as np
from scipy import linalg

def lowess(x, y, f, iterations):

n = len(x)
r = int(ceil(f * n))
h = [np.sort(np.abs(x - x[i]))[r] for i in range(n)]
w = np.clip(np.abs((x[:, None] - x[None, :]) / h), 0.0,
1.0) w = (1 - w ** 3) ** 3
yest = np.zeros(n)
delta = np.ones(n)
for iteration in range(iterations):
for i in range(n):
weights = delta * w[:, i]
b = np.array([np.sum(weights * y), np.sum(weights * y * x)])
A = np.array([[np.sum(weights), np.sum(weights * x)],[np.sum(weights * x),
np.sum(weights * x * x)]])
beta = linalg.solve(A, b)
yest[i] = beta[0] + beta[1] * x[i]

residuals = y - yest
s = np.median(np.abs(residuals))
delta = np.clip(residuals / (6.0 * s), -1, 1)
delta = (1 - delta ** 2) ** 2

return yest

import math
n = 100
x = np.linspace(0, 2 * math.pi, n)
y = np.sin(x) + 0.3 * np.random.randn(n)
f =0.25
iterations=3
yest = lowess(x, y, f, iterations)

import matplotlib.pyplot as plt

plt.plot(x,y,"r.")
plt.plot(x,yest,"b-")

OUTPUT

RESULT:

Thus the program to implement non-parametric Locally Weighted Regression

algorithm in order to fit data points with a graph visualization have been executed
successfully.
Ex.no: 6 Build decision trees and random
Date: forests.

AIM:

To implement the concept of decision trees with suitable dataset from real world
problems using CART algorithm.

ALGORITHM:

Step 1 : begins with the original set S as the root node.

Step 2 :On each iteration of the algorithm, it iterates through the very unused attribute of
the set S and calculates Gini index of this attribute.
Step 3 :Gini Index works with the categorical target variable “Success” or“Failure”.
Itperforms only Binary splits.
Step 4 :The set S is then split by the selected attribute to produce a subset of the data.
Step 5 :The algorithm continues to recur on each subset,considering only attribute never
selected before.

PROGRAM:

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

data =
pd.read_csv('/Users/ganesh/PycharmProjects/DecisionTree/Social_Network_Ads.csv')
data.head()

feature_cols = ['Age', 'EstimatedSalary']

x = data.iloc[:, [2, 3]].values
y = data.iloc[:, 4].values

from sklearn.model_selection import train_test_split

x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.25, random_state=0)

from sklearn.preprocessing import StandardScaler

sc_x = StandardScaler()
x_train = sc_x.fit_transform(x_train)
x_test = sc_x.transform(x_test)

from sklearn.tree import DecisionTreeClassifier

classifier = DecisionTreeClassifier()
classifier = classifier.fit(x_train,

y_train) y_pred =

classifier.predict(x_test)
from sklearn import metrics
print('Accuracy Score:', metrics.accuracy_score(y_test, y_pred))

from sklearn.metrics import confusion_matrix

cm = confusion_matrix(y_test, y_pred)
print(cm)

from matplotlib.colors import ListedColormap

x_set, y_set = x_test, y_test

x1, x2 = np.meshgrid(np.arange(start=x_set[:, 0].min()-1, stop=x_set[:, 0].max()+1,

step=0.01), np.arange(start=x_set[:, 1].min()-1, stop=x_set[:, 1].max()+1, step=0.01))
plt.contourf(x1,x2, classifier.predict(np.array([x1.ravel(),
x2.ravel()]).T).reshape(x1.shape), alpha=0.75, cmap=ListedColormap(("red", "green")))
plt.xlim(x1.min(), x1.max())
plt.ylim(x2.min(), x2.max())
for i, j in enumerate(np.unique(y_set)):
plt.scatter(x_set[y_set == j, 0], x_set[y_set == j, 1], c=ListedColormap(("red", "green"))(i),
label=j)

plt.title("Decision Tree(Test
set)") plt.xlabel("Age")
plt.ylabel("Estimated Salary")
plt.legend()
plt.show()

from sklearn.tree import export_graphviz

from six import StringIO
from IPython.display import Image
import pydotplus

dot_data = StringIO()
export_graphviz(classifier, out_file=dot_data, filled=True, rounded=True,
special_characters=True, feature_names=feature_cols, class_names=['0', '1'])
graph = pydotplus.graph_from_dot_data(dot_data.getvalue())
Image(graph.write_png('decisiontree.png'))

classifier = DecisionTreeClassifier(criterion="gini",
max_depth=3) classifier = classifier.fit(x_train, y_train)
y_pred = classifier.predict(x_test)
print("Accuracy:", metrics.accuracy_score(y_test, y_pred))

Optimized output of decision tree using Gini Index (CART):

RESULT
Thus the program to implement the concept of decision trees with suitable
data set from real world problems using CART algorithm have been executed
successfully.
Ex.no: 7
Build SVM models.
Date:

AIM:

To create a machine learning model which classifies the Spam and Ham E-Mails from
a given dataset using Support Vector Machine algorithm.

ALGORITHM:
Step 1 :Import all the necessary libraries.
Step 2 :Read the given csv file which contains the emails which are both
spam andham.
Step 3 : Gather all the words given in that dataset and Identify the stop
words with amean distribution.
Step 4 :Create an ML model using the Support Vector Classifier after splitting the
dataset into training and test set.
Step 5 :Display the accuracy and f1 score and print the confusion matrix for the
classification of spam and ham.

PROGRAM:
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import string
from nltk.corpus import stopwords
import os
from wordcloud import WordCloud, STOPWORDS, ImageColorGenerator
from PIL import Image
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.model_selection import train_test_split
from sklearn.metrics import classification_report, confusion_matrix
from sklearn.naive_bayes import MultinomialNB
from sklearn.metrics import roc_curve, auc
from sklearn import metrics
from sklearn import model_selection
from sklearn import svm
from nltk import word_tokenize
from sklearn.metrics import roc_auc_score
from matplotlib import pyplot
from sklearn.metrics import plot_confusion_matrix

class data_read_write(object):
def init (self):
pass
def init (self, file_link):
self.data_frame = pd.read_csv(file_link)
def read_csv_file(self,
file_link): return
self.data_frame
def write_to_csvfile(self, file_link):
self.data_frame.to_csv(file_link, encoding='utf-8', index=False, header=True)
return

class generate_word_cloud(data_read_write):
def init (self):
pass
def variance_column(self, data):
return np.variance(data)
def word_cloud(self, data_frame_column, output_image_file):
text = " ".join(review for review in data_frame_column)
stopwords = set(STOPWORDS)
stopwords.update(["subject"])
wordcloud = WordCloud(width = 1200, height = 800, stopwords=stopwords,
max_font_size = 50, margin=0,
background_color = "white").generate(text)
plt.imshow(wordcloud, interpolation='bilinear')
plt.axis("off")
plt.savefig("Distribution.png")
plt.show()
wordcloud.to_file(output_image_file)
return

class data_cleaning(data_read_write):
def init (self):
pass
def message_cleaning(self, message):
Test_punc_removed = [char for char in message if char not in string.punctuation]
Test_punc_removed_join = ''.join(Test_punc_removed)
Test_punc_removed_join_clean = [word for word in Test_punc_removed_join.split()
if word.lower() not in stopwords.words('english')]
final_join = ' '.join(Test_punc_removed_join_clean)
return final_join

def apply_to_column(self, data_column_text):

data_processed = data_column_text.apply(self.message_cleaning)
return data_processed

class apply_embeddding_and_model(data_read_write):
def init (self):
pass

def apply_count_vector(self, v_data_column):

vectorizer = CountVectorizer(min_df=2, analyzer="word",
tokenizer=None, preprocessor=None, stop_words=None)
return vectorizer.fit_transform(v_data_column)

def apply_svm(self, X, y):

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)
params = {'kernel': 'linear', 'C': 2, 'gamma': 1}
svm_cv = svm.SVC(C=params['C'], kernel=params['kernel'], gamma=params['gamma'],
probability=True)
svm_cv.fit(X_train, y_train)
y_predict_test = svm_cv.predict(X_test)
cm = confusion_matrix(y_test, y_predict_test)
sns.heatmap(cm, annot=True)
print(classification_report(y_test, y_predict_test))
print("test set")

print("\nAccuracy Score: " + str(metrics.accuracy_score(y_test, y_predict_test)))

print("F1 Score: " + str(metrics.f1_score(y_test, y_predict_test)))
print("Recall: " + str(metrics.recall_score(y_test, y_predict_test)))
print("Precision: " + str(metrics.precision_score(y_test, y_predict_test)))

class_names = ['ham', 'spam']

titles_options = [("Confusion matrix, without normalization", None),
("Normalized confusion matrix", 'true')]
for title, normalize in titles_options:
disp = plot_confusion_matrix(svm_cv, X_test, y_test,
display_labels=class_names,
cmap=plt.cm.Blues,
normalize=normalize)
disp.ax_.set_title(title)
print(title)
print(disp.confusion_matrix)
plt.savefig("SVM.png"
)plt.show()

ns_probs = [0 for _ in range(len(y_test))]

lr_probs = svm_cv.predict_proba(X_test)
lr_probs = lr_probs[:, 1]
ns_auc = roc_auc_score(y_test, ns_probs)
lr_auc = roc_auc_score(y_test, lr_probs)
print('No Skill: ROC AUC=%.3f' %
(ns_auc)) print('SVM: ROC AUC=%.3f'
% (lr_auc))
ns_fpr, ns_tpr, _ = roc_curve(y_test, ns_probs)
lr_fpr, lr_tpr, _ = roc_curve(y_test, lr_probs)
pyplot.plot(ns_fpr, ns_tpr, linestyle='--', label='No Skill')
pyplot.plot(lr_fpr, lr_tpr, marker='.',
label='SVM') pyplot.xlabel('False Positive Rate')
pyplot.ylabel('True Positive Rate')
pyplot.legend()
pyplot.savefig("SVMMat.png"
)pyplot.show()
return

data_obj = data_read_write("emails.csv")

data_frame = data_obj.read_csv_file("processed.csv")
data_frame.head()
data_frame.tail()
data_frame.describe()
data_frame.info()

data_frame.head()

data_frame.groupby('spam').describe()

data_frame['length'] = data_frame['text'].apply(len)
data_frame['length'].max()

sns.set(rc={'figure.figsize':(11.7,8.27)})
ham_messages_length = data_frame[data_frame['spam']==0] spam_messages_length
= data_frame[data_frame['spam']==1]

ham_messages_length['length'].plot(bins=100, kind='hist',label = 'Ham')

spam_messages_length['length'].plot(bins=100, kind='hist',label = 'Spam')
plt.title('Distribution of Length of Email Text')
plt.xlabel('Length of Email Text')
plt.legend()

data_frame[data_frame['spam']==0].text.values

ham_words_length = [len(word_tokenize(title)) for title in

data_frame[data_frame['spam']==0].text.values]
spam_words_length = [len(word_tokenize(title)) for title in
data_frame[data_frame['spam']==1].text.values]
print(max(ham_words_length))
print(max(spam_words_length))

sns.set(rc={'figure.figsize':(11.7,8.27)})
ax = sns.distplot(ham_words_length, norm_hist = True, bins = 30, label = 'Ham')
ax = sns.distplot(spam_words_length, norm_hist = True, bins = 30, label = 'Spam')
plt.title('Distribution of Number of Words')
plt.xlabel('Number of Words')
plt.legend()
plt.savefig("SVMGraph.png"
) plt.show()

def mean_word_length(x):
word_lengths = np.array([])
for word in word_tokenize(x):
word_lengths = np.append(word_lengths, len(word))
return word_lengths.mean()

ham_meanword_length =
data_frame[data_frame['spam']==0].text.apply(mean_word_length)
spam_meanword_length =
data_frame[data_frame['spam']==1].text.apply(mean_word_length)

sns.distplot(ham_meanword_length, norm_hist = True, bins = 30, label = 'Ham')

sns.distplot(spam_meanword_length , norm_hist = True, bins = 30, label = 'Spam')
plt.title('Distribution of Mean Word Length')
plt.xlabel('Mean Word Length')
plt.legend()
plt.savefig("Graph.png")
plt.show()

from nltk.corpus import stopwords

stop_words = set(stopwords.words('english'))

def stop_words_ratio(x):
num_total_words = 0
num_stop_words = 0
for word in word_tokenize(x):
if word in stop_words:
num_stop_words += 1
num_total_words += 1
return num_stop_words / num_total_words

ham_stopwords = data_frame[data_frame['spam'] == 0].text.apply(stop_words_ratio) spam_stopwords

= data_frame[data_frame['spam'] == 1].text.apply(stop_words_ratio)

sns.distplot(ham_stopwords, norm_hist=True, label='Ham')

sns.distplot(spam_stopwords, label='Spam')

print('Ham Mean: {:.3f}'.format(ham_stopwords.values.mean()))

print('Spam Mean: {:.3f}'.format(spam_stopwords.values.mean()))
plt.title('Distribution of Stop-word Ratio')
plt.xlabel('Stop Word Ratio')
plt.legend()

ham = data_frame[data_frame['spam']==0]
spam = data_frame[data_frame['spam']==1]
spam['length'].plot(bins=60, kind='hist')
ham['length'].plot(bins=60, kind='hist')
data_frame['Ham(0) and Spam(1)'] = data_frame['spam']
print( 'Spam percentage =', (len(spam) / len(data_frame) )*100,"%")
print( 'Ham percentage =', (len(ham) / len(data_frame) )*100,"%")
sns.countplot(data_frame['Ham(0) and Spam(1)'], label = "Count")

data_clean_obj = data_cleaning()
data_frame['clean_text'] = data_clean_obj.apply_to_column(data_frame['text'])

data_frame.head()

data_obj.data_frame.head()

data_obj.write_to_csvfile("processed_file.csv")

cv_object = apply_embeddding_and_model()
spamham_countvectorizer = cv_object.apply_count_vector(data_frame['clean_text'])
X = spamham_countvectorizer
label = data_frame['spam'].values
y = label
cv_object.apply_svm(X,y)

OUTPUT

precision recall f1-score support

0 0.99 0.99 0.99 877

1 0.98 0.97 0.98 269

accuracy 0.99 1146

macro avg 0.99 0.98 0.99 1146
weighted avg 0.99 0.99 0.99 1146

test set
Accuracy Score: 0.9895287958115183
F1 Score: 0.9776119402985075
Recall: 0.9739776951672863
Precision: 0.9812734082397003
Normalized confusion matrix
[[0.99429875 0.00570125]
[0.0260223 0.9739777 ]]
RESULT:

Thus the program to create a machine learning model which classifies the
Spam and Ham E-Mails from a given dataset using Support Vector Machine
algorithm have been successfully executed.
Ex.no: 8 Implement Ensembling Techniques.
Date:

AIM:

To implement the ensembling technique of Blending with the given Alcohol QCM
Dataset.

ALGORITHM:

Step 1: Split the training dataset into train, test and validation dataset.
Step 2: Fit all the base models using train dataset.
Step 3: Make predictions on validation and test dataset.
Step 4: These predictions are used as features to build a second level model
Step 5: This model is used to make predictions on test and meta-features.

PROGRAM:

import pandas as pd
from sklearn.metrics import mean_squared_error from
sklearn.ensemble import RandomForestRegressor
import xgboost as xgb
from sklearn.linear_model import LinearRegression
from sklearn.model_selection import train_test_split
df = pd.read_csv("train_data.csv")
target = df["target"]
train = df.drop("target")
X_train, X_test, y_train, y_test = train_test_split(train, target, test_size=0.20)
train_ratio = 0.70
validation_ratio = 0.20
test_ratio = 0.10
x_train, x_test, y_train, y_test = train_test_split(
train, target, test_size=1 - train_ratio)
x_val, x_test, y_val, y_test = train_test_split(
x_test, y_test, test_size=test_ratio/(test_ratio + validation_ratio))
model_1 = LinearRegression()
model_2 = xgb.XGBRegressor()
model_3 = RandomForestRegressor()
model_1.fit(x_train, y_train)
val_pred_1 = model_1.predict(x_val)
test_pred_1 = model_1.predict(x_test)
val_pred_1 = pd.DataFrame(val_pred_1)
test_pred_1 = pd.DataFrame(test_pred_1)
model_2.fit(x_train, y_train)
val_pred_2 = model_2.predict(x_val)
test_pred_2 = model_2.predict(x_test)
val_pred_2 = pd.DataFrame(val_pred_2)
test_pred_2 = pd.DataFrame(test_pred_2)
model_3.fit(x_train, y_train)
val_pred_3 = model_1.predict(x_val)
test_pred_3 = model_1.predict(x_test)
val_pred_3 = pd.DataFrame(val_pred_3)
test_pred_3 = pd.DataFrame(test_pred_3)
df_val = pd.concat([x_val, val_pred_1, val_pred_2, val_pred_3], axis=1)
df_test = pd.concat([x_test, test_pred_1, test_pred_2, test_pred_3], axis=1)
final_model = LinearRegression()
final_model.fit(df_val, y_val)
final_pred = final_model.predict(df_test)
print(mean_squared_error(y_test, pred_final))

OUTPUT

4790

RESULT:

Thus the program to implement ensembling technique of Blending with the given
Alcohol QCM Dataset have been executed successfully and the output got verfied.
Ex.no: 9 Implement Clustering Algorithms
Date:

AIM:

To implment k-Nearest Neighbour algorithm to classify the Iris Dataset.

ALGORITHM:

Step 1: Select the number K of the neighbors

Step 2: Calculate the Euclidean distance of K number of neighbors
Step 3: Take the K nearest neighbors as per the calculated Euclidean distance.
Step 4: Among these k neighbors, count the number of the data points in each
category.
Step 5: Assign the new data points to that category for which the number of the
neighbor is maximum.
Step 6: Our model is ready.

PROGRAM:

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import classification_report
from sklearn.metrics import confusion_matrix

import pandas as pd
import numpy as np
from sklearn import datasets

iris=datasets.load_iris()
iris_data=iris.data
iris_labels=iris.target

x_train, x_test, y_train, y_test=(train_test_split(iris_data, iris_labels, test_size=0.20))

classifier=KNeighborsClassifier(n_neighbors=6)
classifier.fit(x_train, y_train)
y_pred=classifier.predict(x_test)

print("accuracy is")
print(classification_report(y_test, y_pred))
OUTPUT
accuracy is
precision recall f1-score support

0 1.00 1.00 1.00 9

1 1.00 0.93 0.96 14
2 0.88 1.00 0.93 7

accuracy 0.97 30
macro avg 0.96 0.98 0.97 30
weighted avg 0.97 0.97 0.97 30

RESULT:

Thus the program to implement k-Nearest Neighbour Algorithm for clustering Iris
dataset have been executed successfully and output got verified.
Ex.no: 10 Implement EM for Bayesian Networks.
Date:

AIM:

To implement the EM algorithm for clustering networks using the given dataset.

ALGORITHM:

Step 1 :Initialize θ randomly Repeat until convergence:

Step 2 :E-step- Compute q(h) = P(H = h | E = e; θ) for each h (probabilistic inference)Create
fully- observed weighted examples: (h, e) with weight q(h)
Step 3 :M-step- Maximum likelihood (count and normalize) on weighted examples to get θ

PROGRAM:

from sklearn.cluster import KMeans

from sklearn import preprocessing
from sklearn.mixture import GaussianMixture
from sklearn.datasets import load_iris
import sklearn.metrics as sm
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

dataset=load_iris()
# print(dataset)

X=pd.DataFrame(dataset.data)
X.columns=['Sepal_Length','Sepal_Width','Petal_Length','Petal_Width']
y=pd.DataFrame(dataset.target)
y.columns=['Targets']
# print(X)

plt.figure(figsize=(14,7))
colormap=np.array(['red','lime','black'])

# REAL PLOT
plt.subplot(1,3,1)
plt.scatter(X.Petal_Length,X.Petal_Width,c=colormap[y.Targets],s=40
)plt.title('Real')

# K-PLOT
plt.subplot(1,3,2)
model=KMeans(n_clusters=3
) model.fit(X)
predY=np.choose(model.labels_,[0,1,2]).astype(np.int64)
plt.scatter(X.Petal_Length,X.Petal_Width,c=colormap[predY],s=40
)plt.title('KMeans')

# GMM PLOT
scaler=preprocessing.StandardScaler()
scaler.fit(X)
xsa=scaler.transform(X)
xs=pd.DataFrame(xsa,columns=X.columns
)
gmm=GaussianMixture(n_components=3)
gmm.fit(xs)
y_cluster_gmm=gmm.predict(xs)
plt.subplot(1,3,3)
plt.scatter(X.Petal_Length,X.Petal_Width,c=colormap[y_cluster_gmm],s=40
)plt.title('GMM Classification')

OUTPUT

RESULT:

Thus the program to implement EM Algorithm for clustering networks using the
given dataset have been executed successfully and the output got verified.
Ex.no: 11
Build simple NN models.
Date:

AIM:

To implement the neural network model for the given dataset.

ALGORITHM:

Step 1 :Image Acquisition: The first step is to acquire images of paper

documents with the help of optical scanners. This way, an original image
can be captured and stored.
Step 2 :Pre-processing: The noise level on an image should be optimized and
areas outside the text removed.
Step 3 :Pre-processing is especially vital for recognizing handwritten
documents that are more sensitive to noise.
Step 4 :Segmentation: The process of segmentation is aimed at grouping
characters into meaningful chunks. There can be predefined classes for
characters. So, images can be scanned for patterns that match the classes.
Step 5 :Feature Extraction: This step means splitting the input data into a set of
features, that is, to find essential characteristics that make one or another
pattern recognizable.
Step 6 :Training an MLP neural network using the following steps:
Starting with the input layer, propagate data forward to the output layer.
This step is the forward propagation.
Step 7 :Based on the output, calculate the error (the difference between the
predicted and known outcome). The error needs to be
minimized.Backpropagate the error.
Step 8 :Find its derivative with respect to each weightin the network, and update
the model.
Step 9 :This stage is the process of refinement as an OCR model can require
some corrections. However, it isn’t possible to achieve 100% recognition
accuracy. the identification of characters heavily depends on the context.

PROGRAM:

import print_function
import numpy as np
import tensorflow as tf
from keras.models
import Sequential
from keras.layers.core
import Dense, Dropout, Activation from keras.layers
import Conv2D, MaxPooling2D,
Flatten from keras.optimizers
import RMSprop, SGD
from keras.optimizers import Adam
from keras.utils import np_utils
from emnist import list_datasets
from emnist import extract_training_samples
from emnist import extract_test_samples
import matplotlib
matplotlib.use('TkAgg')import matplotlib.pyplot as plt np.random.seed(1671) # for reproducibility #
network and training
NB_EPOCH = 30
BATCH_SIZE = 256
VERBOSE = 2
NB_CLASSES = 256 # number of outputs = number of
classes OPTIMIZER = Adam()
N_HIDDEN = 512
VALIDATION_SPLIT=0.2 # how much TRAIN is reserved for
VALIDATION DROPOUT = 0.20
print(list_datasets())
X_train, y_train = extract_training_samples('byclass')
print("train shape: ", X_train.shape)
print("train labels: ",y_train.shape)
X_test, y_test = extract_test_samples('byclass')
print("test shape: ",X_test.shape)
print("test labels: ",y_test.shape)
#for indexing from 0
y_train = y_train-1
y_test = y_test-1
RESHAPED = len(X_train[0])*len(X_train[1])
X_train = X_train.reshape(len(X_train),
RESHAPED) X_test = X_test.reshape(len(X_test),
RESHAPED) X_train = X_train.astype('float32')
X_test = X_test.astype('float32')
# normalize
X_train /= 255
X_test /= 255
print(X_train.shape[0], 'train samples')
print(X_test.shape[0], 'test samples')
# convert class vectors to binary class matrices
Y_train = np_utils.to_categorical(y_train,
NB_CLASSES) Y_test =
np_utils.to_categorical(y_test, NB_CLASSES) #
M_HIDDEN hidden layers
# 35 outputs
# final stage is softmax
model = Sequential()
model.add(Dense(N_HIDDEN,
input_shape=(RESHAPED,))) model.add(Activation('relu'))
model.add(Dropout(DROPOUT))
model.add(Dense(256))
model.add(Activation('relu'))
model.add(Dropout(DROPOUT
)) model.add(Dense(256))
model.add(Activation('relu'))
model.add(Dropout(DROPOUT
))
model.add(Dense(256))
model.add(Activation('relu'))
model.add(Dropout(DROPOUT
))
model.add(Dense(NB_CLASSE
S))
model.add(Activation('softmax'))
model.summary()

model.compile(loss='categorical_crossentropy',
optimizer=OPTIMIZER,
metrics=['accuracy'])

history = model.fit(X_train, Y_train,

batch_size=BATCH_SIZE, epochs=NB_EPOCH,
verbose=VERBOSE,
validation_split=VALIDATION_SPLIT) score =
model.evaluate(X_test, Y_test, verbose=VERBOSE)
print("\nTest score:", score[0])
print('Test accuracy:', score[1])

# list all data in history

print(history.history.keys())
# summarize history for accuracy
plt.plot(history.history['accuracy'])
plt.plot(history.history['val_accuracy'])
plt.title('model accuracy')
plt.ylabel('accuracy')
plt.xlabel('epoch')
plt.legend(['train', 'test'], loc='upper left')
plt.show()
# summarize history for loss
plt.plot(history.history['loss'])
plt.plot(history.history['val_loss'])
plt.title('model loss')
plt.ylabel('loss')
plt.xlabel('epoch')
plt.legend(['train', 'test'], loc='upper left')
plt.show()

OUTPUT:
['balanced', 'byclass', 'bymerge', 'digits', 'letters', 'mnist'] train
shape: (697932, 28, 28)
train labels: (697932,)
test shape: (116323, 28, 28)
test labels: (116323,)
697932 train samples
116323 test samples
Model: "sequential"
Layer (type) Output Shape Param #
=================================================================
dense (Dense) (None, 512) 401920
activation (Activation) (None, 512) 0
dropout (Dropout) (None, 512) 0
dense_1 (Dense) (None, 256) 131328
activation_1 (Activation) (None, 256) 0
dropout_1 (Dropout) (None, 256) 0
dense_2 (Dense) (None, 256) 65792
activation_2 (Activation) (None, 256) 0
dropout_2 (Dropout) (None, 256) 0
dense_3 (Dense) (None, 256) 65792
activation_3 (Activation) (None, 256) 0
dropout_3 (Dropout) (None, 256) 0
dense_4 (Dense) (None, 256) 65792
activation_4 (Activation) (None, 256) 0
=================================================================
Total params: 730,624
Trainable params: 730,624
Non-trainable params: 0
RESULT:

Thus the program to implement the neural network model for the given dataset.
Ex.no: 12 Build deep learning NN models.
Date:

AIM:

To implement and build a Convolution neural network model which predicts the age
and gender of a person using the given pre-trained models.

ALGORITHM:
Step 1: Choose the Dataset.
Step 2: Prepare the Dataset for training.
Step 3: Create training Data.
Step 4: Shuffle the Dataset.
Step 5: Assigning Labels and Features.
Step 6: Normalizing X and converting labels to categorical data.
Step 7: Split X and Y for use in CNN.
Step 8: Define, compile and train the CNN Model.
Step 9: Accuracy and Score of the model.

PROGRAM:

import cv2 as cv
import math
import time
from google.colab.patches
import cv2_imshow
def getFaceBox(net, frame, conf_threshold=0.7):
frameOpencvDnn = frame.copy()
frameHeight = frameOpencvDnn.shape[0]
frameWidth = frameOpencvDnn.shape[1]
blob = cv.dnn.blobFromImage(frameOpencvDnn, 1.0, (300, 300), [104, 117, 123],
True, False) net.setInput(blob)
detections = net.forward()
bboxes = []
for i in range(detections.shape[2]):
confidence = detections[0, 0, i, 2]
if confidence > conf_threshold:
x1 = int(detections[0, 0, i, 3] * frameWidth)
y1 = int(detections[0, 0, i, 4] * frameHeight)
x2 = int(detections[0, 0, i, 5] * frameWidth)
y2 = int(detections[0, 0, i, 6] * frameHeight)
bboxes.append([x1, y1, x2, y2])
cv.rectangle(frameOpencvDnn, (x1, y1), (x2, y2), (0, 255, 0),
int(round(frameHeight/150)), 8)
return frameOpencvDnn, bboxes
faceProto = "/content/opencv_face_detector.pbtxt"
faceModel = "/content/opencv_face_detector_uint8.pb"
ageProto = "/content/age_deploy.prototxt"
ageModel = "/content/age_net.caffemodel"
genderProto = "/content/gender_deploy.prototxt"
genderModel = "/content/gender_net.caffemodel"

MODEL_MEAN_VALUES = (78.4263377603, 87.7689143744, 114.895847746)

ageList = ['(0-2)', '(4-6)', '(8-12)', '(15-20)', '(25-32)', '(38-43)', '(48-53)', '(60-100)']
genderList = ['Male', 'Female']

ageNet = cv.dnn.readNet(ageModel, ageProto) genderNet

= cv.dnn.readNet(genderModel, genderProto) faceNet =
cv.dnn.readNet(faceModel, faceProto)

def age_gender_detector(frame):
# Read frame
t = time.time()
frameFace, bboxes = getFaceBox(faceNet, frame)
for bbox in bboxes:
# print(bbox)
face = frame[max(0,bbox[1]-padding):min(bbox[3]+padding,frame.shape[0]-
1),max(0,bbox[0]-padding):min(bbox[2]+padding, frame.shape[1]-1)]blob =
cv.dnn.blobFromImage(face, 1.0, (227, 227), MODEL_MEAN_VALUES,
swapRB=False) genderNet.setInput(blob)
genderPreds = genderNet.forward()
gender = genderList[genderPreds[0].argmax()]
# print("Gender Output : {}".format(genderPreds))
print("Gender : {}, conf = {:.3f}".format(gender,
genderPreds[0].max()))ageNet.setInput(blob)
agePreds = ageNet.forward()
age = ageList[agePreds[0].argmax()]
print("Age Output : {}".format(agePreds))
print("Age : {}, conf = {:.3f}".format(age, agePreds[0].max()))label =
"{},{}".format(gender, age)
cv.putText(frameFace, label, (bbox[0], bbox[1]-10), cv.FONT_HERSHEY_SIMPLEX,
0.8, (0, 255, 255), 2, cv.LINE_AA)
return frameFace

from google.colab import files

uploaded = files.upload()
input = cv.imread("2.jpg")
output = age_gender_detector(input)
cv2_imshow(output)
OUTPUT

gender : Male, conf = 1.000

Age Output : [[2.8247703e-05 8.9249297e-05 3.0017464e-04 8.8183772e-03 9.3055397e-01
5.1735926e-02 7.6946630e-03 7.7927281e-04]]
Age: (25-32), conf = 0.873.

RESULT:
Thus the program to implement and build a Convolution neural
network model which predicts the age and gender of a person using the
given pre-trained models have been executed successfully and the output
got verified.

Ramadan in Java The Joy Jihad of Ritual Fasting Lund Studies in History of Religions Andre Moller Instant Download
No ratings yet
Ramadan in Java The Joy Jihad of Ritual Fasting Lund Studies in History of Religions Andre Moller Instant Download
70 pages
User Experience Design Fundamentals
No ratings yet
User Experience Design Fundamentals
1 page
Ithm 605 Global Foodservice and Lodging Operations Syllabus
No ratings yet
Ithm 605 Global Foodservice and Lodging Operations Syllabus
16 pages
2nd Year
No ratings yet
2nd Year
83 pages
Basics of Essay Writing
No ratings yet
Basics of Essay Writing
20 pages
8 1 AlphaAndBetaDecayLab 2
No ratings yet
8 1 AlphaAndBetaDecayLab 2
3 pages
AIML LAB - Removed
No ratings yet
AIML LAB - Removed
92 pages
Tour Management System
No ratings yet
Tour Management System
12 pages
Ai ML Lab Manual 17
No ratings yet
Ai ML Lab Manual 17
85 pages
AIML Lab - SRMTRPEC - Observation - 20 Feb 24
No ratings yet
AIML Lab - SRMTRPEC - Observation - 20 Feb 24
73 pages
Cs3491 Lab Manual
No ratings yet
Cs3491 Lab Manual
28 pages
Six Sigma Level - 1 Exam 31 12 24
No ratings yet
Six Sigma Level - 1 Exam 31 12 24
12 pages
Bulldog Adhesion Promoter TPO123 TDS Rev 07 2010
No ratings yet
Bulldog Adhesion Promoter TPO123 TDS Rev 07 2010
7 pages
Share and Care - Feeding Lives Improving Sustainability2
No ratings yet
Share and Care - Feeding Lives Improving Sustainability2
13 pages
AIML Manual V1-6-83
No ratings yet
AIML Manual V1-6-83
78 pages
AIML Manual - Merged
No ratings yet
AIML Manual - Merged
41 pages
CS3491-Ai&Ml Lab manual-CSE
No ratings yet
CS3491-Ai&Ml Lab manual-CSE
41 pages
Essentials C3D2010 Session 01 Introduction
No ratings yet
Essentials C3D2010 Session 01 Introduction
13 pages
Proposed Evacuation Center With Research Objectives
100% (6)
Proposed Evacuation Center With Research Objectives
7 pages
Aiml Lab Manual New Ucev
No ratings yet
Aiml Lab Manual New Ucev
37 pages
AI Lab
No ratings yet
AI Lab
45 pages
Stream Processing Lab
No ratings yet
Stream Processing Lab
50 pages
Grade 7 SCIENCE Item-Analysis-for-item-bank
100% (1)
Grade 7 SCIENCE Item-Analysis-for-item-bank
5 pages
AI ML Lab Manual - Prepared by Mrs. R. Viniba-1
No ratings yet
AI ML Lab Manual - Prepared by Mrs. R. Viniba-1
44 pages
30 bt Mức độ thông hiểu - phần 2
No ratings yet
30 bt Mức độ thông hiểu - phần 2
2 pages
2024 Estimation
No ratings yet
2024 Estimation
91 pages
Cs3491 Aiml Record
No ratings yet
Cs3491 Aiml Record
46 pages
Program
No ratings yet
Program
25 pages
CS3491 AIML LAB Record 2023-2024
No ratings yet
CS3491 AIML LAB Record 2023-2024
51 pages
Pas Bahasa Inggris Kelas Ix
No ratings yet
Pas Bahasa Inggris Kelas Ix
7 pages
Ailml Removed
No ratings yet
Ailml Removed
45 pages
500D High Pressure Syringe Pump Datasheet PDF
No ratings yet
500D High Pressure Syringe Pump Datasheet PDF
2 pages
AI and ML Manual
No ratings yet
AI and ML Manual
11 pages
Cs3491-Aiml Lab Manual
No ratings yet
Cs3491-Aiml Lab Manual
48 pages
AIML Manual V1 2-86
No ratings yet
AIML Manual V1 2-86
85 pages
AIML Manual
No ratings yet
AIML Manual
62 pages
French Sociologist Pierre Bourdieu
No ratings yet
French Sociologist Pierre Bourdieu
3 pages
CS3491 - AI&ML Lab Record
No ratings yet
CS3491 - AI&ML Lab Record
47 pages
Aiml Record
No ratings yet
Aiml Record
42 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
38 pages
AI&ML - Lab Manual
No ratings yet
AI&ML - Lab Manual
46 pages
Social Work in A Digital Age - Ethical and Risk Management Challenges
No ratings yet
Social Work in A Digital Age - Ethical and Risk Management Challenges
12 pages
AIML
No ratings yet
AIML
62 pages
Ai&ml Lab Manual Final
No ratings yet
Ai&ml Lab Manual Final
50 pages
Aiml Lab Record
No ratings yet
Aiml Lab Record
48 pages
Aiml-Lab-Manual 24-25
No ratings yet
Aiml-Lab-Manual 24-25
39 pages
Ai&Ml Lab Manual r21
No ratings yet
Ai&Ml Lab Manual r21
45 pages
Aiml Manual
No ratings yet
Aiml Manual
70 pages
Ai&Ml Lab Manual r21-1
No ratings yet
Ai&Ml Lab Manual r21-1
47 pages
Communication in Freaky Friday
No ratings yet
Communication in Freaky Friday
4 pages
AIML LAB - Merged
No ratings yet
AIML LAB - Merged
52 pages
Lab Manual For Aiml
No ratings yet
Lab Manual For Aiml
28 pages
After You Graduate You Get A Job in A Small
No ratings yet
After You Graduate You Get A Job in A Small
2 pages
Plate Heat Exchanger: Pre-Commissioning Checklist
100% (1)
Plate Heat Exchanger: Pre-Commissioning Checklist
1 page
AI Lab Record
No ratings yet
AI Lab Record
42 pages
AIML Record Programs (PDF - Io) - 1
No ratings yet
AIML Record Programs (PDF - Io) - 1
35 pages
CS3491-AI &ML Lab Manual
No ratings yet
CS3491-AI &ML Lab Manual
38 pages
Aiml Lab Record
No ratings yet
Aiml Lab Record
58 pages
Ai Lab Manual
No ratings yet
Ai Lab Manual
39 pages
Hydrology WSE 3 2008
No ratings yet
Hydrology WSE 3 2008
22 pages
3hac042305 041
No ratings yet
3hac042305 041
1 page
Usg Plasters Hydrocal Gypsum Cements Sealers Parting Compounds Brochure en IG515
No ratings yet
Usg Plasters Hydrocal Gypsum Cements Sealers Parting Compounds Brochure en IG515
2 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
41 pages
Document Aiml Lab??
No ratings yet
Document Aiml Lab??
23 pages
AIML
No ratings yet
AIML
32 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
23 pages
AI&ML Lab Manual
No ratings yet
AI&ML Lab Manual
50 pages
Strategy Formulation
No ratings yet
Strategy Formulation
17 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
44 pages
AIML Manual V1!6!83 Removed
No ratings yet
AIML Manual V1!6!83 Removed
51 pages
CLAIND Hygen en 2021 Brochure
No ratings yet
CLAIND Hygen en 2021 Brochure
4 pages
Ai&ml Record
No ratings yet
Ai&ml Record
57 pages
Theo Notes
No ratings yet
Theo Notes
5 pages
AIML Record Programs (PDF - Io)
No ratings yet
AIML Record Programs (PDF - Io)
36 pages
AIML Lab Manual Final
No ratings yet
AIML Lab Manual Final
43 pages
AI and ML Lab Manual - Removed
No ratings yet
AI and ML Lab Manual - Removed
38 pages
AIML Sem4 Record
No ratings yet
AIML Sem4 Record
34 pages
AI&ML
No ratings yet
AI&ML
38 pages
Cs 3491 Ai ML Lab Manual
No ratings yet
Cs 3491 Ai ML Lab Manual
43 pages
Aiml Lab Manual
No ratings yet
Aiml Lab Manual
38 pages
Laphormur F7 - Rieter Manual
No ratings yet
Laphormur F7 - Rieter Manual
391 pages
Aiml Cse Record
No ratings yet
Aiml Cse Record
49 pages
Aiml Lab Manual Upto DT
No ratings yet
Aiml Lab Manual Upto DT
40 pages
Cs3491 Aiml Lab PDF
No ratings yet
Cs3491 Aiml Lab PDF
31 pages
RevModPhys 75 121
No ratings yet
RevModPhys 75 121
60 pages
Aiml Lab Manual Upto DT
No ratings yet
Aiml Lab Manual Upto DT
40 pages
Ai&ml Record (22-23)
No ratings yet
Ai&ml Record (22-23)
37 pages
Study of E Banking Services Offered by ICICI Bank Manavi Mhaskar 09
No ratings yet
Study of E Banking Services Offered by ICICI Bank Manavi Mhaskar 09
58 pages
JD - Lead Salesforce Developer-2
No ratings yet
JD - Lead Salesforce Developer-2
2 pages
Aiml Lab Manual Upto DT
No ratings yet
Aiml Lab Manual Upto DT
40 pages
Feelings When Your Needs Are Satisfied: Engaged
No ratings yet
Feelings When Your Needs Are Satisfied: Engaged
4 pages
CS3491 - AIML Lab Record
No ratings yet
CS3491 - AIML Lab Record
79 pages
Shift Left (Left) and KCS: Working Towards Better Services
No ratings yet
Shift Left (Left) and KCS: Working Towards Better Services
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.