0% found this document useful (0 votes)

150 views22 pages

K - Nearest Neighbors Implementation in R

- The document discusses implementing a K-nearest neighbors algorithm in R to classify whether cars need servicing or not using training and test data from an automotive company case study. - The training data contains labels for 315 cars indicating if they need servicing, and the test data contains attribute values for 135 additional cars without labels. - The goal is to use the KNN algorithm in R on the training data to predict labels for the test cars, in order to help the company process more cars than they have capacity for on their opening day.

Uploaded by

Vaibhav Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

150 views22 pages

K - Nearest Neighbors Implementation in R

Uploaded by

Vaibhav Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Data science for Engineers

Department of Computer Science and Engineering

Indian Institute of Technology, Madras

Lecture - 47
K- nearest neighbours implementation in R

Hello all, welcome to this lecture on K-nearest neighbours implementation in R.

(Refer Slide Time: 00:23)

In this lecture, what we are going to do is to introduce you to a case study which we use as a
means to explain how to implement this knn algorithm in R. We will start with the problem
statement of the case study, and we will show how to solve this case study using R.

In the process, we will show how to read the data from dot csv file, how to understand the data
that is being loaded into the workspace of R, and how to implement this K-nearest neighbours
algorithm in R using this knn function. And we will also talk about how to interpret the results
that this knn algorithm gives to us.
(Refer Slide Time: 01:12)

Before we jump into the case study, let us review some key points from the previous lecture of
Prof. Raghu. If you remember knn is primarily used as a classification algorithm, it is a supervise
learning algorithm. When I say supervise learning algorithm that means the data that is provided
to you has to be labelled data and knn is a non-parametric method. So, what do you mean by this
non-parametric method is that there is no extraction of the parameters of the classifiers from the
data itself. And there is no explicit training phase involved in this knn algorithm.

And the knn algorithm is the lazy learning algorithm, because it would not do any computations
till you ask you to do classification, because we are dealing with the K-nearest neighbours we
would have seen this notion of distance is important when we are dealing with this knn
algorithm. And the way the knn algorithm works is by the majority voting method that means if
you give a test point, we calculate the distance of the test point from all the data points in the
given data and arrange them in the ascending order. And we choose the k first nearest
neighbours. And based on the voting that each of them will give for this test data, we will assign
the class to the test data point that is of essentially the knn works.

Now, let us define the case study problem statement. We have name this case study as
automotive service company case study.
(Refer Slide Time: 03:07)

Let us look at the problem statement. An automotive service chain is launching its grand new
service station this weekend. They offer service to wide variety of cars. The current capacity of
the station is to check the 315 cars thoroughly per day ok. As an inaugural offer, what they have
done is they claim to freely check all the cars that arrive on their launch day, and they said they
will report whether they need servicing or not ok.

What happened is unexpectedly, they got 450 cars. Now, since they have the testing facility for
testing only 315 cars, they will not be able to check all the 450 cars very thoroughly, and the
service men will not work longer than the normal working hours.

So, what they have done is they have hide a data analyst to help them out from the situation. If
you are the data analyst which is hide by this automotive service station person, how can you
save the day for this new service station is the problem statement.
(Refer Slide Time: 04:34)

Now, let us see how a data scientist can save a day for this service station people. Since, service
station has capacity to thoroughly check 315 cars; they have thoroughly checked all the 315 cars
and given the data in this service train data dot csv. Now, for the rest of the cars among the 450,
they cannot thoroughly check all the data and they have checked only those attributes which are
easily measurable, and they have given them in this service test data dot csv. So, essentially the
data scientist has data which is like a training data for him which contains few attributes, and
with a label whether a service is needed or not.

And he also has a data for which now all the other attributes are present he do not have this
column where whether the service is needed or not. The idea here is how do one use this data
service train data to comment upon for the readings which present which are present in the
service test data to tell whether service is needed or not in this case. So, the idea is to use the knn
classification technique to classify the cars in the service test data file which cannot be tested
manually and say whether service is needed or not.

Now, let us see how do you solve this case study in R.

(Refer Slide Time: 06:26)

First you have to get things ready when I say get things ready I mean you have to set the working
directory as the directory in which the given data files are available that you can do using set
working directory command and the corresponding path you can give here. Otherwise you can
use GUI option also to set the working directory. And this command here is used to clear all the
variables in the environment of R. You can very well use the brush button in the environmental
history pan to clear the variables in the workspace.

And another important thing one has to do is for this knn implementation, we need two external
packages which are caret and class, one has to install this caret and class packages if they have
not installed it already. So, the way to install this packages we have explain in our R modules,
you can install the packages through the command window using this command install dot
packages and the package name and say dependencies equal to true, or you can use the GUI to
install the packages. So, please install this packages caret and class. And once you install, you
can load those packages using the library command as we have explained already. We will see
why is this packages important as we go along this lecture.

And library caret is for generating the confusion matrix which Prof. Raghu would have talked
about when he is talking about this performance matrix of a classifier. And this library class is a
library which contains different classification algorithms. And here we are going to use it for
implementing this knn. Now, let us see how to read the data.

(Refer Slide Time: 08:32)

From the given files and for this case, a data is being provided in two files as we have already
seen service train data dot csv, and service test data dot csv. So, in order to read this data from the
csv files function we use is read dot csv function. Let us look what this read dot csv function
takes and what it returns.
(Refer Slide Time: 08:59)

This read dot csv file reads a file in a table format and creates a data frame from it. The syntax
for this read dot csv function is as follows; read dot csv the filename, and the row names. Let us
look at what this input arguments file and row dot names means, file is essentially the name of
the file from which you have to read the data. And row dot names is a vector of row names, it
can be either a vector giving the actual row names or a single value which specifies what column
of the data set is having the row names.

Let us see how to read the data in this particular case.

(Refer Slide Time: 09:48)

As we have seen the data has been given in this two dot csv files, we can use read dot csv
function to read the data. As we have seen in the syntax of read dot csv we have to give the
filename that is the filename service train dot data from which I want to load the data I will give
this file name. And I am assigning this two a variable called service train when you execute this
command what happens is it reads a data from the service train data file and assign it to this
variable which is of the form data frame.

Similarly, you will read the data from service test data and assign it to variable service test which
is again a data frame. In the R environment, once you execute these commands you will see two
data frames which are service test and service train which are having this 315 observations of 6
variables and 135 observations of 6 variables.

Remember why this 315, 315 is the number of cars that they can thoroughly check, but they have
given in this 315 the 6 variables are the attributes which are easily measurable and one column
which says whether service is needed or not. And this 135 cars they have 6 variables they have
measured all the 5 attributes which are important and the 6 attribute is also given here we will
see why the 6 attribute is given and so on as we go on in this lecture.

Now, let us see what is there in this service train and service test data. One way to see what is
there in this service test and service train is to use the view command.
(Refer Slide Time: 11:42)

This view command helps you to see the data frames. For example, if you want to see what is
there in the service train data frame, what you have to do is this view service train will show a
table like this in your editor environment. Now, you can see that there are how many attributes 1,
2, 3, 4, 5, 6 attributes. And if you see these are the five attributes which are measured for testing
whether the service is needed or not, and this attribute is basically saying if service is needed or
not.

Similarly, you can see for the service test data set which is shown here. For now, what we assume
is will act such a way that we do not know this column, and we will come back to this. Now, if
you observe here, there are 135 entries for which they have not thoroughly checked they just
measured this 6 quantities, and they want to figure out whether service is needed or not using the
knn algorithm that is the whole idea. Since, you have viewed what is there in this service test and
service train data sets.

Now is there any way to know what are the data types of the these attributes that are there in this
service train and service test is the next question that comes to mind. Now, let us understand the
data and little more detail
(Refer Slide Time: 13:12)

what we have seen till now is the service train contains 315 observations of six variables service
test contains 135 observations in 6 variables. And variables that are present in the data sets are oil
quality, engine performance, normal mileage, tyre wear, HVAC wear and service. And I as I
mentioned earlier this 5 are the attributes that tells about the condition of the car. And this
attribute simply says whether service is needed or not that is what here.

First five columns are the details about the car and the last column is the label which says
whether a service is needed or not. Now, let us ask this question what are the data types of each
of these attributes, how one get the data types of the attributes that are there in the data.

So, since we have understood the data now. Let us look at what is the structure of the data.
(Refer Slide Time: 14:11)

When you say structure of data what do we mean by that is in the data set you have what are the
variables that are there, and what are their data types. So, the way you get the structure of data in
R is using this structure function. What does this structure function do structure function
compactly display the internal structure of an R object. The syntax for the structure function is as
follows. Structure function takes one input argument which is an object. What is this object this
object is essentially any R object about which you want to have some information.

Now, let us see the structure of two data frames what we have read from the two dots csv files.
(Refer Slide Time: 14:58)

You can see the structure of the service train data frame. Here if you execute this command
structure of service train, what it gives is the following information which says service train is a
data frame which contains 315 observations of six variables. And the variables are oil quality,
engine performance and so on. And they will say the data type of all this five attributes is
numeric, and the last attribute service is a factor with two levels that means we have yes or no in
this attribute. And this one two represents each entry for example one corresponds to no, and two
corresponds to yes and so on.

Let us use the structure command on the service test data and see what it has.
(Refer Slide Time: 15:54)

This is the output you see when you execute this command here. It says the service test is also
data frame which contains 135 observations in 6 variables. These are the variables that are
available. The first 5 variables are numeric type variables, and the service variable is a factor
with two levels which contains yes or no.

Since, we have seen the structure of the data let us ask this question is there any way that I will
get a summary of the data which I have read.
(Refer Slide Time: 16:27)

The answer is yes, you can get. The summary of data is obtained by the summary function.
Essentially what it does is it invokes particular methods depending upon the class of the
argument that goes along with this summary function. For example, summary function gives a 5
point summary for numeric attributes in the data. Syntax for the summary function is as follows.
The summary function takes one argument which is an object. This object is any R object about
which you want to get some information.

Let us use the summary function on our data frames which we have loaded and see what the
results are.
(Refer Slide Time: 17:15)

So, when we execute this command summary of service train, you will get the details about all
the numeric variables which are 5 point summaries including mean; and for the service variable
which is the categorical variable it gives how many no’s are there in that particular attribute and
how many yes values are there in that particular attribute.

(Refer Slide Time: 17:45)

You can use the same summary on service test and you can see that it will return you the 5 point
summary for all the numeric variables, and it will return you the number of no values and yes
values in the service test.

Let us keep this number in mind we have 99 no values and 36 yes values in the service test. As I
said earlier we are going to act in such a way that we do not know the true yes and no values and
we use knn to predict which of them are yes and which of them are no.

(Refer Slide Time: 18:20)

Now, let us do the important task as far as this lecture is concerned which is implementation of
K-nearest neighbours in R. As I said earlier the function which we use to implement this K-
nearest neighbours is knn function. This knn function takes several arguments but I have listed
few which are very important as far as this course is concerned. The arguments it takes are train,
test, cl and k.

Let us see what each of this mean. Train is essentially a matrix or a data frame of the training set
cases that means you need to give all the data, in this case this is our service train data frame.
And this test is a matrix or data frame for the test set cases. In this case, what will be our test
matrix or a data frame this will be our service test data frame. This c l is a factor of true
classifications of a training set, and this k is the important parameter which is the number of
neighbours that are needed to be considered while you do this algorithm which works on this
majority voting criteria.

Now, let us implement this knn on our data. How do you do that? So, the way you do it is as
follows.

(Refer Slide Time: 19:51)

There are certain comments here, let us study what those comments are. So, as we have seen in
the previous lecture K-nearest neighbour is a lazy algorithm, and can do prediction directly with
the testing data set. It acts of training and testing data sets and the class variable of interest that is
outcome categorical variable and the parameter k as I have mentioned is to specify the number of
nearest neighbours that are to be considered for the classification.

So, the way I implement this knn algorithm is through this knn command as a training data set I
will give all my service train dataset. Remember I have a negative 6 here; I will talk about it
while later. And the test data set what I have given is the attributes in the service test except the
6th column. And in the class variables, I have given this 6th column has my classification
parameter.

And let us say I want to build a knn which takes the number of nearest neighbours as 3. So, these
are the input arguments for this knn function. When I execute this whole command here, it will
calculate the labels for the test data set and store them in this predicted knn. I will show you the
results in the coming slide.

Mean while let us interpret the service train a square bracket and minus 6 means this if you
remember since service train is a data frame from a data frames lectures, the statement here
means that in the service train data frame take all the rows and exclude column 6 that is what it
says. This command here gives information in service train except the last column. Similarly, this
command here gives the information in the service test except the last column and service train
dollar symbol service gives the last column of the training data as a classification factor for the
algorithm.

Once you give all these parameters, execute this. The knn will classify the test data points and
then store the labels in this predicted knn. Let us look at the results, and what this predicted knn
contains.

(Refer Slide Time: 22:32)

So, as we have seen in the earlier slide predicted knn is the output from the algorithm which has
categorical variable yes or no indicating whether service is needed or not for each case in the test
data. When you print this predicted knn, this is the output you see. It essentially says in this 135
values you have first car no service is needed, and second car no service is needed, and for the
23rd car service is needed and so on.
So, that is what this knn algorithm does and you have actually finished your job of classifying
the test cars as whether the service is needed or not. When you do not have this luxury of
knowing the true value this is where you stop. But in R case what happened is we already have
the true values whether service is needed or not for this data set what we have. Now, when you
have this luxury of knowing the true classes, you can generate what is called confusion matrix
and see how well you are classified this performing.

(Refer Slide Time: 23:51)

So, there are two ways of generating this confusion matrix. One you can generate the confusion
matrix manually, the other way is to use this caret package which can generate confusion matrix
and along with it lot of other parameters what Prof. Raghu has talked about in his performance
matrix lecture. Let us see how to generate this confusion matrix manually. So, this predicted knn
is the labels that is being protected using the knn algorithm. And when you observe this
command here, this is the last column of the service test data frame which says the true labels of
whether the service is needed or not.

When I do the table it generates contingency table and it stores the result in this confusion
matrix. When I print this confusion matrix, the result what I see is as follows. This is the
predicted no and yes, and these are the true no and yes. Recall that we have seen in your test data
service is not needed for 99 cars and service is needed for 36 cars. This knn has exactly predicted
all of them correctly; this is what is confusion matrix.

(Refer Slide Time: 26:28)

What we have seen this is the way you generate the confusion matrix manually. Once you have
this confusion matrix, you can calculate the accuracy right.

So, how do you calculate the accuracy the formula of accuracy is given in Prof. Raghu’s
performance matrix lecture. Essentially, I am taking the diagonal elements that is the correctly
predicted values divided by the total number of entries in the service test when you divide that
you will get the accuracy as 99 plus 36 is 135, and the n row of service is also 135. This
command here diag of confusion matrix take this element 99 and 36 and the some command will
summed them up. And when you divide that with the number of rows in the service test that is
135 by 135, you will get the value of knn accuracy as 1.

Since, knn is managed to predict all the no cases has correctly has no and all the yes cases
correctly as yes, your accuracy is 1. This is how you generate the confusion matrix manually.
Now, let us see how to generate this confusion matrix using the caret package, and the command
confusion matrix.
So, the command to generate confusion matrix which is there in this caret package is confusion
matrix. And the input arguments that you need to give are the predicted labels and the true labels.
When you pass these two arguments, this is the confusion matrix that is generated along with
confusion matrix it will generate whole lot of other parameters. We have already calculated
accuracy manually. We have seen that that is 1. You can also compare now the confusion matrix
functions also giving this accuracy as 1; along with this confusion matrix.

(Refer Slide Time: 27:12)

We will also get a lot of parameters such as sensitivity, specificity, etcetera.

So, the reason why you have sensitivity is equal to 1. And specificity is equal to 1 in this case is
because all the positive classes are correctly classified all the negative classes are also correctly
classified that is the reason why you have the ideal values of one and one for sensitivity and
specificity.

So, the balance accuracy is again sensitivity plus specificity by 2 which is 2 by 2, it is it is 1. So,
this is how one can implement this knn algorithm in R.
(Refer Slide Time: 27:46)

In summary what we have seen in this lecture is how to read the dot csv files, how to use the
structure and summary functions to know the data (Refer Time: 28:00) types and the summary of
R objects, and how to implement this K-nearest neighbours algorithm which is a supervised
learning algorithm which needs labelled data. And we have also seen how to implement this K-
nearest neighbours algorithm in R using this knn function.

So, with this we end this tutorial session on how to implement knn algorithm in R. In the next
lecture, Prof. Raghu will talk about this k means clustering algorithm; after which I will come
back with a case study on how to implement k means clustering.

Thank you.

Analytics Boot Camp
No ratings yet
Analytics Boot Camp
126 pages
Week 09 Lesson 1 Intro Machine Learning 1 to 32 (4)
No ratings yet
Week 09 Lesson 1 Intro Machine Learning 1 to 32 (4)
61 pages
Slide 2 ML Basics
No ratings yet
Slide 2 ML Basics
42 pages
Unit4_PPT
No ratings yet
Unit4_PPT
118 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
Mastering in Data Science_3RITech
No ratings yet
Mastering in Data Science_3RITech
37 pages
Gaina Sutras Part 1
No ratings yet
Gaina Sutras Part 1
420 pages
Classification and K Nearest Neighbour Algorithm
No ratings yet
Classification and K Nearest Neighbour Algorithm
53 pages
InfentoBALANCER_G2_EN-v1.9
No ratings yet
InfentoBALANCER_G2_EN-v1.9
36 pages
CL 2
No ratings yet
CL 2
85 pages
New ALMM List
No ratings yet
New ALMM List
50 pages
Lecture 3 - MachineLearning-CrashCourse2023
No ratings yet
Lecture 3 - MachineLearning-CrashCourse2023
99 pages
ML Unit-2
No ratings yet
ML Unit-2
55 pages
Lecture5
No ratings yet
Lecture5
21 pages
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
No ratings yet
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
33 pages
Jaggia BA 1e Chap009 PPT
No ratings yet
Jaggia BA 1e Chap009 PPT
25 pages
ml unit2
No ratings yet
ml unit2
38 pages
ML UNIT 5..
No ratings yet
ML UNIT 5..
40 pages
lec49
No ratings yet
lec49
17 pages
mtech final
No ratings yet
mtech final
16 pages
ML Unit V
No ratings yet
ML Unit V
10 pages
ML UNIT-2
No ratings yet
ML UNIT-2
33 pages
Se Study On Tesla Motors: Analysis of The Business Model and Growth Strategy
No ratings yet
Se Study On Tesla Motors: Analysis of The Business Model and Growth Strategy
26 pages
Classification and Regression Trees (CART - III) : DR A. Ramesh
No ratings yet
Classification and Regression Trees (CART - III) : DR A. Ramesh
42 pages
Data Science & Aiml (Mile Stone Solution)
No ratings yet
Data Science & Aiml (Mile Stone Solution)
37 pages
TFM Oviedo de La Fuente
No ratings yet
TFM Oviedo de La Fuente
92 pages
Classification Methods I
No ratings yet
Classification Methods I
20 pages
Fermented Foods Guideline - 3.3 Kimchi
No ratings yet
Fermented Foods Guideline - 3.3 Kimchi
20 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
Data Science - Copy
No ratings yet
Data Science - Copy
13 pages
Classification
No ratings yet
Classification
58 pages
DM - MP (1)
No ratings yet
DM - MP (1)
15 pages
1
No ratings yet
1
6 pages
Mini Project - Machine Learning - Tejas Nayak
No ratings yet
Mini Project - Machine Learning - Tejas Nayak
65 pages
Marketing For The Future: Value Exploration, Creation, & Distribution
No ratings yet
Marketing For The Future: Value Exploration, Creation, & Distribution
21 pages
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
No ratings yet
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
34 pages
Section 03
No ratings yet
Section 03
20 pages
Unit 4 - Statistics
No ratings yet
Unit 4 - Statistics
52 pages
Addition of Vectors: Combining Vector Components
No ratings yet
Addition of Vectors: Combining Vector Components
10 pages
Lec 04
No ratings yet
Lec 04
70 pages
IGCSE 9-1 Maths new to Foundation questions - LCM and HCF
No ratings yet
IGCSE 9-1 Maths new to Foundation questions - LCM and HCF
9 pages
A Study On Waste Disposal Management and Recommendation For Safe Disposal
No ratings yet
A Study On Waste Disposal Management and Recommendation For Safe Disposal
5 pages
Cashless Indian Economy (Merits and Demerits) : Presentation On
No ratings yet
Cashless Indian Economy (Merits and Demerits) : Presentation On
10 pages
Programming Assign. Unit 4
No ratings yet
Programming Assign. Unit 4
5 pages
BMJ Volume 339 Issue Nov11 1 2009 (Doi 10.1136/bmj.b4418) Goyder, C. McPherson, A. Glasziou, P. - Self Diagnosis PDF
No ratings yet
BMJ Volume 339 Issue Nov11 1 2009 (Doi 10.1136/bmj.b4418) Goyder, C. McPherson, A. Glasziou, P. - Self Diagnosis PDF
9 pages
Janani Prakash Loan Prediction Study
No ratings yet
Janani Prakash Loan Prediction Study
97 pages
Self Illuminating Road
No ratings yet
Self Illuminating Road
14 pages
Noise and Distortion Part III Circuit Intuitions
No ratings yet
Noise and Distortion Part III Circuit Intuitions
4 pages
Optimization For Data Science
No ratings yet
Optimization For Data Science
18 pages
3.enhanced ER Model
No ratings yet
3.enhanced ER Model
4 pages
Handout 2
No ratings yet
Handout 2
15 pages
UNIT I - Introduction - DataScience - New
No ratings yet
UNIT I - Introduction - DataScience - New
34 pages
DWM - END SEM LAB Questions
No ratings yet
DWM - END SEM LAB Questions
9 pages
Workbook of Pattern Recognition
No ratings yet
Workbook of Pattern Recognition
11 pages
Internet of Things Comparative Study
No ratings yet
Internet of Things Comparative Study
3 pages
Midterm - APS1070 - 2019 - 09 Fall
No ratings yet
Midterm - APS1070 - 2019 - 09 Fall
2 pages
Machine Learning
100% (2)
Machine Learning
30 pages
QB - Data Science
No ratings yet
QB - Data Science
7 pages
Design and Fiber Installation For University Campus System
No ratings yet
Design and Fiber Installation For University Campus System
7 pages
MILIT PPT Modifies
No ratings yet
MILIT PPT Modifies
43 pages
CH5 Data Mining Classification Prepared by Dr. Maher Abuhamdeh
No ratings yet
CH5 Data Mining Classification Prepared by Dr. Maher Abuhamdeh
61 pages
Culture and Development
No ratings yet
Culture and Development
30 pages
Varignon's Theorem: F R F R F F R
No ratings yet
Varignon's Theorem: F R F R F F R
10 pages
Casio Protrek 5470 Operation Manual PDF
No ratings yet
Casio Protrek 5470 Operation Manual PDF
26 pages
Coincent - Data Science With Python Assignment
100% (2)
Coincent - Data Science With Python Assignment
23 pages
Lab1 411 Eman Yahya 7773225
No ratings yet
Lab1 411 Eman Yahya 7773225
16 pages
'Machine Learning (Nagarjun)
No ratings yet
'Machine Learning (Nagarjun)
10 pages
B.Tech CSE I Year B
No ratings yet
B.Tech CSE I Year B
1 page
ML Assignment v4
No ratings yet
ML Assignment v4
1 page
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
Addition of Vectors: Combining Vector Components
No ratings yet
Addition of Vectors: Combining Vector Components
10 pages
User Manual Hager HXW040H (English - 196 Pages)
No ratings yet
User Manual Hager HXW040H (English - 196 Pages)
2 pages
Data Analytics on Banking
No ratings yet
Data Analytics on Banking
3 pages
Capstone Rough Draft
No ratings yet
Capstone Rough Draft
10 pages
Unit - I: Topic - 1
No ratings yet
Unit - I: Topic - 1
13 pages
Signia NX Brochure - Hearing Aid Express
No ratings yet
Signia NX Brochure - Hearing Aid Express
9 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
Unit-6. Interview Skills
No ratings yet
Unit-6. Interview Skills
7 pages
DsNaIT v2.0
No ratings yet
DsNaIT v2.0
43 pages
JHA For Pipe Scrap Loading and Unloading
No ratings yet
JHA For Pipe Scrap Loading and Unloading
5 pages
Unit 1 Big Data Analytics - An Introduction (Final)
No ratings yet
Unit 1 Big Data Analytics - An Introduction (Final)
65 pages
ML0101EN Clas K Nearest Neighbors CustCat Py v1
100% (1)
ML0101EN Clas K Nearest Neighbors CustCat Py v1
11 pages
Mks Mini Datasheet
100% (1)
Mks Mini Datasheet
6 pages
Project of Class-9 - 2021-2022
No ratings yet
Project of Class-9 - 2021-2022
7 pages
Project Planning: 3.1 3.2 Process Description
No ratings yet
Project Planning: 3.1 3.2 Process Description
10 pages
DLP - Light Science 7
No ratings yet
DLP - Light Science 7
6 pages
Arrangement 1 Circular Arrangement Puzzles by Cetking
No ratings yet
Arrangement 1 Circular Arrangement Puzzles by Cetking
15 pages
Industrial Instrumentation Lab
No ratings yet
Industrial Instrumentation Lab
34 pages
Praktikum Modul 3
No ratings yet
Praktikum Modul 3
5 pages
Intro To Data Science Summary
No ratings yet
Intro To Data Science Summary
17 pages
How To Design A Logo of Letters
100% (17)
How To Design A Logo of Letters
10 pages
Data Mining Problem 2 Report
No ratings yet
Data Mining Problem 2 Report
13 pages
ĐỀ THI VIẾT
No ratings yet
ĐỀ THI VIẾT
12 pages
Hand Signals For Hoist and Crane Operations
No ratings yet
Hand Signals For Hoist and Crane Operations
2 pages
3 1 - Exam: 100 Marks
No ratings yet
3 1 - Exam: 100 Marks
1 page
Data Science Programming In Python
From Everand
Data Science Programming In Python
Anita Raichand
No ratings yet
Amazon Web Services (AWS) Interview Questions and Answers
From Everand
Amazon Web Services (AWS) Interview Questions and Answers
Tech Interviews
4.5/5 (3)
DevOps For Beginners: DevOps Software Development Method Guide For Software Developers and IT Professionals
From Everand
DevOps For Beginners: DevOps Software Development Method Guide For Software Developers and IT Professionals
Joseph Joyner
No ratings yet
SAP Variant Configuration: Your Successful Guide to Modeling
From Everand
SAP Variant Configuration: Your Successful Guide to Modeling
Mike Piehl
5/5 (2)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

K - Nearest Neighbors Implementation in R

Uploaded by

K - Nearest Neighbors Implementation in R

Uploaded by

Data science for Engineers

Department of Computer Science and Engineering

Hello all, welcome to this lecture on K-nearest neighbours implementation in R.

(Refer Slide Time: 00:23)

Now, let us see how do you solve this case study in R.

(Refer Slide Time: 08:32)

Let us see how to read the data in this particular case.

(Refer Slide Time: 17:45)

(Refer Slide Time: 18:20)

(Refer Slide Time: 19:51)

(Refer Slide Time: 22:32)

(Refer Slide Time: 23:51)

(Refer Slide Time: 26:28)

(Refer Slide Time: 27:12)

We will also get a lot of parameters such as sensitivity, specificity, etcetera.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.