0% found this document useful (0 votes)

62 views11 pages

Analysis On Protein Expression Mice

The document analyzes protein expression levels in control and Down syndrome mice that were exposed to context fear conditioning, with and without the drug memantine. It finds that 77 protein expressions were measured in the cerebral cortex of 8 classes of mice defined by genotype, behavior, and treatment. Two classification models, KNN and decision tree, were implemented to predict the mouse classes based on protein levels and identify important proteins affecting learning ability. KNN achieved higher accuracy and identified proteins important for learning, memory, and immune processes. The results suggest KNN can help identify effective drugs to improve learning in people with Down syndrome.

Uploaded by

Shonil Dabreo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views11 pages

Analysis On Protein Expression Mice

Uploaded by

Shonil Dabreo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Analysis of protein expression in mice

10th June, 2020

Shonil Dabreo, s3835204

Affiliations: Master of Data Science, RMIT University, s3835204@student.rmit.edu.au

I certify that this is all my own original work. If I took any parts from elsewhere, then they were non-
essential parts of the assignment, and they are clearly attributed in my submission. I will show I agree
to this honor code by typing "Yes": Yes.

1|Page
Table of contents

Abstract 3

Introduction 3

Methodology 4

Results 8

Discussion 10

Conclusion 10

References 10

2|Page
Abstract

Down Syndrome (DS) is a chromosomal disorder cause by the presence of an additional chromosome 21
referred as trisomy, which alters the normal pathways and normal responses to stimulation, causing
learning and memory deficits. Expression levels of 77 proteins were measured in the cerebral cortex of 8
classes of control and Down syndrome mice which were exposed to context fear conditioning. CFC is a
task used to assess associative learning. The measurements were taken with and without the injection of
drug Memantine. This research attempts to understand the impacts of DS by analyzing the protein
expression in mice which could have affected the stimulated ability to learn among the trisomic mice.
Two classification models are implemented; K-Nearest Neighbors (KNN) Classifier and Decision
Tree. It is observed that the selected feature subsets not only yield higher accuracy classification results
but also are composed of protein responses which are important for the learning and memory process and
the immune system. Results suggest that KNN classification approach can identify the most important
proteins which may help to identify more effective drug to help learning process in people with DS.

Introduction

The mice protein expression dataset was created to study the effect of learning between normal i.e.
control and trisomic mice i.e. mice with Down Syndrome (DS). Down Syndrome (DS) has a prevalence
globally of 1 in a 1000 live human births, and is the most common genetically defined cause of
intellectual disabilities. Humans are made up of millions of cells, and in each cell there are normally 23
pairs of chromosomes with a total of 46. A DNA contains the specific instructions that make each type of
living creature unique. Genes are segments of deoxyribonucleic acid (DNA) that contain the code for a
specific protein that functions in one or more types of cells in the body. Chromosomes are structures
within cells that contain a person's genes. There are total 47 chromosomes in the cell of people with
Down Syndrome. This additional chromosome 21 is known as trisomy of human chromosome 21 (hsa21).
The characteristics of DS can be diagnosed by the observation of extra copy of chromosomes. Over
expression levels of the proteins causes trisomy symptoms i.e. Down Syndrome.

The expression levels of 77 proteins obtained from normal genotype control mice and from trisomic
Ts65Dn mice were examined to find out which proteins were successful and which failed in recovering
the learning ability. These proteins produced detectable signals in the nuclear fraction of cortex. A total of
72 mice were gathered for analysis, out of which 38 were control mice and 34 were trisomic mice (Down
syndrome). The mice were separated into two groups for determining their behavior into Context-Shock
(CS) and Shock-Context (SC).

Firstly, mice are exposed to a training environment(context) after which they are habituated in a chamber
where a mild foot shock is given to them. The normal/control mice would remember and able to associate
between chamber and shock. Whereas, the trisomy mice is expected to fail to remember the association.
Secondly, mice are habituated in a chamber for 3–5 minutes before being exposed to a training
environment (context), at the end of which they receive a mild foot shock which generates a freezing
response(fear). Both the control and trisomy mice would fail to remember the association.

In order to assess the effect of the drug memantine in recovering the ability to learn in trisomic mice,
some mice have been injected with the memantine drug and others have not (i.e. saline) before the
training. In the experiments, 15 measurements of each protein per sample/mouse were registered.
Therefore, for control mice, there are 38x15, or 570 measurements, and for trisomic mice, there are
34x15, or 510 measurements. There are total of 1080 measurements per protein. Each measurement can
be considered as an independent sample/mouse. The eight classes of mice are described based on features

3|Page
such as genotype (i.e. Control mice and Trisomy mice), behavior (i.e. CS and SC) and treatment (i.e.
Memantine and Saline). The description of classes is listed below:-

Classes:
c-CS-s: control mice, stimulated to learn, injected with saline (9 mice)
c-CS-m: control mice, stimulated to learn, injected with memantine (10 mice)
c-SC-s: control mice, not stimulated to learn, injected with saline (9 mice)
c-SC-m: control mice, not stimulated to learn, injected with memantine (10 mice)

t-CS-s: trisomy mice, stimulated to learn, injected with saline (7 mice)

t-CS-m: trisomy mice, stimulated to learn, injected with memantine (9 mice)
t-SC-s: trisomy mice, not stimulated to learn, injected with saline (9 mice)
t-SC-m: trisomy mice, not stimulated to learn, injected with memantine (9 mice)

The aim is to understand which trisomy protein classes that contribute to the success and the failure of
mice learning. Analysis is conducted by creating a model which predicts the 8 classes of mice based on
their protein expression levels. We can then decide which proteins were significant in the predictions i.e.
support a hypothesis where we could say that a particular protein might affect learning in trisomic mice.

Methodology

Data analysis is performed in 4 steps:-

• Data Pre-processing
• Data Exploration
• Data Modelling
• Testing the model

 Data Pre-processing

Data pre-processing is extremely important because it allows improving the quality of the raw
experimental data. The primary aim of preprocessing is to eliminate those small data contributions
associated with the experimental error.
Firstly, the dataset and the needed packages are to be imported in the kernel environment. In this case,
mice data was successfully imported. All the mice data columns/features had appropriate data types
except for the Genotype, Behavior, Treatment and class. These features were changed to category
datatype. There were total 1396 missing values in the mice data. The missing values of each column were
replaced by the corresponding mean values for that column. This was done by iterating through all the
columns. The categorical features contained 0 null values in the mice data. For columns (proteins) with a
few null values, mean values were used as replacing the null values with mean value wouldn’t make any
difference. For columns (proteins) with more number of null values, mean values were used as replacing
the null values with mean value could have most of the protein measurements with similar values, where
there was variance in the measurements of the proteins.

 Data Exploration

The sums of the first 10 proteins were calculated and the graph displaying the sum for each
protein/column was plotted. As we can see in the fig 1 below, the NR2A_N protein had the highest signal
values, whereas, the pBRAF_N protein had the lowest signal values.

4|Page
Fig 1 Fig 2

Fig 3 Fig 4

A series of Box plots for those columns were plotted separately. Fig 2, 3, 4 are the box plots for different
proteins. The plotted points are the outliers which are visible in the Fig 2 and Fig 3 i.e. for proteins
ITSN1_N and NR1_N.. However, there seems to be no outliers in the Fig 4 i.e. for protein pCAMKII_N.
pCAMKII_N
The Fig 3 i.e. NR1_N protein contains three outliers and the Fig 2 i.e. ITSN1_N protein has a lot of
outliers.. We can see that the ITSN1_N (Fig 2) protein has similarity of the values that are closer to the
mean value (i.e. Horizontal line between the box) and the NR1_N and pCAMKII_N proteins have
variation of the values. The box plot is not centered between the upper and lower fence which means
m that
the data is not normally distributed.

Fig 2.1

5|Page
As we can see in the Fig 2.1, a DYRK1A_N protein was considered to determine the behavior of the mice
for both the groups. The DYRK1A_N column was converted in categorical range using cut bins to better
visualize the relation through the pie chart as the data had unique variables
variables. The first pie chart represents
the protein expression level values between 0.0 – 0.9, 0.9 – 1.7 for second pie chart and 1.7 – 2.5 range of
values for third pie chart. The hypothesis is that, as the no of mice in CS group increases, the signal value
of the protein will increase as well. The above Fig 2.1 illustrates that the percentage of CS group (47.6%)
initially is lower than the SC group (52.4%). But in the 2nd pie chart, we can see that the no of CS group
increases and there are no mice from SC group in the 3 rd chart i.e. No of mice SC groups is 100%.
Therefore, the hypothesis is statistically significant.

Fig 2.2 Fig 2.3

As we can see in Fig 2.2, a scatter plot between H3MeK4_N and GluR4_N proteins is plotted. This was
done to check if multiple different proteins could affect the ability to learn in trisomic mice provided with
the drugs injected. The hypothesis is that, as the signal value of H3MeK4_N protein increases the
GluR4_N protein value decreases. In the given Fig 2.2, it is observed that as the signal value of
H3MeK4_N protein increases the GluR4_N protein value remains stable i.e. betwee betweenn 0.1 to 0.2. Hence,
there
here wasn’t sufficient evidence to support the hypothesis. Moreover, this also means that GluR4_N
protein is not effective for the trisomy mice in recovering to learn.

The Fig 2.3 shows the values of H3MeK4_N protein for 8 different classes. A box plot was used to plot
the relation between a protein and the class. The graph with the values of H3MeK4_N protein was
displayed by grouping the class. Total 8 Different box plots of classes are plotted for analysis. All the box
plots have outliers
liers except for the box plot of cc-CS-m class. The hypothesis is that the trisomy mice with
Shock-Context behavior with the treatment given by memantine drug injection (i.e. t-SC-m) t will have
higher effect of H3MeK4_N protein as compare to other classes. Ass we can see in the Fig 2.3, the box
plot of t-SC-m
m for H3MeK4_N protein has more than 0.25 signal values which are higher than box plots
of other classes. The H3MeK4_N protein is statistically significant for tt-SC-m class category; hence, we
can say that H3MeK4_N protein was effective for recovering the ability to learn in trisomy mice.

In addition to, the magnitude data in the proteins vary due to the outliers. A mouse number has multiple
mouse versions in the mouse class which produces thethese outliers.

6|Page
 Data Modelling

Classification model

Decision tree and K-Nearest Neighbors were used to build a model and predict the mice class and
determine which proteins were critical for each class. These are both examples of supervised machine
learning methods with the goal of creating a model that can be used predicts the class or value of the
target variable based on several input variables.

Decision tree algorithm tries to solve the problem by using tree representation of a series of decisions.
In Decision Trees, for predicting a class label for a record we start from the root of the tree. We
compare the values of the root attribute with the record’s attribute. On the basis of comparison, we
follow the branch corresponding to that value and jump to the next node. Each node in the tree acts as
a test case for some attribute, and each edge descending from the node corresponds to the possible
answers to the test case. This process is recursive in nature and is repeated for every subtree rooted at
the new node. The primary challenge in the decision tree implementation is to identify which attributes
we need to consider as the root node and each level.

KNN is a non-parametric, lazy learning algorithm. IN KNN, the model structure is determined from
the data without making any assumptions on underlying distribution. The training phase in KNN is
very minimal as it doesn’t use the training points for generalization. KNN Algorithm is based
on feature similarity: How closely out-of-sample features resemble our training set determines how we
classify a given data point. The output is a class membership (i.e. predicts a class). An object is
classified by a majority vote of its neighbors, with the object being assigned to the class most common
among its k nearest neighbors. The only issue is that stores almost all of the training data which indeed
requires high computational power and memory and which could slow down the prediction process.

Model Implementation

Feature selection:
As we need to select a label data, therefore we would be using the Class column from the mice dataset
as a predictor and the rest of the columns which includes 77 protein expression levels and Genotype,
Behavior and Treatment would be taken based on which class would be predicted. The class was
selected as target variable because it contains all the information of Genotype, Treatment and Behavior
of the mice.

Model selection:
The dataset is divided into 70% train data and 30% test data to get a better indication of the models
performance on unseen data.
Also, the cross validation score of 10 folds of both the models were calculated to tune the parameters
accordingly. As the K value increases the accuracy score also increases. However, it is said that a small
k results in predictions with high variance and low bias and K=1 could result in 100% accuracy or
overfitting. So, k=5 would be the tuned value for K-Nearest Neighbors.
Likewise, changing different parameter values of Decision Tree resulted in an average score of 76%.
Therefore, the Decision Tree model was build with the default value for each parameter. The results
were then interpreted.

7|Page
Results

1. TN / True Negative: when a case was negative and predicted negative

2. TP / True Positive: when a case was positive and predicted positive
3. FN / False Negative: when a case was positive but predicted negative
4. FP / False Positive: when a case was negative but predicted positive
Precision TP/ (TP + FP): Precision is the ability of a classifier not to label an instance positive that
is actually negative. For each class it is defined as the ratio of true positives to the sum of true and
false positives.
Recall TP/ (TP + FN): Recall is the ability of a classifier to find all positive instances. For each
class it is defined as the ratio of true positives to the sum of true positives and false negatives.
F1 Score (2*(Recall
2*(Recall * Precision) / (Recall + Precision)
Precision)): The F1 score is a weighted harmonic mean
of precision and recall such that the best score is 1.0 and the worst is 0.0. The F1 scores are lower than
accuracy measures as they embed precision and recall into their computation
computation.

Classification-error rate: The percentage of observations in the test data that the model mislabeled.

Confusion matrix is used to gauge the accuracy of the model

model. The numbers on the diagonal of the
confusion matrix correspond to correct predictions and the other values as total no of errors i.e.
mislabeled.

Fig 3.1 Classification report

Fig 3.2 confusion matrix

For KNN Algorithm, we got 90% accuracy score. As we can see in Fig 3.2, there are total 34
mislabeled errors for the KNN model. The highest precision i.e. corrected predicted class was c-SC-s.
c

8|Page
Fig 4.1 classification report

Fig 4.2 confusion matrix

For Decision Tree, the accuracy score is 83%. After tuning the parameters, it was observed that
parameters
arameters with default value give the best accuracy score
score. The confusion matrix had a total of 47
mislabeled errors. The highest precision score was of the tt-SC-s class.

Fig 5 Algorithm comparison

9|Page
The Fig 5 shows the comparison of the two models using box plot. As we can see, the KNN model has
higher accuracy as compared to the accuracy of the Decision Tree model. For this dataset, KNN model
was effective in yielding better scores.

Discussion

Two classifiers; K-Nearest Neighbor and Decision Tree were used for analyzing. We can see the
accuracy and the confusion matrix from the above figures.
It can be seen clearly that KNN model worked efficiently better with an accuracy of 90% compared to
the 83% accuracy of Decision Tree model. Therefore, we should implement KNN for this kind of
dataset where features are numeric as KNN is a distance metric algorithm.

Conclusion

• To sum up, KNN model could predict the unlabelled data but couldn’t classify which protein
contributes to success or failed learning. The KNN algorithm doesn’t have a feature importance
method to classify the data. Although, Decision Tree had feature importance method but didn’t had
a high accuracy score.
• We should implement KNN model when the features are numeric to find the similar examples.
Whereas, Decision Tree model should be implemented when we need to classify a particular class
variable where the features contains binary data.

References

 Science direct. Data Pre-processing. Available at

< https://www.sciencedirect.com/topics/engineering/data-preprocessing >
[Accessed 9 June 2020].

 Saringat, M. Z., Mustapha, A. and Andeswari, R. (2018), Comparative Analysis of Mice Protein
Expression: Clustering and Classification Approach, International Journal of Integrated
Engineering, 10(6). Available at:
< https://publisher.uthm.edu.my/ojs/index.php/ijie/article/view/2779 >
[Accessed 10 June 2020].

 Kulan H, Dag T (2019), in silico identification of critical proteins associated with learning
process and immune system for Down syndrome, PLoS ONE 14(1): e0210954. Available at
< https://doi.org/10.1371/journal.pone.0210954 >
[Accessed 10 June 2020].

 Bronshtein, A (2017) A Quick Introduction to K-Nearest Neighbors Algorithm, Noteworthy –

The Journal Blog 12(5). Available at
< https://blog.usejournal.com/a-quick-introduction-to-k-nearest-neighbors-algorithm-
62214cea29c7 >
[Accessed 10 June 2020].

10 | P a g e
 Chauhan, N. S (2019) Decision Tree Algorithm – Explained, towards data science 12(24)
Available at
< https://towardsdatascience.com/decision-tree-algorithm-explained-83beb6e78ef4 >
[Accessed 10 June 2020].

11 | P a g e

Genetics II Quiz
0% (2)
Genetics II Quiz
8 pages
Lesson 1 Human Cultural Variations, Social Differences, Social Change and Political Identities
86% (78)
Lesson 1 Human Cultural Variations, Social Differences, Social Change and Political Identities
29 pages
Final Examination - Class
No ratings yet
Final Examination - Class
6 pages
Test Kit Center Assessment Guide SUMMER 2012
80% (10)
Test Kit Center Assessment Guide SUMMER 2012
175 pages
Report 3
No ratings yet
Report 3
12 pages
Classification of Mice Based On Protein Expression Levels
No ratings yet
Classification of Mice Based On Protein Expression Levels
23 pages
Statistical Learning (AMI22T) : Home Exercise 2
No ratings yet
Statistical Learning (AMI22T) : Home Exercise 2
1 page
Advances in Down Syndrome Research - 1st Edition PDF
100% (14)
Advances in Down Syndrome Research - 1st Edition PDF
17 pages
Ponting Twilight Zone
No ratings yet
Ponting Twilight Zone
9 pages
The Data Cortex Nuclear Data Set
No ratings yet
The Data Cortex Nuclear Data Set
1 page
Advances in Down Syndrome Research - 1st Edition Full Text Download
No ratings yet
Advances in Down Syndrome Research - 1st Edition Full Text Download
15 pages
Genetic Analysis of Down Syndrome Facilitated by Mouse Chromosome Engineering
No ratings yet
Genetic Analysis of Down Syndrome Facilitated by Mouse Chromosome Engineering
5 pages
Determination of correlation dependencies between thermodynamic and experimental data using the example of Aβ and α-synucline peptides
No ratings yet
Determination of correlation dependencies between thermodynamic and experimental data using the example of Aβ and α-synucline peptides
26 pages
Engelhard
No ratings yet
Engelhard
18 pages
ENGL 1100 - Article 1
No ratings yet
ENGL 1100 - Article 1
25 pages
8.2 The Control of Gene Expression A Level Only Gene Mutation Qs
No ratings yet
8.2 The Control of Gene Expression A Level Only Gene Mutation Qs
19 pages
Small Sample Sizes A Big Data Problem in High-Dime
No ratings yet
Small Sample Sizes A Big Data Problem in High-Dime
15 pages
Nihms 1053005
No ratings yet
Nihms 1053005
9 pages
Behavioural Brain Research: Antonio M. Persico, Valerio Napolioni
No ratings yet
Behavioural Brain Research: Antonio M. Persico, Valerio Napolioni
18 pages
NIHMS278876 Supplement Supplemental - Figures
No ratings yet
NIHMS278876 Supplement Supplemental - Figures
20 pages
Analysis of Relation of Diabtees and Alzherimers Disease Through A Graphical Approach
No ratings yet
Analysis of Relation of Diabtees and Alzherimers Disease Through A Graphical Approach
46 pages
Spiegelhalter Et Al. - 1996 - The BUGS Project Examples Volume 1
No ratings yet
Spiegelhalter Et Al. - 1996 - The BUGS Project Examples Volume 1
59 pages
Genetics of Mouse Behavior: Interactions With Laboratory Environment
No ratings yet
Genetics of Mouse Behavior: Interactions With Laboratory Environment
3 pages
Analysis of Triplet Repeat Disorders 1st Edition High-Quality Ebook
100% (17)
Analysis of Triplet Repeat Disorders 1st Edition High-Quality Ebook
16 pages
Functional Genomics and Proteomics in The Clinical Neurosciences: Data Mining and Bioinformatics
No ratings yet
Functional Genomics and Proteomics in The Clinical Neurosciences: Data Mining and Bioinformatics
26 pages
BIOC14 Notes
No ratings yet
BIOC14 Notes
60 pages
Molecular Biology Notes
No ratings yet
Molecular Biology Notes
4 pages
Quiz of Science
No ratings yet
Quiz of Science
26 pages
Week 10 Discussion Section
No ratings yet
Week 10 Discussion Section
14 pages
FCGR2B Knockdown Alleviates Diabetes-Induced Cognitive Dysfunction by Altering Neuronal Excitability
No ratings yet
FCGR2B Knockdown Alleviates Diabetes-Induced Cognitive Dysfunction by Altering Neuronal Excitability
13 pages
Document
No ratings yet
Document
4 pages
The Application of The Permutation Test in Genome Wide Expression Analysis
No ratings yet
The Application of The Permutation Test in Genome Wide Expression Analysis
115 pages
Expression Level Modify Fitnesslandscape
No ratings yet
Expression Level Modify Fitnesslandscape
21 pages
Ch12 Gene Mutation
100% (1)
Ch12 Gene Mutation
61 pages
Unknome
No ratings yet
Unknome
31 pages
Mapping in The Region of Danforth's Short Tail and The Localization of Tail Length Modifiers
No ratings yet
Mapping in The Region of Danforth's Short Tail and The Localization of Tail Length Modifiers
11 pages
Lab Report 3 - Behavioral Neuroscience
No ratings yet
Lab Report 3 - Behavioral Neuroscience
3 pages
Genomics, Convergent Neuroscience and Progress in Understanding Autism Spectrum Disorder
No ratings yet
Genomics, Convergent Neuroscience and Progress in Understanding Autism Spectrum Disorder
3 pages
BIOLOGIE - s41586 024 08457 y
No ratings yet
BIOLOGIE - s41586 024 08457 y
26 pages
Sciadv Adt3778
No ratings yet
Sciadv Adt3778
18 pages
BBCCT 121
No ratings yet
BBCCT 121
12 pages
Developmental Biology Study Notes
No ratings yet
Developmental Biology Study Notes
13 pages
TiCS 9 (3) 126
No ratings yet
TiCS 9 (3) 126
10 pages
Clase #1 The Genetic Code
No ratings yet
Clase #1 The Genetic Code
54 pages
Articulo Cic 05 PDF
No ratings yet
Articulo Cic 05 PDF
12 pages
Broken Detailed Balance at Mesoscopic Scales in Active Biological Systems
No ratings yet
Broken Detailed Balance at Mesoscopic Scales in Active Biological Systems
5 pages
1-Disease Assignment
No ratings yet
1-Disease Assignment
2 pages
Model Organisms: Jennifer Slade B.SC (Hon), M.SC Candidate
No ratings yet
Model Organisms: Jennifer Slade B.SC (Hon), M.SC Candidate
37 pages
From Microarray To RNA-Seq: A Review of Transcriptome Analysis With Next-Generation Sequencing Data
No ratings yet
From Microarray To RNA-Seq: A Review of Transcriptome Analysis With Next-Generation Sequencing Data
27 pages
BIO 199 Handout
No ratings yet
BIO 199 Handout
5 pages
2014 YALCIN Binnaz
No ratings yet
2014 YALCIN Binnaz
107 pages
Memory in Autism
No ratings yet
Memory in Autism
384 pages
How Different Are Self and Nonself ?: A, B A, C D A, e F.G
No ratings yet
How Different Are Self and Nonself ?: A, B A, C D A, e F.G
9 pages
P D D W - J S: Rotein Iscovery With Iscrete ALK UMP Ampling
No ratings yet
P D D W - J S: Rotein Iscovery With Iscrete ALK UMP Ampling
20 pages
8.5 Lesson 6 Handout Reading v2 New
No ratings yet
8.5 Lesson 6 Handout Reading v2 New
1 page
Gene Expression Analysis: Ulf Leser and Karin Zimmermann
No ratings yet
Gene Expression Analysis: Ulf Leser and Karin Zimmermann
46 pages
Text S2 Direct Sequencing: Genomic DNA Was Prepared From Collected Blood Samples Using The
No ratings yet
Text S2 Direct Sequencing: Genomic DNA Was Prepared From Collected Blood Samples Using The
8 pages
Síndrome X Frágil
No ratings yet
Síndrome X Frágil
5 pages
Protein Gym
No ratings yet
Protein Gym
48 pages
Is Synthetic A Priori Knowledge Possible?: Semih Togay 2005102914 48J 1 Paper
No ratings yet
Is Synthetic A Priori Knowledge Possible?: Semih Togay 2005102914 48J 1 Paper
5 pages
Historical Reasoning Towards A Framework For Analy
No ratings yet
Historical Reasoning Towards A Framework For Analy
25 pages
Basic Interpretive Design-1
No ratings yet
Basic Interpretive Design-1
10 pages
Module 8 English Course
No ratings yet
Module 8 English Course
49 pages
LESSON PLANS 13 Agosto
No ratings yet
LESSON PLANS 13 Agosto
18 pages
Summative Assessment For The Unit 3
No ratings yet
Summative Assessment For The Unit 3
2 pages
Pepsi Paper
No ratings yet
Pepsi Paper
18 pages
Emotional Intelligence and Its Role in Managing Educational Processes
No ratings yet
Emotional Intelligence and Its Role in Managing Educational Processes
8 pages
Action Research CAI
100% (2)
Action Research CAI
9 pages
B.sc. Geography
No ratings yet
B.sc. Geography
85 pages
CS-871-Lecture 1
No ratings yet
CS-871-Lecture 1
41 pages
CLASSROOM LANGUAGE Teacher Instructions: Work in Groups of Three Listen and Practice
No ratings yet
CLASSROOM LANGUAGE Teacher Instructions: Work in Groups of Three Listen and Practice
3 pages
Sandeep Freshers Resume MBA Finance
No ratings yet
Sandeep Freshers Resume MBA Finance
3 pages
11 Chapter 3
No ratings yet
11 Chapter 3
25 pages
DLL Module 1 Session 3 ACT. 8 9 10
100% (1)
DLL Module 1 Session 3 ACT. 8 9 10
5 pages
1ST QUARTER PTA Photo-Documentation
No ratings yet
1ST QUARTER PTA Photo-Documentation
7 pages
K To 12 Curriculum Guide: Physical Education
0% (2)
K To 12 Curriculum Guide: Physical Education
126 pages
Daily Test Mathematics Grade 7 SMPK Penabur Gading Serpong: PART 1: SETS (Ex 9 To 13)
No ratings yet
Daily Test Mathematics Grade 7 SMPK Penabur Gading Serpong: PART 1: SETS (Ex 9 To 13)
5 pages
Textbook A Prehistory of Cognitive Poetics Neoclassicism and The Novel 1St Edition Kukkonen Ebook All Chapter PDF
No ratings yet
Textbook A Prehistory of Cognitive Poetics Neoclassicism and The Novel 1St Edition Kukkonen Ebook All Chapter PDF
53 pages
Psychology
No ratings yet
Psychology
5 pages
Vand
No ratings yet
Vand
7 pages
Discourse Analysis
No ratings yet
Discourse Analysis
3 pages
Completion Type
No ratings yet
Completion Type
15 pages
DLL Oct.2 6peg10
No ratings yet
DLL Oct.2 6peg10
3 pages
Introductio 1
No ratings yet
Introductio 1
9 pages
Writing Evaluation Criteria (The Writing Rubric)
No ratings yet
Writing Evaluation Criteria (The Writing Rubric)
2 pages
MBA Assignment - Leadership Dynamics
No ratings yet
MBA Assignment - Leadership Dynamics
10 pages
Changing Masculine To Singular Feminine
No ratings yet
Changing Masculine To Singular Feminine
1 page

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Analysis On Protein Expression Mice

Uploaded by

Analysis On Protein Expression Mice

Uploaded by

Analysis of protein expression in mice

10th June, 2020

Shonil Dabreo, s3835204

Affiliations: Master of Data Science, RMIT University, s3835204@student.rmit.edu.au

t-CS-s: trisomy mice, stimulated to learn, injected with saline (7 mice)

Data analysis is performed in 4 steps:-

Fig 2.2 Fig 2.3

1. TN / True Negative: when a case was negative and predicted negative

Confusion matrix is used to gauge the accuracy of the model

Fig 3.1 Classification report

Fig 3.2 confusion matrix

Fig 4.2 confusion matrix

Fig 5 Algorithm comparison

 Science direct. Data Pre-processing. Available at

 Bronshtein, A (2017) A Quick Introduction to K-Nearest Neighbors Algorithm, Noteworthy –

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.