0% found this document useful (0 votes)

10 views51 pages

Summary

Uploaded by

saijahnavi.pasumarthi2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views51 pages

Summary

Uploaded by

saijahnavi.pasumarthi2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 51

SUMMARY

TRAINING
this model is developed for predicting DATASET
variable length B-cell epitopes. It is ALONG WITH
developed on LBtope_Variable dataset TESTING
that contain 14876 unique
B-cell epitopes and 23321 unique non B-
cellepitopes. These epitopes and non- all the common epitopes in both the datasets
epitopes have variable length. are removed and also the epitopes having
length less than 5 and greater than 50 are
removed.
REFERENCE : B Cell Epitopes

Non B Cell Epitopes

This model developed on VALIDATION
LBtop_Confirm dataset which contain DATASET
only those epitopes or non-epitopes that
have been experimentally validated by
two or more studies. This dataset
Lbtope_Confirm contain 1042 unique B-
cell epitopes and 1795 unique non B-cell
epitopes.

REFERENCE : B Cell Epitopes

Non B Cell Epitopes

Contents:
• method 1: support vector machine
• method 2: XGboost
• method 3: decision tree classifier
• method 4: RNN (Recurrent neural network)
• method 5: CNN (Convolutional neural network)
• method 6 : CNN WITH KFOLD -5,10
• method 7: Amino acid composition(AAC)
• method 8: Dipeptide composition(DPC) -1,2
• method 9: Voting Classifier
• method 10: BERT
METHOD -1 : SUPPORT VECTOR MACHINE

METHODOLOGY
Using TF-IDF Vectorizer (which converts all the sequences to numerical
features)
split the dataset into training and testing - LBtope variable
vectorize the x_train(fit and transform) and x_test(transform)
other parameters:
kernel – linear and rbf

METRICS
Accuracy score, mathews correlation coefficient ,recall, precision, f1score
ROC curve
Results: Testing data LBtope-vairable
Accuracy: 0.6100785340314137

precision recall f1-score support

negative 0.61 1.00 0.76 4661

positive 0.00 0.00 0.00 2979

accuracy 0.61 7640

macro avg 0.31 0.50 0.38 7640
weighted avg 0.37 0.61 0.46 7640

this model is only memorizing the data provided but not learning the
features from it
INTERPRETATION:

• The model is heavily biased towards predicting the negative class,

failing to identify any positive instances.

• The overall performance metrics indicate significant room for

improvement, especially in correctly identifying positive cases.
METHOD - 2 : XGBOOST

METHODOLOGY
Using TF-IDF Vectorizer (which converts all the sequences to numerical
features)
split the dataset into training and testing - Lbtope variable
vectorize the x_train(fit and transform) and x_test(transform)

METRICS
Accuracy score, mathews correlation coefficient ,recall, precision, f1score
ROC curve
Results: Testing data LBtope-vairable

Accuracy: 0.618455497382199

precision recall f1-score support

0 0.62 1.00 0.76 4725

1 0.00 0.00 0.00 2915

accuracy 0.62 7640

macro avg 0.31 0.50 0.38 7640
weighted avg 0.38 0.62 0.47 7640
INTERPRETATION:

• The model is heavily biased towards predicting the negative class,

failing to identify any positive instances.

• The overall performance metrics indicate significant room for

improvement, especially in correctly identifying positive cases.
METHOD - 3 : DECISION TREES

METHODOLOGY
Using TF-IDF Vectorizer (which converts all the sequences to numerical
features)
split the dataset into training and testing - LB tope variable
vectorize the x_train(fit and transform) and x_test(transform)

METRICS
Accuracy score, mathews correlation coefficient ,recall, precision, f1score
Results: Testing data LBtope-vairable

Accuracy: 0.6100785340314137
precision recall f1-score support

negative 0.61 1.00 0.76 4661

positive 0.00 0.00 0.00 2979

accuracy 0.61 7640

macro avg 0.31 0.50 0.38 7640
weighted avg 0.37 0.61 0.46 7640
INTERPRETATION:

• The model is heavily biased towards predicting the negative class,

failing to identify any positive instances.

• The overall performance metrics indicate significant room for

improvement, especially in correctly identifying positive cases.
METHOD - 4 : Recurrent Neural Network
METHODOLOGY
Using TF-IDF Vectorizer (which converts all the sequences to numerical
features)
split the dataset into training and testing - LB tope variable
vectorize the x_train(fit and transform) and x_test(transform)

MODEL
1.Embedding layer - where the input dimension is the size of vocabulary
plus 1 for pad ,output dimension is 16 dimensional vector
2. SimpleRNN (16) this layer consists of 16 units , i processes the
sequence data one step at a time and maintains a hidden state
3. layer : a fully connected layer with 16 units and relu activation function . it
processes the output from RNN layer and applies a non - linear transformation

4. Output Dense layer : the output layer with a single unit and sigmoid activation
function , used for binary classification

5. loss function : binary cross entropy , optimizer: adam

METRICS
Accuracy score, mathews correlation coefficient ,recall, precision, f1score
RESULTS

test

epoch vs accuracy ROC CURVE

Results: Testing data LBtope-vairable

Accuracy: 0.6663612723350525 (MCC): 0.28156001957534404

precision recall f1-score support

0 0.68 0.85 0.76 4603

Confusion
1 0.64 0.40 0.49 3037
Matrix:
[[3906 697]
accuracy 0.67 7640
[1820 1217]]
macro avg 0.66 0.62 0.62 7640
weighted avg 0.66 0.67 0.65 7640
With Validation dataset: LB tope fixed

Accuracy: 0.7271765949947128 (MCC): 0.38499575326462343

precision recall f1-score support

array([[1577, 218],
0 0.74 0.88 0.80 1795 [ 556, 486]])
1 0.69 0.47 0.56 1042

accuracy 0.73 2837

macro avg 0.71 0.67 0.68 2837
weighted avg 0.72 0.73 0.71 2837
INTERPRETATION:
• he flat and similar lines for training and validation accuracy suggest that the
model's learning is stagnant and it is neither improving nor overfitting. However,
the accuracy level indicates that the model's performance is relatively low.

• The AUC is 0.70, which means the model has a moderate ability to distinguish
between the classes, performing better than random guessing but still with
room for improvement.
METHOD - 5 : CNN
METHODOLOGY
Using TF-IDF Vectorizer (which converts all the sequences to numerical
features)
split the dataset into training and testing - LB tope variable
vectorize the x_train(fit and transform) and x_test(transform)

MODEL
1.Embedding layer - where the input dimension is the size of vocabulary
plus 1 for pad ,output dimension is 1000 dimensional vector
2. Convulutional layer - which has 128 filters, window size 5 and an
activation function of reLU
3. Global max pooling layer : A single value per feature map (total of 128 values if the
input had 128 filters)

4. Dense layer :The number of neurons in the layer (128). and The activation function
applied to the output of the layer is reLU

5. Dropout layer : defines a fraction of input layers to drop -0.5

6. Output Dense layer : the output layer with a single unit and sigmoid activation function ,
used for binary classification

METRICS
Accuracy score, mathews correlation coefficient ,recall, precision, f1score
RESULTS
testing Accuracy: 0.689397931098938 epoch=15 batch_size = 32
validation Accuracy : 0.8396193161790624

testing Accuracy: 0.7065445184707642 epoch=16 batch_size = 32

validation Accuracy: 0.8403242862178357

testing Accuracy: 0.6912303566932678 epoch=20 batch_size

= 32
validation Accuracy: 0.8491364117025026

testing
Results: Testing data LBtope-vairable

Classification Report: Confusion Matrix:

precision recall f1-score support [[3698 963]
[1396 1583]]
0 0.73 0.79 0.76 4661
1 0.62 0.53 0.57 2979

accuracy 0.69 7640 (MCC): 0.3360494589435545

macro avg 0.67 0.66 0.67 7640
weighted avg 0.69 0.69 0.69 7640
Validation data : LB tope fixed

MCC
precision recall f1-score support 0.7165044856049663

0 0.86 0.89 0.88 1795

1 0.80 0.74 0.77 1042

accuracy 0.84 2837 array([[1606, 189],

macro avg 0.83 0.82 0.82 2837
[ 266, 776]])
weighted avg 0.84 0.84 0.84 2837
RANDOM TRIES
Batchsize = 64,epoch = 100, accuracy = 69.82
batch_size=32, epochs=10 , accuracy 68.2
batch_size=32, epochs=13, accuracy 68.6
batchsize = 8, epochs = 15,accuracy = 69.06
batchsize = 8, epoch=16, accuracy = 69.86
batchsize=8, epochs=16,accuracy = 70.46
batch_size=8, epochs=18, accuracy = 71.06
batchsize = 8, epochs=21, accuracy = 71.3
batchsize=4, epochs=50, accuracy = 69.03
INTERPRETATION

• The validation accuracy shows a slight increase initially but then flattens and
fluctuates around 0.63.
• This indicates that while the model is improving on the training data, it is not showing
similar improvement on the validation data.
METHOD - 6 : CNN with KFold - 5
Results: Testing data LBtope-variable

Mean Cross-Validation Accuracy: 0.7072543501853943

(MCC): 0.3725244342365087
Classification Report:
precision recall f1-score support

0 0.74 0.80 0.77 23321

1 0.64 0.55 0.60 14876

accuracy 0.71 38197

macro avg 0.69 0.68 0.68 38197
weighted avg 0.70 0.71 0.70 38197
with Validation data
Accuracy:0.8854423686993302
:Lb tope - fixed
mcc: 0.7545398019978594

conf_matrix precision recall f1-score support

array([[1604, 191],
[ 137, 905]]) 0 0.91 0.91 0.91 1795
1 0.85 0.84 0.84 1042

accuracy 0.89 2837

macro avg 0.88 0.88 0.88 2837
weighted avg 0.89 0.86 0.89 2837
CNN with KFold - 10
Results: Testing data LBtope-variable

Mean Cross-Validation Accuracy:0.7143228650093079

(MCC): 0.3845715900895881
Classification Report:
precision recall f1-score support

0 0.74 0.82 0.78 23321

1 0.66 0.55 0.60 14876

accuracy 0.71 38197

macro avg 0.70 0.68 0.69 38197
weighted avg 0.71 0.71 0.71 38197
with Validation data : LB tope - fixed

Accuracy:0.8709904829044766
precision recall f1-score support

mcc: 0.718803700 0 0.88 0.93 0.90 1795

1 0.86 0.77 0.81 1042

conf_matrix
accuracy 0.87 2837
array([[1665, 130],
macro avg 0.87 0.85 0.86 2837
[ 236, 806]]
weighted avg 0.87 0.87 0.87 2837
INTERPRETATION

• The model shows higher precision and recall for the majority class
(class 0) compared to the minority class (class 1), indicating a bias
towards the majority class for cv = 5
• While the model performs well, focusing on improving recall for the
positive class and further tuning could enhance its performance for cv
=10.
METHOD - 7: PFeature - AAC

METHODOLOGY
• To preprocess the training dataset and extract relevant features, we
employed CD-HIT for sequence clustering and redundancy removal,
followed by Pfeature for feature extraction based on amino acid
composition which results in 20 distinct features
• the obtained features are then trained with lazyclassifier to know the
best fit model for classification

METRICS
Accuracy score, mathews correlation coefficient ,recall, precision, f1score
Results: Testing data Best fit model: Extra Trees Classifier
LBtope-variable
mcc :0.363363972584853
Accuracy: 0.6868533171028606
precision recall f1-score support

0 0.68 0.80 0.73 1779

1 0.70 0.55 0.61 1507

accuracy 0.69 3286

macro avg 0.69 0.67 0.67 3286
weighted avg 0.69 0.69 0.68 3286

array([[1429, 350],
[ 684, 823]])
with Validation data : LB tope - fixed
Accuracy: 0.700740218540712 mcc :0.38371043785935655

precision recall f1-score support

negative 0.79 0.71 0.75 1795

positive 0.58 0.68 0.63 1042

accuracy 0.70 2837 array([[1275, 520],

macro avg 0.69 0.70 0.69 2837 [ 329, 713]])
weighted avg 0.72 0.70 0.70 2837
METHOD - 8: PFeature - DPC - 1

METHODOLOGY
• To preprocess the training dataset and extract relevant features, we
employed CD-HIT for sequence clustering and redundancy removal,
followed by Pfeature for feature extraction based on amino acid
composition which results in 400 distinct features(alternte) - gap of 1
• the obtained features are then trained with lazyclassifier to know the
best fit model for classification

METRICS
Accuracy score, mathews correlation coefficient ,recall, precision, f1score
Results: LB tope - variable Best fit model: Random forest classifier
mcc :0.363363972584853
Accuracy: 0.7096774193548387
precision recall f1-score support

negative 0.69 0.84 0.76 1779

positive 0.74 0.56 0.64 1507

accuracy 0.71 3286

macro avg 0.72 0.70 0.70 3286
weighted avg 0.72 0.71 0.70 3286

array([[1490, 289],
[ 665, 842]])
with Validation data : LB tope - fixed
mcc :0.402687364
Accuracy: 0.7151921043355658
precision recall f1-score support

negative 0.79 0.74 0.77 1795

positive 0.60 0.67 0.63 1042
array([[1333, 462],
[ 346, 696]])
accuracy 0.72 2837
macro avg 0.70 0.71 0.70 2837
weighted avg 0.72 0.72 0.72 2837
PFeature - DPC - 2

METHODOLOGY
• To preprocess the training dataset and extract relevant features, we
employed CD-HIT for sequence clustering and redundancy removal,
followed by Pfeature for feature extraction based on amino acid
composition which results in 400 distinct features - gap of 2
• the obtained features are then trained with lazyclassifier to know the
best fit model for classification

METRICS
Accuracy score, mathews correlation coefficient ,recall, precision, f1score
Results: LB tope - variable Best fit model: Extra Trees Classifier
mcc :0.4106882852918963
Accuracy: 0.7087644552647596
precision recall f1-score support

negative 0.71 0.79 0.75 1779

positive 0.71 0.61 0.66 1507

accuracy 0.71 3286

macro avg 0.71 0.70 0.70 3286
weighted avg 0.71 0.71 0.71 3286

array([[1406, 373],
[ 584, 923]])
with Validation data : LB tope - fixed
Accuracy: 0.697567853366232 mcc :0.36

precision recall f1-score support

negative 0.81 0.69 0.74 1795 array([[1233, 562],

positive 0.57 0.72 0.63 1042 [ 296, 746]])

accuracy 0.70 2837

macro avg 0.69 0.70 0.69 2837
weighted avg 0.72 0.70 0.70 2837
METHOD - 9: VOTING CLASSIFIER
METHODOLOGY
• To preprocess the training dataset and extract relevant features, we
employed CD-HIT for sequence clustering and redundancy removal,
followed by Pfeature for feature extraction based on amino acid
composition which results in 400 distinct features(alternte) - gap of 1
• the obtained features are then trained with an ensemble model of
Random Classifier, extra trees classifier, lgbm, svc, nusvc and the voting
method used was soft

METRICS
Accuracy score, mathews correlation coefficient ,recall, precision, f1score
Results: LB tope - variable
mcc :0.3823098
Accuracy: 0.70
precision recall f1-score support

negative 0.70 0.77 0.73 1783

positive 0.69 0.61 0.65 1503

accuracy 0.70 3286

macro avg 0.69 0.69 0.69 3286
weighted avg 0.69 0.70 0.69 3286

array([[1366, 417],
[ 585, 918]])
with Validation data : LB tope - fixed
mcc :0.50040267
Accuracy: 0.77
precision recall f1-score support

negative 0.80 0.85 0.82 1795

positive 0.71 0.64 0.67 1042
array([[1518, 277],
[ 371, 671]])
accuracy 0.77 2837
macro avg 0.76 0.74 0.75 2837
weighted avg 0.77 0.77 0.77 2837
METHOD - 10: BERT
METHODOLOGY
• Model Selection: A pre-trained BERT model (BertForSequenceClassification)
was chosen for sequence classification tasks.
• Training Setup: The model was fine-tuned on the training data using the cross-
entropy loss function and the Adam optimizer.
• Training Loop: During training, the model's parameters were updated in each
epoch using backpropagation, and the loss and accuracy were recorded.
• Evaluation: After each epoch, the model's performance was evaluated on the
validation set without updating the model parameters.

METRICS
Accuracy score, mathews correlation coefficient ,recall, precision, f1score
ROC CURVE
Results: LB tope - variable

Training:
Accuracy of last epoch: 0.94059735
loss of last epoch: 0.159418595

Testing :
Accuracy of last epoch: 0.701178010
loss of last epoch: 1.02653
testing

-> epoch vs accuracy

testing

epoch vs loss <-

with Validation data : LB tope - fixed
Accuracy: 0.9407825167430 mcc :0.872104955
loss: 0.20065079126088
precision recall f1-score support

negative 0.93 0.97 0.95 1795

positive 0.95 0.88 0.92 1042
array([[1749, 49],
[ 122, 920]])
accuracy 0.94 2837
macro avg 0.94 0.93 0.94 2837
weighted avg 0.94 0.94 0.94 2837
• shaded region - random tries

Mymodules - ICT1511-19-S1 - Online Assessment 2
No ratings yet
Mymodules - ICT1511-19-S1 - Online Assessment 2
18 pages
Xgboost PDF
100% (1)
Xgboost PDF
128 pages
Serpent
No ratings yet
Serpent
7 pages
01 Introduction
No ratings yet
01 Introduction
19 pages
07-Ensembles Notes
No ratings yet
07-Ensembles Notes
21 pages
ML Merged
No ratings yet
ML Merged
51 pages
R20!63!20ITC27 Deep Learning Lab Manual (Minor Proj 2) Dr.K.ramu
No ratings yet
R20!63!20ITC27 Deep Learning Lab Manual (Minor Proj 2) Dr.K.ramu
47 pages
Codes and Concepts of ML-Developer-2
No ratings yet
Codes and Concepts of ML-Developer-2
17 pages
Unit IV
No ratings yet
Unit IV
20 pages
DL Practical PROGRAM
No ratings yet
DL Practical PROGRAM
28 pages
Lecture 2.1 - AML
No ratings yet
Lecture 2.1 - AML
32 pages
Project Report Kodeinkgp
No ratings yet
Project Report Kodeinkgp
6 pages
3 Driessen
100% (1)
3 Driessen
34 pages
Paper1 Lite
No ratings yet
Paper1 Lite
18 pages
DeepLearningLab2.Ipynb - Colab
No ratings yet
DeepLearningLab2.Ipynb - Colab
7 pages
ML Lab 8
No ratings yet
ML Lab 8
9 pages
Assignment1 StatsML 23200979
No ratings yet
Assignment1 StatsML 23200979
18 pages
1sttask - Ipynb - Colab
No ratings yet
1sttask - Ipynb - Colab
6 pages
Aicw
No ratings yet
Aicw
19 pages
Eldar: Name: Ticket:N3 Group:E27-24
No ratings yet
Eldar: Name: Ticket:N3 Group:E27-24
10 pages
ML Interview Questions Placements
No ratings yet
ML Interview Questions Placements
99 pages
ML Lab 146
No ratings yet
ML Lab 146
50 pages
Exp 2
No ratings yet
Exp 2
3 pages
AIML Week7 Week8 Week9
No ratings yet
AIML Week7 Week8 Week9
6 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
Ml-Exp-3 - Jupyter Notebook
No ratings yet
Ml-Exp-3 - Jupyter Notebook
6 pages
MLT Notes
No ratings yet
MLT Notes
28 pages
ML Lab Assessment 4
No ratings yet
ML Lab Assessment 4
4 pages
Machine Learning Final Report
No ratings yet
Machine Learning Final Report
8 pages
Cristian Quiñonez Fase2
No ratings yet
Cristian Quiñonez Fase2
7 pages
Davin Marro: University of Oregon Graduate Student
No ratings yet
Davin Marro: University of Oregon Graduate Student
2 pages
Screenshot 2024-03-19 at 8.41.33 PM
No ratings yet
Screenshot 2024-03-19 at 8.41.33 PM
3 pages
Lec 12 13 Evaluation Measures
No ratings yet
Lec 12 13 Evaluation Measures
45 pages
A Conditional Generative Chatbot Using Transformer
No ratings yet
A Conditional Generative Chatbot Using Transformer
14 pages
4th Assign
No ratings yet
4th Assign
6 pages
ML101 Graded Assignment 2.ipynb - Colab
No ratings yet
ML101 Graded Assignment 2.ipynb - Colab
6 pages
ISYE6669 LP 10 21 1 - AndySun - FW
No ratings yet
ISYE6669 LP 10 21 1 - AndySun - FW
8 pages
K Means MLExpert
No ratings yet
K Means MLExpert
3 pages
DAA Lesson Plan For CSE A & B 2022
No ratings yet
DAA Lesson Plan For CSE A & B 2022
3 pages
Experiment 7
No ratings yet
Experiment 7
3 pages
Presented by Prof. Dr. A. M. Siddiqui Penn State University, York, USA
No ratings yet
Presented by Prof. Dr. A. M. Siddiqui Penn State University, York, USA
18 pages
Machine Learnin1
100% (1)
Machine Learnin1
41 pages
? Task
No ratings yet
? Task
23 pages
Data Mining Lab-2
No ratings yet
Data Mining Lab-2
6 pages
WINSEM2024-25 CSE3008 ELA AP2024254001161 2025-02-13 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE3008 ELA AP2024254001161 2025-02-13 Reference-Material-I
2 pages
MA3115 Problem Sheet-1
No ratings yet
MA3115 Problem Sheet-1
2 pages
ARIMA Forecasting Using R
No ratings yet
ARIMA Forecasting Using R
9 pages
Institute of Numerical Sciences, KUST, Kohat, Pakistan
No ratings yet
Institute of Numerical Sciences, KUST, Kohat, Pakistan
1 page
DL2 - Jupyter Notebook
No ratings yet
DL2 - Jupyter Notebook
5 pages
Assignment 1: Q1. Task Description
No ratings yet
Assignment 1: Q1. Task Description
12 pages
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
Macro - and Micro-Averaged Evaluation
No ratings yet
Macro - and Micro-Averaged Evaluation
27 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
8 pages
E2UC403B)
No ratings yet
E2UC403B)
5 pages
Ann Experiential Learning
No ratings yet
Ann Experiential Learning
43 pages
EEE2035F: Signals and Systems I: Class Test 1
No ratings yet
EEE2035F: Signals and Systems I: Class Test 1
5 pages
New Jersey Institute of Technology AI COurse Syllabus
No ratings yet
New Jersey Institute of Technology AI COurse Syllabus
4 pages
Midterm Report
No ratings yet
Midterm Report
4 pages
Machine Learning - Info 4122 - 2023
No ratings yet
Machine Learning - Info 4122 - 2023
4 pages
Lecture 20 - Evaluation Metrics
No ratings yet
Lecture 20 - Evaluation Metrics
27 pages
To Improve The Performance of Models Predicting Ba
No ratings yet
To Improve The Performance of Models Predicting Ba
6 pages
Map Assign 8
No ratings yet
Map Assign 8
7 pages
Confusion Matrix ROC
No ratings yet
Confusion Matrix ROC
8 pages
Boston Consulting Group Matrix: Relative Market Share (Cash Generation)
No ratings yet
Boston Consulting Group Matrix: Relative Market Share (Cash Generation)
9 pages
StatsProb ParameterStatisticSampling Plan
No ratings yet
StatsProb ParameterStatisticSampling Plan
36 pages
17 Ensemble Techniques Problem Statement
No ratings yet
17 Ensemble Techniques Problem Statement
28 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
5, Informed Searching Algorithms-I
No ratings yet
5, Informed Searching Algorithms-I
54 pages
Stats 101c Final Project
100% (1)
Stats 101c Final Project
16 pages
2IIG0 Cheat Sheet 1
No ratings yet
2IIG0 Cheat Sheet 1
2 pages
Midpaper
No ratings yet
Midpaper
16 pages
A E T S F E: Pplied Conometric IME Eries Ourth Dition
No ratings yet
A E T S F E: Pplied Conometric IME Eries Ourth Dition
43 pages
Practical Machine Learning
No ratings yet
Practical Machine Learning
11 pages
SVD Slides
No ratings yet
SVD Slides
26 pages
Anna University Question Paper - MA2261 Probability and Random Processes
No ratings yet
Anna University Question Paper - MA2261 Probability and Random Processes
3 pages
Graph Traversal: Text Depth-First Search Breadth-First Search
No ratings yet
Graph Traversal: Text Depth-First Search Breadth-First Search
41 pages
Ads 5
No ratings yet
Ads 5
5 pages
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
No ratings yet
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
20 pages
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
No ratings yet
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
11 pages
Tutorial Activity 1 Suggested Solutions
No ratings yet
Tutorial Activity 1 Suggested Solutions
6 pages
Null 2
No ratings yet
Null 2
8 pages
Project Report: CS 574 - Computer Vision Using Machine Learning
No ratings yet
Project Report: CS 574 - Computer Vision Using Machine Learning
38 pages
ACP Tutorial Ex2
No ratings yet
ACP Tutorial Ex2
15 pages
MCom PI-2 Quantitative Techniques For Business
No ratings yet
MCom PI-2 Quantitative Techniques For Business
2 pages
Stock Market Prediction Using MLP and Random Forest
No ratings yet
Stock Market Prediction Using MLP and Random Forest
18 pages
MidA F21
No ratings yet
MidA F21
8 pages
IML-IITKGP - Assignment 7 Solution
No ratings yet
IML-IITKGP - Assignment 7 Solution
8 pages
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
From Everand
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
César Pérez López
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Summary

Uploaded by

Summary

Uploaded by

SUMMARY

Non B Cell Epitopes

REFERENCE : B Cell Epitopes

Non B Cell Epitopes

precision recall f1-score support

negative 0.61 1.00 0.76 4661

accuracy 0.61 7640

• The model is heavily biased towards predicting the negative class,

• The overall performance metrics indicate significant room for

precision recall f1-score support

0 0.62 1.00 0.76 4725

accuracy 0.62 7640

• The model is heavily biased towards predicting the negative class,

• The overall performance metrics indicate significant room for

negative 0.61 1.00 0.76 4661

accuracy 0.61 7640

• The model is heavily biased towards predicting the negative class,

• The overall performance metrics indicate significant room for

5. loss function : binary cross entropy , optimizer: adam

epoch vs accuracy ROC CURVE

Accuracy: 0.6663612723350525 (MCC): 0.28156001957534404

precision recall f1-score support

0 0.68 0.85 0.76 4603

Accuracy: 0.7271765949947128 (MCC): 0.38499575326462343

precision recall f1-score support

accuracy 0.73 2837

5. Dropout layer : defines a fraction of input layers to drop -0.5

testing Accuracy: 0.7065445184707642 epoch=16 batch_size = 32

testing Accuracy: 0.6912303566932678 epoch=20 batch_size

Classification Report: Confusion Matrix:

accuracy 0.69 7640 (MCC): 0.3360494589435545

0 0.86 0.89 0.88 1795

accuracy 0.84 2837 array([[1606, 189],

Mean Cross-Validation Accuracy: 0.7072543501853943

0 0.74 0.80 0.77 23321

accuracy 0.71 38197

conf_matrix precision recall f1-score support

accuracy 0.89 2837

Mean Cross-Validation Accuracy:0.7143228650093079

0 0.74 0.82 0.78 23321

accuracy 0.71 38197

mcc: 0.718803700 0 0.88 0.93 0.90 1795

0 0.68 0.80 0.73 1779

accuracy 0.69 3286

precision recall f1-score support

negative 0.79 0.71 0.75 1795

accuracy 0.70 2837 array([[1275, 520],

negative 0.69 0.84 0.76 1779

accuracy 0.71 3286

negative 0.79 0.74 0.77 1795

negative 0.71 0.79 0.75 1779

accuracy 0.71 3286

precision recall f1-score support

negative 0.81 0.69 0.74 1795 array([[1233, 562],

accuracy 0.70 2837

negative 0.70 0.77 0.73 1783

accuracy 0.70 3286

negative 0.80 0.85 0.82 1795

-> epoch vs accuracy

epoch vs loss <-

negative 0.93 0.97 0.95 1795

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.