0% found this document useful (0 votes)

4 views13 pages

Tweaking 2

tweaking ml

Uploaded by

Raviteja PV

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views13 pages

Tweaking 2

tweaking ml

Uploaded by

Raviteja PV

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

PROJECT

PRESENTATION
Robustness of Credibility Assessment

Team Members:
-Naga Dheeraj P
-Ravi Teja
Project Description
Our Aim is to create ‘adversarial examples’
by making small modifications to each text snippet
in the attack dataset that change the
victim classifier’s decision without altering the text meaning.

Evaluation
Evaluation is done using BODEGA score, which is a product of three numbers
computed for each adversarial example:

Confusion score (1: victim classifier changed the decision, 0: otherwise)

Semantic score (BLEURT similarity between the original and adversarial example, clipped to 0-1)
Character score (Levenshetin distance scaled as 0-1 similarity score).

The final ranking will be based on the BODEGA scores, averaged first over all examples in the dataset, and
then over the five domains and three victims. The number of queries to victim models needed to generate
each example will not influence the ranking but will be included in the analysis.
ROADMAP
We are given with 3 Classifier Models (BERT,BiLSTM,Surprise Classifier)
And we are suppose to use the train dataset to train the surprise classifier and to
understand the working of the other models.

After that we need to create a methodology to make changes in the text(in dev.tsv).
Using Synonyms replacement,Rephrasing and Character-level changes untill the
classifier decision changes.

After tweaking the texts we use those tweaked texts to train the given models to
make sure that the models are exposed to these changes.

Now we evaluate the model based on attack.tsv where we use BODEGA score to
evaluate the models.
Dataset
No of
Domain Name Dev dataset Attack Dataset Size
rows

Covid19 1130 - 595 345 KB

Rumour Detection 8683 2070 415 10.8 MB

Fact Checking 172763 19010 405 51.2MB

Style-based news bias

60234 3600 400 248 MB
assessment

Propaganda Detection 11546 3320 416 1.9 MB

Each Dataset 3 tsv - train.tsv , dev.tsv , attack.tsv

Each file has 3 columns with labels, tweet id and tweet(text).
0 for Credible content
1 denotes Non-Credible content
RD_data value counts
0 - 5671
1 - 3012
Text Pre-processing

URL & Punctuation Removal

01. Eliminated URLs from the text to avoid irrelevant links

that could skew the analysis.
Stripped punctuation marks to focus on the core words, enhancing
text processing.

02. Emoji Decoding

Converted emojis to text descriptions, preserving their
sentiment and meaning for better analysis.
Tokenization
Tokenization is the process of breaking down text into smaller units
called tokens, typically words or phrases.
This segmentation allows the analysis algorithms to examine each token
independently and understand the context in which they appear.

Sample of
Tokenization output
Stemming
Stemming is used to reduce words to their base or root form.
The goal is to strip away prefixes, suffixes, and inflections to obtain the word stem, which
might not always be a valid word itself.
For example, the words "running," "runner," and "ran" can all be reduced to the stem "run"
Term frequency(TF) of a term in document = Number of times term appears in the document.
Total number of words in the document

Inverse document frequency(IDF) of a term = log ( Total number of documents in the collection
Number of documents in the collection that contain this term )

TF-IDF = TF * IDF

Spam emails often contain specific keywords or phrases that are not commonly used in legitimate emails.
By analyzing the TF-IDF weights of words in an email, we can identify words that are more indicative of spam.
The classifier learns to associate higher weights for certain words with a higher probability of being spam.
Correlation Analysis(To identify words that are highly influencing the labels)

1. Creating a Binary Matrix:

We start by creating a binary matrix that represents whether each word is present or absent in each tweet.
If a word is present, its corresponding entry in the matrix is 1; otherwise, it's 0.
2. Computing Correlation:
For each tweet, we calculate the correlation between the presence of each word and the label assigned to the tweet.
This correlation tells us how much the presence of a word in a tweet is related to the assigned label.
A high positive correlation means the word is strongly associated with the label, while a negative correlation means the word is less associated.
3. Selecting Top Correlated Words:
After computing the correlation for each word in each tweet, we select the top 6 words that have the highest correlation with the label.
These words are considered the most influential or important words for predicting the label of the tweet.
Tweaking the Statements

Original sentence : water shortage!! we have to save it!! people are dying
Adversarial sentence: h2o shortfall ! ! we have to economize it ! ! people are fail

The replace_with_synonyms method tokenizes input text, identifies words based on their part
of speech, and replaces them with synonyms retrieved from WordNet. This preserves the
sentence's structure while altering its content.
The rephrase method rearranges the order of words in the input text, offering a simple form of
sentence transformation.
Additionally, the character_level_changes method introduces random typographical errors
into the text, simulating small alterations at the character level.
Tweaking the Statements(Code)
Evaluation(Final Phase)

Evaluation is done using BODEGA score, which is a product of three numbers

computed for each adversarial example:

Confusion score (1: victim classifier changed the decision, 0: otherwise)

Semantic score (BLEURT similarity between the original and adversarial example, clipped to 0-1)
Character score (Levenshetin distance scaled as 0-1 similarity score).

BODEGA SCORE = CONFUSION * SEMANTIC * CHARACTER SCORE

Data Mining Project Report
100% (2)
Data Mining Project Report
5 pages
Auto Water Pump Insem Report
100% (1)
Auto Water Pump Insem Report
44 pages
Evaluating Self Explanations in iSTART Word Matching, Latent Semantic
No ratings yet
Evaluating Self Explanations in iSTART Word Matching, Latent Semantic
12 pages
Flipkart TBBD '23 Cheat Sheet - Electronics
No ratings yet
Flipkart TBBD '23 Cheat Sheet - Electronics
23 pages
Face Project
No ratings yet
Face Project
43 pages
Samaksh Gupta Programming Ass. IR
No ratings yet
Samaksh Gupta Programming Ass. IR
13 pages
Intro Text Mining
No ratings yet
Intro Text Mining
83 pages
EUC1502 Module6 TextualAnalysis
No ratings yet
EUC1502 Module6 TextualAnalysis
99 pages
CBC - Converter - HDMF 1.7
100% (1)
CBC - Converter - HDMF 1.7
3 pages
VCS-SH30 Datasheet20221011
No ratings yet
VCS-SH30 Datasheet20221011
2 pages
Chapter 4 Text Classification
No ratings yet
Chapter 4 Text Classification
28 pages
IDTA For NLP
No ratings yet
IDTA For NLP
16 pages
NLP - J - Final ReviewReport - Cyberbullying
No ratings yet
NLP - J - Final ReviewReport - Cyberbullying
25 pages
Quick Start - RAGFlow
No ratings yet
Quick Start - RAGFlow
10 pages
Supervised Learningclassification Part3
No ratings yet
Supervised Learningclassification Part3
42 pages
Fake News Detection Project
No ratings yet
Fake News Detection Project
9 pages
Powershell' Deep Dive:: A United Threat Research Report
No ratings yet
Powershell' Deep Dive:: A United Threat Research Report
16 pages
OOP Chapter 4
No ratings yet
OOP Chapter 4
17 pages
CSE442 Text
No ratings yet
CSE442 Text
89 pages
Chapter 8 Text Analytics
No ratings yet
Chapter 8 Text Analytics
42 pages
2023 Article Jatit 19Vol101No14-3
No ratings yet
2023 Article Jatit 19Vol101No14-3
6 pages
NLP
No ratings yet
NLP
4 pages
Influential Vocabulary Detection
No ratings yet
Influential Vocabulary Detection
15 pages
Blue Doodle Project Presentation
No ratings yet
Blue Doodle Project Presentation
15 pages
Patient Monitor: Series
No ratings yet
Patient Monitor: Series
498 pages
MCA 401 (Unit 05)
No ratings yet
MCA 401 (Unit 05)
6 pages
Create A Larger Than 4GB Casper Partition: Search
No ratings yet
Create A Larger Than 4GB Casper Partition: Search
6 pages
NLP Text Preprocessing
No ratings yet
NLP Text Preprocessing
19 pages
Text Mining Package and Datacleaning: #Cleaning The Text or Text Transformation
No ratings yet
Text Mining Package and Datacleaning: #Cleaning The Text or Text Transformation
6 pages
NCSPCN 12 CRP
No ratings yet
NCSPCN 12 CRP
3 pages
Introduction Deck Nike Final
No ratings yet
Introduction Deck Nike Final
17 pages
Coa CH 11
No ratings yet
Coa CH 11
21 pages
GSoC 2017 Proposal - Rajat Arora
No ratings yet
GSoC 2017 Proposal - Rajat Arora
9 pages
Blockchain Based Framework For Software Development Using DevOps
No ratings yet
Blockchain Based Framework For Software Development Using DevOps
6 pages
DM Chapter 3
No ratings yet
DM Chapter 3
6 pages
K-Map Method
No ratings yet
K-Map Method
3 pages
Data Structures and Algorithms Lab 6: Objective
No ratings yet
Data Structures and Algorithms Lab 6: Objective
3 pages
Lecture 6 - From Unstructured Texts To Structure Data I
No ratings yet
Lecture 6 - From Unstructured Texts To Structure Data I
17 pages
Module 8 - Text - Update
No ratings yet
Module 8 - Text - Update
42 pages
M1 Sample
No ratings yet
M1 Sample
8 pages
LinkVIeW User Manual EN 1.21
No ratings yet
LinkVIeW User Manual EN 1.21
20 pages
Power System Security
88% (40)
Power System Security
32 pages
WBGT 2010SD
No ratings yet
WBGT 2010SD
2 pages
Determining Fake Statements Made by Public Figures by Means of Artificial Intelligence
No ratings yet
Determining Fake Statements Made by Public Figures by Means of Artificial Intelligence
4 pages
Chapter 15 - MINING MEANING FROM TEXT
No ratings yet
Chapter 15 - MINING MEANING FROM TEXT
20 pages
Umeme Service Guidelines PDF
No ratings yet
Umeme Service Guidelines PDF
1 page
de3eff8f6907b6b29ecc2014b615d71dd241738f80ca7be596d3983253f3d57d
No ratings yet
de3eff8f6907b6b29ecc2014b615d71dd241738f80ca7be596d3983253f3d57d
2 pages
Terms of Service
No ratings yet
Terms of Service
3 pages
ML7 - Text Classification
No ratings yet
ML7 - Text Classification
13 pages
Machine Learning For NLP: Vocabulary
No ratings yet
Machine Learning For NLP: Vocabulary
37 pages
Unit2 02
No ratings yet
Unit2 02
7 pages
HCF and LCM PDF
No ratings yet
HCF and LCM PDF
31 pages
MLA TAB Lecture2
No ratings yet
MLA TAB Lecture2
84 pages
Unit IV
No ratings yet
Unit IV
58 pages
5.2 Feature Engineering
No ratings yet
5.2 Feature Engineering
57 pages
DSBA+Master+Codebook+ +Text+Mining+&+TSF
No ratings yet
DSBA+Master+Codebook+ +Text+Mining+&+TSF
11 pages
Text Mining - Vectorization
No ratings yet
Text Mining - Vectorization
24 pages
MITRES TLL008F21 6864hw2
No ratings yet
MITRES TLL008F21 6864hw2
4 pages
Feature Eng
No ratings yet
Feature Eng
34 pages
NLP Question Bank Answers (Jagmeet)
No ratings yet
NLP Question Bank Answers (Jagmeet)
31 pages
Unit-Iv NLP
No ratings yet
Unit-Iv NLP
11 pages
Text Classification MLND Project Report Prasann Pandya
No ratings yet
Text Classification MLND Project Report Prasann Pandya
17 pages
Different Type of Feature Selection For Text Classification
No ratings yet
Different Type of Feature Selection For Text Classification
6 pages
Sma Exp 4
No ratings yet
Sma Exp 4
3 pages
NLP - Module 2
No ratings yet
NLP - Module 2
54 pages
Ai TXT Unit2
No ratings yet
Ai TXT Unit2
14 pages
MCSA in Windows Server 2012 R2 Course Outline PDF
No ratings yet
MCSA in Windows Server 2012 R2 Course Outline PDF
5 pages
Internal Routine (B.Tech)
No ratings yet
Internal Routine (B.Tech)
1 page
Improve Text Classification Accuracy Based On Classifier Fusion Methods
No ratings yet
Improve Text Classification Accuracy Based On Classifier Fusion Methods
6 pages
Module III
No ratings yet
Module III
42 pages
Methodology
No ratings yet
Methodology
9 pages
NLP Q2 21SAL54 Scheme
No ratings yet
NLP Q2 21SAL54 Scheme
6 pages
Notes Co Unit4
No ratings yet
Notes Co Unit4
12 pages
FO - POA - .00166-002 - Additive Manufacturing Checklist
No ratings yet
FO - POA - .00166-002 - Additive Manufacturing Checklist
2 pages
Text Analysis: Why Do We Need Text Analytics
No ratings yet
Text Analysis: Why Do We Need Text Analytics
2 pages
Text and Sentiment Analysis
No ratings yet
Text and Sentiment Analysis
41 pages
DeekshikaJadyada26 AP24LDS11
No ratings yet
DeekshikaJadyada26 AP24LDS11
7 pages
Module 3
No ratings yet
Module 3
40 pages
NLP-Neuro Linguistic Programming: What Is A Corpus?
No ratings yet
NLP-Neuro Linguistic Programming: What Is A Corpus?
3 pages
Topic Analysis Presentation
No ratings yet
Topic Analysis Presentation
23 pages
Brosur Rectiverter Dan Battery (48Vdc 24Vdc 220vac)
No ratings yet
Brosur Rectiverter Dan Battery (48Vdc 24Vdc 220vac)
6 pages
Ass7 Write Up .Final
No ratings yet
Ass7 Write Up .Final
11 pages
ITD253 L2 TextPreprocessing
No ratings yet
ITD253 L2 TextPreprocessing
33 pages
NLP Asgn3
No ratings yet
NLP Asgn3
6 pages
Machine Learning Algorithm For Sentimental Analysis of Twitter Feeds
No ratings yet
Machine Learning Algorithm For Sentimental Analysis of Twitter Feeds
4 pages
SL-3 - Assignment No 7
No ratings yet
SL-3 - Assignment No 7
14 pages
News Classification Using Machine Learning
No ratings yet
News Classification Using Machine Learning
5 pages
Design of LPDDR3 Memory Controller With Axi
No ratings yet
Design of LPDDR3 Memory Controller With Axi
4 pages
Puter Literacy MS Power Point Q & A SR
No ratings yet
Puter Literacy MS Power Point Q & A SR
11 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Tweaking 2

Uploaded by

Tweaking 2

Uploaded by

PROJECT

Confusion score (1: victim classifier changed the decision, 0: otherwise)

Covid19 1130 - 595 345 KB

Rumour Detection 8683 2070 415 10.8 MB

Fact Checking 172763 19010 405 51.2MB

Style-based news bias

Propaganda Detection 11546 3320 416 1.9 MB

Each Dataset 3 tsv - train.tsv , dev.tsv , attack.tsv

URL & Punctuation Removal

01. Eliminated URLs from the text to avoid irrelevant links

02. Emoji Decoding

1. Creating a Binary Matrix:

Evaluation is done using BODEGA score, which is a product of three numbers

Confusion score (1: victim classifier changed the decision, 0: otherwise)

BODEGA SCORE = CONFUSION * SEMANTIC * CHARACTER SCORE

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.