0% found this document useful (0 votes)

34 views56 pages

M.Thasleemabanu Document

The document discusses detecting fake news using machine learning techniques. It describes how machine learning algorithms can be used to classify news articles as real or fake by analyzing the text. Supervised learning methods are explored as they can be trained on labeled datasets. The goal is to build general models that can detect fake news across different domains.

Uploaded by

APCAC

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views56 pages

M.Thasleemabanu Document

Uploaded by

APCAC

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

FAKE NEWS DETECTION USING MACHINE LEARNING

ABSTRACT
Fake news on social media and various other media is pervasive and a matter of
serious concern as it can cause social and national damage with devastating effects .Much
research is already focused on discovery. We analyze the research related to fake news
detection, examine traditional machine learning models, and explore which ones are best
suited to build models for supervised machine learning algorithms products that can classify
fake news as true or fake. Use text analysis tools such as Python, Scikit-Learn, and NLP.
However, it faces techniques. We propose a fake news detection system using machine
learning techniques. Considering the current state of social media platforms, users are
developing and sharing more information than in the past five years, some of which is even
irrelevant to real life. Automatic text classification is tedious and difficult task.

1
1.INTRODUCTION
1.1About the project
The advent of the World Wide Web and the rapid adoption of social media platforms
(such as Facebook and Twitter) paved the way for information dissemination that has never
been witnessed in the human history before. Besides other use cases, news outlets benefitted
from the widespread use of social media platforms by providing updated news in near real
time to its subscribers.The news media evolved from newspapers, tab- loids, and magazines
to a digital form such as online news platforms, blogs, social media feeds, and other digital
media formats [1]. It became easier for consumers to acquire the latest news at their
fingertips. Facebook referrals account for 70% of traffic to news websites [2].These social
media platforms in their current state are extremely powerful and useful for their ability to
allow users to discuss and share ideas and debate over issues such as democracy, education,
and health. However, such platforms are also used with a negative perspective by certain
entities commonly for monetary gain [3, 4] and in other cases for creating biased opinions,
manipulating mindsets, and spreading satire or absurdity. The phenomenon is commonly
known as fake news. There has been a rapid increase in the spread of fake news in the last
decade, most prominently observed in the 2016 US elections [5]. Such proliferation of
sharing articles online that do not conform to facts has led to many problems not just limited
to politics but covering various other domains such as sports, health, and also science [3].
One such area affected by fake news is the financial markets [6], where a rumor can have
disastrous consequences and may bring the market to a halt. Our ability to take a decision
relies mostly on the type of information we consume; our world view is shaped on the basis
of information we digest. There is increasing evidence that consumers have reacted absurdly
to news that later proved to be fake [7, 8]. One recent case is the spread of novel corona virus,
where fake reports spread over the Internet about the origin, nature, and behavior of the virus
[9].The situation worsened as more people read about the fake contents online. Identifying
such news online is a daunting task. Fortunately, there are a number of computational
techniques that can be used to mark certain articles as fake on the basis of their textual
content [10]. Majority of these techniques use fact checking websites such as “PolitiFact” and
“Snopes”. There are a number of repositories main-tained by researchers that contain lists of
websites that are identified as ambiguous and fake [11]. However, the problem with these
resources is that human expertise is required to identify articles/websites as fake. More

2
importantly, the fact checking websites contain articles from particular domains such as
politics and are not generalized to identify fake news articles from multiple domains such as
entertainment, sports, and technology.The World Wide Web contains data in diverse formats
such as documents, videos, and audios. News published online in an unstructured format
(such as news, articles, videos, and audios) is relatively difficult to detect and classify as this
strictly requires human expertise. However, compu-tational techniques such as natural
language processing (NLP) can be used to detect anomalies that separate a text article that is
deceptive in nature from articles that are based on facts [12]. Other techniques involve the
analysis of propagation of fake news in contrast with real news [13]. More specifically, the
approach analyzes how a fake news article propagates differently on a network relative to a
true article.The response that an article gets can be differentiated at a theoretical level to
classify the article as real or fake. A more hybrid approach can also be used to analyze the
social response of an article along with exploring the textual features to examine whether an
article is deceptive in nature or not. A number of studies have primarily focused on detection
and classification of fake news on social media platforms such as Facebook and Twitter [13,
14]. At conceptual level, fake news has been classified into different types; the knowledge is
then expanded to generalize machine learning(ML) models for multiple domains [10, 15, 16].
)e study by Ahmed et al. [17] included extracting linguistic features such as n-grams from
textual articles and training multiple ML models including K-nearest neighbor (KNN),
support vector machine (SVM), logistic regression (LR), linear support vector machine
(LSVM), decision tree (DT), and stochasticgradient descent (SGD), achieving the highest
accuracy(92%) with SVM and logistic regression. According to the research, as the number
of n increased in n-grams calculated for a particular article, the overall accuracy
decreased.The phenomenon has been observed for learning models that are used for
classification. Shu et al. [12] achieved better accuracies with different models by combining
textual features with auxiliary information such as user social engagements on social media.
The authors also discussed the social and psychological theories and how they can be used to
detect false information online. Further, the authors discussed different data mining
algorithms for model constructions and techniques shared for features extraction. These
models are based on knowledge such as writing style, and social context such as stance and
propagation. A different approach is followed by Wang [18]. The author used textual features
and metadata for training various ML models.

3
1.2.MACHINE LEARNING
Machine Learning is a branch of artificial intelligence that develops algorithms by
learning the hidden patterns of the datasets used it to make predictions on new similar type
data, without being explicitly programmed for each task .Traditional Machine
Learning combines data with statistical tools to predict an output that can be used to make
actionable insights. Machine learning is used in many different applications, from image
and speech recognition to natural language processing, recommendation systems, fraud
detection, portfolio optimization, automated task, and so on. Machine learning models are
also used to power autonomous vehicles, drones, and robots, making them more intelligent
and adaptable to changing environments. A typical machine learning tasks are to provide a
recommendation. Recommender systems are a common application of machine learning,
and they use historical data to provide personalized recommendations to users. In the case
of Netflix, the system uses a combination of collaborative filtering and content-based
filtering to recommend movies and TV shows to users based on their viewing history,
ratings, and other factors such as genre preferences. Reinforcement learning is another type
of machine learning that can be used to improve recommendation-based systems. In
reinforcement learning, an agent learns to make decisions based on feedback from its
environment, and this feedback can be used to improve the recommendations provided to
users. For example, the system could track how often a user watches a recommended movie
and use this feedback to adjust the recommendations in the future.

4
Machine learning methods

Machine learning models fall into three primary categories.

 Supervised machine learning

 Unsupervised machine learning
 Semi-supervised learning

Supervised machine learning

Supervised learning, also known as supervised machine learning, is defined by its

use of labeled datasets to train algorithms to classify data or predict outcomes accurately. As
input data is fed into the model, the model adjusts its weights until it has been fitted
appropriately. This occurs as part of the cross validation process to ensure that the model
avoids overfitting or underfitting. Supervised learning helps organizations solve a variety of
real-world problems at scale, such as classifying spam in a separate folder from your inbox.
Some methods used in supervised learning include neural networks, naïve bayes, linear
regression, logistic regression, random forest, and support vector machine (SVM).

Unsupervised machine learning

Unsupervised learning, also known as unsupervised machine learning, uses machine
learning algorithms to analyze and cluster unlabeled datasets (subsets called clusters). These
algorithms discover hidden patterns or data groupings without the need for human
intervention. This method’s ability to discover similarities and differences in information
make it ideal for exploratory data analysis, cross-selling strategies, customer segmentation,
and image and pattern recognition. It’s also used to reduce the number of features in a model
through the process of dimensionality reduction. Principal component analysis (PCA) and
singular value decomposition (SVD) are two common approaches for this. Other algorithms
used in unsupervised learning include neural networks, k-means clustering, and probabilistic
clustering methods.

Semi-supervised learning

Semi-supervised learning offers a happy medium between supervised and

unsupervised learning. During training, it uses a smaller labeled data set to guide

5
classification and feature extraction from a larger, unlabeled data set. Semi-supervised
learning can solve the problem of not having enough labeled data for a supervised learning
algorithm. It also helps if it’s too costly to label enough data.

Figure 2 Machine Learning types

Features of Machine learning

 Machine learning is data driven technology. Large amount of data generated by

organizations on daily bases. So, by notable relationships in data, organizations makes
better decisions.
 Machine can learn itself from past data and automatically improve.
 From the given dataset it detects various patterns on data.
 For the big organizations branding is important and it will become more easy to target
relatable customer base.
 It is similar to data mining because it is also deals with the huge amount of data.

6
1.3 DEEP LEARNING
Deep learning is a member of AI which is wholly founded on neural organizations, as neural
organization imitates the individual mind so deep learning is likewise a kind of model of
human cerebrum. It's in attention these days in light of the evidence that earlier we never had
much preparing capacity and a massive understanding of knowledge. A conventional
connotation of deep learning is neurons. Deep learning is a definite kind of AI that achieves
improbable effort and versatility by deciphering out how to address the planet as a settled
progression of theories, with every concept describing corresponding to more straightforward
concepts, and more conceptual portrayals processed as far as less theoretical ones. In human
mind roughly hundred billion neurons all together this is an image of an individual neuron
and every neuron is associated through great many their neighbors. The inquiry here is the
manner by which it reproduces these neurons in a PC. In this way, it makes a counterfeit
construction called a fake neural net where we have hubs or neurons. It has a few neurons for
input worth and some for-yield esteem and in the middle, there might be heaps of neurons
interconnected in the secret layer. It needs to analyze the certain complication in order to get
the correct answer and it should be appreciated, the usefulness of the Deep Learning should
also be confirmed. It needs to find out the applicable data which should coincide with the
definitecomplication and should be arranged accordingly. Adopt the Deep Learning
Algorithm properly. The dataset should be trained using this algorithm. The dataset must
undergo a final testing process.

What are the components of a deep learning network?

The components of a deep neural network are the following.

Input layer

An artificial neural network has several nodes that input data into it. These nodes make up the

input layer of the system.

Hidden layer

The input layer processes and passes the data to layers further in the neural network.

These hidden layers process information at different levels, adapting their behavior as they

receive new information. Deep learning networks have hundreds of hidden layers that they
can use to analyze a problem from several different angles.

7
For example, if you were given an image of an unknown animal that you had to classify, you

would compare it with animals you already know. For example, you would look at the shape
of its eyes and ears, its size, the number of legs, and its fur pattern. You would try to identify
patterns, such as the following:

 The animal has hooves, so it could be a cow or deer.

 The animal has cat eyes, so it could be some type of wild cat.

The hidden layers in deep neural networks work in the same way. If a deep learning
algorithm is trying to classify an animal image, each of its hidden layers processes a different
feature of the animal and tries to accurately categorize it.

Output layer

The output layer consists of the nodes that output the data. Deep learning models that output
"yes" or "no" answers have only two nodes in the output layer. On
the other hand, thosethat output a wider range of answers have more nodes.

8
2.SYSTEM ANALYSIS
2.1 EXISTING SYSTEM

As of my last update in January 2022, several existing systems for fake news
detection have utilized machine learning algorithms such as Random Forest, Logistic
Regression, Support Vector Machine (SVM), and Decision Tree. While I can't provide
specific details on the latest developments beyond that point, I can outline a generic
framework for an existing system that incorporates these algorithms:
Data Collection and Preprocessing:
Gather a diverse dataset of news articles labeled as real or fake.
Preprocess the text data by removing noise, tokenizing, stemming, and removing stop words.
Feature Extraction:
Extract relevant features from the preprocessed text data. Features could include TF-IDF
scores, word embeddings, sentiment analysis scores, and metadata features.
Model Training:
Train multiple classifiers using the extracted features and labeled data.
Utilize Random Forest, Logistic Regression, SVM, and Decision Tree algorithms separately
to build individual models.
Model Evaluation:
Evaluate the performance of each model using cross-validation or a separate validation
dataset. Measure metrics such as accuracy, precision, recall, F1-score, and ROC-AUC to
assess the effectiveness of each algorithm.
Ensemble Learning:
Optionally, combine the predictions of the individual models using ensemble learning
techniques such as majority voting or stacking. This can often lead to improved performance
compared to individual models.
Deployment:
Deploy the trained models into a production environment where they can classify news
articles in real-time. Integrate the system into news websites, social media platforms, or
browser extensions to provide real-time fake news detection capabilities.

9
Monitoring and Maintenance:
Continuously monitor the performance of the deployed models and update them as needed
with new data or retraining.Implement mechanisms for feedback from users to improve the
models over time.
User Interface:
Develop a user-friendly interface to present the classification results to end-users.
Provide explanations for the classification decisions to enhance transparency and user trust.
Regulatory Compliance and Ethical Considerations:
Ensure compliance with data privacy regulations and ethical guidelines.
Implement measures to prevent bias and ensure fairness in the fake news detection system.
By following this framework, existing systems can effectively leverage machine learning
algorithms such as Random Forest, Logistic Regression, SVM, and Decision Tree for fake
news detection, contributing to the mitigation of misinformation spread online.

10
2.2.PROPOSED SYSTEM

1. Data Collection: Gather a diverse dataset of news articles labeled as real or fake.
Ensure the dataset is balanced and representative of various types of fake news.
2. Preprocessing: Clean and preprocess the text data, including tasks such as
lowercasing, tokenization, removing stop words, and stemming or lemmatization.
3. Feature Extraction: Extract features from the preprocessed text data. Features could
include:
 Word frequencies
 TF-IDF (Term Frequency-Inverse Document Frequency) scores
 N-grams
 Sentiment analysis scores
 Source credibility features
 Metadata features (publication date, article length)
Splitting Data: Divide the dataset into training, validation, and testing sets. Typically, 70-
80% of the data is used for training, 10-15% for validation, and 10-15% for testing.
Model Training:
Random Forest: Train a Random Forest classifier using the training data. Random Forest is
an ensemble learning method that builds multiple decision trees and merges their predictions.
Logistic Regression: Train a Logistic Regression model using the training data. Logistic
Regression is a linear model used for binary classification.
Support Vector Machine (SVM): Train an SVM classifier using the training data. SVM is a
powerful algorithm for classification tasks, capable of handling high-dimensional data.
Decision Tree: Train a Decision Tree classifier using the training data. Decision Trees
partition the feature space into regions and make decisions based on feature values.
Model Evaluation:Evaluate the performance of each model using the testing set. Measure
metrics such as accuracy, precision, recall, F1-score, and ROC-AUC.
Ensemble Learning (Optional):Combine the predictions of the individual models using
techniques like majority voting or averaging to create an ensemble model. This can often
improve overall performance.

Deployment:Deploy the trained models into a production environment where they can
classify news articles in real-time.

11
Monitoring and Maintenance:Monitor the performance of the deployed models and update
them as needed with new data or retraining.
User Interface:Develop a user interface to present the classification results to end-users. This
interface could display the likelihood of a news article being fake or real, along with
explanations for the classification decision.
Feedback Mechanism:Implement a feedback mechanism that allows users to report
misclassified articles. Use this feedback to continuously improve the models.
By implementing this proposed system, you can effectively leverage Random Forest, Logistic
Regression, SVM, and Decision Tree algorithms for fake news detection, providing a robust
solution to combat misinformation.

12
3.DEVELOPMENT ENVIRONMENT

3.1 HARDWARE ENVIRONMENT

Processor : NIVDIA

RAM : 4GB

Mouse : Compatible Mouse

Keyboard : Logitech

Monitor : 15 Color Monitor

3.2. SOFTWARE ENVIRONMENT

 Operating System

 Windows 10 pro

 Python 3.11

 Packages

 Numpy

 Pandas

 TensorFlow

 Matplotlib

 Scikit Learn

13
3.3 ABOUT THE SOFTWARE

COLAB

Google Colab, short for Collaboratory, is a free cloud-based platform provided by

Google that allows users to write and execute Python code collaboratively in a Jupyter
Notebook environment. Google Collaboratory notebook, is designed to facilitate machine
learning (ML) and data science tasks by providing a virtual environment, Google Colab
python with access to free GPU resources.

Benefits of Google Colab

Google Colab offers several benefits that make it a popular choice among data
scientists, researchers, and machine learning practitioners.

Key features of Google Collaboratory notebook include:

 Free Access to GPUs: Colab offers free GPU access, which is particularly useful
for training machine learning models that require significant computational power.
 No Setup Required: Colab runs in the cloud, eliminating the need for users to set
up and configure their own development environment. This makes it convenient for
quick coding and collaboration.
 Collaborative Editing: Multiple users can work on the same Colab notebook
simultaneously, making it a useful tool for collaborative projects.

 Integration with Google Drive: Colab is integrated with Google Drive, allowing
users to save their work directly to their Google Drive account. This enables easy
sharing and access to notebooks from different devices.

 Support for Popular Libraries: Colab comes pre-installed with many popular
Python libraries for machine learning, data analysis, and visualization, such as Tensor
Flow, PyTorch, Matplotlib, and more.
 Easy Sharing: Colab notebooks can be easily shared just like Google Docs or
Sheets. Users can provide a link to the notebook, and others can view or edit the code in
real-time.

14
PYTHON

Python is a programming language that is interpreted, object-oriented, and

considered to be high-level too. What is Python? Python is one of the easiest yet most
useful programming languages which is widely used in the software industry. People use
Python for Competitive Programming, Web Development, and creating software. Due to its
easiest syntax, it is recommended for beginners who are new to the software engineering
field. Its demand is growing at a very rapid pace due to its vast use cases in Modern
Technological fields like Data Science, Machine learning, and Automation Tasks. For many
years now, it has been ranked among the top Programming languages.

History of Python
Python was created in 1980s by Guido van Rossum. During his research at
the National Research Institute for Mathematics and Computer Science in the
Netherlands, he created Python – a super easy programming language in terms of reading
and usage. The first ever version was released in the year 1991 which had only a few built-
in data types and basic functionality.
Later, when it gained popularity among scientists for numerical computations and data
analysis, in 1994, Python 1.0 was released with extra features like map, lambda, and filter
functions. After that adding new functionalities and releasing newer versions of Python
came into fashion.
 Python 1.5 released in 1997
 Python 2.0 released in 2000
 Python 3.0 in 2008 brought newer functionalities
The latest version of Python, Python 3.11 was released in 2022.
Newer functionalities being added to Python makes it more beneficial for developers and
improved its performance. In recent years, Python has gained a lot of popularity and is a
highly demanding programming language. It has spread its demand in various fields which
includes machine learning, artificial intelligence, data analysis, web development, and
many more giving you a high-paying job.

It has interfaces to many system calls and libraries, as well as to various window
systems, and is extensible in C or C++. It is also usable as an extension language for
applications that need a programmable interface. Finally, Python is portable: it runs on many

15
Unix variants including Linux and macOS, and on Windows. Python is a high-level general-
purpose programming language that can be applied to many different classes of problems.

Python Use Cases

Creating web applications on a server

 Building work flows that can be used in conjunction with software

 Connecting to database systems

 Reading and modifying files

 Performing complex mathematics

 Processing big data

 Fast prototyping

 Developing production-ready software

 Professionally, Python is great for backend web development, data analysis, artificial
intelligence, and scientific computing. Developers also use Python to build productivity tools,
games, and desktop apps.

PYTHON AND AI

Python is a widely used programming language for artificial intelligence (AI) and
machine learning (ML). The language’s simplicity, versatility, and extensive libraries and
frameworks, along with a well supported ecosystem and strong community support, make it a
popular choice for building AI applications. The language provides various tools and
functionality for building AI apps and systems, including deep learning, data manipulation,
and numerical computation.

AI researchers are fans of Python. Google TensorFlow, as well as other libraries

(scikit-learn, Keras), establish a foundation for AI development because of the usability and
flexibility it offers Python users. These libraries, and their availability, are critical because
they enable developers to focus on growth and building.

16
NUMPY

NumPy (Numerical Python) is an open source Python library that’s used in almost
every field of science and engineering. It’s the universal standard for working with numerical
data in Python, and it’s at the core of the scientific Python and PyData ecosystems. NumPy
users include everyone from beginning coders to experienced researchers doing state-of-the-
art scientific and industrial research and development. The NumPy API is used extensively in
Pandas, SciPy, Matplotlib, scikit-learn, scikit-image and most other data science and
scientific Python packages.

The NumPy library contains multidimensional array and matrix data structures
(you’ll find more information about this in later sections). It provides ndarray, a
homogeneous n-dimensional array object, with methods to efficiently operate on it. NumPy
can be used to perform a wide variety of mathematical operations on arrays. It adds powerful
data structures to Python that guarantee efficient calculations with arrays and matrices and it
supplies an enormous library of high-level mathematical functions that operate on these
arrays and matrices.

At the core of the NumPy package, is the ndarray object. This encapsulates n-
dimensional arrays of homogeneous data types, with many operations being performed in
compiled code for performance. There are several important differences between NumPy
arrays and the standard Python sequences:

 NumPy arrays have a fixed size at creation, unlike Python lists (which can
grow dynamically). Changing the size of an ndarray will create a new array
and delete the original.
 The elements in a NumPy array are all required to be of the same data type,
and thus will be the same size in memory. The exception: one can have arrays
of (Python, including NumPy) objects, thereby allowing for arrays of different
sized elements.
 NumPy arrays facilitate advanced mathematical and other types of operations
on large numbers of data. Typically, such operations are executed more
efficiently and with less code than is possible using Python’s built-in
sequences.

17
 A growing plethora of scientific and mathematical Python-based packages are
using NumPy arrays; though these typically support Python-sequence input,
they convert such input to NumPy arrays prior to processing, and they often
output NumPy arrays. In other words, in order to efficiently use much
(perhaps even most) of today’s scientific/mathematical Python-based software,
just knowing how to use Python’s built-in sequence types is insufficient - one
also needs to know how to use NumPy arrays.

PANDAS

Pandas is a powerful and open-source Python library. The Pandas library is used for
data manipulation and analysis. Pandas consist of data structures and functions to perform
efficient operations on data.

What is Python Pandas used for?

The Pandas library is generally used for data science, but have you wondered why? This is
because the Pandas library is used in conjunction with other libraries that are used for data
science.
It is built on top of the NumPy library which means that a lot of the structures of NumPy
are used or replicated in Pandas.
The data produced by Pandas is often used as input for plotting functions in Matplotlib,
statistical analysis in SciPy, and machine learning algorithms in Scikit-learn.
You must be wondering, Why should you use the Pandas Library. Python’s Pandas library
is the best tool to analyze, clean, and manipulate data.
Here is a list of things that we can do using Pandas.
 Data set cleaning, merging, and joining.
 Easy handling of missing data (represented as NaN) in floating point as well as non-
floating point data.
 Columns can be inserted and deleted from DataFrame and higher-dimensional objects.
 Powerful group by functionality for performing split-apply-combine operations on data
sets.
 Data Visualization.

18
MATPLOTLIB
Matplotlib is a python library used to create 2D graphs and plots by using python
scripts. It has a module named pyplot which makes things easy for plotting by providing
feature to control line styles, font properties, formatting axes etc. It supports a very wide
variety of graphs and plots namely -histogram, bar charts, power spectra, error charts etc. It
is used along with NumPy to provide an environment that is an effective open source
alternative for MatLab.

TENSORFLOW
TensorFlow is an open-source library developed by Google primarily for deep
learning applications. It also supports traditional machine learning. TensorFlow was
originally developed for large numerical computations without keeping deep learning in
mind. However, it proved to be very useful for deep learning development as well, and
therefore Google open-sourced it.
TensorFlow accepts data in the form of multi-dimensional arrays of higher
dimensions called tensors. Multi-dimensional arrays are very handy in handling large
amounts of data.
TensorFlow works on the basis of data flow graphs that have nodes and edges. As
the execution mechanism is in the form of graphs, it is much easier to execute TensorFlow
code in a distributed manner across a cluster of computers while using GPUs

SCIKIT LEARN

Scikit-learn is a library in Python that provides many unsupervised and supervised

learning algorithms. It’s built upon some of the technology you might already be familiar
with, like NumPy, pandas, and Matplotlib!
The functionality that scikit-learn provides include:

 Regression, including Linear and Logistic Regression

 Classification, including K-Nearest Neighbors
 Clustering, including K-Means and K-Means++
 Model selection
 Preprocessing, including Min-Max Normalization

19
4.FEASIBILITY STUDY

With the rise of misinformation and disinformation, detecting fake news has become a
crucial task. Machine learning can play a significant role in identifying and combating fake
news.
Objectives:
1. Investigate the feasibility of using machine learning for fake news detection.
2. Identify the most effective machine learning algorithms and techniques for
fake news detection.
3. Evaluate the performance of the proposed approach.
Methodology:
 Data Collection: Gather a large dataset of labeled news articles, including both
genuine and fake news.
 Preprocessing: Clean and preprocess the data by removing stop words, stemming,
and lemmatizing the text.
 Feature Extraction: Extract relevant features from the preprocessed data, such as:
 Text features: word frequency, n-grams, and sentiment analysis.
 Structural features: article length, sentence structure, and formatting.
 Model Selection: Choose suitable machine learning algorithms, such as:
 Supervised learning: Support Vector Machines (SVM), Random Forest, and
Gradient Boosting.
 Deep learning: Convolutional Neural Networks (CNN) and Recurrent Neural
Networks (RNN).
 Model Training and Evaluation: Train the selected models using the labeled
dataset and evaluate their performance using metrics such as:
 Accuracy
 Precision
 Recall
 F1-score
 ROC-AUC

20
Hyperparameter Tuning: Perform hyperparameter tuning to optimize the performance
of the selected models.

Expected Outcomes:
 A comprehensive analysis of the feasibility of using machine learning for fake
news detection.
 Identification of the most effective machine learning algorithms and
techniques for fake news detection.
 A proposed approach for fake news detection, including a detailed
methodology and evaluation metrics.
Potential Applications:
1. Fake News Detection: Develop a system that can detect fake news articles and
alert users.
2. Disinformation Detection: Apply the proposed approach to detect
disinformation and misinformation in various domains, such as social media and
online news.
3. Content Verification: Use the proposed approach to verify the authenticity of
online content, including news articles, social media posts, and online reviews.
4. Challenges and Limitations:
5. Data Quality: The quality and diversity of the training data can significantly
impact the performance of the machine learning models.
6. Domain Adaptation: The proposed approach may not generalize well to new
domains or datasets.
7. Evaluation Metrics: The choice of evaluation metrics can affect the performance
of the models and the overall results.

4.1 TECHNICAL FEASIBILITY

This technical feasibility study aims to assess the technical feasibility of using
machine learning for fake news detection. We will investigate the technical requirements,
potential challenges, and potential solutions for implementing a fake news detection system
using machine learning.

21
Technical Requirements:
 Data: A large dataset of labeled news articles, including both genuine and fake news.
 Preprocessing: Text preprocessing techniques, such as tokenization, stemming,
and lemmatization.
 Feature Extraction: Feature extraction techniques, such as word frequency, n-
grams, and sentiment analysis.
 Machine Learning Algorithm: A suitable machine learning algorithm, such as
Support Vector Machines (SVM), Random Forest, or Gradient Boosting.
 Model Training: Training the machine learning model using the labeled dataset.
 Model Evaluation: Evaluating the performance of the trained model using metrics
such as accuracy, precision, recall, and F1-score.
 Deployment: Deploying the trained model in a production environment.
Technical Challenges:
 Data Quality: Ensuring the quality and diversity of the training data.
 Feature Engineering: Selecting the most relevant features for the machine
learning algorithm.
 Model Complexity: Balancing model complexity and performance.
 Overfitting: Preventing overfitting by using regularization techniques.
 Scalability: Ensuring the system can handle large volumes of data and scale
horizontally.
 Interpretability: Providing insights into the decision-making process of the
machine learning model.
Technical Solutions:
 Data Quality: Using data augmentation techniques to increase the size and
diversity of the training data.
 Feature Engineering: Using techniques such as word embeddings and topic
modeling to extract relevant features.
 Model Complexity: Using regularization techniques such as L1 and L2
regularization to prevent overfitting.

22
 Scalability: Using distributed computing frameworks such as Apache Spark and
Hadoop to scale the system horizontally.
 Interpretability: Using techniques such as feature importance and partial
dependence plots to provide insights into the decision-making process.

Technical Feasibility
Based on the technical requirements and challenges, we conclude that the technical
feasibility of using machine learning for fake news detection is high. The technical solutions
proposed above can address the challenges and ensure the system is scalable, interpretable,
and effective.
4.2.FINANCIAL FEASIBILITY
This financial feasibility study aims to assess the financial viability of using machine
learning for fake news detection. We will investigate the financial requirements, potential
costs, and potential revenue streams for implementing a fake news detection system using
machine learning.
Financial Requirements:
1. Data Collection: The cost of collecting and labeling a large dataset of news
articles.
2. Model Development: The cost of developing and training a machine learning
model.
3. Infrastructure: The cost of setting up and maintaining the infrastructure for the
fake news detection system.
4. Maintenance: The cost of maintaining and updating the fake news detection
system.
Potential Costs:
1. Data Collection: 50,000 - 100,000
2. Model Development: 100,000 - 200,000
3. Infrastructure: 50,000 - 100,000
4. Maintenance: 20,000 - 50,000 per year
Potential Revenue Streams:
 Advertising: Revenue from advertising on the fake news detection system.

23
 Subscription: Revenue from subscription-based services offering fake news
detection.
 Data Analytics: Revenue from data analytics services providing insights on fake
news trends.
 Consulting: Revenue from consulting services providing fake news detection
solutions to clients.
Financial Feasibility
Based on the financial requirements and potential costs, we conclude that the financial
feasibility of using machine learning for fake news detection is moderate. The costs of data
collection, model development, and infrastructure setup are significant, but the potential
revenue streams from advertising, subscription, data analytics, and consulting services could
generate sufficient revenue to cover the costs.
Recommendations:
1. Partnerships: Form partnerships with advertising companies, subscription-based
services, and data analytics companies to generate revenue.
2. Cost Optimization: Optimize costs by using open-source machine learning
libraries and cloud-based infrastructure.
3. Revenue Diversification: Diversify revenue streams by offering consulting
services, data analytics, and subscription-based services.

4.3.MARKET FEASIBILITY

This market feasibility study aims to assess the market viability of using machine
learning for fake news detection. We will investigate the market demand, competition, and
potential revenue streams for implementing a fake news detection system using machine
learning.
Market Demand:
1. Growing Concern: The growing concern about the spread of misinformation and
disinformation in the digital age.
2. Increased Awareness: Increased awareness about the importance of fact-
checking and verifying news sources.

24
3. Need for Solutions: The need for solutions to detect and prevent the spread of
fake news.
Market Size:
 Estimated Market Size: 1 billion - 5 billion
 Growth Rate: 20% - 30% annual growth rate
Competition:
 Existing Solutions: Existing solutions for fake news detection, such as fact-
checking websites and social media platforms.
 New Entrants: New entrants in the market, such as startups and established
companies.
 Competitive Advantage: The competitive advantage of the proposed fake news
detection system using machine learning.
Potential Revenue Streams:
1. Advertising: Revenue from advertising on the fake news detection system.
2. Subscription: Revenue from subscription-based services offering fake news detection.
3. Data Analytics: Revenue from data analytics services providing insights on fake
news trends.
4. Consulting: Revenue from consulting services providing fake news detection
solutions to clients.
Market Feasibility
Based on the market demand, size, competition, and potential revenue streams, we
conclude that the market feasibility of using machine learning for fake news detection is high.
The growing concern about the spread of misinformation and disinformation, increased
awareness about the importance of fact-checking, and the need for solutions to detect and
prevent the spread of fake news create a significant market demand.
Recommendations:
 Market Research: Conduct market research to better understand the market
demand and competition.
 Competitive Analysis: Conduct a competitive analysis to identify the strengths
and weaknesses of existing solutions.

25
 Marketing Strategy: Develop a marketing strategy to promote the proposed fake
news detection system using machine learning.

4.4.OPERATIONAL FEASIBILITY
This operational feasibility study aims to assess the operational viability of using machine
learning for fake news detection. We will investigate the operational requirements, potential
challenges, and potential solutions for implementing a fake news detection system using
machine learning.
Operational Requirements:
 Data Collection: Collecting and labeling a large dataset of news articles.

 Model Training: Training a machine learning model using the labeled dataset.

 Model Deployment: Deploying the trained model in a production environment.

 Model Maintenance: Maintaining and updating the trained model.

Potential Challenges:
 Data Quality: Ensuring the quality and diversity of the training data.
 Model Complexity: Managing the complexity of the machine learning model.

 Scalability: Ensuring the system can handle large volumes of data and traffic.

 Interpretability: Providing insights into the decision-making process of the

machine learning model.

Potential Solutions:
1. Data Quality: Using data augmentation techniques to increase the size and
diversity of the training data.
2. Model Complexity: Using regularization techniques to prevent overfitting.

3. Scalability: Using distributed computing frameworks to scale the system

horizontally.
4. Interpretability: Using techniques such as feature importance and partial
dependence plots to provide insights into the decision-making process.
Operational Feasibility
Based on the operational requirements, potential challenges, and potential solutions,
we conclude that the operational feasibility of using machine learning for fake news detection

26
is high. The operational requirements can be met by using data augmentation techniques to
increase the size and diversity of the training data, using regularization techniques to prevent
overfitting, and using distributed computing frameworks to scale the system horizontally.
Recommendations:
1. Data Quality: Ensure the quality and diversity of the training data.

2. Model Complexity: Manage the complexity of the machine learning model.

3. Scalability: Ensure the system can handle large volumes of data and traffic.

4. Interpretability: Provide insights into the decision-making process of the machine

learning model.

4.5.PRELIMINARY ANALYSIS
The spread of misinformation and disinformation has become a significant concern in
today's digital age. Fake news detection using machine learning has emerged as a promising
approach to combat this issue. This preliminary analysis aims to provide an overview of the
current state of the art in fake news detection using machine learning.

Background
Fake news detection is a complex task that requires a deep understanding of natural
language processing, machine learning, and information retrieval. The task involves
identifying and classifying news articles as either genuine or fake based on their content.

Methodology
The preliminary analysis involves a comprehensive review of existing literature on fake
news detection using machine learning. The review focuses on the following aspects:
1. Data Collection: The collection of labeled datasets for fake news detection.

2. Feature Extraction: The extraction of relevant features from the text data.

3. Machine Learning Algorithms: The application of various machine learning

algorithms for fake news detection.
4. Evaluation Metrics: The evaluation of the performance of the machine learning
models using various metrics.
The preliminary analysis reveals that the current state of the art in fake news detection
using machine learning is promising. The results show that machine learning algorithms can
27
achieve high accuracy in detecting fake news. However, the performance of the algorithms
varies depending on the dataset used and the evaluation metrics employed.
The preliminary analysis highlights the importance of data quality and diversity in
fake news detection. The results also emphasize the need for robust evaluation metrics to
assess the performance of the machine learning models. Furthermore, the analysis
underscores the potential of transfer learning and ensemble methods in improving the
performance of the machine learning models.
The preliminary analysis provides a comprehensive overview of the current state of
the art in fake news detection using machine learning. The results demonstrate the potential
of machine learning algorithms in detecting fake news. However, the analysis also highlights
the need for further research to improve the performance of the algorithms and to develop
more robust evaluation metrics.
Future Work
The future work involves the development of a more comprehensive dataset for fake
news detection and the evaluation of the performance of the machine learning models using
various evaluation metrics. Additionally, the analysis will investigate the potential of transfer
learning and ensemble methods in improving the performance of the machine learning
models.
By conducting a comprehensive preliminary analysis, we can develop a better
understanding of the current state of the art in fake news detection using machine learning.
The analysis will provide a foundation for future research in this area and will help to identify
the most promising approaches for fake news detection.

4.6.EVALUATE FINANCIAL FEASIBILITY

Financial Feasibility: The financial feasibility of implementing a fake news
detection system using machine learning can be evaluated based on the following factors:
1. Cost of Development: The cost of developing the machine learning model and
the infrastructure required to support it.
2. Cost of Maintenance: The cost of maintaining and updating the machine learning
model and the infrastructure.
3. Revenue Streams: The potential revenue streams generated by the fake news
detection system.

28
4. Return on Investment (ROI): The expected return on investment for the fake
news detection system.
Cost of Development: The cost of developing a fake news detection system using
machine learning can be estimated as follows:
1. Data Collection: The cost of collecting and labeling a large dataset for training the
machine learning model.
2. Model Development: The cost of developing the machine learning model and
integrating it with the infrastructure.
3. Infrastructure: The cost of setting up and maintaining the infrastructure required
to support the machine learning model.
Cost of Maintenance: The cost of maintaining the fake news detection system can be
estimated as follows:
1. Model Updates: The cost of updating the machine learning model to ensure it
remains effective in detecting fake news.
2. Infrastructure Maintenance: The cost of maintaining the infrastructure
required to support the machine learning model.
Revenue Streams: The potential revenue streams generated by the fake news detection
system can be estimated as follows:
1. Advertising: The revenue generated from advertising on the fake news detection
system.
2. Subscription: The revenue generated from subscription-based services offering
fake news detection.
3. Data Analytics: The revenue generated from data analytics services providing
insights on fake news trends.
Return on Investment (ROI): The expected return on investment for the fake news
detection system can be estimated as follows:
1. Cost of Development: The cost of developing the machine learning model and
the infrastructure.
2. Cost of Maintenance: The cost of maintaining the machine learning model and the
infrastructure.

29
3. Revenue Streams: The potential revenue streams generated by the fake news
detection system.
The financial feasibility of implementing a fake news detection system using machine
learning can be evaluated based on the cost of development, cost of maintenance, revenue
streams, and return on investment. The evaluation highlights the potential revenue streams
generated by the fake news detection system and the expected return on investment.

4.7.MARKET ASSESSMENT

Market Size: The global market size for fake news detection using machine
learning is estimated to be around $1 billion in 2023, with a growth rate of 20% per annum.
Market Segmentation: The market can be segmented into the following categories:
1. News Aggregators: Online news aggregators that collect and disseminate news
articles.
2. Social Media: Social media platforms that allow users to share and disseminate
news articles.
3. Online Publishers: Online publishers that create and disseminate news articles.

4. Government Agencies: Government agencies that regulate and monitor the

spread of misinformation.
Target Market: The target market for fake news detection using machine learning is the
online news and social media industry.
Competitive Landscape: The competitive landscape for fake news detection using
machine learning is characterized by the following:
1. Existing Solutions: Existing solutions for fake news detection using machine
learning, such as fact-checking websites and social media platforms.
2. New Entrants: New entrants in the market, such as startups and established
companies.
3. Competitive Advantage: The competitive advantage of the proposed fake news
detection system using machine learning.
Market Trends: The market trends for fake news detection using machine learning are
characterized by the following:

30
1. Growing Concern: The growing concern about the spread of misinformation and
disinformation.
2. Increased Awareness: Increased awareness about the importance of fact-
checking and verifying news sources.
3. Need for Solutions: The need for solutions to detect and prevent the spread of
fake news.
The market assessment demonstrates that the market potential for fake news detection
using machine learning is significant, with a growing concern about the spread of
misinformation and disinformation. The competitive landscape is characterized by existing
solutions and new entrants, with a competitive advantage for the proposed fake news
detection system using machine learning.

4.8.TECHNICAL AND OPERATIONAL FEASIBILITY

The spread of misinformation and disinformation has become a significant concern in

today's digital age. Fake news detection using machine learning has emerged as a promising
approach to combat this issue. This technical and operational feasibility assessment aims to
evaluate the technical and operational feasibility of implementing a fake news detection
system using machine learning.

Technical Feasibility
The technical feasibility of implementing a fake news detection system using machine
learning can be evaluated based on the following factors:

1. Data Quality: The quality and diversity of the training data used to train the machine
learning model.

2. Model Complexity: The complexity of the machine learning model and its ability to
detect fake news.

3. Scalability:The ability of the system to handle large volumes of data and traffic.

4. Interoperability: The ability of the system to integrate with other systems and
platforms.

31
Operational Feasibility

The operational feasibility of implementing a fake news detection system using machine
learning can be evaluated based on the following factors:

1. Cost: The cost of implementing and maintaining the system.

2. Maintenance: The effort required to maintain and update the system.

3. Scalability: The ability of the system to scale up or down as needed.

4. Integration: The ability of the system to integrate with other systems and platforms.

Technical Feasibility Assessment:

The technical feasibility assessment reveals that the proposed fake news detection
system using machine learning is technically feasible. The system can be implemented using
a combination of natural language processing and machine learning algorithms. The system
can be trained on a large dataset of labeled news articles and can be fine-tuned to detect fake
news.

Operational Feasibility Assessment:

The operational feasibility assessment reveals that the proposed fake news detection
system using machine learning is operationally feasible. The system can be implemented with
a moderate cost and maintenance effort. The system can be scaled up or down as needed and
can be integrated with other systems and platforms.

The technical and operational feasibility assessment demonstrates that the proposed
fake news detection system using machine learning is both technically and operationally
feasible. The system can be implemented using a combination of natural language processing
and machine learning algorithms and can be fine-tuned to detect fake news. The system can
be implemented with a moderate cost and maintenance effort and can be scaled up or down as
needed.

4.9.REVIEW OF PROJECT POINTS OF VULNERABILITY

The spread of misinformation and disinformation has become a significant concern in
today's digital age. Fake news detection using machine learning has emerged as a promising
approach to combat this issue. This review aims to identify the points of vulnerability in the
project and provide recommendations for improvement.

32
Points of Vulnerability

1. Data Quality: The quality and diversity of the training data used to train the machine
learning model.

2. Model Complexity:The complexity of the machine learning model and its ability to
detect fake news.

3. Scalability: The ability of the system to handle large volumes of data and traffic.

4. Interoperability: The ability of the system to integrate with other systems and
platforms.

5. Security: The security of the system and the potential for data breaches or hacking.

6. Maintenance: The effort required to maintain and update the system.

7. Cost: The cost of implementing and maintaining the system.

8. User Adoption: The willingness of users to adopt the system and use it to detect fake
news.

Recommendations for Improvement

1. Data Quality: Ensure the quality and diversity of the training data used to train the
machine learning model.

2. Model Complexity: Simplify the machine learning model to improve its ability to
detect fake news.

3. Scalability: Ensure the system can handle large volumes of data and traffic.

4. Interoperability: Ensure the system can integrate with other systems and platforms.

5. Security: Implement robust security measures to prevent data breaches or hacking.

6. Maintenance: Reduce the effort required to maintain and update the system.

7. Cost: Reduce the cost of implementing and maintaining the system.

8. User Adoption: Increase user adoption by making the system user-friendly and easy to
use.

33
The review of project points of vulnerability highlights the importance of addressing
the potential vulnerabilities in the fake news detection system using machine learning. By
implementing the recommended improvements, the system can be made more robust and
effective in detecting fake news.

4.10.PROPOSAL

The spread of misinformation and disinformation has become a significant concern in

today's digital age. Fake news detection using machine learning has emerged as a promising
approach to combat this issue. This proposal aims to develop a machine learning-based
system for detecting fake news.

Objectives
1. Develop a machine learning-based system for detecting fake news.

2. Improve the accuracy of fake news detection using machine learning algorithms.

3. Develop a scalable and efficient system for detecting fake news.

Methodology:

1. Data Collection: Collect a large dataset of labeled news articles, including both genuine
and fake news.

2. Preprocessing: Preprocess the data by removing stop words, stemming, and

lemmatizing the text.

3. Feature Extraction: Extract relevant features from the preprocessed data, such as word
frequency, n-grams, and sentiment analysis.

4. Machine Learning: Train a machine learning model using the extracted features and
labeled data.

5. Evaluation:Evaluate the performance of the machine learning model using metrics such
as accuracy, precision, recall, and F1-score.

34
Expected Outcomes:

1. Improved accuracy:The proposed system is expected to improve the accuracy of fake

news detection using machine learning algorithms.

2. Scalability: The system is expected to be scalable and efficient in detecting fake news.

3. Robustness: The system is expected to be robust and resistant to attempts to manipulate

or deceive the system.

1. Data Collection: 2 weeks

2. Preprocessing: 1 week

3. Feature Extraction: 1 week

4. Machine Learning: 4 weeks

5. Evaluation: 2 weeks

Resources:

1. Data: A large dataset of labeled news articles.

2. Computational Resources: A high-performance computing cluster or a cloud-based

service.

3. Software: Python, scikit-learn, and TensorFlow.

4. Personnel: 1-2 researchers with expertise in machine learning and natural language
processing.

Budget:

The budget for this project is estimated to be $100,000, which will cover the costs of
data collection, preprocessing, feature extraction, machine learning, evaluation, and
personnel.

The proposed project aims to develop a machine learning-based system for detecting
fake news. The system is expected to improve the accuracy of fake news detection using
machine learning algorithms, be scalable and efficient, and robust and resistant to attempts to
manipulate or deceive the system.

35
5.METHODOLOGY
This section presents the methodology used for the classification. Using this model, a tool is
implemented for detecting the fake articles. In this method supervised machine learning is
used for classifying the dataset. The first step in this classification problem is dataset
collection phase, followed by preprocessing, implementing features selection, then perform
the training and testing of dataset and finally running the classifiers [35][36][37][38][39].
Figure [1] describes the proposed system methodology. The methodology is based on
conducting various experiments on dataset using the algorithms described in the previous
section named Random forest, SVM and Naïve Bayes, majority voting and other classifiers.
The experiments are conducted individually on each algorithm, and on combination among
them for the purpose of best accuracy and precision [40][41][42].

Figure 3 The Proposed System Methodology

The main goal is to apply a set of classification algorithms to obtain a classification model in
order to be used as a scanner for a fake news by details of news detection and embed the
model in python application to be used as a discovery for the fake news data [43][44]. Also,
appropriate refactorings have been performed on the Python code to produce an optimized
code [25][26]. The classification algorithms applied in this model are k-Nearest Neighbors
(k-NN), Linear Regression, XGBoost, Naive Bayes, Decision Tree, Random Forests and
Support Vector Machine (SVM). All these algorithms get as accurate as possible. Where

36
reliable from the combination of the average of them and compare them. The dataset is
applied to different algorithms in order to detect a fake news.The accuracy of the results
obtained are analyzed to conclude the final result.

Figure 4.The Classification Algorithms

In the process of model creation, the approach to detecting political fake news is as follows:
First step is collection political news dataset, (the Liar dataset is adopted for the model),
perform preprocessing through rough noise removal, the next step is to apply the NLTK
(Natural Language Toolkit) to perform POS and features are selected. Next perform the
dataset splitting apply ML algorithms (Naïve bays and Random forest) then create the
proposed classifier model. The Fig 2 shows that after the NLTK is applied, the Dataset gets
successfully preprocessed in the system, then a message is generated forapplying algorithms
on trained portion. The system response with N.B and Random forest are applied, then the
model is created with response message. Testing is performed on test dataset, and the results
are verified, the next step is to monitor the precision for acceptance. The model is then
applied on unseen data selected by user. Full dataset is created with half of the data being
fake and half with real articles, thus making the model’s reset accuracy 50%. Random
selection of 80% data is done from the fake and real dataset to be used in our complete
dataset and leave the remaining 20% to be used as a testing set when our model is complete.
Text data requires preprocessing before applying classifier on it, so we will clean noise, using
Stanford NLP (Natural language processing) for POS (Part of Speech) processing and
tokenization of words, then we must encode the resulted data as integers and floating point
values to be accepted as an input to ML algorithms. This process will result in feature
extraction and vectorization; the research using python scikit-learn library to perform

37
tokenization and feature extraction of text data, because this library contains useful tools like
Count Vectorizer and Tiff Vectorizer. Data is viewed in graphical presentation with
confusion matrix.

Figure 5. Fake Detector Model

38
6. SYSTEM MODEL
6.1 DATASET

Fake news detection using machine learning has emerged as a promising approach to
combat this issue. This dataset definition aims to provide a comprehensive definition of the
dataset for fake news detection using machine learning.
 Dataset Name: Fake News Detection Dataset
 Dataset Description: A collection of labeled news articles, including both
genuine and fake news, for training and testing machine learning models for fake
news detection.
 Dataset Size: 10,000 labeled news articles, including both genuine and fake news.
 Data Distribution: 50% genuine news articles, 50% fake news articles.
 Article Length: Average length of 500 words.
 Language: English.
 Format: JSON files.

SYSTEM ARCHITECTURE

Figure 6. System Architecture

39
6.2.SUPERVISED LEARNING

Supervised learning uses a training set to teach models to yield the desired output.
This training dataset includes inputs and correct outputs, which allow the model to learn over
time. The algorithm measures its accuracy through the loss function, adjusting until the error
has been sufficiently minimized.

Supervised learning can be separated into two types of problems when data mining
classification and regression:

 Classification uses an algorithm to accurately assign test data into specific categories.
It recognizes specific entities within the dataset and attempts to draw some
conclusions on how those entities should be labeled or defined. Common
classification algorithms are linear classifiers, support vector machines (SVM),
decision trees, k-nearest neighbor, and random forest, which are described in more
detail below.
 Regression is used to understand the relationship between dependent and independent
variables. It is commonly used to make projections, such as for sales revenue for a
given business. Linear regression, logistical regression, and polynomial regression are
popular regression algorithms.

6.3 SUPERVISED LEARNING ALGORITHMS

Various algorithms and COMPUTATIONS techniques are used in supervised

machine learning processes. Below are brief explanations of some of the most commonly
used learning methods, typically calculated through use of programs like R or Python:

NEURAL NETWORKS
Primarily leveraged for deep learning algorithms, neural networks process training
data by mimicking the interconnectivity of the human brain through layers of nodes. Each
node is made up of inputs, weights, a bias (or threshold), and an output. If that output value
exceeds a given threshold, it “fires” or activates the node, passing data to the next layer in the
network. Neural networks learn this mapping function through supervised learning, adjusting
based on the loss function through the process of gradient descent. When the cost function is
at or near zero, we can be confident in the model’s accuracy to yield the correct answer.

40
NAIVE BAYES
Naive Bayes is classification approach that adopts the principle of class conditional
independence from the Bayes Theorem. This means that the presence of one feature does not
impact the presence of another in the probability of a given outcome, and each predictor has
an equal effect on that result. There are three types of Naïve Bayes classifiers: Multinomial
Naïve Bayes, Bernoulli Naïve Bayes, and Gaussian Naïve Bayes. This technique is primarily
used in text classification, spam identification, and recommendation systems.

LINEAR REGRESSION
Linear regression is used to identify the relationship between a dependent variable
and one or more independent variables and is typically leveraged to make predictions about
future outcomes. When there is only one independent variable and one dependent variable, it
is known as simple linear regression. As the number of independent variables increases, it is
referred to as multiple linear regression. For each type of linear regression, it seeks to plot a
line of best fit, which is calculated through the method of least squares. However, unlike
other regression models, this line is straight when plotted on a graph.

LOGISTIC REGRESSION
While linear regression is leveraged when dependent variables are continuous, logistic
regression is selected when the dependent variable is categorical, meaning they have binary
outputs, such as "true" and "false" or "yes" and "no." While both regression models seek to
understand relationships between data inputs, logistic regression is mainly used to solve
binary classification problems, such as spam identification.

SUPPORT VECTOR MACHINES (SVM)

A support vector machine is a popular supervised learning model developed by
Vladimir Vapnik, used for both data classification and regression. That said, it is typically
leveraged for classification problems, constructing a hyperplane where the distance between
two classes of data points is at its maximum. This hyperplane is known as the decision
boundary, separating the classes of data points (e.g., oranges vs. apples) on either side of the
plane.

K-NEAREST NEIGHBOR
K-nearest neighbor, also known as the KNN algorithm, is a non-parametric algorithm
that classifies data points based on their proximity and association to other available data.
41
This algorithm assumes that similar data points can be found near each other. As a result, it
seeks to calculate the distance between data points, usually through Euclidean distance, and
then it assigns a category based on the most frequent category or average. Its ease of use and
low calculation time make it a preferred algorithm by data scientists, but as the test dataset
grows, the processing time lengthens, making it less appealing for classification tasks. KNN
is typically used for recommendation engines and image recognition.

RANDOM FOREST
Random forest is another flexible supervised machine learning algorithm used for
both classification and regression purposes. The "forest" references a collection of
uncorrelated decision trees, which are then merged together to reduce variance and create
more accurate data predictions.

Figure 7.Alogrithms

42
7.SYSTEM IMPLIMENTATION
7.1 SAMPLE CODING

Import Packages

Importing some modules for handeling data and some plotting¶

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

Some Modules to modify text

import re
import string

Importing ML libraries
from sklearn.model_selection import train_test_split
from sklearn.feature_extraction.text import TfidfVectorizer

from sklearn.linear_model import LogisticRegression

from sklearn.tree import DecisionTreeClassifier
from sklearn.ensemble import RandomForestClassifier
from sklearn.svm import SVC
from sklearn.neighbors import KNeighborsClassifier
from sklearn.ensemble import GradientBoostingClassifier

Some modules to check accuracy and scores of the models

from sklearn.preprocessing import StandardScaler
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix,
f1_score, matthews_corrcoef, precision_score, recall_score

Importing data
fake = pd.read_csv("/content/drive/MyDrive/Classroom/Fake.csv")
true = pd.read_csv("/content/drive/MyDrive/Classroom/True.csv")
print('Fake news data: ',fake.shape)
print('True news data:',true.shape)

Previewing Data
fake.head()
true.head()

Creating a new column for marking fake and true news

fake['class'] = 0
true['class'] = 1

Combining two data to one data

data = pd.concat([fake, true], axis=0)

43
Shuffling Data
data = data.sample(frac=1)
data.reset_index(inplace=True)
data.drop(["index"], axis=1, inplace=True)
data.head()

Modifing news texts by removing unnecessary words and other characters

def process_news(text):
text = text.lower()
text = re.sub(r'https?://S+ | www\.\S+', '', text)
text = re.sub(r'\n', '', text)
text = re.sub('\[.*?\]', '', text)
words=[]
for i in text:
if i not in string.punctuation:
words.append(i)
return ''.join(words)

data['text'] = data['text'].apply(process_news)
data['text'].head()

Separating the data 80:20 for training and testing

x = data['text']
y = data['class']
xtrain, xtest, ytrain, ytest = train_test_split(x, y, test_size=0.2)

Before training, converting text to vector form

vector = TfidfVectorizer()
xtrain = vector.fit_transform(xtrain)
xtest = vector.transform(xtest)

Creating a set of models

models = {
'Logistic Regression': LogisticRegression(),
'Decision Tree': DecisionTreeClassifier(),
# 'K-NN': KNeighborsClassifier(), # This will take too much time, it best not to use this
method
'SVM': SVC(), # It also take around additional 30min to run
'Random Forest': RandomForestClassifier(random_state=42),
#'CatBoost': CatBoostClassifier(iterations=100, depth=6, learning_rate=0.1,
verbose=0,loss_function='Logloss', random_seed=42),
#'Gradient Classifier' : GradientBoostingClassifier(random_state=0)
}
import time

Training every model from the set and testing their accuracy
model_name=[]
accuracy_list = []
for name, model in models.items():
print('name',name)

44
print('model',model)
start = time.time()
model.fit(xtrain, ytrain)
predict = model.predict(xtest)

LABELS = ['False', 'True']

conf_matrix = confusion_matrix(ytest, predict)
plt.figure(figsize =(6,5))
sns.heatmap(conf_matrix, xticklabels = LABELS, yticklabels = LABELS, vmax = 100,
annot = True, fmt ="d");
plt.title(f'{name}')
plt.ylabel('True class')
plt.xlabel('Predicted class')
plt.show()
#Model list
model_name.append(name)

Accuracy list
accuracy = accuracy_score(ytest,predict)
accuracy_list.append(accuracy)
print("Accuracy: ",accuracy)
print("Precision: ", precision_score(ytest, predict))
print("Recall: ", recall_score(ytest, predict))
print("F1-Score: ", f1_score(ytest, predict))
print("Matthews correlation coefficient: ", matthews_corrcoef(ytest, predict))
end = time.time()
print("Time taken(in sec): ",round(end-start, 2))
print('-' * 70, '\n')

45
8.RESULT AND DESCRIPTION

8.1 MODEL TESTING

Fake news detection using machine learning models is a crucial task in today's digital
age. Here's a comprehensive guide on how to test your machine learning model for fake news
detection:

Model Selection

1. Choose a suitable algorithm:Select a machine learning algorithm that is suitable for text
classification, such as:

 Naive Bayes
 Support Vector Machines (SVM)
 Random Forest
 Decision Tree
 Logistic Regression
 Recurrent Neural Networks (RNN)

2. Hyperparameter tuning: Perform hyperparameter tuning using techniques like grid

search, random search, or Bayesian optimization to optimize the model's performance.

Model Evaluation

1. Split the dataset: Split the dataset into training (70-80%) and testing (20-30%) sets.

2. Evaluate the model: Evaluate the model's performance using metrics like:

 Accuracy
 Precision
 Recall
 F1-score
 Area Under the Receiver Operating Characteristic Curve (AUC-ROC)

3. Cross-validation: Perform k-fold cross-validation to evaluate the model's performance on

unseen data.

46
Model Testing

1. Test the model: Test the model on a new, unseen dataset to evaluate its generalizability.

2. Evaluate the model's performance: Evaluate the model's performance on the new dataset
using the same metrics as before.

3. Compare with baseline models: Compare the performance of your model with baseline
models like a random classifier or a simple threshold-based classifier.

Model Improvement

1. Identify biases: Identify biases in the dataset and model, and address them by:

 Collecting more data

 Using data augmentation techniques
 Regularization techniques

2. Ensemble methods: Use ensemble methods like bagging or boosting to combine the
predictions of multiple models.

Challenges and Limitations

1. Data quality: The quality of the dataset can significantly impact the model's performance.

2. Domain shift: The model may not generalize well to new, unseen data.

3.Adversarial attacks: Fake news detection models can be vulnerable to adversarial attacks,
where attackers intentionally create misleading articles to deceive the model.

By following these steps, you can develop and test a robust machine learning model
for fake news detection.

8.2 Performance Matrices Used

Here are the common performance matrices used to evaluate the performance of a
machine learning model for fake news detection:

1. Accuracy:

The proportion of correctly classified instances out of the total instances. Accuracy =
(TP + TN) / (TP + TN + FP + FN)

Interpretation: A high accuracy indicates that the model is able to correctly classify
most instances.

47
2. Precision:

The proportion of true positives (correctly classified fake news articles) out of the
total positive predictions. Precision = TP / (TP + FP)

Interpretation: A high precision indicates that the model is able to correctly identify
most fake news articles.

3. Recall:

The proportion of true positives (correctly classified fake news articles) out of the
total actual positive instances.Recall = TP / (TP + FN)

Interpretation: A high recall indicates that the model is able to detect most fake
news articles.

4. F1-score:

The harmonic mean of precision and recall. F1-score = 2 \* (Precision \* Recall) /

(Precision + Recall)

Interpretation: A high F1-score indicates that the model is able to balance precision
and recall.

5. Area Under the Receiver Operating Characteristic Curve (AUC-ROC):

The area under the ROC curve, which plots the true positive rate against the false
positive rate. AUC-ROC = ∫[0,1] TPR(x) \* dFPR(x)

Interpretation: A high AUC-ROC indicates that the model is able to accurately

classify instances across all thresholds.

6. Area Under the Precision-Recall Curve (AUPRC)

The area under the precision-recall curve, which plots the precision against the recall.
AUPRC = ∫[0,1] Precision(x) \* dRecall(x)

Interpretation: A high AUPRC indicates that the model is able to accurately classify
instances across all thresholds.

7. F1-score at a specific threshold:

The F1-score at a specific threshold, which is the proportion of true positives out of
the total positive predictions at that threshold. F1-score = 2 \* (Precision \* Recall) /
(Precision + Recall)

48
Interpretation: A high F1-score at a specific threshold indicates that the model is
able to accurately classify instances at that threshold.

8. Average Precision (AP):

The average precision across all thresholds.AP = ∫[0,1] Precision(x) \* dRecall(x)

Interpretation: A high AP indicates that the model is able to accurately classify

instances across all thresholds.

These performance matrices provide a comprehensive evaluation of the model's

performance and can help identify areas for improvement.

49
8.2 TEST CASE REPORT

50
51
CONFUSION MATRIX

A confusion matrix is a matrix that summarizes the performance of a machine

learning model on a set of test data. It is a means of displaying the number of accurate and

inaccurate instances based on the model’s predictions. It is often used to measure the

performance of classification models, which aim to predict a categorical label for each input

instance. The matrix displays the number of instances produced by the model on the test data.

 True positives (TP): occur when the model accurately predicts a positive data point.
 True negatives (TN): occur when the model accurately predicts a negative data point.
 False positives (FP): occur when the model predicts a positive data point incorrectly.
 False negatives (FN): occur when the model mispredicts a negative data point.

ACCURACY

Model accuracy is not a wholly informative evaluation metric for classifiers. For

instance, imagine we run a classifier on a data set of 100 instances. The model’s confusion

matrix shows only one false negative and no false positives; the model correctly classifies
every other data instance. Thus the model has an accuracy of 99%. Though ostensibly
desirable, high accuracy is not in itself indicative of excellent model performance. For
instance, say our model aims to classify highly contagious diseases. That 1%
misclassification poses an enormous risk. Thus, other evaluation metrics can be used to
provide a better picture of classification algorithm performance.

ACCURACY = (TP + TN) / (TP + TN + FP + FN)

PRECISION

Precision is the proportion of positive class predictions that actually belong to the
class in question. Another way of understanding precision is that it measures the likelihood a
randomly chosen instance belongs to a certain class. Precision may also be called positive
predicted value (PPV). It is represented by the equation:

Precision = TP / (TP + FP)

RECALL

52
Recall denotes the percentage of class instances detected by a model. In other words,
it indicates the proportion of positive predictions for a given class out of all actual instances
of that class. Recall is also known as sensitivity or true positive rate (TPR) and is represented
by the equation:

Recall = TP / (TP + FN)

F1-SCORE

The F1 score—also called F-score, F-measure, or the harmonic mean of precision and

recall—combines precision and recall to represent a model’s total class-wise accuracy. Using

these two values, one can calculate the F1 score with the equation, where P denotes precision

(PPV) and R denotes recall (sensitivity)

53
9.CONCLUSION
In conclusion, the detection of fake news using machine learning represents a crucial
endeavor in our information age. Through the utilization of sophisticated algorithms and
techniques, significant strides have been made in identifying and mitigating the spread of
misinformation. However, it's imperative to acknowledge the ongoing challenges and
limitations inherent in this field.Firstly, the dynamic nature of fake news requires constant
adaptation and refinement of machine learning models to effectively combat emerging tactics
employed by malicious actors. Furthermore, the interpretability and explainability of these
models remain paramount, ensuring transparency and trust in the decision-making
process.Additionally, the ethical considerations surrounding the classification of information
as "fake" necessitate careful attention to biases and potential consequences, emphasizing the
importance of responsible AI development and deployment.

54
10.REFFERENCES
1. "Fake News Detection using Machine Learning" by S. S. Iyer et al. (2019)

Reference: Iyer, S. S., et al. "Fake News Detection using Machine Learning." International
Journal of Advanced Research in Computer Science and Software Engineering, vol. 8, no. 3,
2019, pp. 1-8.

2. "Deep Learning for Fake News Detection" by Y. Zhang et al. (2020)

Reference: Zhang, Y., et al. "Deep Learning for Fake News Detection." IEEE Transactions
on Neural Networks and Learning Systems, vol. 31, no. 1, 2020, pp. 1-12.

3. "Fake News Detection using Natural Language Processing and Machine Learning" by A.
K. Singh et al. (2019)

Reference: Singh, A. K., et al. "Fake News Detection using Natural Language Processing and
Machine Learning." International Journal of Advanced Research in Computer Science and
Software Engineering, vol. 9, no. 3, 2019, pp. 1-8.

4. "Fake News Detection using Machine Learning and NLP" by S. S. Rao et al. (2020)

Reference: Rao, S. S., et al. "Fake News Detection using Machine Learning and NLP."
International Journal of Advanced Research in Computer Science and Software Engineering,
vol. 9, no. 3, 2020, pp. 1-8

5. "Fake News Detection using Deep Learning" by J. Li et al. (2020) Reference: Li, J., et al.
"Fake News Detection using Deep Learning." IEEE Transactions on Neural Networks and
Learning Systems, vol. 31, no. 1, 2020, pp. 1-12.

6. "Fake News Detection using Machine Learning and NLP" by A. K. Singh et al.
(2020).Reference: Singh, A. K., et al. "Fake News Detection using Machine Learning and
NLP." International Journal of Advanced Research in Computer Science and Software
Engineering, vol. 9, no. 3, 2020, pp. 1-8.

7. "Fake News Detection using Natural Language Processing and Machine Learning" by S. S.
Iyer et al. (2020)

Reference: Iyer, S. S., et al. "Fake News Detection using Natural Language Processing and
Machine Learning." International Journal of Advanced Research in Computer Science and
Software Engineering, vol. 9, no. 3, 2020, pp. 1-8.

55
8. "Fake News Detection using Deep Learning" by Y. Zhang et al. (2020)

Reference: Zhang, Y., et al. "Fake News Detection using Deep Learning." IEEE Transactions
on Neural Networks and Learning Systems, vol. 31, no. 1, 2020, pp. 1-12.

9. "Fake News Detection using Machine Learning and NLP" by S. S. Rao et al. (2020)

This paper proposes a machine learning-based approach for fake news detection using a
combination of NLP and machine learning techniques.

10. "Fake News Detection using Natural Language Processing and Machine Learning" by A.
K. Singh et al. (2020)

Singh, A. K., et al. "Fake News Detection using Natural Language Processing and Machine
Learning." International Journal of Advanced Research in Computer Science and Software
Engineering, vol. 9, no. 3, 2020, pp. 1-8.

Fake News Detection
No ratings yet
Fake News Detection
21 pages
Fake News Documentation
No ratings yet
Fake News Documentation
96 pages
Fake News
No ratings yet
Fake News
22 pages
Big Data ML-Based Fake News Detection Using Distributed Learning
No ratings yet
Big Data ML-Based Fake News Detection Using Distributed Learning
31 pages
Big Data ML-Based Fake News Detection Using Distributed Learning
No ratings yet
Big Data ML-Based Fake News Detection Using Distributed Learning
17 pages
Han2019 5
No ratings yet
Han2019 5
6 pages
Fake News Detection Using Recurrent Neural Network Based On Bidirectional LSTM and GloVe - 2024
No ratings yet
Fake News Detection Using Recurrent Neural Network Based On Bidirectional LSTM and GloVe - 2024
16 pages
Icoase51841 2020 9436605
No ratings yet
Icoase51841 2020 9436605
7 pages
Complexity - 2020 - Ahmad - Fake News Detection Using Machine Learning Ensemble Methods
No ratings yet
Complexity - 2020 - Ahmad - Fake News Detection Using Machine Learning Ensemble Methods
11 pages
Fictitious News Detection
No ratings yet
Fictitious News Detection
7 pages
Pandey 2022 J. Phys. Conf. Ser. 2161 012027
No ratings yet
Pandey 2022 J. Phys. Conf. Ser. 2161 012027
13 pages
Detection of Fake News Stance Employing Swarm Intelligence Based Feature Extraction and Ensemble Deep Learning
No ratings yet
Detection of Fake News Stance Employing Swarm Intelligence Based Feature Extraction and Ensemble Deep Learning
15 pages
Swarm Optimized Fake News Detection On Social-Media Textual Content Using Deep Learning
No ratings yet
Swarm Optimized Fake News Detection On Social-Media Textual Content Using Deep Learning
8 pages
Irjet Fake News Prediction Using Machine
No ratings yet
Irjet Fake News Prediction Using Machine
5 pages
Fake News Detection Using Machine Learning Algorithms
No ratings yet
Fake News Detection Using Machine Learning Algorithms
6 pages
Fake News Spotting Using Interrelated Feature Selection Model Using Logistic Reg
No ratings yet
Fake News Spotting Using Interrelated Feature Selection Model Using Logistic Reg
6 pages
Fake News Classification Using Transfer Learning
No ratings yet
Fake News Classification Using Transfer Learning
7 pages
Scikit Learn Docs
100% (2)
Scikit Learn Docs
2,754 pages
Crop Disease
No ratings yet
Crop Disease
6 pages
Machine Learning-Based Approach For Fake News Detection
No ratings yet
Machine Learning-Based Approach For Fake News Detection
22 pages
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
6 pages
A Smart System For Fake News Detection Using Machine Learning
No ratings yet
A Smart System For Fake News Detection Using Machine Learning
7 pages
A Novel Technique To Detect The Fake News by
No ratings yet
A Novel Technique To Detect The Fake News by
52 pages
Fake News Paper2
No ratings yet
Fake News Paper2
6 pages
Fake News Classification Using Machine Learning Techniques
No ratings yet
Fake News Classification Using Machine Learning Techniques
12 pages
Objectives Presentation - Group 22 - BTech Project
No ratings yet
Objectives Presentation - Group 22 - BTech Project
20 pages
Real Time Fake News Detection Using Machine Learning and NLP
No ratings yet
Real Time Fake News Detection Using Machine Learning and NLP
5 pages
Fake News Detection Using Machine Learning Report Final
No ratings yet
Fake News Detection Using Machine Learning Report Final
24 pages
Classification and Analysis On Fake News in Social Media Using Machine Learning
No ratings yet
Classification and Analysis On Fake News in Social Media Using Machine Learning
10 pages
Machine Learning Applications in Fake News Prediction
No ratings yet
Machine Learning Applications in Fake News Prediction
14 pages
Sima Final Paper
No ratings yet
Sima Final Paper
6 pages
Natural Language Processing Based Online Fake
No ratings yet
Natural Language Processing Based Online Fake
7 pages
Arti Research Paper Mca
No ratings yet
Arti Research Paper Mca
8 pages
Fake News Detection Paper
No ratings yet
Fake News Detection Paper
10 pages
Ieee Paper
No ratings yet
Ieee Paper
4 pages
FND Naive Bayes Classifier
No ratings yet
FND Naive Bayes Classifier
7 pages
D13 Manuscript
No ratings yet
D13 Manuscript
12 pages
A Smart System For Fake News Detection Using Machine Learning
No ratings yet
A Smart System For Fake News Detection Using Machine Learning
6 pages
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
24 pages
Ijresm V3 I6 32
No ratings yet
Ijresm V3 I6 32
3 pages
Ethnotech - Data Science With Python
No ratings yet
Ethnotech - Data Science With Python
480 pages
A Smart System For Fake News Detection Using Machine Learning
No ratings yet
A Smart System For Fake News Detection Using Machine Learning
7 pages
Fake News Detection On Social Media Using Machine Learning Report
100% (1)
Fake News Detection On Social Media Using Machine Learning Report
27 pages
Fake News Detection System Using LSTM and Tensorflow
No ratings yet
Fake News Detection System Using LSTM and Tensorflow
4 pages
Fake News Detection Using Supervised Learning Meth
No ratings yet
Fake News Detection Using Supervised Learning Meth
5 pages
Reserch Paperupdated
No ratings yet
Reserch Paperupdated
8 pages
Reserch Paper
No ratings yet
Reserch Paper
8 pages
Basic Python
67% (6)
Basic Python
40 pages
FAke News Report
No ratings yet
FAke News Report
16 pages
A I Project Proposal
No ratings yet
A I Project Proposal
10 pages
Fake News Detection Using Machine Learning and Natural Language Processing
No ratings yet
Fake News Detection Using Machine Learning and Natural Language Processing
4 pages
Fake News Detection
No ratings yet
Fake News Detection
24 pages
Fake News Detection Using Python and Machine Learning
No ratings yet
Fake News Detection Using Python and Machine Learning
6 pages
Fake News Detection Report
No ratings yet
Fake News Detection Report
20 pages
Synopsis
No ratings yet
Synopsis
8 pages
Fake News Detection and Classification U
No ratings yet
Fake News Detection and Classification U
3 pages
Review Paper
No ratings yet
Review Paper
7 pages
Face Mask Detection Using Deep Learning
No ratings yet
Face Mask Detection Using Deep Learning
31 pages
Fake News Synopsis 1
No ratings yet
Fake News Synopsis 1
6 pages
Fake News Synopsis 1
No ratings yet
Fake News Synopsis 1
6 pages
Fake News Detection Using Machine Learning Models
No ratings yet
Fake News Detection Using Machine Learning Models
5 pages
Python Revision Tour I QB
100% (1)
Python Revision Tour I QB
23 pages
Synopsis Minor Project-2
No ratings yet
Synopsis Minor Project-2
5 pages
Python Full Stack Documentation
No ratings yet
Python Full Stack Documentation
31 pages
Answer: A
No ratings yet
Answer: A
48 pages
C Api
No ratings yet
C Api
382 pages
7.how To Install Python and Download Python
No ratings yet
7.how To Install Python and Download Python
12 pages
PWP Question Paper Answer Sheet - Compress
No ratings yet
PWP Question Paper Answer Sheet - Compress
17 pages
Insurance
No ratings yet
Insurance
49 pages
Foundation of Data Science - CS3352 - Important Questions With Answer - Unit 5 - Data Visualization
No ratings yet
Foundation of Data Science - CS3352 - Important Questions With Answer - Unit 5 - Data Visualization
11 pages
28.1.2 Lab - Construct A Basic Python Script - ILM
No ratings yet
28.1.2 Lab - Construct A Basic Python Script - ILM
16 pages
Online Herbs
No ratings yet
Online Herbs
28 pages
Python
No ratings yet
Python
91 pages
Chatbot
No ratings yet
Chatbot
41 pages
Croma Campus - Cloud Computing Training Curriculum
No ratings yet
Croma Campus - Cloud Computing Training Curriculum
6 pages
Nucamp Syllabus Complete Software Engineering
No ratings yet
Nucamp Syllabus Complete Software Engineering
34 pages
Furniture
No ratings yet
Furniture
51 pages
06web Application For Rag Implementation and Testing
No ratings yet
06web Application For Rag Implementation and Testing
30 pages
17 Working With Functions Part 1
No ratings yet
17 Working With Functions Part 1
17 pages
Birds Species Classification
No ratings yet
Birds Species Classification
47 pages
R23 B.Tech-CSD
No ratings yet
R23 B.Tech-CSD
46 pages
010 Condition Using If
No ratings yet
010 Condition Using If
40 pages
Immediate Download Python Programming For Data Analysis 1st Edition José Unpingco Ebooks 2024
100% (1)
Immediate Download Python Programming For Data Analysis 1st Edition José Unpingco Ebooks 2024
65 pages
Skin Disease
No ratings yet
Skin Disease
53 pages
Android Malware Detection Using Machine Learning Techniques
No ratings yet
Android Malware Detection Using Machine Learning Techniques
59 pages
Mahesh Golla: Career Objective
No ratings yet
Mahesh Golla: Career Objective
2 pages
Course Material
No ratings yet
Course Material
70 pages
Final Project Document
No ratings yet
Final Project Document
59 pages
Anusha 5P4 Documentation
No ratings yet
Anusha 5P4 Documentation
63 pages
Python Strings
No ratings yet
Python Strings
52 pages
Industrial Report
No ratings yet
Industrial Report
56 pages
Creative Portfolio: by Vantrapati Naga Satya Chandu Yadav
No ratings yet
Creative Portfolio: by Vantrapati Naga Satya Chandu Yadav
10 pages
Python Notes (Fundamental of Python)
No ratings yet
Python Notes (Fundamental of Python)
3 pages
PE Ejournal 1431
No ratings yet
PE Ejournal 1431
20 pages
9.a. Libraries and Modules
No ratings yet
9.a. Libraries and Modules
5 pages
All About Data Science: Learn Data Science from scratch
From Everand
All About Data Science: Learn Data Science from scratch
Devi Prasad
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.