0% found this document useful (0 votes)

8 views23 pages

D (1) (1) Report2 Srushti

The document is a technical seminar report submitted by Srushti Siddappa Menasinakai on 'A Dynamic Reward-Based Deep Reinforcement Learning For IoT Intrusion Detection' as part of her Bachelor of Engineering in Electronics and Communication Engineering at Visvesvaraya Technological University. It discusses the challenges of IoT security and proposes a deep reinforcement learning-based intrusion detection system that achieves 99% accuracy using the Bot-IoT dataset. The report includes acknowledgments, a declaration of originality, and an overview of the system architecture and methodologies employed in the research.

Uploaded by

veerannadesai305

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views23 pages

D (1) (1) Report2 Srushti

Uploaded by

veerannadesai305

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

VISVESVARAYATECHNOLOGICALUNIVERSITY

BELAGAVI – 590018, KARNATAKA

A Technical Seminar Report on

“A Dynamic Reward-Based Deep Reinforcement Learning For

IOT Intrusion Detection ”
Submitted toVisvesvarayaTechnological University in the partial
fulfilment of requirement for the award of
Bachelor of Engineering in
ELECTRONICSANDCOMMUNICATIONENGINEERING
Submittedby
Srushti Siddappa Menasinakai 2SR21EC050
Under the Guidance of
Mr. Chandrashekhar.B.G
Asst.Prof., Dept. of ECE

2024-2025

DEPARTMENTOFELECTRONICSANDCOMMUNICATION
ENGINEERING
SRITARALABALUJAGADGURUINSTITUTEOF TECHNOLOGY
RANEBENNUR-581115
SRITARALABALUJAGADGURUINSTITUTEOF TECHNOLOGY
RANEBENNUR-581 115
(Affiliated toVisvesvaraya Technological University,Belagavi)
2024-2025

CERTIFICATE
It is to certify that the Seminar on “A Dynamic Reward-Based Deep Reinforcement
Learning For IOT Intrusion Detection” has been successfully presented at STJ INTITUTE
OF TECHNOLOGY by Srushti Siddappa Menasinakai, 2SR21EC050, in partial fulfilment of
the requirements for the degree of Bachelor of Electronics and communication Engineering
of Visvesvaraya Technological University, Belagavi during academic year 2024-2025. It is
certified that all correction indicated for Internal Assessment have been incorporated in the
report deposited in the department library. The Seminar report Sasitsatisfies theAcademic
requirements in respect of Seminar work for the above degree.

Signatureof Guide SignatureofCo-ordinator SignatureofHOD

Mr.Chandrashekhar Miss.Swetha.C.M Dr.MaheshwarappaBPhD
Asst.Prof., Dept.ofECE Asst.Prof., Dept.ofECE HeadoftheDepartment,ECE
ACKNOWLEDGEMENT

I express my sincere gratitude to our guide,Mr.Chandrashekhar.B.G, Asst.Prof., Dept.

of Electronics & Communication Engineering, STJIT, Ranebennur, for her stimulating
guidance , continuous encouragement and supervision throughout the course of present work.
I am thankful to our Coordinator, Miss.Swetha.C.M, Asst.Prof., Dept. of Electronics
and Communication Engineering, for their constant cooperation and support.
Iwouldlike to place on record our deep sense of gratitude to Prof.
Dr.MaheshwarappaBPh.D, HOD Dept. of Electronics & Communication Engineering, STJIT,
Ranebennur, for his generous guidance, help and useful suggestions.
I also wish to extend our thanks to all staff for attending our seminars and for their
insightful comments and constructive suggestions to improve the quality of this project work.
I am extremely thankful to Dr.S G Makanur Ph.D.,,Principal,STJIT, RANEBENNUR,
for providing us infrastructural facilities to work in, without which this work would not have
been possible.

Submitted by

Srushti Siddappa Menasinakai[2SR21EC050]

DECLARATION

I, Srushti Siddappa Menasinakai [2SR21EC050], student of 8th semester, BE in electronics and

communication Engineering, S T J Institute of Technology, Ranebennur, here by declare that the Technical
Seminar entitled “A Dynamic Reward-Based Deep Reinforcement Learning For IOT Intrusion Detection”
has been successfully submitted to the Vivesvaraya Technological University during the academic year
2024-2025, is a record of an original work done by me under the guidance of Mr.Chandrashekar.B.G,
Department of Electronics and Communication Engineering. This Technical Seminar is submitted in the
partial fulfillment of the requirements for the award of the degree of Bachelor of engineering in Electronics
and Communication Engineering. The results embodied in this report have not been submitted to any other
University or Institute for the award of any degree.

Date:
Place:Ranebennur

Submittedby

Srushti Siddappa Menasinakai[2SR21EC050]

ABSTRACT

The rapid proliferation of Internet of Things (IoT) technology has enhanced human quality of life
while simultaneously introducing significant cybersecurity challenges. IoT devices are constrained
by limited computational resources, storage capacity, and power supply, rendering them susceptible
to botnet exploitation and Distributed Denial of Service (DDoS) attacks. Conventional signature-
based intrusion detection systems (IDS) are frequently deployed to mitigate network attacks in IoT
environments. However, these systems heavily rely on manual expert knowledge and exhibit limited
adaptability to emerging threats, particularly when confronted with novel attacks such as zero-day
exploits. Deep reinforcement learning (DRL), which enables agents to make autonomous decisions
through policy function approximation, has demonstrated promising results in network attack
identification. This paper proposes a DRL-based intrusion detection system for identifying diverse
attack vectors in IoT environments. To enhance the model’s sensitivity to multi- class samples, we
design a dynamic reward function, thereby improving the overall network attack recognition
capabilities. The proposed model is validated using the Bot-IoT dataset. Experimental results
indicate that the model achieves 99% accuracy in classification.
LISTOF FIGURES
FIGN0. PAGE NO.

3.1 EVOLUTION OF IOT Intrusion Detection 5

3.2 FEATURE OF IOT Intrusion Detection 5
3.3 Deep Reinforcement Learning 6
3.4 DRL Feauters Selection 6
3.5 Feauters of the Bot-Iot Dataset 7
3.6 Data Distrubution Chart 8
3.7 DRL-IDS Training Algoritm 9
3.8(A) Binary Classification 10

3.8(B) Multi Classification 10

TABLEOF CONTENTS

SlNo. Title PageNo.

1 INTRODUCTION 1
1.1 HISTORY 1
1.2PROBLEM STATEMENT 2
1.3 OBJECTIVES 2

2 LITERATURESURVEY 3

3 SYSTEM ARCHITECTURE 4-6

3.1 WORKING 7-9
3.2 Challenges 10
3.3 Results 11

4 ADVANTAGES, DISADVANTAGES AND 12-14

APPLICATIONS

5 CONCLUSIONANDFUTUREWORK 15

REFERENCES
A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

CHAPTER1:
INTRODUCTION

The rapid proliferation of IoT technology has enhanced efficiency in smart cities, homes, and industrial
automation. However, sophisticated cyber attacks pose unprecedented challenges to IoT security.
Existing intrusion detection systems lack scalability and adaptability for the increasing number of
connected devices and emerging attack vectors, necessitating more robust defensive mechanisms.
Intrusion Detection Systems (IDS) originated from model. Conventional IDS include signature- based
and anomaly-based approaches. Signature-based IDS identify known threats but struggle with zero-day
attacks, while anomaly-based IDS detect deviations from baseline behavior, enabling discovery of
unknown threats. Machine learning (ML) has advanced intrusion detection by analyzing network traffic
patterns. Deep learning (DL) leverages multi-layer neural networks to approximate complex functions,
demonstrating superior capabilities in attack classification

1.1 History:

The field of intrusion detection has evolved significantly over the years, beginning with rule-based expert
systems in the early 1980s. These early IDS models relied on predefined signatures of known attacks to detect
intrusions. While effective against well-documented threats, they lacked the capability to detect zero-day
attacks and sophisticated malware.

In the 1990s and early 2000s, the rise of network-based intrusion detection systems (NIDS) introduced
anomaly detection techniques, which analyzed deviations from normal network behavior. These methods
employed statistical models, clustering algorithms, and basic machine learning classifierslike decision trees
and support vector machines (SVM). However, as IoT networks became more complex, these approaches
struggled with scalability and false positive rates.

With the rapid growth of deep learning in the 2010s, researchers began integrating neural networks into IDS.
Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) improved the accuracy of
anomaly detection, but these models required large labeled datasets and extensive computational resources.
More recently, reinforcement learning has gained attention due to its ability to *adapt to evolving attack
strategies without extensive retraining.
The latest advancements in Deep Reinforcement Learning (DRL)have paved the way for highly adaptive and
self-learning IDS models. These models leverage real-time feedback loops to adjust their detection strategies
dynamically, making them well-suited for IoT environments where security threats evolve continuously.

Department Of ECE, STJIT, Ranebennur 1

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

The proposed dynamic reward-based DRL approach builds on this historical progression, introducing a more
efficient and adaptable intrusion detection mechanism that addresses the limitations of previous systems.

Uses the Bot-IoT dataset, which is a benchmark dataset for IoT security.z
Achieves 99% accuracy in attack detection by addressing class imbalance with negative sampling and
SARSA encoding.

Modern Advancements:
Dynamic Reward-Based Deep Reinforcement Learning (DRL) for IoT Intrusion Detection have evolved
significantly over the past decade. In 2015, early works focused on basic anomaly detection using traditional
machine learning models. By 2018, researchers integrated Deep Q-Networks (DQN) and Convolutional Neural
Networks (CNN) to improve accuracy and adaptability. In 2020, Principal Component Analysis (PCA) and
Recursive Feature Elimination (RFE) were introduced for efficient feature selection, reducing computational
overhead. From 2021 onwards, adaptive reward mechanisms and hybrid AI approaches (e.g., combining DRL
with anomaly-based IDS) have enhanced real-time intrusion detection. Recent works in 2023-2024emphasize
cloud-based IDS solutions and federated learning, enabling scalable and privacy-preserving security in smart
homes, industrial IoT, and critical infrastructure.

1.2 Problemstatement:

With the increasing number of IoT devices, cybersecurity threats such as malware, unauthorized access, and
denial-of-service (DoS) attacks are becoming more common. Traditional intrusion detection systems (IDS)
struggle to handle the dynamic and complex nature of IoT networks due to their limited adaptability and
predefined rule-based mechanisms. These systems often produce high false positive rates and fail to detect
new or evolving threats efficiently.

1.3 Objectives:

1.Improve detection accuracy by leveraging reinforcement learning to adapt to new attack patterns.

2. Reduce false positives through an optimized reward mechanism that refines the learning process.

3.Enhance adaptability by allowing the IDS to learn and evolve dynamically without manual intervention.

Department Of ECE, STJIT, Ranebennur 2

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

CHAPTER2
LITERATURESURVEY

[1] D. Denning, “An Intrusion-Detection Model,” IEEE Transactions on Software Engineering, vol. SE-13,
pp. 222–232, Feb. 1987. Conference Name: IEEE Transactions on Software Engineering.

[2] B. Sudharsan, J. G. Breslin, and M. I. Ali, “ML-MCU: A Framework to Train ML Classifiers on MCU-
Based IoT Edge Devices,” IEEE Internet of Things Journal, vol. 9, pp. 15007–15017, Aug. 2022. Conference
Name: IEEE Internet of Things Journal.

[3] Z. Bao, Y. Lin, S. Zhang, Z. Li, and S. Mao, “Threat of Adversarial Attacks on DL-Based IoT Device
Identification,” IEEE Internet of Things Journal, vol. 9, pp. 9012–9024, June 2022. Conference Name: IEEE
Internet of Things Journal.

[4] A. Uprety and D. B. Rawat, “Reinforcement Learning for IoT Security: A Comprehensive Survey,” IEEE
Internet of Things Journal, vol. 8, pp. 8693–8706, June 2021. Conference Name: IEEE Internet of Things
Journal.

[5] Y. Wang, Y. Jia, Y. Tian, and J. Xiao, “Deep reinforcement learning with the confusion-matrix-based
dynamic reward function for customer credit scoring,” Expert Systems with Applications, vol. 200, p. 117013,
Aug. 2022.

[6] M. Shafiq, Z. Tian, A. K. Bashir, X. Du, and M. Guizani, “CorrAUC: A Malicious Bot-IoT Traffic
Detection Method in IoT Network Using Machine-Learning Techniques,” IEEE Internet of Things Journal,
vol. 8, pp. 3242–3254, Mar. 2021. Conference Name: IEEE Internet of Things Journal.

[7] H. Qiu, Q. Zheng, T. Zhang, M. Qiu, G. Memmi, and J. Lu, “Toward Secure and Efficient Deep Learning
Inference in Dependable IoT Systems,” IEEE Internet of Things Journal, vol. 8, pp. 3180–3188, Mar. 2021.
Conference Name: IEEE Internet of Things Journal.

[8] N. Koroniotis, N. Moustafa, E. Sitnikova, and B. Turnbull, “Towards the development of realistic botnet
dataset in the Internet of Things for network forensic analytics: Bot-IoT dataset,” Future Generation Computer
Systems, vol. 100, pp. 779–796, Nov. 2019.

Department Of ECE, STJIT, Ranebennur 3

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

CHAPTER3
SYSTEMARCHITECTURE

1. Data Collection Layer (IoT Devices)

 This layer includes smart home devices, industrial IoT (IIoT) sensors, medical devices, and
autonomous systems.
 Network traffic logs are collected continuously for intrusion analysis.
 Data sources include protocols like MQTT, CoAP, and HTTP.

2. Preprocessing & Feature Extraction

 Raw data undergoes normalization, filtering, and transformation.
 Feature selection techniques (e.g., Principal Component Analysis (PCA), Chi-Square Test) are applied
to remove redundant features.
 Essential traffic features such as packet flow, network latency, and device behavior are extracted.

3. Deep Reinforcement Learning (DRL) Model

 Uses an AI agent that learns from IoT traffic patterns through state-action-reward training.
 Implements Proximal Policy Optimization (PPO) and Deep Q-Networks (DQN) to improve threat
detection.
 The dynamic reward mechanism enhances IDS efficiency by rewarding correct classifications and
penalizing false detections.

4. Intrusion Detection & Classification

 Anomaly detection and signature-based classification are performed using neural networks.
 Attacks are categorized into DDoS, botnet activities, phishing, and privilege escalation.
 The IDS generates alerts and mitigation actions upon detecting malicious activity.

5. Feedback & Continuous Learning

 The DRL model updates itself based on previous detection performance.
 Self-learning mechanisms reduce false positives and improve real-time adaptability.
 Continuous retraining ensures evolving cyber threats are detected efficiently.

Department Of ECE, STJIT, Ranebennur 4

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

Fig3.1: Evolution of Dynamic Reward-Based Reward Intrusion Detection System

The Dynamic Reward-Based Reward Intrusion Detection System enhances IoT security by learning and
adapting to cyber threats. It uses deep reinforcement learning (PPO, DQN) with a dynamic reward mechanism
to improve detection accuracy. The system analyzes real-time network traffic, detects intrusions like DDoS
and botnets, and reduces false positives. A self-learning feedback loopensures continuous improvement for
proactive threat defense.

Fig3.2: Features of Dynamic Reward-Based Reward IOT Intrusion Detection System

Department Of ECE, STJIT, Ranebennur 5

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

1) Deep Reinforcement Learnig(DRL):

An IoT Intrusion Detection System (IDS) using Deep Reinforcement Learning (DRL). The system
starts with a ToN_IoT dataset, which undergoes preprocessing (normalization and scaling) to clean
and standardize the data. After sampling, key features are extracted using log-likelihood sliding
principal component analysis, followed by training a Dynamic Reward-Based Reinforcement
Learning model to detect cyber threats.

Fig3.3: Deep Reinforcement Learning

2) The DRL Feauters Selection:

This process, including data cleaning, feature selection (decision tree and recursive elimination), and mini-
batch training using a Deep Q-Network (DQN) with convolutional neural networks (CNN). The model
predicts normal vs attack traffic, ensuring an adaptive IDS that continuously learns to
enhance cybersecurity.

Fig3.4: DRL Feautre Selection

Department Of ECE, STJIT, Ranebennur 6

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

3.1 Working :

The IoT ecosystem encompasses a diverse array of devices, including cameras, sensors, vehicles,
and industrial equipment, all of which are vulnerable to network attacks. The proposed model
aggregates both malicious and benign network traffic to construct a comprehensive dataset for IDS
training. During the preprocessing phase, the data undergoes a series of operations including
integration, cleaning, transformation, and normalization (as formulated in equation 1). The
processed data is subsequently structured in the SARSA format [st , at , r, st+1, at+1] to conform to
the input requirements of the reinforcement learning paradigm. The preprocessed data serves as
input for the Deep Reinforcement Learning (DRL) model during the training phase. The model
iteratively optimizes its parameters based on the reward signals provided by the simulated
environment while concurrently making predictions on the input data to classify potential network
attacks. Leveraging neural networks, the model approximates the policy function intrinsic to
reinforcement learning, thereby achieving efficient and adaptive attack identification.

B. Dataset and Preprocessing :

The Bot-IoT dataset was collaboratively developed by the University of New South Wales, Australia, and
the Cyber Security Cooperative Research Centre, with a primary focus on botnet attack research in IoT
environments. This comprehensive dataset incorporates both authentic and simulated network traffic,
encompassing a diverse range of attack vectors. It comprises over 72 million records, each characterized by
42 features. The high-fidelity labeling of this dataset renders it an invaluable resource for research in intrusion
detection and network anomaly detection.

Fig 3.5: Feauters of the Bot-Iot Dataset

Department Of ECE, STJIT, Ranebennur 7

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

Fig3.6: Data Distrubution Chart

As per above figure Using the Bot-IoT dataset as a case study, the preprocessing pipeline includes: data
integration, feature selection, data cleaning, data transformation, and normalization. Approximately 3.5
million records are extracted and consolidated from the original log files, with about 20 features selected
based on the recommendations from the original paper [8]. Post-cleaning, approximately 3.3 million
valid records are retained, with categorical variables encoded using label Encoder. The ”attack” and
”daddr” columns, which exhibit high correlation with attack traffic, are excluded to enhance model
generalization. Finally, all numerical values are scaled to the range of 0-1 using min-max normalization.

C. Deep Reinforcement Learning :

Deep Reinforcement Learning (DRL) synergistically integrates the strengths of reinforcement learning and deep
neural networks. Its theoretical foundation is rooted in Markov Decision Processes (MDPs), which are formally
defined by a tuple (S, A,T, R, γ), where S represents the set of possible states, A denotes the set of available
actions, T : S × A × S → [0, 1] is the state transition probability function, R : S × A → R is the reward function,
and γ ∈ [0, 1] is the discount factor. MDPs adhere to the Markov property, which posits that the probability
distribution of future states is conditionally independent of past states given the present state.

The DQN algorithm combines Q-Learning with deep neural networks to approximate the action-value
function Q(s, a), and optimizes for maximum cumulative reward based on the Bellman equation
(Equation 2). The policy is derived from the Q-function as follows: π(st+1) = argmaxaQ(st+1,a)

Department Of ECE, STJIT, Ranebennur 8

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

Fig3.7: DRL-IDS Training Algorithm

This enhancement aims to augment the model’s capability to detect underrepresented attack vectors and
improve overall classification accuracy in the context of anomaly detection. And show in algorithm 1.

Department Of ECE, STJIT, Ranebennur 9

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

3.2 Challenges:

"A Dynamic Reward-Based Deep Reinforcement Learning for IoT Intrusion Detection" highlights several key
challenges in developing an effective intrusion detection system using deep reinforcement learning (DQN-
based model). Some of the main challenges include:

1. Class Imbalance in the Dataset:

 The Bot-IoT dataset used in the study has an inherent imbalance, where certain attack categories (such
as Information Theft) have significantly fewer samples than others (like DDoS and DoS).
 This imbalance creates difficulties in training the model effectively, leading to reduced accuracy for
underrepresented attack types.

2. Binary vs. Multi-Class Classification Complexity:

 The model is tested in both binary classification (benign vs. malicious) and multi-class classification
(benign, DDoS, DoS, Reconnaissance, Information Theft).
 While binary classification achieves high accuracy (close to 1.0 for benign and malicious traffic),
multi-class classification introduces additional complexity due to overlapping attack patterns.

3. Recognition of Rare Attack Categories:

 The model performs well for most attack types but struggles with Information Theft, which has fewer
samples in the dataset.
 This leads to a lower recognition rate for this category compared to others like DDoS and
Reconnaissance.

4. Generalization to Real-World IoT Networks:

 Training the model on a specific dataset may not ensure optimal performance when applied to different real-
world IoT environments.
 IoT devices vary in architecture, behavior, and security vulnerabilities, making it difficult to generalize the
model’s performance across different networks.

5. Computational Complexity & Training Time:

 Deep reinforcement learning models require high computational power and large training times, making
deployment challenging for resource-constrained IoT devices.
 Optimizing the model for real-time detection while maintaining efficiency remains a challenge.

Department Of ECE, STJIT, Ranebennur 10

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

3.3 RESULTS:

This study utilizes the Bot-IoT dataset for empirical evaluation, where the inherent class imbalance poses
significant challenges for model training and performance assessment. We train the proposed DQN-based
model using preprocessed data and evaluate its efficacy. The experimental framework encompasses two
classification paradigms: binary classification (discriminating between benign and malicious traffic) and
multi-class classification (categorizing traffic into five distinct classes: benign,DDoS, DoS, Reconnaissance,
and Information Theft). This experimental design enables a comprehensive evaluation of the model’s
performance across tasks of varying complexity and granularity.

The model achieves performance scores exceeding 0.99 for the Normal, DDoS, DoS, and Reconnaissance
categories, while the Information Theft category shows slightly lower but still robust performance above
0.95, potentially attributable to the limited sample size in this class. (b) illustrating multi-class classification.
In these matrices, rows represent predicted classes, columns denote true classes, diagonal elements indicate
correct classification proportions, and color intensity corresponds to proportion levels. In binary
classification, the model attains recognition rates of 1.0 and 0.99 for benign and malicious traffic,
respectively.

Fig.3.8: (a)Binary Classification (b)Multi Classification

(a) depicting binary classification and(b) illustrating multi-class classification. In these matrices, rows represent
predicted classes, columns denote true classes, diagonal elements indicate correct classification proportions, and color
intensity corresponds to proportion levels. In binary classification, the model attains recognition rates of 1.0 and 0.99
for benign and malicious traffic, respectively. Multi- class results corroborate the findings , revealing a relatively
weaker recognition rate for the Information Theft category, primarily due to its underrepresentation in the original
dataset.

Department Of ECE, STJIT, Ranebennur 11

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

CHAPTER4:

ADAVNTAGES,DISADAVNTAGESANDAPPLICATIONS

Adavntages:

1. Smart Cities & Smart Homes :

 Protects IoT-enabled smart home devices (e.g., cameras, thermostats) from cyber threats.
 Enhances smart city security by detecting attacks on IoT infrastructure like traffic control systems.

2. Industrial IoT (IoT) Security :

 Secures industrial automation systems, sensors, and SCADA networks from network intrusions.
 Prevents DDoS attacks on manufacturing plants and energy grids.

3. Healthcare IoT Security :

 Enhances security for connected medical devices (e.g., pacemakers, remote patient monitoring).
 Prevents cyber threats in telehealth and smart hospital networks.

4. Automotive & Connected Vehicles :

 Protects self-driving cars and vehicle-to-vehicle (V2V) communication from cyber intrusions.
 Detects anomalies in smart traffic systems.

5. Enterprise IoT & Financial Security :

 Secures IoT-based financial transaction systems against fraud and cyber attacks.
 Enhances security for IoT-connected office devices like smart printers, security cameras, and
access control systems.

6. Critical Infrastructure Protection :

 Defends power grids, water treatment plants, and public utility services from cyber intrusions.
 Detects network anomalies in government and military IoT networks.

Department Of ECE, STJIT, Ranebennur 12

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

7. Military & Defense Applications :

 Enhances security in IoT-based surveillance systems, drone networks, and battlefield

communications.
 Detects advanced persistent threats (APTs) targeting national defense IoT infrastructure.

Disadavntages:

1.Computational Complexity :
 Training DRL models requires high processing power and memory, which may not be feasible for
low-power IoT devices.

2. Longer Training Time :

 DRL models take longer to train compared to conventional Machine Learning (ML) or rule-based
IDS due to the need for continuous learning and policy optimization.

3. Data Dependency :
 The model's performance depends on large labeled datasets like Bot-IoT. If real-world IoT traffic
differs significantly, retraining is required.

4. Overfitting Risks :
 The model may become too specialized to training data, leading to poor generalization in real-world
IoT networks.

Department Of ECE, STJIT, Ranebennur 13

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

Applications:

1. Smart Cities & Smart Homes :

 Protects IoT-enabled smart home devices (e.g., cameras, thermostats) from cyber threats.
 Enhances smart city security by detecting attacks on IoT infrastructure like traffic control systems.

2. Industrial IoT (IoT) Security :

 Secures industrial automation systems, sensors, and SCADA networks from network intrusions.
 Prevents DDoS attacks on manufacturing plants and energy grids.

3. Healthcare IoT Security :

 Enhances security for connected medical devices (e.g., pacemakers, remote patient monitoring).
 Prevents cyber threats in telehealth and smart hospital networks.

4. Automotive & Connected Vehicles :

 Protects self-driving cars and vehicle-to-vehicle (V2V) communication from cyber intrusions.
 Detects anomalies in smart traffic systems.

5. Enterprise IoT & Financial Security :

Department Of ECE, STJIT, Ranebennur 14

A Dynamic Reward-Based Deep Reinforcement Learning for IOT Intrusion Detection

CHAPTER5:
CONCLUSION AND FUTUREWORK

Conclusion:
This study proposes a novel Network Intrusion Detec- tion System (NIDS) for IoT environments, leveraging
deep reinforcement learning techniques. The system employs a dynamic reward function to enhance the
model’s capability in addressing class imbalance and improving multi-class attack detection. Empirical
evaluations on the Bot-IoT dataset demonstrate the model’s exceptional performance in both binary and multi-
class classification tasks, achieving detection accuracy exceeding 99% for the majority of attack vectors. The
enhanced DQN model exhibits superior performance compared to conventional methodologies in mitigating
class imbalance issues, particularly in the detection of DDoS and DoS attacks, thus offering a robust solution
for IoT network security.

This study proposes a novel Network Intrusion Detection System (NIDS) for IoT environments, leveraging
deep reinforcement learning techniques. The system employs a dynamic reward function to enhance the
model’s capability in addressing class imbalance and improving multi-class attack detection. Empirical
evaluations on the Bot-IoT dataset demonstrate the model’s exceptional performance in both binary and multi-
class classification tasks, achieving detection accuracy exceeding 99% for the majority of attack vectors. The
enhanced DQN model exhibits superior performance compared to conventional methodologies in mitigating
class imbalance issues, particularly in the detection of DDoS and DoS attacks, thus offering a robust solution
for IoT network security.

Futurework:

A Dynamic Reward-Based Deep Reinforcement Learning for IoT Intrusion Detection focuses on enhancing
the model’s performance, scalability, and real-world applicability. Key directions include improving the deep
reinforcement learning framework with advanced architectures, optimizing computational efficiency for large-
scale IoT networks, and testing the model in real-time environments with evolving cyber threats. Additionally,
future research aims to generalize the system across diverse IoT communication protocols (e.g., MQTT,
CoAP) and integrate hybrid detection mechanisms such as federated learning or blockchain for enhanced
security. Addressing adversarial attacks and refining the reward mechanism for better adaptability are also
crucial areas for further exploration.

Department Of ECE, STJIT, Ranebennur 15

References
[1].Y. Wang, Y. Jia, Y. Tian, and J. Xiao, “Deep reinforcement learning with the confusion-
matrix-based dynamic reward function for customer credit scoring,” Expert Systems with
Applications, vol. 200, p. 117013, Aug. 2022.

[2] M. Shafiq, Z. Tian, A. K. Bashir, X. Du, and M. Guizani, “CorrAUC: A Malicious Bot-IoT
Traffic Detection Method in IoT Network Using Machine-Learning Techniques,” IEEE Internet
of Things Journal, vol. 8, pp. 3242–3254, Mar. 2021. Conference Name: IEEE Internet of
Things Journal.

[3] H. Qiu, Q. Zheng, T. Zhang, M. Qiu, G. Memmi, and J. Lu, “Toward Secure and Efficient
Deep Learning Inference in Dependable IoT Systems,” IEEE Internet of Things Journal, vol. 8,
pp. 3180–3188, Mar. 2021. Conference Name: IEEE Internet of Things Journal.

[4] N. Koroniotis, N. Moustafa, E. Sitnikova, and B. Turnbull, “Towards the development of

realistic botnet dataset in the Internet of Things for network forensic analytics: Bot-IoT dataset,”
Future Generation Computer Systems, vol. 100, pp. 779–796, Nov. 2019.

[5] A. M. Seid, G. O. Boateng, B. Mareri, G. Sun, and W. Jiang, “Multi-Agent DRL for Task
Offloading and Resource Allocation in Multi-UAV Enabled IoT Edge Network,” IEEE
Transactions on Network and Service Management, vol. 18, pp. 4531–4547, Dec. 2021.
Conference Name: IEEE Transactions on Network and Service Management.

[6] K. Kuru, “Definition of Multi-Objective Deep Reinforcement Learning Reward Functions

for Self-Driving Vehicles in the Urban Environment,” IEEE Transactions on Intelligent
Transportation Systems, Sept. 2023. Publisher: Institute of Electrical and Electronics Engineers
(IE

24UG-AI006 PROJECT DOCUMENTATION FINALv
No ratings yet
24UG-AI006 PROJECT DOCUMENTATION FINALv
68 pages
Master - Thesis - V4
No ratings yet
Master - Thesis - V4
67 pages
CBIT B E AIDS Major Project Part 1 Report Template 2
No ratings yet
CBIT B E AIDS Major Project Part 1 Report Template 2
62 pages
Group 12
No ratings yet
Group 12
54 pages
Hybrid Deep Learning Model For Attack Detection in Internet of Things
No ratings yet
Hybrid Deep Learning Model For Attack Detection in Internet of Things
20 pages
GRU IoT IDS
No ratings yet
GRU IoT IDS
63 pages
Thesis
No ratings yet
Thesis
233 pages
DDOS Attack Final
No ratings yet
DDOS Attack Final
41 pages
Network Intrusion Detection System Using Deep Learning
No ratings yet
Network Intrusion Detection System Using Deep Learning
9 pages
LCL Ewm BBP Document
100% (2)
LCL Ewm BBP Document
18 pages
1ds20ee013 Technical Report
No ratings yet
1ds20ee013 Technical Report
29 pages
Mini Project Report
No ratings yet
Mini Project Report
42 pages
Internship Report 11
No ratings yet
Internship Report 11
23 pages
A Deep Transfer Learning Approach For Iot/iiot Cyber Attack Detection Using Telemetry Data
No ratings yet
A Deep Transfer Learning Approach For Iot/iiot Cyber Attack Detection Using Telemetry Data
20 pages
1 s2.0 S0952197623016998 Main
No ratings yet
1 s2.0 S0952197623016998 Main
21 pages
Real
No ratings yet
Real
77 pages
Cyber-Security and Reinforcement Learning - A Brief Survey
No ratings yet
Cyber-Security and Reinforcement Learning - A Brief Survey
18 pages
Ids Uisng Deep Learning Techniques
No ratings yet
Ids Uisng Deep Learning Techniques
70 pages
Final Doc Mine
No ratings yet
Final Doc Mine
95 pages
ADML IoT 1-0-1
No ratings yet
ADML IoT 1-0-1
115 pages
Network-Based Detection of Iot Botnet Attacks Using Deep Autoencoders
No ratings yet
Network-Based Detection of Iot Botnet Attacks Using Deep Autoencoders
45 pages
Group1 Document
No ratings yet
Group1 Document
54 pages
Abi Bharu Athi Arul Anitha Project
No ratings yet
Abi Bharu Athi Arul Anitha Project
48 pages
DL Ids
No ratings yet
DL Ids
72 pages
Hierarchical Adversarial Attacks Against Graph-Neural-Network-Based IoT Network Intrusion Detection System
No ratings yet
Hierarchical Adversarial Attacks Against Graph-Neural-Network-Based IoT Network Intrusion Detection System
10 pages
Anomaly Detection in Iot Systems Using Unsupervised Learning
No ratings yet
Anomaly Detection in Iot Systems Using Unsupervised Learning
42 pages
Batch Reinforcement Learning Approach Using Recursive Feature Elimination For Network Intrusion Detection
No ratings yet
Batch Reinforcement Learning Approach Using Recursive Feature Elimination For Network Intrusion Detection
16 pages
Final 4-2
No ratings yet
Final 4-2
60 pages
Uzoezie Fidelis C. Cyb Project
No ratings yet
Uzoezie Fidelis C. Cyb Project
83 pages
ElysiumPro - Cloud Computing - Title With Abstract PDF 2024-25 V 2.0.1
No ratings yet
ElysiumPro - Cloud Computing - Title With Abstract PDF 2024-25 V 2.0.1
22 pages
Keerthi Page Number Edit
No ratings yet
Keerthi Page Number Edit
21 pages
Transactions On Emerging Telecommunications Technologies - 2019 - Otoum - DL IDS A Deep Learning Based Intrusion Detection
No ratings yet
Transactions On Emerging Telecommunications Technologies - 2019 - Otoum - DL IDS A Deep Learning Based Intrusion Detection
14 pages
NEJE KZ Board Schematic
0% (1)
NEJE KZ Board Schematic
1 page
Missing Child Identification System
No ratings yet
Missing Child Identification System
88 pages
Project Final Report 2
No ratings yet
Project Final Report 2
69 pages
Concept Map
No ratings yet
Concept Map
37 pages
Design of An Intrusion Detection Model For IoT-Enabled Smart Home
No ratings yet
Design of An Intrusion Detection Model For IoT-Enabled Smart Home
18 pages
CYBER ATTACKS DETECTION USING GoogleNet MODEL FOR ENVIRONMENTAL AWARE SMART CITY APPLICATIONS
No ratings yet
CYBER ATTACKS DETECTION USING GoogleNet MODEL FOR ENVIRONMENTAL AWARE SMART CITY APPLICATIONS
10 pages
1234 Pagenumber
No ratings yet
1234 Pagenumber
42 pages
22mdt1038 Capstone Final
No ratings yet
22mdt1038 Capstone Final
63 pages
Finalsynospi
No ratings yet
Finalsynospi
15 pages
Sensors 22 08417
No ratings yet
Sensors 22 08417
12 pages
Cps Lit Review
No ratings yet
Cps Lit Review
3 pages
Om 3
No ratings yet
Om 3
10 pages
Report I
No ratings yet
Report I
47 pages
Principles of Programming Languages: Eliezer A. Albacea
No ratings yet
Principles of Programming Languages: Eliezer A. Albacea
68 pages
Arpit Jain Seminar Report
No ratings yet
Arpit Jain Seminar Report
31 pages
Internet of Things LTP
No ratings yet
Internet of Things LTP
3 pages
DRL For IoT Systems SI CFP IEEEIoTJournal2019
No ratings yet
DRL For IoT Systems SI CFP IEEEIoTJournal2019
1 page
Applsci 11 08383 v2
No ratings yet
Applsci 11 08383 v2
24 pages
218R1A0433
No ratings yet
218R1A0433
39 pages
LBDMIDS LSTM Based Deep Learning Model F
No ratings yet
LBDMIDS LSTM Based Deep Learning Model F
7 pages
Elevating Cybersecurity Using AI and Deep Learning For Intrusion Detection Reinforcement
No ratings yet
Elevating Cybersecurity Using AI and Deep Learning For Intrusion Detection Reinforcement
9 pages
Infrastructure Pentesting PDF
No ratings yet
Infrastructure Pentesting PDF
13 pages
Reinforcement Learning For Intrusion Detection More Model Longness and Fewer Update
No ratings yet
Reinforcement Learning For Intrusion Detection More Model Longness and Fewer Update
11 pages
Major Stage 1
No ratings yet
Major Stage 1
34 pages
Summary of Articles
No ratings yet
Summary of Articles
9 pages
2021 A - Lightweight - Optimized - Deep - Learning - Ba
No ratings yet
2021 A - Lightweight - Optimized - Deep - Learning - Ba
8 pages
Internship - SHEETAL
No ratings yet
Internship - SHEETAL
26 pages
Deep Learning Algorithms For Intrusion D
No ratings yet
Deep Learning Algorithms For Intrusion D
8 pages
Computers 12 00034 v2
No ratings yet
Computers 12 00034 v2
17 pages
178p1a0427 (Seminar Report)
No ratings yet
178p1a0427 (Seminar Report)
37 pages
Wistron LA710 - 17513 PDF
No ratings yet
Wistron LA710 - 17513 PDF
107 pages
1 s2.0 S0045790623000514 Main
No ratings yet
1 s2.0 S0045790623000514 Main
14 pages
Major Project Second Review
No ratings yet
Major Project Second Review
17 pages
Intrusion Detection Using Deep Neural Network Algorithm On The Internet of Things
No ratings yet
Intrusion Detection Using Deep Neural Network Algorithm On The Internet of Things
4 pages
A Project Report
No ratings yet
A Project Report
7 pages
BI
No ratings yet
BI
17 pages
Directory of Sugar Refineries CY 2018 2019
No ratings yet
Directory of Sugar Refineries CY 2018 2019
2 pages
4 Wire Data E&M Fiber Link Card System: User Guide
No ratings yet
4 Wire Data E&M Fiber Link Card System: User Guide
6 pages
Test: Jfo Section 2 Quiz
0% (2)
Test: Jfo Section 2 Quiz
2 pages
PCB & Electronics Hardware
No ratings yet
PCB & Electronics Hardware
84 pages
SPPL Ca 2022 V18a
No ratings yet
SPPL Ca 2022 V18a
284 pages
E-Boda Project Proposal
No ratings yet
E-Boda Project Proposal
11 pages
Pa600 UpgradeManual v2.1 EFGIC
No ratings yet
Pa600 UpgradeManual v2.1 EFGIC
38 pages
Mahesh (7 0)
No ratings yet
Mahesh (7 0)
6 pages
Thinkvision t34w 30 Curved Usermanual
No ratings yet
Thinkvision t34w 30 Curved Usermanual
40 pages
Capstone Project
No ratings yet
Capstone Project
24 pages
Wooden Gear Clock - 9 Steps (With Pictures) - Instructables
No ratings yet
Wooden Gear Clock - 9 Steps (With Pictures) - Instructables
9 pages
Virtual Elements in CDS Views 1731724717
No ratings yet
Virtual Elements in CDS Views 1731724717
7 pages
Implementation of GSM Based Water Meter A Step Towards Automation in Billing System
No ratings yet
Implementation of GSM Based Water Meter A Step Towards Automation in Billing System
4 pages
AIRVO2 Quick Guide
No ratings yet
AIRVO2 Quick Guide
8 pages
Ii-Ii Supply Paid List Dec-2024 (R18)
No ratings yet
Ii-Ii Supply Paid List Dec-2024 (R18)
9 pages
Co-Creating An Industry Standard For Sharing Agricultural Data - Aug 20
No ratings yet
Co-Creating An Industry Standard For Sharing Agricultural Data - Aug 20
13 pages
Judge Eloida R. de Leon-Diaz v. Atty. Ronaldo Antonio v. Calayan Almacen PDF Attorney-Client Privilege Notary Public
No ratings yet
Judge Eloida R. de Leon-Diaz v. Atty. Ronaldo Antonio v. Calayan Almacen PDF Attorney-Client Privilege Notary Public
1 page
Software Project Management Unit-3 - 1 PDF
No ratings yet
Software Project Management Unit-3 - 1 PDF
2 pages
01LAB Anonuevo Justin
No ratings yet
01LAB Anonuevo Justin
5 pages
Hacktricks-Cloud:basic-Github-Information - MD at Master Carlospolop:hacktricks-Cloud GitHub
No ratings yet
Hacktricks-Cloud:basic-Github-Information - MD at Master Carlospolop:hacktricks-Cloud GitHub
1 page
ICT - Tools in Education by Roshan - Sai
No ratings yet
ICT - Tools in Education by Roshan - Sai
1 page
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
From Everand
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
M. Sreedevi
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

D (1) (1) Report2 Srushti

Uploaded by

D (1) (1) Report2 Srushti

Uploaded by

VISVESVARAYATECHNOLOGICALUNIVERSITY

BELAGAVI – 590018, KARNATAKA

A Technical Seminar Report on

“A Dynamic Reward-Based Deep Reinforcement Learning For

Signatureof Guide SignatureofCo-ordinator SignatureofHOD

I express my sincere gratitude to our guide,Mr.Chandrashekhar.B.G, Asst.Prof., Dept.

Srushti Siddappa Menasinakai[2SR21EC050]

I, Srushti Siddappa Menasinakai [2SR21EC050], student of 8th semester, BE in electronics and

Srushti Siddappa Menasinakai[2SR21EC050]

3.1 EVOLUTION OF IOT Intrusion Detection 5

3.8(B) Multi Classification 10

SlNo. Title PageNo.

3 SYSTEM ARCHITECTURE 4-6

4 ADVANTAGES, DISADVANTAGES AND 12-14

Department Of ECE, STJIT, Ranebennur 1

Department Of ECE, STJIT, Ranebennur 2

Department Of ECE, STJIT, Ranebennur 3

1. Data Collection Layer (IoT Devices)

2. Preprocessing & Feature Extraction

3. Deep Reinforcement Learning (DRL) Model

4. Intrusion Detection & Classification

5. Feedback & Continuous Learning

Department Of ECE, STJIT, Ranebennur 4

Fig3.1: Evolution of Dynamic Reward-Based Reward Intrusion Detection System

Fig3.2: Features of Dynamic Reward-Based Reward IOT Intrusion Detection System

Department Of ECE, STJIT, Ranebennur 5

1) Deep Reinforcement Learnig(DRL):

Fig3.3: Deep Reinforcement Learning

2) The DRL Feauters Selection:

Fig3.4: DRL Feautre Selection

Department Of ECE, STJIT, Ranebennur 6

B. Dataset and Preprocessing :

Fig 3.5: Feauters of the Bot-Iot Dataset

Department Of ECE, STJIT, Ranebennur 7

Fig3.6: Data Distrubution Chart

C. Deep Reinforcement Learning :

Department Of ECE, STJIT, Ranebennur 8

Fig3.7: DRL-IDS Training Algorithm

Department Of ECE, STJIT, Ranebennur 9

1. Class Imbalance in the Dataset:

2. Binary vs. Multi-Class Classification Complexity:

3. Recognition of Rare Attack Categories:

4. Generalization to Real-World IoT Networks:

5. Computational Complexity & Training Time:

Department Of ECE, STJIT, Ranebennur 10

Fig.3.8: (a)Binary Classification (b)Multi Classification

Department Of ECE, STJIT, Ranebennur 11

1. Smart Cities & Smart Homes :

2. Industrial IoT (IoT) Security :

3. Healthcare IoT Security :

4. Automotive & Connected Vehicles :

5. Enterprise IoT & Financial Security :

6. Critical Infrastructure Protection :

Department Of ECE, STJIT, Ranebennur 12

7. Military & Defense Applications :

 Enhances security in IoT-based surveillance systems, drone networks, and battlefield

2. Longer Training Time :

Department Of ECE, STJIT, Ranebennur 13

1. Smart Cities & Smart Homes :

2. Industrial IoT (IoT) Security :

3. Healthcare IoT Security :

4. Automotive & Connected Vehicles :

5. Enterprise IoT & Financial Security :

Department Of ECE, STJIT, Ranebennur 14

Department Of ECE, STJIT, Ranebennur 15

[4] N. Koroniotis, N. Moustafa, E. Sitnikova, and B. Turnbull, “Towards the development of

[6] K. Kuru, “Definition of Multi-Objective Deep Reinforcement Learning Reward Functions

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.