0% found this document useful (0 votes)

5 views8 pages

Articles Search Project

The document outlines a project aimed at developing a search query system using Deep Learning, specifically Recurrent Neural Networks (RNNs), to improve article retrieval accuracy. It details the methodologies employed, including data preprocessing, model architecture selection, and training processes, while comparing the RNN approach to traditional search methods. The results indicate that the RNN-based model shows potential for enhanced performance in understanding and processing user queries.

Uploaded by

edam.koubaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views8 pages

Articles Search Project

Uploaded by

edam.koubaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Report of articles search using Deep Learning project

Directed by: Ahmed Benameur∗ Siwar Ben Gharsallah† Sarra Ben Hadj Slama‡

∗ Author
† Author
‡ Author

1
Contents

1 The problematic 4
1.1 Recurrent Neural Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2 Word Embeddings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.3 Semantic Matching . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

2 The Project 5
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.2 Data Base used . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.3 Generating data set . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.4 Data Preprocessing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.5 Model Architecture Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.6 Model Development . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.7 Model Training . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

3 Results and Conclusion 8

2
List of Figures
1 Bibliomatrix sample data base. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2 Keywords . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3 Data preprocessing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
4 Defining the model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
5 Training the model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
6 Performing a query . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

3
1 The problematic
The goal of this project is to develop a search query system that can understand and process user queries
to retrieve the most relevant articles from a given corpus. Traditional search systems often rely on
keyword matching and simple statistical methods, which can fall short when dealing with complex queries
or nuanced language. By utilizing DL architectures, our system aims to grasp the contextual meaning
of search queries and articles, thereby enhancing retrieval accuracy. The following are 3 approaches to
tackle the problematic of this project.

1.1 Recurrent Neural Networks

Recurrent Neural Networks are a class of artificial neural networks where connections between nodes
form a directed graph along a temporal sequence. This feature allows them to exhibit temporal dynamic
behavior, making RNNs particularly well-suited for tasks involving sequential data, such as natural
language processing (NLP).

1.2 Word Embeddings

Word embeddings are a type of word representation that allows words to be represented as vectors
in a continuous vector space, capturing semantic relationships between words based on their usage in
context. Unlike traditional keyword matching techniques, word embeddings can understand the nuanced
meanings and relationships between words, making them particularly effective for natural language
processing (NLP) tasks.

1.3 Semantic Matching

Semantic matching involves understanding and matching the meaning behind words and phrases rather
than relying solely on exact keyword matches. This approach allows for a deeper comprehension of the
context and intent behind search queries and articles, leading to more accurate and relevant retrieval of
information. By focusing on the semantics of the content, our system aims to overcome the limitations
of traditional keyword-based search methods.

4
2 The Project
2.1 Introduction
Throughout this report, we will detail the steps taken to develop and evaluate our RNN-based search
query system. This includes data preprocessing, model architecture selection, training and optimization
processes, and performance evaluation. We will also compare our RNN approach with traditional search
methods to highlight its advantages and areas for further improvement.
The findings from this project demonstrate the potential of RNNs to significantly enhance search
query performance in article retrieval, paving the way for more intelligent and efficient information
retrieval systems.

2.2 Data Base used

The data base used is generated from the library biblimatrix using biblioshiny interface.

Figure 1: Bibliomatrix sample data base.

5
2.3 Generating data set
Through performing keywords queries, and observing their answers, we managed to generate a dataset
to train our RNN model with.

Figure 2: Keywords

6
2.4 Data Preprocessing
It consisted of three main steps:
Tokenization: Split the sentences into individual words or subword units to create a vocabulary.
Numerical Encoding: Convert words or subword units into numerical representations using the vo-
cabulary.
Padding: Ensure all sequences have the same length by padding shorter sentences with special tokens.

Figure 3: Data preprocessing.

2.5 Model Architecture Selection

After preprocessing the data, We had to choose an appropriate model architecture for the translation
task. We opted for a sequence-to-sequence model based on recurrent neural networks (RNNs).

2.6 Model Development

this model takes input text, processes it through an embedding layer and an LSTM layer to capture
sequential dependencies, passes it through fully connected dense layers to extract higher-level features,
and finally outputs the probability of each class using a sigmoid activation function.

Figure 4: Defining the model

7
2.7 Model Training
Model training was one of the most critical steps in my project. We used a training dataset to adjust
the model’s weights by minimizing the loss function of each layer.

Figure 5: Training the model

3 Results and Conclusion

Figure 6: Performing a query

The search queries model we developed demonstrated promising performance, although further im-
provement is possible with ongoing training and additional adjustments.

LLM Book
No ratings yet
LLM Book
161 pages
Towards AI Search Paradigm
No ratings yet
Towards AI Search Paradigm
63 pages
Panel Kapasitor Bank-Model - PDF 1
No ratings yet
Panel Kapasitor Bank-Model - PDF 1
1 page
7700e SPM
No ratings yet
7700e SPM
2 pages
Aca 21 JDC
No ratings yet
Aca 21 JDC
54 pages
23141091,18201115,19301124,19101116 Cse
No ratings yet
23141091,18201115,19301124,19101116 Cse
53 pages
Mapping The Enterprise: Modeling The Enterprise As Services With Enterprise Canvas 1 / Converted Edition Tom Graves
No ratings yet
Mapping The Enterprise: Modeling The Enterprise As Services With Enterprise Canvas 1 / Converted Edition Tom Graves
54 pages
Rupam's Master Thesis
No ratings yet
Rupam's Master Thesis
58 pages
Full Text 01
No ratings yet
Full Text 01
33 pages
Os Practical
No ratings yet
Os Practical
23 pages
AIP491 SP23AI08 Capstone Project Report
No ratings yet
AIP491 SP23AI08 Capstone Project Report
91 pages
1.machine Learning and Its Applications
No ratings yet
1.machine Learning and Its Applications
75 pages
Thesis Darius Dragnea
No ratings yet
Thesis Darius Dragnea
64 pages
CSE 21-131 Carlsson Lindgren
No ratings yet
CSE 21-131 Carlsson Lindgren
78 pages
Data Sampel Properti & Real Estate
No ratings yet
Data Sampel Properti & Real Estate
6 pages
Automatic Question & Answer Generation Using Generative Large Language Model (LLM)
No ratings yet
Automatic Question & Answer Generation Using Generative Large Language Model (LLM)
52 pages
ENIT TIC Report Template
No ratings yet
ENIT TIC Report Template
35 pages
Improving Retrieval Augmented Generation
No ratings yet
Improving Retrieval Augmented Generation
33 pages
Book Genre Classification Using ML
No ratings yet
Book Genre Classification Using ML
46 pages
(HK241) Convolution Operation
No ratings yet
(HK241) Convolution Operation
6 pages
Wang Asu 0010N 21448
No ratings yet
Wang Asu 0010N 21448
81 pages
Full Text 01
No ratings yet
Full Text 01
66 pages
Sec Sheet 3 Carnot Cycle
No ratings yet
Sec Sheet 3 Carnot Cycle
3 pages
Thesis Philippe Saade
No ratings yet
Thesis Philippe Saade
69 pages
LECTURE 3 - Corporate Image
No ratings yet
LECTURE 3 - Corporate Image
10 pages
Nokia 7730 SXR 1 Series Service Interconnect Routers Data Sheet EN
No ratings yet
Nokia 7730 SXR 1 Series Service Interconnect Routers Data Sheet EN
9 pages
PhoCLIP 232 Specialized Project OFFICIAL
No ratings yet
PhoCLIP 232 Specialized Project OFFICIAL
105 pages
David Coimbra - Dissertacao
No ratings yet
David Coimbra - Dissertacao
74 pages
Report
No ratings yet
Report
55 pages
Evaluation of Text Transformers For Classifying Sentiment of Revi
No ratings yet
Evaluation of Text Transformers For Classifying Sentiment of Revi
104 pages
A M3 RD Ipjn Yd Ps GKF
No ratings yet
A M3 RD Ipjn Yd Ps GKF
20 pages
Condensed Summaries
No ratings yet
Condensed Summaries
419 pages
Introduction (BT4222) YL
No ratings yet
Introduction (BT4222) YL
48 pages
On The Applicability of Deep Learning To Construct Process Models From Natural Text 16 05
No ratings yet
On The Applicability of Deep Learning To Construct Process Models From Natural Text 16 05
66 pages
Unit Test Generation Using Machine Master Thesis Laurence Saes PDF
No ratings yet
Unit Test Generation Using Machine Master Thesis Laurence Saes PDF
64 pages
Aust Cse Thesis Final Book
No ratings yet
Aust Cse Thesis Final Book
72 pages
Deep Learning Based Sentiment
No ratings yet
Deep Learning Based Sentiment
62 pages
Statistical Machine Learning For Information Retrieval - Adam Berger PDF
No ratings yet
Statistical Machine Learning For Information Retrieval - Adam Berger PDF
147 pages
Clapingo Android Internship Assignment
No ratings yet
Clapingo Android Internship Assignment
5 pages
SentimentAnalysisOfIMDBMovie Reviews
No ratings yet
SentimentAnalysisOfIMDBMovie Reviews
60 pages
16 Mikami
No ratings yet
16 Mikami
27 pages
Malware - Detection - Using - Neural - Networks (Main Paper)
No ratings yet
Malware - Detection - Using - Neural - Networks (Main Paper)
51 pages
A Survey On Large Language Models With Some Insights
No ratings yet
A Survey On Large Language Models With Some Insights
174 pages
Aman
No ratings yet
Aman
71 pages
STR-W6753: Universal-Input/58 W Off-Line Quasi-Resonant Flyback Switching Regulator
No ratings yet
STR-W6753: Universal-Input/58 W Off-Line Quasi-Resonant Flyback Switching Regulator
8 pages
Master Thesis
No ratings yet
Master Thesis
100 pages
1 AI - Introduction and ML
No ratings yet
1 AI - Introduction and ML
32 pages
Design and Analysis of CNN-Based Skin Disease Detection System With Preliminary Diagnosis
No ratings yet
Design and Analysis of CNN-Based Skin Disease Detection System With Preliminary Diagnosis
13 pages
Thesis RAG Retrieval Augmented Generation For The IR-Anthology
No ratings yet
Thesis RAG Retrieval Augmented Generation For The IR-Anthology
83 pages
FYP Proposal
No ratings yet
FYP Proposal
18 pages
BERT Model
No ratings yet
BERT Model
69 pages
Review Exercise On Analytics: 1. Define AI and Specify The Role of AI
No ratings yet
Review Exercise On Analytics: 1. Define AI and Specify The Role of AI
5 pages
Computer Science 2
No ratings yet
Computer Science 2
66 pages
Cristian-Stefan Tutuianu PDF
No ratings yet
Cristian-Stefan Tutuianu PDF
40 pages
Sentiment Classification With Deep Neural Networks: Yi Zhou
No ratings yet
Sentiment Classification With Deep Neural Networks: Yi Zhou
58 pages
Practical Amazon EC2, SQS, Kinesis, and S3: A Hands-On Approach To AWS
No ratings yet
Practical Amazon EC2, SQS, Kinesis, and S3: A Hands-On Approach To AWS
1 page
Training The Application of LLM
No ratings yet
Training The Application of LLM
68 pages
CS985 Project FrankMitchell BiP Solutions
No ratings yet
CS985 Project FrankMitchell BiP Solutions
66 pages
Deep Learning of Semantic Word Representations To Implement A Content-Based Recommender For The Recsys Challenge'14
No ratings yet
Deep Learning of Semantic Word Representations To Implement A Content-Based Recommender For The Recsys Challenge'14
5 pages
Case Study
No ratings yet
Case Study
11 pages
Lebanese International University: CSCI345 - Digital Logic Assignment 1
No ratings yet
Lebanese International University: CSCI345 - Digital Logic Assignment 1
5 pages
Hik ProConnect Mobile Client User Manual
No ratings yet
Hik ProConnect Mobile Client User Manual
44 pages
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
No ratings yet
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
13 pages
Predicting User Interaction On Social Media Using Machine Learnin
No ratings yet
Predicting User Interaction On Social Media Using Machine Learnin
76 pages
Deep Learning Notes (1) 2
No ratings yet
Deep Learning Notes (1) 2
54 pages
Neural Transfer Learning For NLP
No ratings yet
Neural Transfer Learning For NLP
329 pages
Unidad de Corte 5510
No ratings yet
Unidad de Corte 5510
20 pages
Rapport
No ratings yet
Rapport
106 pages
IEC 61850 Process Bus
No ratings yet
IEC 61850 Process Bus
3 pages
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
No ratings yet
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
15 pages
Augmenting LLMs Survey
No ratings yet
Augmenting LLMs Survey
33 pages
Sayiqa - AI Engineer
No ratings yet
Sayiqa - AI Engineer
4 pages
Content-Based Image Retrieval Using Deep Learning
No ratings yet
Content-Based Image Retrieval Using Deep Learning
44 pages
Introduction To: Energy Modelling & Building Simulation
No ratings yet
Introduction To: Energy Modelling & Building Simulation
14 pages
3 Hproblems
No ratings yet
3 Hproblems
8 pages
Thesis Anum Afzal
No ratings yet
Thesis Anum Afzal
127 pages
2018 Grade 11 Mathematics Third Term Test Paper Sabaragamuwa Province
No ratings yet
2018 Grade 11 Mathematics Third Term Test Paper Sabaragamuwa Province
12 pages
B V M Catalogue
No ratings yet
B V M Catalogue
24 pages
Extreme Privacy - Mobile Devices
100% (6)
Extreme Privacy - Mobile Devices
135 pages
LAB6
50% (2)
LAB6
5 pages
Mariem Abidi Rapport PFE 2020 Final
No ratings yet
Mariem Abidi Rapport PFE 2020 Final
101 pages
Book Rust Devils
92% (12)
Book Rust Devils
39 pages
527260-002F CE840 UserGuide
No ratings yet
527260-002F CE840 UserGuide
100 pages
GEA Marine Purifiers For Motor Yachts - tcm11-83673
No ratings yet
GEA Marine Purifiers For Motor Yachts - tcm11-83673
6 pages
Đề luyện thi học sinh giỏi 2
No ratings yet
Đề luyện thi học sinh giỏi 2
17 pages
Tos Tle Cookery Third Quarter Bahian
100% (1)
Tos Tle Cookery Third Quarter Bahian
2 pages
Deep Learning Based Recommendation Systems
No ratings yet
Deep Learning Based Recommendation Systems
47 pages
Internship Presentation
No ratings yet
Internship Presentation
16 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Articles Search Project

Uploaded by

Articles Search Project

Uploaded by

Report of articles search using Deep Learning project

3 Results and Conclusion 8

1.1 Recurrent Neural Networks

1.2 Word Embeddings

1.3 Semantic Matching

2.2 Data Base used

Figure 1: Bibliomatrix sample data base.

Figure 3: Data preprocessing.

2.5 Model Architecture Selection

2.6 Model Development

Figure 4: Defining the model

Figure 5: Training the model

3 Results and Conclusion

Figure 6: Performing a query

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.