0% found this document useful (0 votes)

6 views4 pages

Ai Report - Merged

Uploaded by

Pranav Gajjar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views4 pages

Ai Report - Merged

Uploaded by

Pranav Gajjar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Artificial Intelligence – LAB

PROJECT REPORT

Next Word Suggestion

Group 10

Submitted To: Submitted By:

Dr. Kapil Sharma Jay Sanghavi, 20BCP108
Prakhar Gupta, 20BCP086
Abstract
The Next Word Prediction is an important task in Natural Language Processing that involves predicting
the next word in a sentence. The goal of this project is to create a model that can accurately predict the
next word in a sentence. This project involves cleaning the input data, tokenizing the data, and building
a LSTM and GRU models for prediction.

Introduction
This code is an implementation of a next word prediction model using a GRU (Gated Recurrent Unit)
neural network architecture. The goal of the model is to predict the next word in a given sequence of
words.
The code starts by importing the necessary libraries and loading the data from a text file. The data is
then cleaned and tokenized using the Keras Tokenizer class. The sequences of words are split into
input/output pairs, where the input is a single word and the output is the next word in the sequence.
The GRU model is then defined using Keras Sequential API. The model consists of an embedding layer,
a GRU layer with 128 units, and a dense output layer with softmax activation. The model is compiled
using categorical cross-entropy loss and the Adam optimizer.
The model is trained on the input/output pairs using the fit() function, and several callbacks are used to
monitor the training process and save the best model. The model is then used to predict the next word
in a given input sequence.
Overall, this code demonstrates how to build and train a GRU-based next word prediction model using
Keras.

Literature Review
Next Word Prediction has become an important task in Natural Language Processing. It has various
applications in text completion, machine translation, and speech recognition. The prediction model can
be built using various techniques such as n-grams, Hidden Markov Models, and neural networks.
Recently, Recurrent Neural Networks (RNNs) have become very popular in Next Word Prediction.
RNNs are known for their ability to model sequential data. Long Short-Term Memory (LSTM) and
Gated Recurrent Unit (GRU) are the most commonly used RNNs architectures for this task. LSTM is
better suited for longer sequences while GRU is less computationally expensive and has shown better
performance in some cases.
Tokenization is a crucial step in Next Word Prediction. It involves converting the input text into
numerical sequences. The Tokenizer class in Keras can be used for this purpose. The sequences are the
split into input (X) and output (y) variables. These variables are then used to train the LSTM and GRU
models.
Methodology
Libraries: The required libraries are imported as follows:

• tensorflow is imported for building and training the model.

• keras.preprocessing.text is imported to tokenize the text data.
• keras.layers are imported for defining the layers in the model.
• keras.models is imported for creating the sequential model.
• keras.utils is imported to convert the output labels into categorical format.
• pickle is imported to save and load the tokenizer.
• numpy is imported to perform array operations.
• os is imported for interacting with the operating system.
Data: The data used in this project is the text corpus of the book "Metamorphosis" by Franz Kafka. The
data is cleaned and tokenized before being used for training the model. The cleaned data is stored in the
variable new_data, and the tokenized data is stored in the variable sequence_data.

Data Preprocessing: The Tokenizer function from keras.preprocessing.text is used to tokenize the data.
The function is fitted on the cleaned data, and the tokenizer is saved using the pickle library. The
sequences variable is created by sliding a window of size 2 over the tokenized data. Each window of
size 2 is converted into a sequence of 2 integers, where the first integer is the input and the second
integer is the output. The input sequences are stored in the variable X, and the output sequences are
stored in the variable y. The output sequences are converted into categorical format using the
to_categorical function from keras.utils.
Model Architecture: Two different models are created for the same task with different architectures:
LSTM and GRU. The architecture of the LSTM model consists of an embedding layer, two LSTM
layers, and two Dense layers. The architecture of the GRU model consists of an embedding layer, a
GRU layer, and a Dense layer.
Callbacks: The following callbacks are used during training:

• ModelCheckpoint is used to save the best model based on the loss value.
• ReduceLROnPlateau is used to reduce the learning rate when the loss value does not improve
for 3 epochs.
• TensorBoard is used to visualize the training process.
Compilation and Training
The model is compiled using the compile function from tensorflow.keras. The categorical_crossentropy
loss function and the Adam optimizer are used. The model is trained using the fit function from
tensorflow.keras.
Model Training: We train the model using the Adam optimizer and cross-entropy loss function. We
monitor the performance on the validation set during training to ensure that the model is not overfitting.
We also experiment with different hyperparameters to find the optimal values.
Prediction: Two trained models, one for LSTM and the other for GRU, can be selected for next word
prediction. The user enters a sentence, and the last word of the sentence is used as input to predict the
next word. The load_model function from tensorflow.keras.models is used to load the selected model,
and the saved tokenizer is loaded using pickle. The Predict_Next_Words function takes the loaded
model, the loaded tokenizer, and the input text as input and predicts the next word of the input text. The
user can choose to stop the script by typing "stop the script".
Model Improvement: We explore ways to improve the performance of the model. For example, we
can use a larger training dataset, fine-tune the hyperparameters, or incorporate additional features such
as part-of-speech tags or named entities. We also experiment with different architectures, such as adding
more LSTM layers or using a different type of recurrent neural network.
Overall, these methods provide a framework for building a next word prediction AI model using the
TensorFlow library. By following these steps and experimenting with different approaches, we can
build a model that accurately predicts the next word in a sequence and generates high-quality text.

Conclusion
The project demonstrates how to train a model for next word prediction using LSTM and GRU
architectures. The model is trained on the text corpus of the book "Metamorphosis" by Franz Kafka.
The trained model can be used to predict the next word

References
• Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural computation, 9(8),
1735-1780.
• Cho, K., Van Merriënboer, B., Bahdanau, D., & Bengio, Y. (2014). On the properties of neural
machine translation: Encoder-decoder approaches. arXiv preprint arXiv:1409.1259.
• Chung, J., Gulcehre, C., Cho, K., & Bengio, Y. (2014). Empirical evaluation of gated recurrent
neural networks on sequence modeling. arXiv preprint arXiv:1412.3555.
• Karpathy, A. (2015). The unreasonable effectiveness of recurrent neural networks. Andrej
Karpathy blog.
• Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning (Vol. 1). MIT Press.

Cheatsheet Recurrent Neural Networks
No ratings yet
Cheatsheet Recurrent Neural Networks
5 pages
Dreyfus 2002 Intelligence Without Representation Merleau-Ponty S
100% (1)
Dreyfus 2002 Intelligence Without Representation Merleau-Ponty S
18 pages
Next Word Prediction Model
No ratings yet
Next Word Prediction Model
2 pages
Next Word Prediction Model
No ratings yet
Next Word Prediction Model
2 pages
Next Word Prediction With NLP and Deep Learning
No ratings yet
Next Word Prediction With NLP and Deep Learning
13 pages
432033-none-74d75bc3
No ratings yet
432033-none-74d75bc3
4 pages
Internship Generative AI Task
No ratings yet
Internship Generative AI Task
3 pages
REPORT -MINOR PROJECT
No ratings yet
REPORT -MINOR PROJECT
22 pages
Steps
No ratings yet
Steps
3 pages
PT 2
No ratings yet
PT 2
59 pages
10 (3S) 4112-4118
No ratings yet
10 (3S) 4112-4118
7 pages
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
NLPPR8
No ratings yet
NLPPR8
4 pages
Word Prediction Using NLP
No ratings yet
Word Prediction Using NLP
12 pages
DL 4
No ratings yet
DL 4
5 pages
DL_exp13
No ratings yet
DL_exp13
4 pages
NN Text Generation Zaid Bouslikhin
No ratings yet
NN Text Generation Zaid Bouslikhin
14 pages
ChatBot with GANs
No ratings yet
ChatBot with GANs
61 pages
aM3RdIpjnYdPsGKF
No ratings yet
aM3RdIpjnYdPsGKF
20 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
No ratings yet
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
15 pages
Team-11-K.Manoj
No ratings yet
Team-11-K.Manoj
11 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
6 - RNN LSTM & Gru
No ratings yet
6 - RNN LSTM & Gru
14 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Polynomial Expansion Paper
No ratings yet
Polynomial Expansion Paper
4 pages
NLP LAB 4
No ratings yet
NLP LAB 4
2 pages
UNIT-3
No ratings yet
UNIT-3
4 pages
Thuyết Trình TWP
No ratings yet
Thuyết Trình TWP
7 pages
AI Project
No ratings yet
AI Project
19 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
Next Word Prediction Using Machine Learning Techniques: Cybersecurity November 2022
No ratings yet
Next Word Prediction Using Machine Learning Techniques: Cybersecurity November 2022
12 pages
Python Automation
No ratings yet
Python Automation
54 pages
04 - RNNs
No ratings yet
04 - RNNs
37 pages
Lecture 3 LSTM,GRU.pptx
No ratings yet
Lecture 3 LSTM,GRU.pptx
45 pages
Next Word Predictor
No ratings yet
Next Word Predictor
12 pages
Nlp Lab Manual
No ratings yet
Nlp Lab Manual
21 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
AAM unit 6 notes
No ratings yet
AAM unit 6 notes
20 pages
The Rust Programming Language, 2nd Edition
From Everand
The Rust Programming Language, 2nd Edition
Steve Klabnik
No ratings yet
Summaries of The Chapters
No ratings yet
Summaries of The Chapters
29 pages
Practical MXNet Applications: Definitive Reference for Developers and Engineers
From Everand
Practical MXNet Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Individual Report - CA 2 - 20000086
No ratings yet
Individual Report - CA 2 - 20000086
3 pages
unit4 (1)
No ratings yet
unit4 (1)
23 pages
Keras Deep Learning Essentials: Definitive Reference for Developers and Engineers
From Everand
Keras Deep Learning Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
From Everand
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
Robert Johnson
No ratings yet
CM Slides On Attention
No ratings yet
CM Slides On Attention
162 pages
Sentiment Analysis with an Recurrent Neural Networks
No ratings yet
Sentiment Analysis with an Recurrent Neural Networks
12 pages
DL 8
No ratings yet
DL 8
7 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
Deep DL Manual Deep
No ratings yet
Deep DL Manual Deep
8 pages
Learning PyTorch 2.0, Second Edition
From Everand
Learning PyTorch 2.0, Second Edition
Matthew Rosch
No ratings yet
Learning PyTorch 2.0, Second Edition: Utilize PyTorch 2.3 and CUDA 12 to experiment neural networks and deep learning models
From Everand
Learning PyTorch 2.0, Second Edition: Utilize PyTorch 2.3 and CUDA 12 to experiment neural networks and deep learning models
Matthew Rosch
No ratings yet
Experiment 5
No ratings yet
Experiment 5
8 pages
Unit III- Recurrent Neural Networks
No ratings yet
Unit III- Recurrent Neural Networks
44 pages
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
From Everand
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
Peter Bradley
No ratings yet
Text Generation With LSTM Recurrent Neural Networks in Python With Keras
No ratings yet
Text Generation With LSTM Recurrent Neural Networks in Python With Keras
23 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
AN2DL_05_2324_Seq2SeqAndWordEmbedding
No ratings yet
AN2DL_05_2324_Seq2SeqAndWordEmbedding
42 pages
CISC 867 Deep Learning: 14. Text Classification With Recurrent Neural Networks and Word Embeddings
No ratings yet
CISC 867 Deep Learning: 14. Text Classification With Recurrent Neural Networks and Word Embeddings
28 pages
Modelling of Hydrogen Blending
No ratings yet
Modelling of Hydrogen Blending
11 pages
Proposal
No ratings yet
Proposal
10 pages
III-II CSE - ML MID 2 - OBJ - Set-1
No ratings yet
III-II CSE - ML MID 2 - OBJ - Set-1
2 pages
Measurement: Amit Kumar Jaiswal, Prayag Tiwari, Sachin Kumar, Deepak Gupta, Ashish Khanna, Joel J.P.C. Rodrigues
No ratings yet
Measurement: Amit Kumar Jaiswal, Prayag Tiwari, Sachin Kumar, Deepak Gupta, Ashish Khanna, Joel J.P.C. Rodrigues
8 pages
1 s2.0 S2590005623000346 Main
No ratings yet
1 s2.0 S2590005623000346 Main
10 pages
Virus-Mnist A Benchmark Malware Dataset
No ratings yet
Virus-Mnist A Benchmark Malware Dataset
6 pages
Spam Email Using Machine Learning
No ratings yet
Spam Email Using Machine Learning
13 pages
Fast Voltage Contingency Screening and Ranking Using Cascade Neural Network
No ratings yet
Fast Voltage Contingency Screening and Ranking Using Cascade Neural Network
9 pages
Fundamentals Machine Learning Using Pyth
No ratings yet
Fundamentals Machine Learning Using Pyth
348 pages
Image Recognition On Arm Cortex-M With CMSIS-NN
No ratings yet
Image Recognition On Arm Cortex-M With CMSIS-NN
17 pages
Innovations in Sustainable Energy and Technology 2021
100% (2)
Innovations in Sustainable Energy and Technology 2021
382 pages
One Dollar Gestures - Gesture Recognition Method
No ratings yet
One Dollar Gestures - Gesture Recognition Method
10 pages
MSC Chennamsetty LH 2020
No ratings yet
MSC Chennamsetty LH 2020
56 pages
Artificial Intelligence Prospect
No ratings yet
Artificial Intelligence Prospect
16 pages
A Review Paper On Artificial Neural Network: Intelligent Traffic Management System
100% (1)
A Review Paper On Artificial Neural Network: Intelligent Traffic Management System
7 pages
Daily Worker Evaluation Model For SME-Scale Food Production System Using Kansei Engineering and Artificial Neural Network
No ratings yet
Daily Worker Evaluation Model For SME-Scale Food Production System Using Kansei Engineering and Artificial Neural Network
5 pages
Image Denoising Network Based On Subband Information Sharing Using Dual-Tree Complex Wavelet
No ratings yet
Image Denoising Network Based On Subband Information Sharing Using Dual-Tree Complex Wavelet
17 pages
QB Ecc604 May 2022 Examination Te Extc Sem Vi 2021-22
No ratings yet
QB Ecc604 May 2022 Examination Te Extc Sem Vi 2021-22
25 pages
Deep Unsupervised Learning
No ratings yet
Deep Unsupervised Learning
90 pages
Seasonal Rainfall Prediction
No ratings yet
Seasonal Rainfall Prediction
11 pages
Başucu Kitabı
No ratings yet
Başucu Kitabı
24 pages
Deep Parametric Portfolio Policies
No ratings yet
Deep Parametric Portfolio Policies
69 pages
Mca Calicut 4 To 6 Semester
No ratings yet
Mca Calicut 4 To 6 Semester
26 pages
Convolutional Neural Networks For
No ratings yet
Convolutional Neural Networks For
9 pages
Fraud Detection Using Machine Learning
No ratings yet
Fraud Detection Using Machine Learning
36 pages
Ref - 17 6 DOF Manipulators Absolute Positioning Accuracy Improvement Using A Neural-Network
No ratings yet
Ref - 17 6 DOF Manipulators Absolute Positioning Accuracy Improvement Using A Neural-Network
6 pages
Introduction To Deep Learning: Radu Ionescu, Prof. PHD
No ratings yet
Introduction To Deep Learning: Radu Ionescu, Prof. PHD
90 pages
AI AR Based Skin Rash Diagnosis System
No ratings yet
AI AR Based Skin Rash Diagnosis System
14 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Ai Report - Merged

Uploaded by

Ai Report - Merged

Uploaded by

Artificial Intelligence – LAB

Next Word Suggestion

Submitted To: Submitted By:

• tensorflow is imported for building and training the model.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.