0% found this document useful (0 votes)

34 views36 pages

Stock Price Prediction NEPSE

The project report focuses on stock price prediction using machine learning techniques, particularly Long Short Term Memory (LSTM) networks, to analyze historical stock data and forecast future prices. It includes a comprehensive analysis of requirements, feasibility studies, methodologies, and various machine learning models such as SVM, Random Forest, XGBoost, and LightGBM. The findings indicate that the LSTM model outperforms other models in terms of predictive accuracy, as evidenced by the lowest Root Mean Square Error (RMSE).

Uploaded by

Bishal Timalsina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views36 pages

Stock Price Prediction NEPSE

Uploaded by

Bishal Timalsina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 36

COLLEGE OF APPLIED BUSINESS AND TECHNOLOGY

Chabahil, Gangahity, Kathmandu

Stock Price Prediction

A Project Report

Submitted by:

Alish Dahal (102)

Bibek Mainali (107)
Nischal Shrestha (119)
Sayastha Karki (127)
College Of Applied Business and Technology
B.Sc.CSIT 4th Semester

Submitted to:
College of Applied Business and Technology
Department of Science and Technology
Table of Contents
ACKNOWLEDGEMENT ........................................................................................................ I
ABSTRACT .............................................................................................................................
II
LIST OF ABBREVIATIONS ..................................................................................................III
CHAPTER 1: INTRODUCTION.......................................................................................... 1-3
Introduction ............................................................................................................... 1
Report Organization ......................................................................................................
1
Problem Statement ........................................................................................................
1
Objective .......................................................................................................................
1
CHAPTER 2: REQUIREMENT ANALYSIS AND FEASIBILITY STUDY ..........................
3
Literature Review ..........................................................................................................
3
Requirement Analysis ...................................................................................................
3
Functional Requirement ....................................................................................
3 Non-Functional Requirement............................................................................
4
Feasibility Study ........................................................................................................ 5-
6
Technical Feasibility ......................................................................................... 5
Operational Feasibility ......................................................................................
5
Economic Feasibility .........................................................................................
6
Chapter 3: SYSTEM DESIGN...................................................................................................
6
Methodology.................................................................................................................. 8
Data Collection.................................................................................................11
LSTM
………………………………………………………………………...12
Random
Forest………………………………………………………………..14
XGBoost……………………………………………………………………...15
LightGBM……………………………………………………………………16
Chapter 4: IMPLEMENTATION AND TESTING..................................................................
17
Implementation............................................................................................................ 17
Tools Used....................................................................................................... 18
Evaluation and Testing...... ....................................................................................... 19
Chapter 5: CONCLUSION AND RECOMMENDATION......................................................
23
Conclusion .................................................................................................................. 23
Future Enhancement …............................................................................................... 23
REFERENCES .......................................................................................................................
24
APPENDIX .............................................................................................................................
25
ACKNOWLEDGEMENT

It gives us immense pleasure to express our deepest sense of gratitude and sincere thanks to
our highly respected and esteemed guide Mr. Tekendra Nath Yogi, for his valuable guidance,
encouragement and help for completing this work. His useful suggestions for this whole work
and co-operative behavior are sincerely acknowledged.

At the end we would like to express our sincere thanks to all our friends and others who
helped us directly or indirectly during this project work.
Alish Dahal (102)
Bibek Mainali (107)
Nischal Shrestha (119)
Sayastha Karki (127)

I
ABSTRACT
Stock Price Prediction Using Machine Learning: An Overview

Stock price prediction using machine learning is a complex yet promising endeavor in the
realm of financial markets. The volatile and dynamic nature of stock markets, influenced by
various factors such as economic indicators, geopolitical events, and company performance,
makes accurate prediction challenging. However, the application of machine learning
techniques, particularly Long Short Term Memory (LSTM) networks, has shown potential in
forecasting stock prices to a certain extent.

Keywords:
machine learning, Python, Collab

II
LIST OF ABBREVIATIONS

ANN: Artificial Neural Network

LSTM: Long Short Term Memory
RNN: Recurrent Neural Network
SVM: Support Vector Machine

III
INTRODUCTION

Introduction
Stock price prediction using machine learning is a fascinating field that leverages advanced
computational techniques to forecast the future values of stocks traded on the financial
markets. It's a complex and challenging task due to the inherent volatility and uncertainty in
stock markets, influenced by various factors such as economic indicators, market sentiment,
geopolitical events, and company performance. Machine learning models, particularly Long
Short Term Memory (LSTM) networks, offer a promising approach to analyze historical
stock data and make informed predictions about future stock prices. We will use LSTM,
SVM, Random Forest Model, and Gradient Boosting methods(XGBoost and LightGBM) to
train on the historical stock price data to predict future stock closing prices.

Objectives
The objectives of this project are to:
 Financial Gain
 Risk Management
 Increased Accuracy

Report Organization
The report is organized into 6 chapters, each of which discusses a different aspect of the
project, from the problem statement, objectives, and scope to the recommendations for future
work.
Chapter 1: Introduction
 This chapter introduces the project, including the problem statement, objectives, and
scope.
 It also provides a brief overview of the state-of-the-art in stock price prediction sign
classification.
Chapter 2: Related Work

1
 This chapter discusses the different approaches that have been used for stock
price prediction, including the use of hand-crafted features and deep learning
methods.
 It also discusses the challenges that still need to be addressed in order to improve
the accuracy of stock price prediction systems.
Chapter 3: Methodology
 This chapter describes the methodology that was used for this project, including
the data, RNN architecture, and training and evaluation procedures.
 It also discusses the challenges that were encountered during the implementation
of the project.
Chapter 4: Results
 This chapter presents the results of the project, including the accuracy of the
RNN model on a custom dataset of stock price predictions.
 It also discusses the implications of the results for the future of stock price
prediction systems.
Chapter 5: Discussion
 This chapter discusses the limitations of the project and the challenges that still
need to be addressed in order to improve the accuracy of stock price prediction
systems.
 It also discusses the potential applications of the project in real-world stock price
prediction recognition systems.
Chapter 6: Conclusion
 This chapter summarizes the main findings of the project and provides
recommendations for future work.

2
REQUIREMENT ANALYSIS AND FEASIBILITY STUDY

Literature Review

Stock price prediction refers to the prediction of the trading operations at a certain time in the
future. It is based on the historical and real data of the stock market according to a certain
forecasting model. This prediction plays an important and positive role in improving the
efficiency of the trading market and giving play to market signals. Accurate stock price
forecast can help investors adjust their trading strategies in time, and effectively avoid
investment risks, so as to obtain higher returns. Price prediction has long appeared in all
kinds of trading markets. However, due to the influence of many factors, including not only
the internal change rules of the stock market, but also the sudden impact of the external
market, the prediction results of some existing stock price prediction models are not perfect.
Using the existing technology and the improvement of the existing algorithm, the prediction
result can be closer to the actual situation. Therefore, we need to further improve the
algorithm and model, make use of the historical data given, and extract valuable data
information to achieve more accurate stock price prediction.

Problem Statement

Investors face significant risk when investing in the stock market.

While there are many success stories of investors making huge profits in the stock market,
there are also stories of investors losing everything due to poor decisions. The strategy that
investors take to the stock market, rather than the stock market itself, is the issue. Many
investors are unclear about whether it is feasible to buy a particular stock or what is required
to be successful in the market.

Objective of study
 To use machine learning methods to predict stock market using historical data

Non-Functional Requirements
The stock price prediction system must meet the following non-functional requirements:

3
 Accuracy: The system must be able to accurately identify stock price predictions, with
a high confidence score.
 Speed: The system must be able to classify stock price predictions in real time.
 Robustness: The system must be able to handle variations in market crash and market
increase.
 Scalability: The system must be able to scale to handle a large number of stock price
predictions.

Data Requirements
The stock price prediction system will require a dataset of stock price predictions. The dataset
should include datasets of stock price predictions from a variety of time, days and years. The
dataset should also include the ground truth labels for such data which indicate the type of
stock price prediction.

Hardware Requirements
The stock price prediction system will require a computer with a GPU. The GPU will be used
to accelerate the train.

Software Requirements
The stock price prediction system will require the following software:
 Pandas
 numpy
 Svm
 Random forest model
 Gradient Boosting Methods
Deployment Requirements
The stock price prediction system can be deployed on a variety of platforms, including:
 On-premises servers
 Cloud-based platforms

Testing Requirements
The stock price prediction system will be tested using the following methods:
 Unit testing

4
 Integration testing
 System testing
 User acceptance testing

Maintenance Requirements
The stock price prediction system will require regular maintenance to ensure that it continues
to function properly. The maintenance tasks will include:

 Updating the dataset

 Troubleshooting any problems that arise

Feasibility Study
This feasibility study will assess the feasibility of developing a stock price prediction system
using historical price data The study will consider the following factors:

Technical Feasibility
There are a number of publicly available datasets of stock price predictions that can be used
for training and testing.

The project uses a number of Python libraries, including NumPy, Pandas, Pickle, Matplotlib,
sklearn, XGboost and Light GBM and Keras. These libraries provide a variety of features that
are essential for the project, such as data manipulation, data visualization, and model training.

Operational Feasibility
The operational feasibility of the project is also high. The RNN-based stock price prediction
system can be deployed on a variety of platforms, including embedded systems, cloud-based
systems, and mobile devices. Additionally, the system can be easily integrated with existing
stock management systems.

Economic Feasibility
The economic feasibility of the project is also good. The development costs of the project
will depend on the complexity of the system and the hardware platform that is used and the

5
size of the data sets. However, in general, the development of such a system can be relatively
cost-effective.

In addition to the three sections mentioned above, here are some other factors that should be
considered when assessing the feasibility of the project:

 The availability of funding

 The regulatory environment
 The level of public acceptance

Methodology

6
First, we need to load the stock data from a CSV file. We will be using the 'NEPSE_10years’
file from nepse website from 2013 to 2023. The data will be stored in a Pandas Data Frame,
and we will extract the 'Close' column for further processing.

Figure 1 Nepse stock closing price

We will move onto some data pre-processing steps. This is necessary because it is essential
for improving data quality and handling missing values and outliers.It involves normalization
to bring features to a common scale, pre-processing enhances model performance and ensures
data compatibility with analysis techniques. Here we are reshaping the data to have a single
feature and normalize it to the range [0, 1]. This normalization step is essential to ensure that
all features have the same scale and to improve the model's convergence. Before Data
processing Numpy is imported in Python for its efficient numerical computing capabilities. It
provides N-dimensional arrays for handling large datasets and a wide range of mathematical
functions for array operation.
We will split the data into training and testing sets. Here, we assign 80% of the data for
training and the remaining 20% for testing. The training data will be used to train our models,

7
while the testing data will be used to evaluate their performance. Train data is used to learn
patterns and features and test data is used to evaluate models performance on unseen data.
This ensures the models' generalization capability.
LSTM Model: We will start with a Long Short-Term Memory (LSTM) model, which is a
type of recurrent neural network (RNN) suitable for sequential data like stock prices. LSTM
models are known for their ability to capture temporal dependencies and make accurate
predictions. In this code TensorFlow is an open-source deep learning library developed by
Google. It allows you to build and train various machine learning and deep learning models
efficiently using neural networks.

Sequential is a class in tensor flow’s Keras API that allows you to create a linear stack of
layers in a neural network. Dense is a layer type in TensorFlow used for fully connected
layers in neural networks, where every neuron is connected to every neuron in the previous
and next layer. Dropout is a regularization technique used to prevent overfitting in neural
networks. It randomly sets a fraction of input units to zero during the training, which helps in
reducing the co-adaptation of neurons and improves the generalization of the model. Adam
is an optimization algorithm used for training neural networks. It is an adaptive learning rate
optimization algorithm that combines the benefits of AdaGrad and RMSprop. It dynamically
adjusts the learning rates for each parameter, making it well suited for a wide range of deep
learning tasks. We define a function, create_lstm_model, to create the LSTM model with the
given hyperparameters. The function takes the number of LSTM units, activation function,
and learning rate as inputs. The model consists of an LSTM layer followed by a dense output
layer. We use the Adam optimizer and mean squared error loss for training the model.

Next, we define a set of hyperparameters for tuning the LSTM model. We specify different
values for the number of LSTM units, activation functions, and learning rates. We will
perform a grid search to find the best combination of hyperparameters. We initialize a
variable, best_rmse, with a high value to keep track of the best root mean squared error
(RMSE) achieved by the LSTM model. Root Mean Square Error is calculated with the
following formula :

8
We will update this value as we find better models during the grid search. Now, we iterate
over all possible combinations of hyperparameters and train the LSTM models. For each
combination, we create the LSTM model, train it on the training data, and make predictions
on the testing data. We calculate the RMSE between the actual and predicted values and
update the best_rmse and best_lstm_model variables if we find a model with a lower RMSE.
After finding the best LSTM model, we make predictions on the entire dataset and inverse
normalize the predictions to obtain the actual stock prices.

SVM Model: We will now explore Support Vector Machines (SVM). SVM is a popular non-
linear regression model that finds the best hyperplane to separate the data points. We
initialize an SVM model and define a set of hyperparameters to tune using a grid search. The
grid search selects the best hyperparameters based on the negative mean squared error
scoring. We fit the SVM model to the data and make predictions on the entire dataset.

Random Forest Model: Next, we explore the Random Forest model, which is an ensemble
model consisting of multiple decision trees. Random Forest models are known for their
robustness and ability to handle complex relationships in the data. We initialize a Random
Forest model and define a set of hyperparameters for tuning. Similar to the SVM model, we
use a grid search to find the best combination of hyperparameters based on the negative mean
squared error. We fit the Random Forest model to the data and make predictions on the entire
dataset.

Gradient Boosting Methods: Lastly, we explore two popular gradient boosting methods,
XGBoost and LightGBM. These models are known for their efficiency and excellent
performance in many machine learning tasks. We follow a similar approach for both
XGBoost and LightGBM models. We initialize the models and define a set of
hyperparameters for tuning using a grid search. We fit the models to the data and make
predictions on the entire dataset.

9
To evaluate the models' performance, we calculate the root mean squared error (RMSE)
between the actual and predicted stock prices. Lower RMSE values indicate better model
performance. We plot the actual and predicted values for each model to visualize their
predictions.Finally, we plot the predictions made by the best LSTM model alongside the
actual stock prices.

So far, we have explored various machine-learning models for predicting stock prices. We
started with an LSTM model to capture temporal dependencies in the data. Then, we explored
other models like Support Vector Machines, Random Forest, XGBoost, and LightGBM. We
evaluated their performance using the root mean squared error (RMSE) metric and visualized
the predictions. The LSTM model demonstrates superior predictive performance, exhibiting
the lowest Root Mean Square Error (RMSE) among all the models. This highlights the
strength and effectiveness of the LSTM model in forecasting stock prices accurately. Next,
we will do forecast using LSTM Model.

Data Collection
The preprocessed dataset was downloaded from Nepal Stock Exchange (NEPSE) website.
The data set is from the year 2013 to 2023 over the span of ten years. The dataset was divided
into train set and test set. The train set of data were used to train the model and the test set of

10
data were used to test the accuracy of the model.

Figure 2: - Preprocessed Training Data

Long Short Term Memory

Long Short Term Memory is a kind of recurrent neural network. In RNN output
from the last step is fed as input in the current step. LSTM was designed by
Hochreiter & Schmidhuber. It tackled the problem of long-term dependencies of
RNN in which the RNN cannot predict the word stored in the long-term memory but
can give more accurate predictions from the recent information. As the gap length
increases RNN does not give an efficient performance. LSTM can by default retain
the information for a long period of time. It is used for processing, predicting, and
classifying on the basis of time-series data.

Long Short-Term Memory (LSTM) is a type of Recurrent Neural Network (RNN)

that is specifically designed to handle sequential data, such as time series, speech,
11
and text. LSTM networks are capable of learning long-term dependencies in
sequential data, which makes them well suited for tasks such as language translation,
speech recognition, and time series forecasting.

A traditional RNN has a single hidden state that is passed through time, which can
make it difficult for the network to learn long-term dependencies. LSTMs address
this problem by introducing a memory cell, which is a container that can hold
information for an extended period of time. The memory cell is controlled by three
gates: the input gate, the forget gate, and the output gate. These gates decide what
information to add to, remove from, and output from the memory cell.

The input gate controls what information is added to the memory cell. The forget
gate controls what information is removed from the memory cell. And the output
gate controls what information is output from the memory cell. This allows LSTM
networks to selectively retain or discard information as it flows through the network,
which allows them to learn long-term dependencies.

LSTMs can be stacked to create deep LSTM networks, which can learn even more
complex patterns in sequential data. LSTMs can also be used in combination with
other neural network architectures, such as Convolutional Neural Networks (RNNs)
for image and video analysis. LSTM

12
has a chain structure that contains four neural networks and different memory blocks
called cells.

Random Forest Algorithm

Random Forest is a popular machine learning algorithm that belongs to the supervised
learning technique. It can be used for both Classification and Regression problems in ML. It
is based on the concept of ensemble learning, which is a process of combining multiple
classifiers to solve a complex problem and to improve the performance of the model. Instead
of relying on one decision tree, the random forest takes the prediction from each tree and
based on the majority votes of predictions, and it predicts the final output.

Fig: Working of Random Forest algorithm.

XGBoost:

13
XGBoost is an implementation of Gradient Boosted decision trees. XGBoost models
majorly dominate in many Kaggle Competitions.

In this algorithm, decision trees are created in sequential form. Weights play an important
role in XGBoost. Weights are assigned to all the independent variables which are then fed
into the decision tree which predicts results. The weight of variables predicted wrong by the
tree is increased and these variables are then fed to the second decision tree. These
individual classifiers/predictors then ensemble to give a strong and more precise model. It
can work on regression, classification, ranking, and user-defined prediction problems.

Mathematics behind XgBoost

Before beginning with mathematics about Gradient Boosting, Here’s a simple example of a
CART that classifies whether someone will like a hypothetical computer game X. The
example of tree is below:

The prediction scores of each individual decision tree then sum up to get If you look at the
example, an important fact is that the two trees try to complement each other.
Mathematically, we can write our model in the form

where, K is the number of trees, f is the functional space of F, F is the set of possible
CARTs. The objective function for the above model is given by:

where, first term is the loss function and the second is the regularization parameter.

LightGBM
LightGBM is a gradient-boosting framework based on decision trees to increase the
efficiency of the model and reduces memory usage.
It uses two novel techniques:
 Gradient-based One Side Sampling(GOSS)
 Exclusive Feature Bundling (EFB)

14
These techniques fulfill the limitations of the histogram-based algorithm that is primarily
used in all GBDT (Gradient Boosting Decision Tree) frameworks. The two techniques of
GOSS and EFB described below form the characteristics of the LightGBM Algorithm.
They comprise together to make the model work efficiently and provide it a cutting edge
over other GBDT frameworks.

Different data instances have varied roles in the computation of information gain. The
instances with larger gradients(i.e., under-trained instances) will contribute more to the
information gain. GOSS keeps those instances with large gradients (e.g., larger than a
predefined threshold, or among the top percentiles), and only randomly drops those
instances with small gradients to retain the accuracy of information gain estimation. This
treatment can lead to a more accurate gain estimation than uniformly random sampling,
with the same target sampling rate, especially when the value of information gain has a
large range.

High-dimensional data are usually very sparse which provides us the possibility of
designing a nearly lossless approach to reduce the number of features. Specifically, in a
sparse feature space, many features are mutually exclusive, i.e., they never take nonzero
values simultaneously. The exclusive features can be safely bundled into a single feature
(called an Exclusive Feature Bundle). Hence, the complexity of histogram building
changes from O(data × feature) to O(data × bundle), while bundle<<feature. Hence, the
speed of the training framework is improved without hurting accuracy.

15
IMPLEMENTATION AND TESTING
Implementation
A simple GUI was developed in web which has a feature of choose file to upload image for
classification. To develop a working system implementation was done in 3 phases. First phase
was the implementation of convolutional Neural Network along with the preprocessors of
image data. Second phase was the implementation of backend API along with database.
Finally, third phase was implementation of frontend i.e. a website for the classification
model.

Tools Used

HTML
HTML was used to design website for classification of signs. HTML gave the structure for
the webpages. It was used to format the texts and images of the web pages. Formatting pages,
creating hyperlinks and web forms are all done using HTML.

CSS
CSS was used to stylize the website. Arranging the images and texts of the webpage in
organized manner, decorating HTML elements and managing the elements are all done using
CSS.

JavaScript
JavaScript was used to request backend for data and display data properly on the website.
AJAX principle was followed in order to consume backend API‟s data. Not only it was used
to request for data but also it was used to manage active user’s session.

16
Evaluation and testing
To evaluate the models' performance, we calculate the root mean squared error (RMSE)
between the actual and predicted stock prices. Lower RMSE values indicate better model
performance. We plot the actual and predicted values for each model to visualize their
predictions.

17
18
19
Finally, we plot the predictions made by the best LSTM model alongside the actual stock
prices over 100 days.

20
CONCLUSION AND RECOMMENDATION
Conclusion
In this report, development of a system to classify stock price predictions and then into
predefine categories is documented. Historical data can be used to train. The stock price
predictions beside the roads are displayed to the user also in low vision. The main goal of our
project “Detecting different Stock price predictions from historical data” was achieved by
using LSTM, gradient boosting. Using flask web framework of python saved a lot of time
required to develop backend which allowed spare time to focus on the non-functional
requirement too.

Future Enhancements
The system can be further enhanced by adding more feature or adding more data. Adding
various sign in the dataset and optimizing algorithms for better efficiency.

21
REFERENCES
Sonkavde, G.; Dharrao, D.S.; Bongale, A.M.; Deokate, S.T.; Doreswamy, D.; Bhat, S.K.
Forecasting Stock Market Prices Using Machine Learning and Deep Learning Models: A
Systematic Review, Performance Analysis and Discussion of Implications. Int. J. Financial
Stud. 2023, 11, 94. https://doi.org/10.3390/ijfs11030094
.

Faruque, Md Omar & Strasser, Thomas & Lauss, Georg & Marandi, V. & Forsyth, Paul &
Dufour, Christian & Dinavahi, Venkata & Monti, A. & Kotsampopoulos, Panos & Martinez,
Juan & Strunz, K. & Saeedifard, Maryam & Wang, Xiaoyu & Shearer, David & Paolone,
Mario & Brandl, Ron & Matar, Mahmoud & Davoudi, Ali & Iravani, Reza. (2015). Real-
Time Simulation Technologies for Power Systems Design, Testing, and Analysis. IEEE
Power and Energy Technology Systems Journal. 2. 63 - 73. 10.1109/JPETS.2015.2427370.

[3] Stock price predictions Preprocessed Dataset: https://nepsealpha.com/nepse-data

22
APPENDIX
Code for model:

"""Stock Prediction using Machine Learning.ipynb

Automatically generated by Colaboratory.

import pandas as pd
import numpy as np
# Load the stock data

data = pd.read_csv("nepse_10 years.csv")

close_prices_NEPSE = data['Close']

# Reverse the order of the data

close_prices_NEPSE_reverse = close_prices_NEPSE.iloc[::-1]

# Reset index to maintain the correct time series order in the plot
close_prices_NEPSE_reverse.reset_index(drop=True, inplace=True)

# Plot the line chart

import matplotlib.pyplot as plt
plt.figure(figsize=(10, 6))
plt.plot(close_prices_NEPSE_reverse)
plt.xlabel('Time')
plt.ylabel('Close Prices')
plt.title('NEPSE Stock Close Prices')
plt.grid(True)
plt.show()

# Data preprocessing
data = close_prices_NEPSE_reverse.values.reshape(-1, 1) # Reshape the data
data_normalized = data / np.max(data) # Normalize the data

# Split the data into training and testing sets

train_size = int(len(data_normalized) * 0.8)
train_data = data_normalized[:train_size]
test_data = data_normalized[train_size:]

from tensorflow.keras.models import Sequential

from tensorflow.keras.layers import LSTM, Dense, Dropout
from tensorflow.keras.optimizers import Adam

# Function to create LSTM model

def create_lstm_model(units, activation, learning_rate):
model = Sequential()
model.add(LSTM(units=units, activation=activation, input_shape=(1, 1)))
model.add(Dense(units=1))
optimizer = Adam(learning_rate=learning_rate)
model.compile(optimizer=optimizer, loss='mean_squared_error')
return model

# Define hyperparameters for tuning

lstm_units = [50, 100, 200]
lstm_activations = ['relu', 'tanh']

23
learning_rates = [0.001, 0.01, 0.1]
epochs = 100
batch_size = 32

# Perform hyperparameter tuning for LSTM model

best_rmse = float('inf')
best_lstm_model = None

from sklearn.metrics import mean_squared_error

for units in lstm_units:

for activation in lstm_activations:
for learning_rate in learning_rates:
# Create and train LSTM model
model = create_lstm_model(units=units, activation=activation, learning_rate=learning_rate)
model.fit(train_data[:-1].reshape(-1, 1, 1), train_data[1:], epochs=epochs, batch_size=batch_size,
verbose=0)

# Predict on test data

test_predictions = model.predict(test_data[:-1].reshape(-1, 1, 1)).flatten()

# Calculate RMSE
rmse = np.sqrt(mean_squared_error(test_data[1:], test_predictions))

# Check if current model has lower RMSE

if rmse < best_rmse:
best_rmse = rmse
best_lstm_model = model

# Predict on the entire dataset using the best LSTM model

all_lstm_predictions = best_lstm_model.predict(data_normalized[:-1].reshape(-1, 1, 1)).flatten()

# Inverse normalize the LSTM predictions

all_lstm_predictions = all_lstm_predictions * np.max(data)

from sklearn.svm import SVR

from sklearn.model_selection import GridSearchCV

# Support Vector Machines (SVM) Model

svm_model = SVR()

svm_params = {
'C': [0.1, 1, 10],
'gamma': [0.01, 0.1, 1]
}

svm_grid_search = GridSearchCV(svm_model, svm_params, scoring='neg_mean_squared_error')

svm_grid_search.fit(np.arange(len(close_prices_NEPSE_reverse)).reshape(-1, 1),
close_prices_NEPSE_reverse)
svm_best_model = svm_grid_search.best_estimator_
svm_predictions = svm_best_model.predict(np.arange(len(close_prices_NEPSE_reverse)).reshape(-1, 1))

from xgboost import XGBRegressor

from lightgbm import LGBMRegressor
from sklearn.ensemble import RandomForestRegressor

# Random Forest Model

rf_model = RandomForestRegressor()

rf_params = {

24
'n_estimators': [50, 100, 200],
'max_depth': [None, 5, 10]
}

rf_grid_search = GridSearchCV(rf_model, rf_params, scoring='neg_mean_squared_error')

rf_grid_search.fit(np.arange(len(close_prices_NEPSE_reverse)).reshape(-1, 1), close_prices_NEPSE_reverse)
rf_best_model = rf_grid_search.best_estimator_
rf_predictions = rf_best_model.predict(np.arange(len(close_prices_NEPSE_reverse)).reshape(-1, 1))

# Gradient Boosting Methods (XGBoost)

xgb_model = XGBRegressor()

xgb_params = {
'learning_rate': [0.1, 0.01, 0.001],
'max_depth': [3, 5, 7]
}

xgb_grid_search = GridSearchCV(xgb_model, xgb_params, scoring='neg_mean_squared_error')

xgb_grid_search.fit(np.arange(len(close_prices_NEPSE_reverse)).reshape(-1, 1), close_prices_NEPSE_reverse)
xgb_best_model = xgb_grid_search.best_estimator_
xgb_predictions = xgb_best_model.predict(np.arange(len(close_prices_NEPSE_reverse)).reshape(-1, 1))

# Gradient Boosting Methods (LightGBM)

lgbm_model = LGBMRegressor()

lgbm_params = {
'learning_rate': [0.1, 0.01, 0.001],
'max_depth': [3, 5, 7]
}

lgbm_grid_search = GridSearchCV(lgbm_model, lgbm_params, scoring='neg_mean_squared_error')

lgbm_grid_search.fit(np.arange(len(close_prices_NEPSE_reverse)).reshape(-1, 1),
close_prices_NEPSE_reverse)
lgbm_best_model = lgbm_grid_search.best_estimator_
lgbm_predictions = lgbm_best_model.predict(np.arange(len(close_prices_NEPSE_reverse)).reshape(-1, 1))

import matplotlib.pyplot as plt

from sklearn.metrics import mean_squared_error

# Function to calculate RMSE

def rmse(y_true, y_pred):
return np.sqrt(mean_squared_error(y_true, y_pred))

# List of model names and predictions

model_names = ['Support Vector Machines (SVM)', 'Random Forest', 'XGBoost', 'LightGBM']
predictions = [svm_predictions, rf_predictions, xgb_predictions, lgbm_predictions]
best_models = [svm_best_model, rf_best_model, xgb_best_model, lgbm_best_model]

# Truncate actual values to match the length of predictions

actual_values_reverse = close_prices_NEPSE_reverse[-len(svm_predictions):]

# Evaluate models and plot graphs

for i, model_name in enumerate(model_names):
model_prediction = predictions[i]
model_prediction_truncated = model_prediction[-len(actual_values_reverse):] # Truncate predicted values
model_rmse = rmse(actual_values_reverse, model_prediction_truncated)

# Plotting actual and predicted values

plt.figure(figsize=(10, 6))
plt.plot(actual_values_reverse, label='Actual')

25
plt.plot(model_prediction_truncated, label='Predicted')
plt.title(f"{model_name} - RMSE: {model_rmse:.2f}")
plt.xlabel('Time')
plt.ylabel('Stock Price')
plt.legend()
plt.show()

# Print the best hyperparameters for the model

best_model = best_models[i]
print(f"Best Hyperparameters for {model_name}:")
print(best_model)
print("-----------------------------")

# Plotting LSTM predictions

plt.figure(figsize=(10, 6))
plt.plot(actual_values_reverse, label='Actual')
plt.plot(all_lstm_predictions, label='LSTM Predicted')
plt.title(f"LSTM Model - RMSE: {best_rmse:.2f}")
plt.xlabel('Time')
plt.ylabel('Stock Price')
plt.legend()
plt.show()

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from sklearn.metrics import mean_squared_error
from sklearn.model_selection import GridSearchCV
from sklearn.svm import SVR
from sklearn.ensemble import RandomForestRegressor
from xgboost import XGBRegressor
from lightgbm import LGBMRegressor
import tensorflow as tf
from tensorflow import keras
from tensorflow.keras import layers
from keras.models import Sequential
#from tensorflow import Sequential
#from tensorflow import LSTM, Dense
from tensorflow.keras.layers import LSTM , Dense
#from tensorflow import Adam
from keras.models import Model
from keras.optimizers import adam

# Load the stock data

# file_path = r'E:\non-credt\stock prediction\nepse_10years.csv'
data = pd.read_csv("nepse_10 years.csv")
close_prices_NEPSE = data['Close']

# Reverse the order of the data

close_prices_NEPSE_reverse = close_prices_NEPSE.iloc[::-1]

# Reset index to maintain the correct time series order in the plot
close_prices_NEPSE_reverse.reset_index(drop=True, inplace=True)

# Data preprocessing
data = close_prices_NEPSE_reverse.values.reshape(-1, 1) # Reshape the data
data_normalized = data / np.max(data) # Normalize the data

# Split the data into training and testing sets

train_size = int(len(data_normalized) * 0.8)

26
train_data = data_normalized[:train_size]
test_data = data_normalized[train_size:]

# Function to create LSTM model

# Define hyperparameters for tuning

lstm_units = [50, 100, 200]
lstm_activations = ['relu', 'tanh']
learning_rates = [0.001, 0.01, 0.1]
epochs = 200
batch_size = 32

# Perform hyperparameter tuning for LSTM model

best_rmse = float('inf')
best_lstm_model = None

for units in lstm_units:

# Predict on test data

test_predictions = model.predict(test_data[:-1].reshape(-1, 1, 1)).flatten()

# Calculate RMSE
rmse = np.sqrt(mean_squared_error(test_data[1:], test_predictions))

# Check if current model has lower RMSE

if rmse < best_rmse:
best_rmse = rmse
best_lstm_model = model

# Predict on the entire dataset using the best LSTM model

all_lstm_predictions = best_lstm_model.predict(data_normalized[:-1].reshape(-1, 1, 1)).flatten()

# Inverse normalize the LSTM predictions

all_lstm_predictions = all_lstm_predictions * np.max(data)

# Calculate the scaling factor based on the maximum value of the original data
scaling_factor = np.max(close_prices_NEPSE_reverse)

# Function to predict future stock prices using the LSTM model

def predict_future_lstm(model, data, num_predictions, scaling_factor):
predictions = []

# Get the last data point from the input data

last_data_point = data[-1]

for _ in range(num_predictions):
# Predict the next time step

27
prediction = model.predict(last_data_point.reshape(1, 1, 1))
predictions.append(prediction[0, 0])

# Update last_data_point to include the predicted value for the next iteration
last_data_point = np.append(last_data_point[1:], prediction)

# Inverse normalize the predictions

predictions = np.array(predictions) * scaling_factor

return predictions

# Predict the next 100 days using the LSTM model

num_predictions = 100
lstm_predictions = predict_future_lstm(best_lstm_model, data_normalized, num_predictions, scaling_factor)

# Plot the LSTM predictions for the next 100 days

plt.figure(figsize=(10, 6))
plt.plot(close_prices_NEPSE_reverse, label='Actual')
plt.plot(np.arange(len(close_prices_NEPSE_reverse), len(close_prices_NEPSE_reverse) + num_predictions),
lstm_predictions, label='LSTM Predicted')
plt.title(f"LSTM Model - RMSE: {best_rmse:.2f}")
plt.xlabel('Time')
plt.ylabel('Stock Price')
plt.legend()
plt.show()

# Print the predicted stock prices for the next 100 days using LSTM
print("Predicted stock prices for the next 100 days:")
for i, prediction in enumerate(lstm_predictions, start=1):
print(f"Day {i}: {prediction:.2f}")

Output:

28
29
30

Stock Price Prediction LSTM Final Report
No ratings yet
Stock Price Prediction LSTM Final Report
68 pages
1922 B.SC Cs Batchno 24
No ratings yet
1922 B.SC Cs Batchno 24
91 pages
IS Project Proposal Final Draft
No ratings yet
IS Project Proposal Final Draft
59 pages
Qt0cp1x8th NoSplash
No ratings yet
Qt0cp1x8th NoSplash
43 pages
Stock Price Prediction
No ratings yet
Stock Price Prediction
51 pages
Priyanshu Lakhotiya Aiml
No ratings yet
Priyanshu Lakhotiya Aiml
49 pages
Stock Price Prediction Using ML and LSTM Based Deep Learning Models
No ratings yet
Stock Price Prediction Using ML and LSTM Based Deep Learning Models
13 pages
Prediction of Stock Price by Using RNN Model
No ratings yet
Prediction of Stock Price by Using RNN Model
59 pages
Project Synopsis Stock Price Prediction Using Machine Learni
No ratings yet
Project Synopsis Stock Price Prediction Using Machine Learni
3 pages
A Project On Stock Market Prediction and Comparison Using Machine Learning Models
No ratings yet
A Project On Stock Market Prediction and Comparison Using Machine Learning Models
25 pages
UG Project Report Divakar-2-3
No ratings yet
UG Project Report Divakar-2-3
72 pages
Prediction of Stock Price Using Machine Learning Techniques
No ratings yet
Prediction of Stock Price Using Machine Learning Techniques
10 pages
Report SP
No ratings yet
Report SP
39 pages
Sem Proj-III Stock
No ratings yet
Sem Proj-III Stock
58 pages
Introduction
No ratings yet
Introduction
16 pages
O Level Project - Pratigya Gangwar
No ratings yet
O Level Project - Pratigya Gangwar
62 pages
Stock Price Prediction
No ratings yet
Stock Price Prediction
7 pages
Machine Learning in Dentistry Dropbox Download
100% (18)
Machine Learning in Dentistry Dropbox Download
15 pages
Project Report Group-2
No ratings yet
Project Report Group-2
44 pages
Special Topics - II Report Team No - 56
No ratings yet
Special Topics - II Report Team No - 56
33 pages
Stock Researchppr - Final
No ratings yet
Stock Researchppr - Final
11 pages
Stock Trend Analysis Applications With LSTM Predictions
No ratings yet
Stock Trend Analysis Applications With LSTM Predictions
14 pages
Stock Prediction ML Report
No ratings yet
Stock Prediction ML Report
23 pages
Kaggle Course Notes
No ratings yet
Kaggle Course Notes
87 pages
Stock Price Prediction System
No ratings yet
Stock Price Prediction System
4 pages
AI Project Report - Team 9
No ratings yet
AI Project Report - Team 9
25 pages
Project Report - Payal Kataria
No ratings yet
Project Report - Payal Kataria
53 pages
Project Report SP
No ratings yet
Project Report SP
9 pages
Stock Market Prediction
No ratings yet
Stock Market Prediction
11 pages
Patel Prince Vipulbhai Thesis 2021
No ratings yet
Patel Prince Vipulbhai Thesis 2021
41 pages
New College Pro Design
No ratings yet
New College Pro Design
6 pages
Stock Market Prediction
No ratings yet
Stock Market Prediction
24 pages
AnIntrusion Detection System Over The IoT Data Streams Using Explainable Artificial Intelligence (XAI)
No ratings yet
AnIntrusion Detection System Over The IoT Data Streams Using Explainable Artificial Intelligence (XAI)
30 pages
UCLA Electronic Theses and Dissertations: Title
No ratings yet
UCLA Electronic Theses and Dissertations: Title
44 pages
Non-Invasive Transport Tier Classification of Banana Señorita' (Musa Acuminata) Using Machine Learning Techniques
No ratings yet
Non-Invasive Transport Tier Classification of Banana Señorita' (Musa Acuminata) Using Machine Learning Techniques
6 pages
Stock Price Prediction - SMCS2324009
No ratings yet
Stock Price Prediction - SMCS2324009
28 pages
Progressive Seminar 1
No ratings yet
Progressive Seminar 1
16 pages
Stock Price Sam23
No ratings yet
Stock Price Sam23
38 pages
Stock Prediction Project Documentation
No ratings yet
Stock Prediction Project Documentation
2 pages
Fajardan Ines Nardo Thesis Animal-Intrusion-monitoring-system
No ratings yet
Fajardan Ines Nardo Thesis Animal-Intrusion-monitoring-system
96 pages
Ise Aiml-Lab Manual
No ratings yet
Ise Aiml-Lab Manual
47 pages
Paper 8660
No ratings yet
Paper 8660
4 pages
Synopsis Report - Upendra
No ratings yet
Synopsis Report - Upendra
34 pages
Design and Development of IOT Based Smart Door Lock System
No ratings yet
Design and Development of IOT Based Smart Door Lock System
4 pages
Untitled 2
No ratings yet
Untitled 2
7 pages
My File
No ratings yet
My File
27 pages
My File
No ratings yet
My File
20 pages
Stroke Prediction D.B
No ratings yet
Stroke Prediction D.B
11 pages
BT4241 RP
No ratings yet
BT4241 RP
8 pages
Soft Computing Paper
No ratings yet
Soft Computing Paper
6 pages
Stock Price Prediction: Project I (PRJCS681) Bachelor of Technology Department of CSE
No ratings yet
Stock Price Prediction: Project I (PRJCS681) Bachelor of Technology Department of CSE
14 pages
A Systematic Analysis of Stock Prediction Models Using Artificial Intelligence Approaches
No ratings yet
A Systematic Analysis of Stock Prediction Models Using Artificial Intelligence Approaches
17 pages
MCQ Dlei
No ratings yet
MCQ Dlei
16 pages
Kabirs Paper
No ratings yet
Kabirs Paper
11 pages
Deepika
No ratings yet
Deepika
15 pages
Project Title: Stock Market Prediction
No ratings yet
Project Title: Stock Market Prediction
8 pages
StockMarketPredictionTermPaper Final1
No ratings yet
StockMarketPredictionTermPaper Final1
4 pages
Research Paper - LSTM - Final-3 - Removed
No ratings yet
Research Paper - LSTM - Final-3 - Removed
4 pages
Informatics in Medicine Unlocked: Wafae Abbaoui, Sara Retal, Brahim El Bhiri, Nassim Kharmoum, Soumia Ziti
No ratings yet
Informatics in Medicine Unlocked: Wafae Abbaoui, Sara Retal, Brahim El Bhiri, Nassim Kharmoum, Soumia Ziti
19 pages
AI USE CASES Investment-Banks
No ratings yet
AI USE CASES Investment-Banks
8 pages
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
No ratings yet
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
61 pages
Comparative Analysis and Evaluation of CNN Models For Deepfake Detection
No ratings yet
Comparative Analysis and Evaluation of CNN Models For Deepfake Detection
6 pages
Indian Sign Language Recognition Using Mediapipe Holistic: Kaushal Goyal
No ratings yet
Indian Sign Language Recognition Using Mediapipe Holistic: Kaushal Goyal
16 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
9 pages
Denoisng of Images
No ratings yet
Denoisng of Images
59 pages
Research Paper
No ratings yet
Research Paper
6 pages
Customer Churn Prediction
No ratings yet
Customer Churn Prediction
8 pages
StockMarketPredictionTermPaper Final1
No ratings yet
StockMarketPredictionTermPaper Final1
4 pages
Stock Market Prediction Using Machine Learning: December 2018
No ratings yet
Stock Market Prediction Using Machine Learning: December 2018
4 pages
Stock Price Prediction Abstract2
No ratings yet
Stock Price Prediction Abstract2
2 pages
TARP
No ratings yet
TARP
21 pages
Bolivar VR Diszlexia
No ratings yet
Bolivar VR Diszlexia
7 pages
A Machine Learning Approach To Predict Autism
No ratings yet
A Machine Learning Approach To Predict Autism
4 pages
Iarjset 2024 11624
No ratings yet
Iarjset 2024 11624
12 pages
Unit 1 ML
No ratings yet
Unit 1 ML
17 pages
Stock Price Preduction Report
No ratings yet
Stock Price Preduction Report
4 pages
Module 5.1
No ratings yet
Module 5.1
43 pages
Stock Market Prediction Using Machine Learning: December 2018
No ratings yet
Stock Market Prediction Using Machine Learning: December 2018
4 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
21 pages
Classification of Capital Expenditures and Revenue Expenditures: An Analysis of Correlation and Neural Networks
No ratings yet
Classification of Capital Expenditures and Revenue Expenditures: An Analysis of Correlation and Neural Networks
6 pages
Ad3511 Set2
No ratings yet
Ad3511 Set2
2 pages
Stock Market Prediction Using Machine Learning: December 2018
No ratings yet
Stock Market Prediction Using Machine Learning: December 2018
4 pages
Project Report 1
No ratings yet
Project Report 1
9 pages
Stock Market Prediction Using Machine Learning
No ratings yet
Stock Market Prediction Using Machine Learning
3 pages
Image Processing Technology Based On Machine Learning
No ratings yet
Image Processing Technology Based On Machine Learning
6 pages
Title 2.1170
No ratings yet
Title 2.1170
5 pages
Building LSTM-Based Model For Solar Energy Forecasting - by Dr. Saptarsi Goswami - Towards Data Science
No ratings yet
Building LSTM-Based Model For Solar Energy Forecasting - by Dr. Saptarsi Goswami - Towards Data Science
7 pages
Backtrader Essentials: Building Successful Strategies with Python
From Everand
Backtrader Essentials: Building Successful Strategies with Python
Ali AZARY
No ratings yet
Manufacturing: Engineering, Management and Marketing
From Everand
Manufacturing: Engineering, Management and Marketing
S.O.T Ogaji
No ratings yet
MARKET TIMING FOR THE INVESTOR: Picking Market Tops and Bottoms with Technical Analysis
From Everand
MARKET TIMING FOR THE INVESTOR: Picking Market Tops and Bottoms with Technical Analysis
BC LOW
2/5 (2)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Stock Price Prediction NEPSE

Uploaded by

Stock Price Prediction NEPSE

Uploaded by

COLLEGE OF APPLIED BUSINESS AND TECHNOLOGY

Chabahil, Gangahity, Kathmandu

Stock Price Prediction

Alish Dahal (102)

ANN: Artificial Neural Network

Investors face significant risk when investing in the stock market.

 Updating the dataset

 The availability of funding

Figure 1 Nepse stock closing price

Figure 2: - Preprocessed Training Data

Long Short Term Memory

Long Short-Term Memory (LSTM) is a type of Recurrent Neural Network (RNN)

Random Forest Algorithm

Fig: Working of Random Forest algorithm.

Mathematics behind XgBoost

[3] Stock price predictions Preprocessed Dataset: https://nepsealpha.com/nepse-data

"""Stock Prediction using Machine Learning.ipynb

Automatically generated by Colaboratory.

data = pd.read_csv("nepse_10 years.csv")

# Reverse the order of the data

# Plot the line chart

# Split the data into training and testing sets

from tensorflow.keras.models import Sequential

# Function to create LSTM model

# Define hyperparameters for tuning

# Perform hyperparameter tuning for LSTM model

from sklearn.metrics import mean_squared_error

for units in lstm_units:

# Predict on test data

# Check if current model has lower RMSE

# Predict on the entire dataset using the best LSTM model

# Inverse normalize the LSTM predictions

from sklearn.svm import SVR

# Support Vector Machines (SVM) Model

svm_grid_search = GridSearchCV(svm_model, svm_params, scoring='neg_mean_squared_error')

from xgboost import XGBRegressor

# Random Forest Model

rf_grid_search = GridSearchCV(rf_model, rf_params, scoring='neg_mean_squared_error')

# Gradient Boosting Methods (XGBoost)

xgb_grid_search = GridSearchCV(xgb_model, xgb_params, scoring='neg_mean_squared_error')

# Gradient Boosting Methods (LightGBM)

lgbm_grid_search = GridSearchCV(lgbm_model, lgbm_params, scoring='neg_mean_squared_error')

import matplotlib.pyplot as plt

# Function to calculate RMSE

# List of model names and predictions

# Truncate actual values to match the length of predictions

# Evaluate models and plot graphs

# Plotting actual and predicted values

# Print the best hyperparameters for the model

# Plotting LSTM predictions

# Load the stock data

# Reverse the order of the data

# Split the data into training and testing sets

# Function to create LSTM model

# Define hyperparameters for tuning

# Perform hyperparameter tuning for LSTM model

for units in lstm_units:

# Predict on test data

# Check if current model has lower RMSE

# Predict on the entire dataset using the best LSTM model

# Inverse normalize the LSTM predictions

# Function to predict future stock prices using the LSTM model

# Get the last data point from the input data

# Inverse normalize the predictions

# Predict the next 100 days using the LSTM model

# Plot the LSTM predictions for the next 100 days

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.