Literature Survey
Literature Survey
Machine Learning:
LITERATURE SURVEY:
Weather conditions around the world change rapidly and continuously. Correct forecasts are
essential in today’s daily life. From agriculture to industry, from traveling to daily
commuting, we are dependent on weather forecasts heavily. As the entire world is suffering
from the continuous climate change and its side effects, it is very important to predict the
weather without any error to ensure easy and seamless mobility, as well as safe day to day
operations. The current weather prediction models heavily depend on complex physical
models and need to be run on large computer systems involving hundreds of HPC nodes. The
computational power of these large systems is required to solve the models that describe the
atmosphere. Despite using these costly and complex devices, there are often inaccurate
forecasts because of incorrect initial measurements of the conditions or an incomplete
understanding of atmospheric processes. Moreover, it generally takes a long time to solve
complex models like these.
As weather systems can travel a long way over time in all directions, the weather of
one place depends on that of others considerably. In this work, we propose a method to utilize
surrounding city’s historical weather data along with a particular city’s data to predict its
weather condition. We combine these data and use it to train simple machine learning
models, which in turn, can predict correct weather conditions for the next few days
BACKGROUNDS:
This section briefly presents how machine learning can be used in weather forecasting and
the related works in the literature on this fast-growing research topic
Methodology:
In this project, we aim to use ML techniques to predict the temperature and humidity of the
next day at any particular hour based on the weather data of the current day. First, we
combine the weather observations at a particular timestamp from all the cities that we
consider to create a single record. That is, each record in the data will contain the
temperature, humidity, wind direction, atmospheric pressure, condition, and etc. data of all
the cities. The target variable for this record is assigned as the temperature at the same
timestamp of the next day. Thus, we predict the temperature and humidity of the next day
given the weather observation of the current day.
Algorithms:
• SVM: A Support Vector Machine (SVM) is a discriminative classifier formally
defined by a separating hyperplane. In other words, given labelled training data
(supervised learning), the algorithm outputs an optimal hyperplane which categorizes
new examples. In two-dimensional space this hyperplane is a line dividing a plane in
two parts where in each class lay in either side.
• ANN: Artificial Neural Networks are a special type of machine learning algorithms
that are modelled after the human brain. That is, just like how the neurons in our
nervous system are able to learn from the past data, similarly, the ANN is able to learn
from the data and provide responses in the form of predictions or classifications. ANN
is nonlinear statistical model which display a complex relationship between the inputs
and outputs to discover a new pattern.
Libraries:
• Pandas: Pandas is a Python library used for working with data sets. It has functions
for analysing, cleaning, exploring, and manipulating data. Pandas allows us to
analyse big data and make conclusions based on statistical theories.
• NumPy: NumPy is a Python library used for working with arrays. It also has functions
for working in domain of linear algebra, Fourier transform, and matrices. NumPy
aims to provide an array object that is up to 50x faster than traditional Python lists.
• Sklearn: Scikit-learn (Sklearn) is the most useful and robust library for machine
learning in Python. It provides a selection of efficient tools for machine learning and
statistical modelling including classification, regression, clustering and dimensionality
reduction via a consistence interface in Python.
Platforms:
• Python
• Jupyter Notebook