0% found this document useful (0 votes)
437 views20 pages

Internship Report Anthony and Joshil PDF

This internship report summarizes a project on price prediction of real estate. It describes the implementation of machine learning algorithms like random forest regression to predict real estate prices based on attributes of properties. The report includes sections on system design with functional and non-functional requirements, implementation details of the random forest algorithm, the methodology used to collect and analyze data, results of the predictions, and conclusions. The goal is to build an accurate model that can help predict property prices and be a useful tool for real estate companies.

Uploaded by

Akhil
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
437 views20 pages

Internship Report Anthony and Joshil PDF

This internship report summarizes a project on price prediction of real estate. It describes the implementation of machine learning algorithms like random forest regression to predict real estate prices based on attributes of properties. The report includes sections on system design with functional and non-functional requirements, implementation details of the random forest algorithm, the methodology used to collect and analyze data, results of the predictions, and conclusions. The goal is to build an accurate model that can help predict property prices and be a useful tool for real estate companies.

Uploaded by

Akhil
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

INTERNSHIP PROJECT REPORT

ON

“PRICE PREDICTION OF REAL ESTATE”

SUBMITTED BY

Anthony Lasrado
(4SO18CS067)
Joshil Fernandes
(4SO18CS058)

AT

ZEPHYR TECHNOLOGIES

2nd FLOOR,OBERLE TOWERS


BALMATTA,MANGALURU
KARNATAKA-575002, INDIA
ACKNOWLEGEMENT

This dissertation would not have been possible without the guidance and the help of several
individuals and organizations who in one way or another contributed and extended their
valuable assistance during this internship project.

I would like to express our gratitude to employees of Zephyr technologies & Solutions Pvt.
Ltd for providing this internship opportunity under whom I executed this project. Their
constant guidance and willingness to share their vast knowledge made us understand this
project and its manifestations in great depths and helped me to complete the assigned tasks.

I would like to extend my special thanks to Mr. Vedanth Shenoy, Faculty at Zephyr
technologies for his constant guidance throughout this internship.

Finally, I would like to thank my family and friends for their blessings, and for helping me in
all aspects and appreciating me to spend all the time in the work during my internship period
and lending their helping hand in successfully completing the project.

Anthony Lasrado
(4SO18CS067)
Joshil Fernandes
(4SO18CS058)
ABOUT THE COMPANY

ZEPHYR TECHNOLOGIES & SOLUTIONS PVT. LTD is a software company delivering


high quality, cost effective, reliable result-oriented web and e-commerce solutions on time for
a global clientele. Professionalism, Skill and Expertise are the tools we use to make the web
work for your business bringing in maximum return on your investment in shortest possible
time. We have delivered on IT projects of varying complexities for their very demanding and
internet clients spread across the globe. They develop unique web solutions which ensures
increased efficiency and competitive advantage for your business and thus to your end users.

Their tools are professionalism, skills and expertise that translate into delivering quality work
at every step for any project we undertake. They work towards getting better than the best out
of every team member at ZEPHYR TECHNOLOGIES, which means when you hire them all
round quality is assured off as you want it. Their Advantage Quality includes protection of
intellectual for the source codes developed specifically for your business. They do not sell the
source codes to the third parties and all elements that they create for your web solution belongs
to you. ZEPHYR TECHNOLOGIES project managers and business analysts place great
value for building a clean communication link with you as they consider it the key ingredient
for the success of any project at hand.
ABSTRACT

Real estate involves the purchase, sale, and development of land, residential and
non-residential buildings.The main players in the real estate market are the
landlords, developers, builders, real estate agents, tenants, buyers etc. The
activities of the real estate sector encompass the housing and construction
sectors also.

The real estate sector in India has assumed growing importance with the
liberalization of the economy. The consequent increase in business
opportunities and migration of the labour force has, in turn, increased the
demand for commercial and housing space, especially rental housing.

Developments in the real estate sector are being influenced by the developments
in the retail, hospitality and entertainment [e.g., hotels, resorts, cinema theatres)
industries, economic services (e.g., hospitals, schools) and information
technology (IT)-enabled services (like call centres) etc. and vice versa.

The real estate sector is a major employment driver, being the second largest
employer next only to agriculture. This is because of the chain of backward and
forward linkages that the sector has with the other sectors of the economy,
specially with the housing and construction sector. About 250 ancillary
industries such as cement, steel, brick, timber, building materials are dependent
on the real estate industry.

The Indian real estate markets, as compared to the other more developed Asian
and Western markets is characterized by smaller size, lower availability of good
quality space and higher prices.
PRICE PREDICTION OF REAL ESTATE

TABLE OF CONTENTS
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

2. System Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2.1 Functional Requirements . . . . . . . . . . . . . . . . . . . . . . . 7

2.2 Non-Functional Requirements . . . . . . . . . . . . . . . . . . . 7

2.3 System Requirements . . . . . . . . . . . . . . . . . . . . . . . . . 7

3. Implementation Details . . . . . . . . . . . . . . . . . . . . . . . . . 8

4. Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

5. Results . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . . 17

6. Conclusion . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . 19

5
PRICE PREDICTION OF REAL ESTATE

CHAPTER 1
INTRODUCTION

The real estate sector is the second largest employer after agriculture and
experts have stated that the sector is poised to grow around 20 percent over the
next decade. The real estate sector comprises four sub sectors - housing, retail,
hospitality, and commercial. For the past decades, the high growth of the sector
is matched by the growth of the corporate environment, since there is a demand
for office space as well as urban and semi-urban accommodations. The
construction industry in India ranks third among the 14 primary sectors in terms
of direct, indirect and induced effects in the economy. It is expected that real
sector will incur more non-resident Indian (NRI) investments
We will understand the problem of a real estate company from its CEO and then
apply ML to solve it . In this project I will be walking you through analyzing the
problem from collecting data, importing it to a Jupyter notebook, looking for
promising attributes, finding out correlations, plotting graphs, creating a pipeline,
dealing with missing values.

6
PRICE PREDICTION OF REAL ESTATE

CHAPTER 2
SYSTEM DESIGN

2.1 Functional Requirements


• The model should be able to give accurate and trustworthy predictions.
• The application must show graphical visualization of the predicted results and data in
general.

• The user should be able to enter the input values for prediction.

2.2 Non-Functional Requirements


Non-Functional Requirements will describe how a system should behave and what limits are
constrained on its functionality. It generally specifies the system’s quality attributes or
characteristics.

• Availability: The system should be available to any transaction verification system.


• Correctness: The accuracy of the system should be as maximum as possible for better
prediction.

• Maintainability: The system should maintain correct history of records.


• Usability: The system should satisfy a maximum number of banking system needs.

2.3 System Requirements


• Software requirements
• Languages: Python
• Operating Systems: Windows, Linux, etc.
• Back End Software: Anaconda, Jupyter notebook.
• Hardware Requirements:
• CPU: Intel Pentium IV 600MHz
• Hard disk space: 20 GB or more
• Memory: 4 GB RAM
7
PRICE PREDICTION OF REAL ESTATE

CHAPTER 3
IMPLEMENTATION DETAILS

Algorithms
Ability of system to automatically learn and improve from experience without being
explicitly programmed is called machine learning and it focuses on the development of
computer programs that can access data and use it to learn by themselves. And classifier can
be stated as an algorithm that is used to implement classification especially in concrete
implementation, it also refers to a mathematical function implemented by algorithm that will
map input data into category. It is an instance of supervised learning i.e., where training set
of correctly identified observations is available.

A. Random forest regression:

The Decision Tree is an easily understood and interpreted algorithm and


hence a single tree may not be enough for the model to learn the features
from it. On the other hand, Random Forest is also a “Tree”-based algorithm
that uses the qualities features of multiple Decision Trees for making
decisions.

Therefore, it can be referred to as a ‘Forest’ of trees and hence the name


“Random Forest”. The term ‘Random’ is due to the fact that this algorithm is
a forest of ‘Randomly created Decision Trees’.

8
PRICE PREDICTION OF REAL ESTATE

The Decision Tree algorithm has a major disadvantage in that it causes over-
fitting. This problem can be limited by implementing the Random Forest
Regression in place of the Decision Tree Regression. Additionally, the
Random Forest algorithm is also very fast and robust than other regression
models.

Random Forest Algorithm (Source)

To summarize in short, The Random Forest Algorithm merges the output of


multiple Decision Trees to generate the final output.

9
PRICE PREDICTION OF REAL ESTATE

Problem Analysis

The same data set that was used for the Decision Tree Regression is utilized
in this where we have one independent variable Temperature and one
independent variable Revenue which we have to predict. In this problem, we
have to build a Random Forest Regression Model which will study the
correlation between the Temperature and Revenue of the Ice Cream Shop and
predict the revenue for the ice cream shop based on the temperature on a
particular day.

Step 1: Importing the libraries

As usual, the NumPy, matplotlib and the Pandas libraries are imported.
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

Step 2: Importing the dataset

The data set is imported using the function ‘pd.read_csv’ from my github
repository. In this, we assign the independent variable (X) to the
‘Temperature’ column and the dependent variable (y) to the
‘Revenue’ column.
dataset = pd.read_csv('https://raw.githubusercontent.com/mk-
gurucharan/Regression/master/IceCreamData.csv')X =
dataset['Temperature'].values
y = dataset['Revenue'].valuesdataset.head(5)>>Temperature Revenue
24.566884 534.799028
26.005191 625.190122
27.790554 660.632289
20.595335 487.706960
11.503498 316.240194

10
PRICE PREDICTION OF REAL ESTATE

Step 3: Splitting the dataset into the Training set and Test set

Similar to the Decision Tree Regression Model, we will split the data set, we
use test_size=0.05 which means that 5% of 500 data rows (25 rows) will only
be used as test set and the remaining 475 rows will be used as training set for
building the Random Forest Regression Model.
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.05)

Step 4: Training the Random Forest Regression model on the training set

In this step, to train the model, we import the RandomForestRegressor class


and assign it to the variable regressor. We then use the .fit() function to fit the
X_train and y_train values to the regressor by reshaping it accordingly.
# Fitting Random Forest Regression to the dataset
from sklearn.ensemble import RandomForestRegressor
regressor = RandomForestRegressor(n_estimators = 10, random_state = 0)
regressor.fit(X_train.reshape(-1,1), y_train.reshape(-1,1))

Step 5: Predicting the Results

In this step, we predict the results of the test set with the model trained on the
training set values using the regressor.predict function and assign it to
‘y_pred’.
y_pred = regressor.predict(X_test.reshape(-1,1))

Step 6: Comparing the Real Values with Predicted Values

In this step, we shall compare and display the values of y_test as ‘Real
Values’ and y_pred as ‘Predicted Values’ in a Pandas dataframe.
df = pd.DataFrame({'Real Values':y_test.reshape(-1), 'Predicted
Values':y_pred.reshape(-1)})
df>>
Real Values Predicted Values

11
PRICE PREDICTION OF REAL ESTATE

534.622865 510.602018
542.608070 558.764770
618.457277 653.356430
460.402500 449.302331
759.377432 728.037404
631.318237 649.712332
572.672047 583.685756
494.627437 503.075097
250.131728 239.372956
594.651009 635.653662
383.956240 384.531416
491.230603 503.075097
875.019348 933.984685
273.073342 224.659296
500.925064 498.355934
191.623312 193.223331
691.516541 726.817925
421.621505 420.997198
636.298374 653.945550
321.848273 276.772845
283.679657 275.805778
608.936345 589.542982
212.591740 239.372956
594.804871 541.164031
500.065779 524.649546

From the above values, we infer that the model is able to predict the values of
the y_test with a good accuracy though it can be improved by tuning the
hyper-parameters such as n_estimators and max_depth. I leave it to you all to
play with those parameters and improve the accuracy of the Random Forest
Regression Model.

Step 7: Visualizing the Random Forest Regression Results

12
PRICE PREDICTION OF REAL ESTATE

Temperature vs Revenue (Random Forest Regression)

In this graph, the Real values are plotted with “Red” color and the Predicted
values are plotted with “Green” color. The plot of the Decision Tree
Regression model is also drawn in “Black” color.

13
PRICE PREDICTION OF REAL ESTATE

CHAPTER 4
METHODOLODY

Step 1: Importing Libraries.

 Pandas numpy
 Scikit learn
 Matpolotlip.pyplot

Step 2: Data Collection.

14
PRICE PREDICTION OF REAL ESTATE

Step 3: Data Preprocessing.

15
PRICE PREDICTION OF REAL ESTATE

16
PRICE PREDICTION OF REAL ESTATE

Step 4: Experimenting and Trying to get the best accuracy. (In our Project, we have used
RandomForestRegressor method). gives the best accuracy.

Step 5: Training and Evaluating the Machine Learning Model.

Step 6: Interpreting the ML Model.

Step 7: Building Predictive System and evaluating.

17
PRICE PREDICTION OF REAL ESTATE

CHAPTER 5

RESULTS
Algorithm evaluation:

Using Cross validation:

Using the Model:

18
PRICE PREDICTION OF REAL ESTATE

Saving the model and testing the model on test data:

CONCLUSION

19
PRICE PREDICTION OF REAL ESTATE

In this project, we tried predicting the realestate price using the various parameters that
were provided in the data about the real estate .On implementation, the prediction results show
the correlation among different attributes considered. Multiple instances, parameters and
various factors can be used to make this sales prediction more innovative and successful.
Accuracy, which plays a key role in prediction-based systems, can be significantly increased
as the number of parameters used are increased. The project can be further collaborated in a
web-based application or in any device supported with an in-built intelligence by virtue of
Internet of Things (IoT), to be more feasible for use. There is a further need of experiments for
proper measurements of both accuracy and resource efficiency to assess and optimize correctly
.

20

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy