0% found this document useful (0 votes)

258 views7 pages

World Happiness Report

The document summarizes key findings from the World Happiness Report from 2015 to 2019. It finds that freedom and GDP per capita have more influence on happiness scores than generosity based on linear regression analysis. Filtering out outliers like Myanmar and Syria, multi-linear regression predicts happiness scores with a root mean squared error of 0.56. While this model performs better than decision trees, limitations include oversimplifying variables and needing more data over time to study global impacts.

Uploaded by

MuriloMonteiro

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

258 views7 pages

World Happiness Report

Uploaded by

MuriloMonteiro

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

World Happiness Report

December 22, 2020

1 Happiness Report
1.1 Introduction
The motivation to investigate this work comes from the will to understand the countries’ charac-
teristics in respect to a few factors, and hopefully follow the examples of nations that managed
to incorporate happiness among its citizens. What can we infer about happiness taking economy,
generosity or freedom into account? Can Linear Regression be used to predict Happiness Score?
The dataset from the World Happiness Report from the years of 2015 to 2019 is based on the World
Hapiness Report on Kaggle (originally found here). The .csv files and the “UCSD” module created
can be found here
[67]: import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
from numpy.random import random
import ucsd #module created in order to shorten the Jupyter Notebook
from math import sqrt
%matplotlib inline

[68]: #Importing the datasets

data2015 = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

,→Happiness/2015.csv', index_col = 'Happiness Rank')

data2016 = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

,→Happiness/2016.csv', index_col = 'Happiness Rank')

data2017 = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

,→Happiness/2017.csv', index_col = 'Happiness Rank')

data2018 = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

,→Happiness/2018.csv', index_col = 'Happiness Rank')

data2019 = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

,→Happiness/2019.csv', index_col = 'Happiness Rank')

regions = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

,→Happiness/regions.csv')

[69]: #Creating columns for those dataframes that do not have Year, Region and/or␣
,→Colormap columns

1
data2015['Year'] = 2015
data2016['Year'] = 2016
data2017['Year'] = 2017
data2018['Year'] = 2018
data2019['Year'] = 2019

data2017['Region'] = 0
data2018['Region'] = 0
data2019['Region'] = 0

1.2 Intersection to find the common parameters:

[70]: commonParameters = set(data2015.head(0)).intersection(
set(data2016.head(0)),
set(data2017.head(0)),
set(data2018.head(0)),
set(data2019.head(0)))

1.3 Removing Columns

It is necessary to eliminate some columns which will be not performed any sort of analysis
[71]: #this function deletes the columns from dataframe1 that are not in dataframe2
def remove_column(dataframe1, dataframe2):
for parameter in list(dataframe1.head(0)):
if parameter not in dataframe2:
del dataframe1[parameter]

remove_column(data2015,commonParameters)
remove_column(data2016,commonParameters)
remove_column(data2017,commonParameters)
remove_column(data2018,commonParameters)
remove_column(data2019,commonParameters)

1.4 Assigning a Region value to the country

For this, it was created a python module that replaces the value of Region according to the Country.

Example:
target_dataframe.loc[target_dataframe['Country'] == 'Nepal', 'Region'] = 'Southern Asia'
assigns ‘Southern Asia’ to the Region field where the Country == ‘Nepal’ condition is met
[72]: ucsd.setRegionToDataFrame(data2017)
ucsd.setRegionToDataFrame(data2018)
ucsd.setRegionToDataFrame(data2019)

2
1.4.1 Defining plot function
For the scatterplot function, one may see several needed attributes for the proper plot to be shown.
Sometimes the dot radius is too little to be seen, so the factors slope and exponent are used to
distinguish some desirable characteristic as a sort of a third dimension variable.
For example, let’s say we want to investigate how the Economy GDP behaves as we analyze the
Generosity x Happiness Score plot. We can see that, for the year of 2015, Generosity does not
directly influence the Happiness Score and, albeit considered an outlier here, the most generous
country (Myanmar) has a low Happiness Score. On the other hand, we can clearly realize how
countries with lower GDP occupy lower positions on the plot. The bigger the dot size the bigger the
GDP per capita. The dot radius here is multiplicated by a factor of 40 for the sake of visualization.
[73]: def scatterplot(dataframe, x, y, sizeVariable, slope=1, exponent=1, xmax=1.8,␣
,→ymax=8):

regions = set(dataframe['Region'])
title = str(dataframe['Year'].iloc[0])
i = 0
if sizeVariable != 'None':
for item in regions:
xaxis = dataframe[x].loc[dataframe['Region'] == item]
yaxis = dataframe[y].loc[dataframe['Region'] == item]
plt.scatter(xaxis, yaxis, s = slope*dataframe[sizeVariable].
,→loc[dataframe['Region'] == item]**exponent, label = list(regions)[i], alpha␣

,→= 0.7)

i += 1
plt.xlabel(x)
plt.ylabel(y)
plt.title(title)
plt.legend(loc = (1.05, 0))
plt.axis([0, xmax, 0, ymax])
plt.show()
else:
for item in regions:
xaxis = dataframe[x].loc[dataframe['Region'] == item]
yaxis = dataframe[y].loc[dataframe['Region'] == item]
plt.scatter(xaxis, yaxis, label = list(regions)[i], alpha = 0.7)
i += 1
plt.xlabel(x)
plt.ylabel(y)
plt.title(title)
plt.legend(loc = (1.05, 0))
plt.axis([0, xmax, 0, ymax])
plt.show()

[74]: #Plot 1

linearfactor = 40

3
exponentialfactor = 1
xmax = 1
scatterplot(data2015, 'Generosity', 'Happiness Score', 'Economy (GDP per␣
,→Capita)' , linearfactor, exponentialfactor, xmax)

The Economy GDP per capita and Freedom (Plot 2 and 3) have more influence on the final
happiness outcome.
For the subsequent years, the plots have similar behavior to their respectively similar analysis.
[75]: #Plot 2

linearfactor = 1
exponentialfactor = 1
xmax = 1.8
scatterplot(data2015, 'Economy (GDP per Capita)', 'Happiness Score', 'None' ,␣
,→linearfactor, exponentialfactor, xmax)

4
[76]: #Plot 3

linearfactor = 1
exponentialfactor = 1
xmax = 0.7
scatterplot(data2015, 'Freedom', 'Happiness Score', 'None', linearfactor,␣
,→exponentialfactor, xmax)

5
1.5 Multi Linear Regression for Happiness Score Prediction
[77]: #importing the Machine Learning libraries
from sklearn.linear_model import LinearRegression
from sklearn.model_selection import train_test_split
from sklearn.metrics import mean_absolute_error
from sklearn.metrics import mean_squared_error

[80]: # Concatening the datasets

df = pd.concat([data2015, data2016, data2017, data2018, data2019])

1.5.1 Filtering Outliers

Since Myanmar and Syria have an peculiar behavior regarding to Generosity, it is appropriate to
leave them aside of this investigation and take this opportunity for futures analysis.
[81]: # Filtering out the Myanmar and Syria as a Generosity outliers
df = df[df.Country != 'Myanmar']
df = df[df.Country != 'Syria']

1.5.2 Setting the data

[82]: #Setting the independent (x) and dependent variables(y)

y = df['Happiness Score']
x = df[['Economy (GDP per Capita)', 'Freedom', 'Generosity']]

[83]: #Splitting the training and testing data

x_train, x_test, y_train, y_test = train_test_split(x,y, test_size = 0.33,␣
,→random_state = 42)

[84]: # Training the data and predicting the response

linear_model = LinearRegression()
linear_model.fit(x_train,y_train)
y_pred = linear_model.predict(x_test)

[85]: y_test.describe()

[85]: count 255.000000

mean 5.442976
std 1.111163
min 2.839000
25% 4.610500
50% 5.472000
75% 6.215500
max 7.769000
Name: Happiness Score, dtype: float64

6
1.5.3 Root Mean Squared Error

[86]: #calculating the Root Mean Squared Error:

rmse = sqrt(mean_squared_error(y_true = y_test, y_pred=y_pred))
rmse

[86]: 0.567465700011897

1.6 Results Discussions

We can see that, in this particular case, the Multi Linear Regression has a lower error, therefore,
performing better prediction than than Decision Tree Model.
We see that Freedom and Economy GDP play a major role in this particular analysis here and,
actually, from the years of 2018 and 2019, the World is getting less generous as the years pass by.
The limitations of this conducted work lie on simplification of the huge amount of possible variables
would be needed for, only perhaps, getting an accurate result for Happiness Prediction. Also, a
much more broad set of years would be necessary for investigating the impact of global crisis such
as wars, economical collapses or pandemics. And, although real world data is messy and noisy, a
standardization of the measured variables through the years would also be of great value.

47th BCS Exam Final Routine Batch-01
No ratings yet
47th BCS Exam Final Routine Batch-01
8 pages
DSE 200X Final Project DarioDiazCuevas
No ratings yet
DSE 200X Final Project DarioDiazCuevas
46 pages
Mechanics of The Industrial Revolution
No ratings yet
Mechanics of The Industrial Revolution
39 pages
END of Year Project Report
No ratings yet
END of Year Project Report
44 pages
Amader Paribesh Class IV Bengali Book
50% (2)
Amader Paribesh Class IV Bengali Book
184 pages
Etd
No ratings yet
Etd
141 pages
Kongu Tamil Brief PDF
0% (1)
Kongu Tamil Brief PDF
2 pages
intro-to-pandas-world-happiness
No ratings yet
intro-to-pandas-world-happiness
20 pages
Package Hmisc' - Harrell (2022)
No ratings yet
Package Hmisc' - Harrell (2022)
455 pages
HOUSE PRICE PREDICTION
No ratings yet
HOUSE PRICE PREDICTION
17 pages
Kemboi - Effect of Financial Technology On The Financial Performance of Commercial Banks in Kenya
No ratings yet
Kemboi - Effect of Financial Technology On The Financial Performance of Commercial Banks in Kenya
62 pages
Paper Title (24pt, Times New Roman, Upper Case, Line Spacing: Before: 8pt, After: 16pt)
No ratings yet
Paper Title (24pt, Times New Roman, Upper Case, Line Spacing: Before: 8pt, After: 16pt)
6 pages
Score Matching through the roof; lLinear, Nonlinear, and Latent Variables Causal Discovery 26th July 2024 (AAA)
No ratings yet
Score Matching through the roof; lLinear, Nonlinear, and Latent Variables Causal Discovery 26th July 2024 (AAA)
27 pages
Ventures Regression
No ratings yet
Ventures Regression
19 pages
Mihir GK Recent Issue
No ratings yet
Mihir GK Recent Issue
72 pages
MRP by Isha Paliwal
No ratings yet
MRP by Isha Paliwal
30 pages
linear-regression-world-happiness
No ratings yet
linear-regression-world-happiness
10 pages
Suest - Seemingly Unrelated Estimation: 20 Estimation and Postestimation Commands
No ratings yet
Suest - Seemingly Unrelated Estimation: 20 Estimation and Postestimation Commands
19 pages
Agriculture of WB
No ratings yet
Agriculture of WB
24 pages
The Northern Mountains
No ratings yet
The Northern Mountains
3 pages
11 Vocabulary by Jaideep Singh
100% (1)
11 Vocabulary by Jaideep Singh
15 pages
Writing Part
No ratings yet
Writing Part
164 pages
Teacher Respiratory Health Symptoms in Relation To School and Home Environment
No ratings yet
Teacher Respiratory Health Symptoms in Relation To School and Home Environment
15 pages
Bengali Food Recipe in Bengali Language
No ratings yet
Bengali Food Recipe in Bengali Language
3 pages
ART Integeration Activity Sikkim (Science) - by Yuvval Bhasin 7-C
No ratings yet
ART Integeration Activity Sikkim (Science) - by Yuvval Bhasin 7-C
8 pages
Gold Rate Prediction Using Linear Regression
No ratings yet
Gold Rate Prediction Using Linear Regression
10 pages
Homework 8
No ratings yet
Homework 8
4 pages
This Study Resource Was: MAT 243 Project Three Summary Report
No ratings yet
This Study Resource Was: MAT 243 Project Three Summary Report
8 pages
4 FWL Handout
No ratings yet
4 FWL Handout
12 pages
Main Concepts of Life Science
100% (1)
Main Concepts of Life Science
10 pages
Literature Review
No ratings yet
Literature Review
9 pages
The Classical Model: Slides by Niels-Hugo Blunch Washington and Lee University
100% (1)
The Classical Model: Slides by Niels-Hugo Blunch Washington and Lee University
22 pages
Chapter 3 Project
No ratings yet
Chapter 3 Project
32 pages
Harvest Festival of India
No ratings yet
Harvest Festival of India
3 pages
11.ABM SoftSensor MachineLearning DeepLearning
No ratings yet
11.ABM SoftSensor MachineLearning DeepLearning
13 pages
Essay On My Favorite Author
No ratings yet
Essay On My Favorite Author
1 page
Interpretable Machine Learning - A Brief History, State-of-the-Art and Challenges
No ratings yet
Interpretable Machine Learning - A Brief History, State-of-the-Art and Challenges
15 pages
Some Important Compound Words Using (Noun + Verb)
No ratings yet
Some Important Compound Words Using (Noun + Verb)
10 pages
Lecture-1, People of Bengal-Ancient Period
100% (1)
Lecture-1, People of Bengal-Ancient Period
18 pages
৫০ টি গল্প - প্রচেত গুপ্ত
No ratings yet
৫০ টি গল্প - প্রচেত গুপ্ত
483 pages
Prepositions
No ratings yet
Prepositions
12 pages
Dania Purnama - 2224190099 - Uji Normalitas
No ratings yet
Dania Purnama - 2224190099 - Uji Normalitas
4 pages
Jossy Proposal Final Loan Disbursement MWU Commented)
No ratings yet
Jossy Proposal Final Loan Disbursement MWU Commented)
31 pages
477.
100% (1)
477.
335 pages
Colegio de Kidapawan: Where Quality and Relevant Education Is Within Everyone's Reach
No ratings yet
Colegio de Kidapawan: Where Quality and Relevant Education Is Within Everyone's Reach
7 pages
WBPSC Miscellaneous Mains Question Paper 2019
No ratings yet
WBPSC Miscellaneous Mains Question Paper 2019
16 pages
A Remedial Course in English For College
No ratings yet
A Remedial Course in English For College
25 pages
Food SI Model Practice Set Book
No ratings yet
Food SI Model Practice Set Book
67 pages
History of The Rashtrakutas
No ratings yet
History of The Rashtrakutas
180 pages
WBCS Preli 2006-2016 PDF
No ratings yet
WBCS Preli 2006-2016 PDF
229 pages
Logistic Regression Cost Function
No ratings yet
Logistic Regression Cost Function
1 page
Holo Na, Ratna - Kazi Anwar Hossain
No ratings yet
Holo Na, Ratna - Kazi Anwar Hossain
172 pages
BCS Preliminary Analysis Part 1 (WWW - Exambd.net) PDF
No ratings yet
BCS Preliminary Analysis Part 1 (WWW - Exambd.net) PDF
244 pages
Sample QP - UG (Except BBA) CET
No ratings yet
Sample QP - UG (Except BBA) CET
36 pages
Ajay Basnet
No ratings yet
Ajay Basnet
38 pages
Note Making
No ratings yet
Note Making
7 pages
Practical Aspects of Finite Element Simulation
No ratings yet
Practical Aspects of Finite Element Simulation
11 pages
Pokemon HP Predictions
No ratings yet
Pokemon HP Predictions
24 pages
Highest Peaks in States of India
No ratings yet
Highest Peaks in States of India
4 pages
Examveda Spelling by (Tanvir Polash)
No ratings yet
Examveda Spelling by (Tanvir Polash)
9 pages
English Grammar
No ratings yet
English Grammar
16 pages
Assimil - German With Ease 1 82
No ratings yet
Assimil - German With Ease 1 82
82 pages
One Word Substitution - Global English Creativity
No ratings yet
One Word Substitution - Global English Creativity
12 pages
Jmapb 680 Regents Workbook by Topic PDF
No ratings yet
Jmapb 680 Regents Workbook by Topic PDF
210 pages
A Risk-Oriented Model For Factor Rotation Decisions
No ratings yet
A Risk-Oriented Model For Factor Rotation Decisions
38 pages
BENGALI Lang and Culture Presentation
100% (1)
BENGALI Lang and Culture Presentation
42 pages
Maps of Natural Vegetation
No ratings yet
Maps of Natural Vegetation
5 pages
All Mathematical Shortcuts
No ratings yet
All Mathematical Shortcuts
87 pages
Csmarks Feedback 22660382 Pervea01 - 32308
No ratings yet
Csmarks Feedback 22660382 Pervea01 - 32308
11 pages
Bengali Pedagogy (2012-19)
No ratings yet
Bengali Pedagogy (2012-19)
60 pages
PDF
No ratings yet
PDF
863 pages
Flora: Leh Ladakh Jammu and Kashmir India
No ratings yet
Flora: Leh Ladakh Jammu and Kashmir India
39 pages
AP TET 2011 Social Studies Question Paper II With Answers
No ratings yet
AP TET 2011 Social Studies Question Paper II With Answers
8 pages
Major Tribes in India
No ratings yet
Major Tribes in India
2 pages
Diversity in India
No ratings yet
Diversity in India
10 pages
IVT Network - Statistical Analysis in Analytical Method Validation - 2014-07-10
100% (1)
IVT Network - Statistical Analysis in Analytical Method Validation - 2014-07-10
11 pages
Admission English: By: Nazmul Huda Dept - of Law, University of Chittagong Phone:01770022470
No ratings yet
Admission English: By: Nazmul Huda Dept - of Law, University of Chittagong Phone:01770022470
26 pages
Tourist Leaflet and Fact File
No ratings yet
Tourist Leaflet and Fact File
6 pages
Antonyms and Synonyms
100% (1)
Antonyms and Synonyms
11 pages
GRE+GAT WORD LIST (Edited)
No ratings yet
GRE+GAT WORD LIST (Edited)
194 pages
Anthropological Background of Bangladesh
No ratings yet
Anthropological Background of Bangladesh
3 pages
Eastern Dark
No ratings yet
Eastern Dark
24 pages
Class 7 Social Science Human Environment Interactions Exam Notes
No ratings yet
Class 7 Social Science Human Environment Interactions Exam Notes
8 pages
Data Mining Slides
No ratings yet
Data Mining Slides
43 pages
India Tourism Statistics 2009
No ratings yet
India Tourism Statistics 2009
276 pages
Course Outline Eca 1,2 Etc
No ratings yet
Course Outline Eca 1,2 Etc
3 pages
Gujarati Vocabulary
No ratings yet
Gujarati Vocabulary
25 pages
Ekanko Natok Sonkolon PDF
No ratings yet
Ekanko Natok Sonkolon PDF
238 pages
Challenging Iba Ques - Shabbir Ahsan
100% (1)
Challenging Iba Ques - Shabbir Ahsan
4 pages
40th BCS Bangla Written Question
No ratings yet
40th BCS Bangla Written Question
3 pages
Trackpad Pro Ver. 5.0 Class 7: WINDOWS 11 & MS OFFICE 2021
From Everand
Trackpad Pro Ver. 5.0 Class 7: WINDOWS 11 & MS OFFICE 2021
Nidhi Arora
No ratings yet
SSC CGL Preparatory Guide -Mathematics (Part 2)
From Everand
SSC CGL Preparatory Guide -Mathematics (Part 2)
Dr. DK Sukhani
4/5 (1)
Beyond the Barracks
From Everand
Beyond the Barracks
Nelson Baker
No ratings yet
SSC CGL Preparatory Guide -Mathematics (Part 3)
From Everand
SSC CGL Preparatory Guide -Mathematics (Part 3)
Dr. DK Sukhani
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

World Happiness Report

Uploaded by

World Happiness Report

Uploaded by

World Happiness Report

December 22, 2020

[68]: #Importing the datasets

data2015 = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

data2016 = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

data2017 = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

data2018 = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

data2019 = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

regions = pd.read_csv('~/Git/UCSanDiegoX/Week 9 and 10 - Final Project/

1.2 Intersection to find the common parameters:

1.3 Removing Columns

1.4 Assigning a Region value to the country

[80]: # Concatening the datasets

1.5.1 Filtering Outliers

1.5.2 Setting the data

[82]: #Setting the independent (x) and dependent variables(y)

[83]: #Splitting the training and testing data

[84]: # Training the data and predicting the response

[85]: count 255.000000

[86]: #calculating the Root Mean Squared Error:

1.6 Results Discussions

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.