The document provides context and sample data for a data analysis project for a cardio fitness company. The objectives are to generate customer profiles, perform univariate and multivariate analyses, and provide insights. The data contains variables such as product, age, gender, education, marital status, usage, fitness, income, and miles. Based on the data, it appears Product TM 195 is popular for both males and females with lower income and mileage. There are also strong positive correlations between miles and usage, and miles and fitness. Regression analysis shows miles can be predicted based on usage and fitness.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
100%(1)100% found this document useful (1 vote)
975 views12 pages
Cardio Good Fitness Project
The document provides context and sample data for a data analysis project for a cardio fitness company. The objectives are to generate customer profiles, perform univariate and multivariate analyses, and provide insights. The data contains variables such as product, age, gender, education, marital status, usage, fitness, income, and miles. Based on the data, it appears Product TM 195 is popular for both males and females with lower income and mileage. There are also strong positive correlations between miles and usage, and miles and fitness. Regression analysis shows miles can be predicted based on usage and fitness.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 12
Cardio Good Fitness Project
Project 1: Data Analysis
Objectives • Preliminary Data Analysis • Generate customer profile (characteristics of a customer) of the different products • Perform univariate and multi-variate analyses • Generate a set of insights and recommendations that will help the company in targeting new customers Context Data contains the following variables
• Product - the model no. of the treadmill
• Age - in no of years, of the customer • Gender - of the customer • Education - in no. of years, of the customer • Marital Status - of the customer • Usage - Avg. # times the customer wants to use the treadmill every week • Fitness - Self rated fitness score of the customer (5 - very fit, 1 - very unfit) • Income - of the customer • Miles- expected to run Context (Sample Data) Marital Age Gender Education Usage Fitness Income Miles Product Status
0 TM195 18 Male 14 Single 3 4 29562 112
1 TM195 19 Male 15 Single 2 3 31836 75
2 TM195 19 Female 14 Partnered 4 3 30699 66
3 TM195 19 Male 12 Single 3 3 32973 85
4 TM195 20 Male 13 Partnered 4 2 35247 47
Frequency Distributions Histograms Histograms by Gender Box Plots Product Analysis Income Comparisons to Products Marital Status Partnered Single Product Gender TM195 Female 46153.777778 45742.384615 Male 50028.000000 43265.842105 TM498 Female 49724.800000 48920.357143 Male 49378.285714 47071.800000 TM798 Female 84972.250000 58516.000000 Male 81431.368421 68216.428571
Miles Comparisons to Products
Marital Status Partnered Single Product Gender TM195 Female 74.925926 78.846154 Male 80.190476 99.526316 TM498 Female 94.000000 80.214286 Male 87.238095 91.100000 Based on above data, Product TM 195 is in demand TM798 Female 215.000000 133.333333 Male 176.315789 147.571429 For both Males and females, for low income and low mileage customers. Pair Plots for Various Data Attributes Correlations
Note:
There is strong positive correlation
that exist between Miles and Usage, and between Miles and Fitness Age Education Usage Fitness Income Miles Age 1.000000 0.280496 0.015064 0.061105 0.513414 0.036618 Education 0.280496 1.000000 0.395155 0.410581 0.625827 0.307284 Usage 0.015064 0.395155 1.000000 0.668606 0.519537 0.759130 Fitness 0.061105 0.410581 0.668606 1.000000 0.535005 0.785702 Income 0.513414 0.625827 0.519537 0.535005 1.000000 0.543473 Miles 0.036618 0.307284 0.759130 0.785702 0.543473 1.000000 Regression Analysis
Miles Predicted = -56.74 + 20.21*Usage + 27.20*Fitness