0% found this document useful (0 votes)
52 views1 page

DIY Project - Data Mining and Analytics2

Uploaded by

priyadevanaga
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
52 views1 page

DIY Project - Data Mining and Analytics2

Uploaded by

priyadevanaga
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 1

DIY Project for Data Mining and Analytics - DIY 2

This DIY project is to work on the world's most sensitive situation that has occurred and analysis of
data using Data Mining approaches to develop a model is absolutely necessary. This is based on the
Corona Virus pandemic and its threat.

You are provided with 2 real life data-sets in the format of CSV files, called : covid_19_data.csv and
covid19_line_list_data_modified.

The fields of covid_19_data.csv are as follows:

1. SNo 2. ObservationDate 3.Province/State 4.Country/Region 5. Last Update 6. Confirmed 7. Deaths


8.Recovered

[ All the fields are self explanatory from the name]

The fields of covid19_line_list_data_modified are as follows:

1. id 2.case_in_country 3. reporting date 4. summary 5. location 6. country 7. gender 8. age

9. symptom_onset 10.If_onset_approximated 11.hosp_visit_date 12.exposure_start 13.exposure_end

14. visiting Wuhan 15.from Wuhan 16. death 17.recovered 18.symptom

[ All the fields are self explanatory from the name]

You are advised to perform the following:

1) Clean, filter and Load data as necessary for analysis.

2) Develop appropriate models using Clustering techniques.

3) Use Data Analysis and mining techniques to develop solutions to queries as given below:

i) Which is the highest affected area and what is the number. Group from the model, the second
highest affected area along with number.

ii) What is the mortality Vs. recovery ratio.

iii) Is there any general tendency towards particular age, gender or random?

iv) What is the mortality rate among different age groups?

4) Develop a simple User Interface including all the queries and processes above to make it a
functional system.

Use Python based libraries for data mining and analytics project development.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy