List of Topic For ML Project
List of Topic For ML Project
If you wish to explore the climate data for a different year, you can use
the GHCN_data_preprocessing.ipynb notebook to download and perform the
preprocessing described above. Please be advised that depending on the
dataset size for a given year, GHCN_data_preprocessing.ipynb may not run on
DataHub. We will not be providing infrastructural support for running the
notebook, but you are welcome to run it on a different machine you have
access to or ask a GSI to dump the data for you.
The data contains only the (latitude, longitude) coordinates for the weather
stations. To map the coordinates to geographical locations, the reverse-
geocoder package mentioned in the References section might be helpful.
You can access all the data within the Topic 3/Dataset A directory on Google
Drive. It includes the following reports:
kepler_exoplanet_search.csv contains data collected by NASA from the
Kepler Space Observatory as part of a long-term study on finding
habitable exoplanets from over 10,000 candidates. (source)
kelper_planetary_system_composite.csv contains data collected by NASA
from the Kelper Space Observatory as part of an ongoing study that
tabulates all confirmed planetary systems outside the solar system.
You are encouraged to use the composite data in conjunction with the
exoplanet search results above. (source)
nasa_neows.csv contains data collected from NASA’s NeoWs (Near Earth
Object Web Service) that collects information on near earth asteroids.
You can access all the data within the Topic 3/Dataset B directory on Google
Drive. It includes the following reports: