Data Science
Data Science
• The third step is Clean and Process Data. After the data is
collected from multiple sources, it is time to clean the data. Clean data
means data that is free from misspellings, redundancies, and irrelevance.
Clean data largely depends on data integrity. There might be duplicate data
or the data might not be in a format, therefore the unnecessary data is
removed and cleaned. There are different functions provided by SQL and
Excel to clean the data. This is one of the most important steps in Data
Analysis as clean and formatted data helps in finding trends and solutions.
The most important part of the Process phase is to check whether your data
is biased or not.
Analyzing The Data: