Data Processing and Data Analysis - 104910 1
Data Processing and Data Analysis - 104910 1
analysis
Data processing
• Data processing occurs when data is collected and translated
into usable information.
• Six stages of data processing
• 1. Data collection
• Collecting data is the first step in data processing. Data is
pulled from available sources, including data lakes and data
warehouses. It is important that the data sources available are
trustworthy and well-built so the data collected is of the
highest possible quality.
• 2. Data preparation.
• the data is collected, it then enters the data preparation
stage. Data preparation, often referred to as “pre-
processing” is the stage at which raw data is cleaned up and
organized for the following stage of data processing.
• During preparation, raw data is diligently checked for any
errors. The purpose of this step is to eliminate bad data (
redundant, incomplete, or incorrect data) and begin to
create high-quality data for the best business intelligence.
• 3. Data input
• The clean data is then entered into its destination, and
translated into a language that it can understand. Data input
is the first stage in which raw data begins to take the form of
usable information.
• 4. Processing
• During this stage, the data inputted to the computer in the
previous stage is actually processed for interpretation. though
the process itself may vary slightly depending on the source
of data being processed (data lakes, social networks,
connected devices etc.) and its intended use.
• 5. Data output/interpretation
• The output/interpretation stage is the stage at which data is
finally usable to non-data scientists. It is translated, readable,
and often in the form of graphs, videos, images, plain text,
etc.).
• 6. Data storage
• The final stage of data processing is storage. After all of the
data is processed, it is then stored for future use. When data
is properly stored, it can be quickly and easily accessed by
members of the organization when needed.
Data Analysis in reseach
• Research data analysis is a process used by
researchers to reduce data to a story and interpret it
to derive insights. The data analysis process helps
reduce a large chunk of data into smaller fragments,
which makes sense.
Why we analyze data in research?
• Researchers rely heavily on data as they have a story to
tell or research problems to solve. It starts with a
question, and data is nothing but an answer to that
question.
• But, what if there is no question to ask? Well! It is
possible to explore data even without a problem – we call
it ‘Data Mining’, which often reveals some interesting
patterns within the data that are worth exploring.