01 DM BI Intro
01 DM BI Intro
and
Business Intelligence
Introduction
Created/Adopted/Modified for
Data Mining and Business Intelligence – MCA II Semester
Vidya Vikas Institute of Engineering & Technology
Mysore
2023-24
GPD
Why Data Mining?
1. Data Generation:
Generation Raw data is generated through various sources
such as transactions, sensors, social media, or customer interactions.
It can be structured, semi-structured, or unstructured.
2. Data Collection:
Collection Capturing data in its original form without any
significant modifications.
3. Data Storage:
Storage The data is stored in databases, data lakes, or other
storage systems, in a secure and organized manner to ensure easy
retrieval and efficient processing. This can involve data warehouses,
data marts, or cloud-based storage solutions.
Life Cycle of Data
From Raw Data to Valuable Information
The journey from raw data to valuable information involves the lifecycle of data.
4. Data Pre-processing:
Pre-processing Data undergoes cleaning, transformation,
and integration processes. Data quality is improved, inconsistencies
are resolved, and relevant attributes are selected for analysis.
5. Data Analysis:
Analysis In this stage, various analytical techniques such as
statistical analysis,
analysis data mining,
mining machine learning,
learning and visualization
are applied to extract insights and patterns from the processed data.
6. Data Interpretation:
Interpretation The analyzed data is interpreted to derive
meaningful information and actionable insights. This step involves
identifying trends, correlations, and relationships within the data.
Life Cycle of Data
From Raw Data to Valuable Information
The journey from raw data to valuable information involves the lifecycle of data.