Unit-1 DM
Unit-1 DM
Business Intelligence
Why reporting and Analysing data, Raw data to valuable information Lifecycle
of Data – What is Business Intelligence – BI and DW in today’s perspective –
What is data warehousing – The building Blocks: Defining Features – Data
warehouses and data Imarts – Overview of the components – Metadata
warehousing – Trends in data warehousing.
* Data reporting: Gathering data into one place and presenting it in visual
representations.
Respondents mentioned these six main differences between data analysis and
reporting:
1. Reqiured skills
2. Order Of Operation
4. Ease of Automation
5. Impact on Strategy
6. Data Context
1. Required Skills
2. Order of Operations
Since data reporting organizes and visualizes data, it must happen before you can
perform data analysis. Even if you decide to complete only your data analysis in-
house, you’ll need to have some form of data to work with, such as an external
report. Stevens explains how analysis follows reporting. “Reports don’t give
conclusions, but a proper analysis of the question raised in the reports will. Data
analysis provides answers to the why, and also gives the way forward.”
After you get a finished data report or analysis, someone will need to implement that
information. Depending on your business structure, it can take longer to implement
one than the other. So, when you consider the time you’ll need to take performing
and implementing data analysis and reporting, keep your industry, work style and
team structure in mind.
4. Ease of Automation
Since data analysis requires a human touch, it’s harder to automate than data
reporting. On the other hand, analysis’s need for human involvement lends it its
strengths. How do these concepts apply to data? “Close reading” data works the
same way. It transforms cold information into warm insight and actionable
intelligence.
5. Impact on Strategy
Most businesses collect data to inform some kind of strategy, whether for customer
retention, finances, lead generation, or another aspect of the business. Most of the
survey respondents who contributed their opinions agree that analysis has a much
bigger role to play in building strategies than reporting. Data reports give you a look
into your organization’s current performance. Meanwhile, analysis turns that data
into actionable insights to guide your future actions.
6.Data Context
While reporting provides data without context so you can draw your own
conclusions, analysis makes those conclusions for you to deliver context.
Raw data is data collected from one or multiple sources that remains in its
unaltered initial state. At this point, the data might contain human, machine, or
instrumental errors—depending on the collection method. Collecting raw data is
Defining Goals
Define the information you want to extract to lay the groundwork for your raw
data-gathering goals. For example, if the desired data is user-base and customer
information, online and in-person surveys focused on a specific age and
geographical demographic can be used to gather it.
Other types of raw data may require advance planning. For instance, collecting
data from log records would require having a monitoring system in place for
anywhere from a few weeks to a year to collect data before being able to pull it.
Choosing the appropriate raw data collection method can reduce the percentage
of human or machine errors you’d have to scrub out when cleaning a raw
database. Generally, electronic collecting methods tend to result in lower error
rates—manual collection can introduce variables that leave room for
interpretation, such as illegible handwriting or hard-to-understand accents in
audio or video recordings.
Collecting Data
Raw data tends to be large in volume and highly complex. During the collection
process, the overall volume of data is only an estimate—once you process the
data by cleaning it of errors and invalid data points, you’ll have a more accurate
sense of scope.
1. The first task BI has to do is to gather the necessary data about the business.
The key to this is automating the process. Gathering data was very time and
money consuming in the past, but todays with the usage of modern computers,
it’s much easier to collect data from various sources.
2. The second task is to analyse the collected data and then further extract
information from it. The extracted information is then transformed into
knowledge.
3. The final task is to use the newly gathered knowledge to improve the business.
There are many business intelligence tools to complete the process of gathering
knowledge.
Data Marts
As corporate-wide data warehouses came into use, it was discovered that in
many situations full-blown data warehouse was overkill for applications. Data
marts evolved to solve this problem. A data mart is a special type of a data
warehouse. It is focused on a single subject (or functional area), such as Sales,
Finance, or Marketing. Whereas data warehouses have an enterprise- wide
depth, the information in data marts pertains to a single department. The primary
use for a data mart is Business Intelligence (BI) applications. Implementing a
data mart can be less expensive than implementing a data warehouse, thus
making it more practical for the small business.
Types of Data Mart
There are three common types of data marts:
• Independent Data Mart
• Dependent Data Mart
• Hybrid Data Mart
1. Independent Data Mart
An independent data mart is created and maintained separately from the data
warehouse. It is created to satisfy the particular needs of a specific business unit
or department. Independent data marts are typically smaller in size and more
rapidly and readily set up. They offer flexibility and agility since they are not
constrained by the challenges of the centralized data warehouse. Nevertheless,
data redundancy and inconsistency may result if it is replicated over several
different data marts.
2. Dependent Data Mart
A dependent data mart is generated right out of a data warehouse. It takes some
of the data from the data warehouse and arranges it to meet the needs of a
specific industry. Dependent data marts, which profit from the data integration,
data quality, and consistency provided by the data warehouse, allow for the
centralization and preservation of all data in a single source of truth. They are
often developed to serve particular reporting and analytical needs, and they are
frequently updated from the data warehouse. Dependent data marts offer data
consistency and prevent data duplication because they rely on the data
warehouse as their main source of data.
3. Hybrid Data Mart
Both independent and dependent data mart components can be found in a hybrid
data mart. As well as combining additional data sources particular to a given
business unit or department, it makes use of the centralized data warehouse for
the integration and consistency of the core data. By offering flexibility and
agility for department-specific needs while keeping the integrity and
consistency of shared data from the data warehouse, hybrid data marts offer the
benefits of both strategies. This strategy creates a balance between localized
data management and centralized control.
Types of Metadata:
There are many types of metadata that can be used to describe different aspects
of data, such as its content, format, structure, and provenance. Some common
types of metadata inclu de:
1. Descriptive metadata: This type of metadata provides information
about the content, structure, and format of data, and may include
elements such as title, author, subject, and keywords. Descriptive
metadata helps to identify and describe the content of data and can be
used to improve the discoverability of data through search engines and
other tools.
2. Administrative metadata: This type of metadata provides
information about the management and technical characteristics of
data, and may include elements such as file format, size, and creation
date. Administrative metadata helps to manage and maintain data over
time and can be used to support data governance and preservation.
3. Structural metadata: This type of metadata provides information
about the relationships and organization of data, and may include
elements such as links, tables of contents, and indices. Structural
metadata helps to organize and connect data and can be used to
facilitate the navigation and discovery of data.
4. Provenance metadata: This type of metadata provides information
about the history and origin of data, and may include elements such as
the creator, date of creation, and sources of data. Provenance metadata
helps to provide context and credibility to data and can be used to
support data governance and preservation.
5. Rights metadata: This type of metadata provides information about
the ownership, licensing, and access controls of data, and may include
elements such as copyright, permissions, and terms of use. Rights
metadata helps to manage and protect the intellectual property rights
of data and can be used to support data governance and compliance.
6. Educational metadata: This type of metadata provides information
about the educational value and learning objectives of data, and may
include elements such as learning outcomes, educational levels, and
competencies. Educational metadata can be used to support the
discovery and use of educational resources, and to support the design
and evaluation of learning environments.
Benefits of Metadata :-
Consolidating your data storage sources into a single service as much as possible
can reduce such costs. Following this data warehousing trend is guaranteed to
optimise operations in your marketing agency, or any other large agency.
Cloud data centres operate with energy efficiencies well above industry averages.
In 2021, companies in the EU reported an 80% energy consumption
reduction following the migration of their enterprise data to SaaS storage
providers.
With trend and pattern analysis capabilities improving by leaps and bounds,
businesses that integrate AI into their data warehousing solutions can reduce their
operational costs and perform data operations more efficiently.
For these reasons, in-database analytics have become one of the latest data
warehousing trends, especially amongst marketing agencies.
Data compression reduces the number of bits (binary digits) necessary to store
data. It works by creating reference libraries for the 1s and 0s of binary data and
then replacing longer strings with shorter reference tags.
Compressing your data frees up storage capacity, accelerates data transfers, and
reduces overall storage costs. That’s why it’s been one of the most popular data
warehousing trends for a while, and it’s likely to stick around.
#9. Analytics on Demand
In a SaaS-heavy work environment, users may be extracting data from
warehouses through dozens of different applications. Analytics on demand has
thus become a trend in marketing agencies.