0% found this document useful (0 votes)
84 views20 pages

Data Quality: Mata Kuliah: Manajemen Data

This document discusses data quality and provides definitions, dimensions, and processes related to data quality. It defines data quality as planning, implementing, and controlling activities to apply quality management techniques to data to ensure it is fit for use and meets consumer needs. It identifies six core data quality dimensions: completeness, uniqueness, consistency, accuracy, validity, and timeliness. It also discusses the impact of data quality on companies and organizations and outlines the key steps in a data quality process.

Uploaded by

Ranto Aja
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
84 views20 pages

Data Quality: Mata Kuliah: Manajemen Data

This document discusses data quality and provides definitions, dimensions, and processes related to data quality. It defines data quality as planning, implementing, and controlling activities to apply quality management techniques to data to ensure it is fit for use and meets consumer needs. It identifies six core data quality dimensions: completeness, uniqueness, consistency, accuracy, validity, and timeliness. It also discusses the impact of data quality on companies and organizations and outlines the key steps in a data quality process.

Uploaded by

Ranto Aja
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 20

Data Quality

Mata Kuliah : Manajemen Data


Kelompok 4
- 2011600851 : Ahmad Bardosono
- 2011600778 : Lani Asep Sutisna
- 2011600901 : Putra Tegar Nugraha
Definition
“Data Quality is the planning, implementation, and control of activities that apply quality management
techniques to data, in order to assure it is fit for consumption and meet the needs of data consumers.”
- Data Management Body of Knowledge (DMBOK)

Data Quality Presentation 01


Data Quality
is Part of a Large Enterprise Landscape Successful.
Successful Data Quality Improvement Requires Many Inter-related Disciplines

Master Data Management Business Intelligence Big Data

Data Warehouse Data Architecture

Data Quality
Data Quality Presentation 02
Data Quality
must be able to meet specified business requirements for :

Accuracy Reliability
data stored value is correct objective Data definitions are important to data quality
and come from reputable sources ,data must be used as a basis fordecision
making

Completeness Accessibility
All required data items are included. Data items that are easily obtainable and
legal to access with strong protections and
controls built into the process

Timeliness
Concept of data quality that involves whether the data is
up-to-date and available within a useful time period
Data Quality Presentation 03
Impact Data Quality On Companies & Organitazions

Economic
Revenues, Costs, Profits

Brand
Reputation, Customer Royality, Trusted

Law
Regulation

Data Quality Presentation 04


SIX CORE DATA QUALITY DIMENSIONS

Completeness Uniqueness
The proportion of stored data against No thing will be recorded more than
the potential of "100% complete" once based upon how that thing is
identified.

Consistency Timeliness
The absence of difference, when comparing The degree to which data represent
two or more representations of a thing reality from the required point in time.
against a definition.
.

Accuracy Validity
The degree to which data correctly Data are valid if it conforms to the
describes object or event being syntax
described.

Data Quality Presentation 05


Completeness
The proportion of stored data against the potential of
Reference Measure
"100% complete"

Related Dimension

Validity
Business rules which define A measure of the absence of
what "100% complete“ blank (null or empty string)
Accuracy represents. values or the presence of non-
blank values.

Data Quality Presentation 06


Consistency
The absence of difference, when comparing two or
Reference Measure
more representations of a thing against a definition.

Related Dimension

Validity Uniqueness
Data item measured against Analysis of pattern and/or
itself or its counterpart in value frequency.
Accuracy another data set or database.

Data Quality Presentation 07


Accuracy
The degree to which data correctly describes the
Reference Measure
object or event being described.

Related Dimension

Validity
use 3rd party reference data Any objects that may be
from sources which are characterised or described by
deemed trustworthy and of the data, held as data item, record,
same chronology. data set or database.

Data Quality Presentation 08


Validity
Data are valid if it conforms to the syntax (format,
Reference Measure
type, range) of its definition.

Related Dimension

Completeness Consistency
Database, metadata or All data can typically be
documentation rules as to the measured for Validity. Validity
Accuracy Uniqueness allowable types, format, and applies at the data item level
range and record level

Data Quality Presentation 09


Timeliness
The degree to which data represent reality from the
Reference Measure
required point in time.

Related Dimension

Accuracy
The time data being recorded Time difference
occurred.

Data Quality Presentation 10


Uniqueness
No thing will be recorded more than once based
Reference Measure
upon how that thing is identified.

Related Dimension

Consistency
Data item measured against Measured against all records
itself or its counterpart in within a single data set
another data set or database.

Data Quality Presentation 11


DATA QUALITY PROCESS

Define DQ Requirements Conduct DQ Assessment Resolve DQ Issues Monitor and Control


Perform data Profiling in order Define data quality rules and For data quality issues Define and populate Data
to help us discover value quality thresholds. identified during data quality Quality .
frequesncies or format of data. assessment conduct “root
cause analysis” to determine
Data profiling can be performed Perform Data Quality issue root cause Monitor Data Quality
by using specialized tool or Assessment by enforcing data Conduct issue resolution by
query languages that are quality rules on existing data eliminating root cause
supported by data source. set.
Although some data quality Identify Data Quality Issues and Review data policies and
problems can be discovered update Issue log. procedures if necessary
during data profiling activity, the
purpose of data profiling is to
give insight for data quality
assessment.

Data Quality Presentation 12


Data Profiling
Jenis Kelamin Tanggal Lahir Tanggal Pendaftaran
ID Nama Lengkap Alamat Semester
(P/L) (dd-mm-yyyy) (dd-mm-yyyy)
2011600621 Erian Tasa L 07-06-1987 Depok 2 13-02-2021
2011600778 Lani Asep Sutisna L 12-03-1990 Jakarta 1 18-01-2021
2011600794 Ahmad Haris Kurniawan L 19-02-1989 Bekasi 1 02-02-2021
2011600851 A Bardosono L 30-02-1980 Bekasi 1 28-01-2021
2011600893 Hambali Achmad L 22-01-1991 Jakarta 1 05-02-2021
2011600901 Putra Tegar N P 28-11-1992 Tangerang 1 08-01-2021
2011600968 Desti Destiansari Istinabiyah P 04-09-1994 1 22-01-2021
2011601008 Adi Wirawan L 12-12-1973 Depok 1 06-03-2021
2011601016 Dinda Claudia P 16-05-1993 Jakarta 1 15-01-2021
2011600778 Lani Asep Sutisna L 03-12-1990 Jakarta 1 18-01-2021

Category Results
Number of records 10
number of unique values 9
Number of blanks 1

Data Quality Presentation 13


Data Assessment
Jenis Kelamin Tanggal Lahir Tanggal Pendaftaran
ID Nama Lengkap Alamat Semester
(P/L) (dd-mm-yyyy) (dd-mm-yyyy)
2011600621 Erian Tasa L 07-06-1987 Depok 2 13-02-2021
2011600778 Lani Asep Sutisna L 12-03-1990 Jakarta 1 18-01-2021
2011600794 Ahmad Haris Kurniawan L 19-02-1989 Bekasi 1 02-02-2021
2011600851 A Bardosono L 30-02-1980 Bekasi 1 28-01-2021
2011600893 Hambali Achmad L 22-01-1991 Jakarta 1 05-02-2021
2011600901 Putra Tegar N P 28-11-1992 Tangerang 1 08-01-2021
2011600968 Desti Destiansari Istinabiyah P 04-09-1994 1 22-01-2021
2011601008 Adi Wirawan L 12-12-1973 Depok 1 06-03-2021
2011601016 Dinda Claudia P 16-05-1993 Jakarta 1 15-01-2021
2011600778 Lani Asep Sutisna L 03-12-1990 Jakarta 1 18-01-2021

DQ Dimension Rules Passed Failed Score

Completeness All Data must be entered, no blanks allowed 9 1 90

Consistency All Data must be represented consistently 9 1 90

Accuracy All Data accurately represent the "real world" values 8 2 80

Validity All data conforms to the syntax of its definitions 8 2 80


Timeliness All Data represents reality from the required point of time 9 1 90

Uniqueness All Data are properly identified and recorded only once 9 1 90

Data Quality Presentation 14


Data Assessment Results
Jenis Kelamin Tanggal Lahir Tanggal Pendaftaran
ID Nama Lengkap Alamat Semester
(P/L) (dd-mm-yyyy) (dd-mm-yyyy)
2011600621 Erian Tasa L 07-06-1987 Depok 2 13-02-2021
2011600778 Lani Asep Sutisna L 12-03-1990 Jakarta 1 18-01-2021
2011600794 Ahmad Haris Kurniawan L 19-02-1989 Bekasi 1 02-02-2021
2011600851 A Bardosono L 30-02-1980 Bekasi 1 28-01-2021
2011600893 Hambali Achmad L 22-01-1991 Jakarta 1 05-02-2021
2011600901 Putra Tegar N P 28-11-1992 Tangerang 1 08-01-2021
2011600968 Desti Destiansari Istinabiyah P 04-09-1994 1 22-01-2021
2011601008 Adi Wirawan L 12-12-1973 Depok 1 06-03-2021
2011601016 Dinda Claudia P 16-05-1993 Jakarta 1 15-01-2021
2011600778 Lani Asep Sutisna L 03-12-1990 Jakarta 1 18-01-2021

DQ Dimension Rules Passed Failed Score

Completeness All Data must be entered, no blanks allowed 9 1 90

Consistency All Data must be represented consistently 9 1 90

Accuracy All Data accurately represent the "real world" values 8 2 80

Validity All data conforms to the syntax of its definitions 8 2 80


Timeliness All Data represents reality from the required point of time 9 1 90

Uniqueness All Data are properly identified and recorded only once 9 1 90

Data Quality Presentation 15


Data Issue Resolution
Jenis Kelamin Tanggal Lahir Tanggal Pendaftaran
ID Nama Lengkap Alamat Semester
(P/L) (dd-mm-yyyy) (dd-mm-yyyy)
2011600621 Erian Tasa L 07-06-1987 Depok 1 13-02-2021
2011600778 Lani Asep Sutisna L 12-03-1990 Jakarta 1 18-01-2021
2011600794 Ahmad Haris Kurniawan L 19-02-1989 Bekasi 1 02-02-2021
2011600851 Ahmad Bardosono L 30-03-1980 Bekasi 1 28-01-2021
2011600893 Hambali Achmad L 22-01-1991 Jakarta 1 05-02-2021
2011600901 Putra Tegar Nugraha L 28-11-1992 Tangerang 1 08-01-2021
2011600968 Desti Destiansari Istinabiyah P 04-09-1994 Jakarta 1 22-01-2021
2011601008 Adi Wirawan L 12-12-1973 Depok 1 06-02-2021
2011601016 Dinda Claudia P 16-05-1993 Jakarta 1 15-01-2021

DQ Dimension Rules Passed Failed Score Corective Measure

Completeness All Data must be entered, no blanks allowed 9 0 100 fill in the data that is still empty
Consistency All Data must be represented consistently 9 0 100 all data must be consistent
Accuracy All Data accurately represent the "real world" values 9 0 100 all data must match real world values
Validity All data conforms to the syntax of its definitions 9 0 100 all data must match definition & syntax
Timeliness All Data represents reality from the required point of time 9 0 100 all data must match the specified time standards
Uniqueness All Data are properly identified and recorded only once 9 0 100 all data must be recorded once

Data Quality Presentation 16


The Advantages of Data Quality Management

Reliable Strategic Decision Making Reduced Costs and Risks

Timely Information Analysis Increased Productivity

New opportunities Better Customer Service

Data Quality Presentation 17


Conclusion
“Data Quality If included as part of corporate data management, and supported by a corporate data
governance program, with active metadata management efforts and carried out according to the practices
recommended by all data quality experts, Data Quality Management can provide lasting benefits to any
organization. The most important benefit that data quality offers is "offering the right data for the right
purpose".

Data Quality Presentation 18


Thank You

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy