Data Quality: Mata Kuliah: Manajemen Data
Data Quality: Mata Kuliah: Manajemen Data
Data Quality
Data Quality Presentation 02
Data Quality
must be able to meet specified business requirements for :
Accuracy Reliability
data stored value is correct objective Data definitions are important to data quality
and come from reputable sources ,data must be used as a basis fordecision
making
Completeness Accessibility
All required data items are included. Data items that are easily obtainable and
legal to access with strong protections and
controls built into the process
Timeliness
Concept of data quality that involves whether the data is
up-to-date and available within a useful time period
Data Quality Presentation 03
Impact Data Quality On Companies & Organitazions
Economic
Revenues, Costs, Profits
Brand
Reputation, Customer Royality, Trusted
Law
Regulation
Completeness Uniqueness
The proportion of stored data against No thing will be recorded more than
the potential of "100% complete" once based upon how that thing is
identified.
Consistency Timeliness
The absence of difference, when comparing The degree to which data represent
two or more representations of a thing reality from the required point in time.
against a definition.
.
Accuracy Validity
The degree to which data correctly Data are valid if it conforms to the
describes object or event being syntax
described.
Related Dimension
Validity
Business rules which define A measure of the absence of
what "100% complete“ blank (null or empty string)
Accuracy represents. values or the presence of non-
blank values.
Related Dimension
Validity Uniqueness
Data item measured against Analysis of pattern and/or
itself or its counterpart in value frequency.
Accuracy another data set or database.
Related Dimension
Validity
use 3rd party reference data Any objects that may be
from sources which are characterised or described by
deemed trustworthy and of the data, held as data item, record,
same chronology. data set or database.
Related Dimension
Completeness Consistency
Database, metadata or All data can typically be
documentation rules as to the measured for Validity. Validity
Accuracy Uniqueness allowable types, format, and applies at the data item level
range and record level
Related Dimension
Accuracy
The time data being recorded Time difference
occurred.
Related Dimension
Consistency
Data item measured against Measured against all records
itself or its counterpart in within a single data set
another data set or database.
Category Results
Number of records 10
number of unique values 9
Number of blanks 1
Uniqueness All Data are properly identified and recorded only once 9 1 90
Uniqueness All Data are properly identified and recorded only once 9 1 90
Completeness All Data must be entered, no blanks allowed 9 0 100 fill in the data that is still empty
Consistency All Data must be represented consistently 9 0 100 all data must be consistent
Accuracy All Data accurately represent the "real world" values 9 0 100 all data must match real world values
Validity All data conforms to the syntax of its definitions 9 0 100 all data must match definition & syntax
Timeliness All Data represents reality from the required point of time 9 0 100 all data must match the specified time standards
Uniqueness All Data are properly identified and recorded only once 9 0 100 all data must be recorded once