DAMA - Navigating-The-Labyrinth 5db75d9f03100 e
DAMA - Navigating-The-Labyrinth 5db75d9f03100 e
Navigating the
Labyrinth
By Laura Sebastian-Coleman
Technics Publications
2 Lindsley Road
Basking Ridge, NJ 07920 USA
https://www.TechnicsPub.com
Foreword ______________________________________ 1
Introduction ____________________________________ 3
Sue Geuens
President, DAMA International
You’ve sensed it, you’ve read it, you’ve seen it. Reliable, well-
managed data is critical to organizational success in the twenty-
first century. Whatever sector you work in – financial services,
health care, insurance, manufacturing, technology, retail,
education, and beyond – your organization requires data to
transact business and serve customers. This data not only fuels
your business processes; it also provides you the business
intelligence needed to measure organizational success. As
importantly, the data your organization produces can be mined
for insights about current operations—insights you can apply to
improve your processes and advance your organization’s
strategy.
The Importance of
Managing Data
DATA IS EVERYWHERE
Organizations have always needed to manage their data, but
advances in technology have expanded the scope of this
management need. Data is pervasive across organizations.
Almost every business process – from setting up customers, to
transacting purchases, to contacting customers for feedback and
services – uses data as input and produces data as output. Most
of this data is in electronic form, which means that it is
malleable: it can be stored in large quantities, manipulated,
integrated, and aggregated for different uses, including business
intelligence and predictive analytics. It also provides evidence of
an organization’s compliance (or lack of compliance) with laws
and regulations.
DATA AS AN ASSET
An asset is an economic resource, that can be owned or
controlled, and that holds or produces value. Assets are often
thought of as property, but with the strong implication that they
can be converted to money. Data is widely recognized as an
enterprise asset, although many organizations still struggle to
manage data as an asset. For example, data is not yet accounted
for in most organizations’ balance sheets.
that very few organizations treat their data as an asset. 1 For many
it can even be a liability. Failure to manage data is similar to
failure to manage capital. It results in waste and lost opportunity.
Poorly-managed data presents ethical as well as security risks.
2
The Data Governance Institute. https://bit.ly/1ef0tnb.
o Data Architecture
o Data Modeling
o Building and managing data warehouses and
marts
o Integrating data for use by business intelligence
analysts and data scientists
o Managing the lifecycle of highly critical shared
data, like Reference Data and Master Data
Data Data
Metadata
Governance Security
Data Management
Challenges
19
3
This section derives from Redman, Thomas. Data Quality for the Information Age
(1996) pp. 41-42, 232-36; and Data Driven (2008), Chapter One, “The Wondrous
and Perilous Properties of Data and Information.”
The fact that data can be easily copied and replicated means it
can be breached without being ‘gone’ from its rightful owners.
Moreover, because data represents people, products, and money,
legislators and regulators have recognized the potential uses and
abuses of information and have put in place laws intended to
mitigate obvious risks. For example:
Consumers are also more aware of how their data might be used.
For example, when making purchases on a website, they expect
not only smoother and more efficient operation of processes, but
also protection of their information and respect for their privacy.
Organizations that do not protect their customers’ data may not
have those customers for long.
Many of the costs of poor quality data are hidden and indirect
and therefore hard to measure. Others, like fines, are direct and
easy to calculate. Costs come from:
Data asset valuation must also recognize that the value of data is
contextual (i.e., what is of value to one organization may not be
5
For case studies and examples, see Aiken, Peter and Billings, Juanita. Monetizing
Data Management (2014) and Laney, Douglas, Infonomics: How to Monetize,
Manage, and Measure Information as an Asset for Competitive Advantage
(2018).
6
McGilvray, 2008; English, 1999. DAMA’s lifecycle depiction is based on
McGilvray’s formulation, POSMAD – Plan, Obtain, Store & Share, Maintain,
Apply, Dispose – which has proved a very valuable model, especially in the data
quality space.
7
McGilvray (2008) points out that there are costs at all points in the data lifecycle,
but value is created only when data is used. See also Redman (2008).
Store /
Enhance Use
Maintain
Dispose of
8
Figures differ, but a quick Google search of “What percentage of data is unused?”
produces three that are somewhat shocking: 97% according to Gartner (25
January 2018), 85% according to Veritas (15 March 2016), and 73% according to
Inc.com (12 April 2018).
9
Aiken, 2014.
DAMA’s Data
Management Principles
• Data is valuable
33
DATA IS VALUABLE
• Data is an asset with unique properties: Data is an
asset, but it differs from other assets in important ways
that impact how it is managed. The most obvious of
these properties is that data is not consumed when it is
used, as are financial and physical assets.
11
See Paulk, Mark C. “A history of the Capability Maturity Model for Software.”
https://bit.ly/2HTuIK6.
• The organization
measures itself
• Managed knowledge against data
of data management management
• Formal knowledge principles principles
of data management • Principles are • Principles drive data
principles centrally owned and management
• Emerging knowledge • Principles become articulated process
of data management organizational • Principles are used improvements
• Little or no principles enablers, making to ensure reliability
knowledge of data • Some principles are processes that of data management Level 5
management applied in more than follow them more processes
principles one area of the Optimized
• Individuals follow reliable
data management
organization Level 4
principles out of Managed
common sense but Level 3
not conscious
knowledge of them Level 2 Defined
Repeatable
Level 1
Initial / Ad Hoc
Whatever the focus, a DMMA can help bridge the gap between
business and IT perspectives on the health and effectiveness of
data management practices. A DMMA provides a common
language for depicting what progress looks like across data
management functions and offers a stage-based path to
improvement which can be tailored to an organization’s strategic
priorities. Thus, it can be used both to set and to measure
organizational goals, as well as to compare one’s organization
against other organizations or industry benchmarks.
Data Ethics
Handling data – not only managing it, but using it and sharing it
with other entities – in an ethical manner is necessary to the
long-term success of any organization that wants to get value
from its data. Unethical data handling can result in the loss of
reputation and customers, because it puts at risk people whose
data is exposed. In some cases, unethical practices are also
illegal. 12 Given the connection between the right to privacy and
12
HIPAA (Health Insurance Portability and Accountability Act) in the US, PIPEDA
(Personal Information Protection and Electronic Documents Act) in Canada, the
EU General Data Protection Regulation (GDPR) and other data protection /
41
13
The issue of data ethics has been examined not only in regard to privacy, but also
in regard to the use of data to influence outcomes of political processes. See
Nicholas Confessore and Danny Hakim, March 6, 2017. “Data Firm says ‘Secret
Sauce’ Aided Trump; Many Scoff.” New York Times. March 6, 2017.
https://nyti.ms/2J2aDx2; and Barb Darrow, “Is Big Data Killing Democracy?”
Fortune Magazine, September 15, 2017. Or, simply google: Is big data killing
democracy?
14
The accepted tenets of bioethics, which focus on preserving human dignity,
provide a good general starting point for principles of data ethics. For example,
the Belmont Principles for medical research may be adapted in Information
Public policy and law try to codify right and wrong based on
ethical principles. But they cannot codify every circumstance.
For example, privacy laws in the European Union, Canada, and
the United States show different approaches to codifying data
ethics. These principles can also provide a framework for
organizational policy. The principles underlying the General
Data Protection Regulation (GDPR) of the EU include:
15
Numerous recent books describe the degree to which data science techniques have
been used to influence political and economic processes in potentially unethical
ways. See, for example, Stephens-Davidowitz, Seth. Everybody Lies: Big Data,
New Data, and What the Internet Can Tell Us About Who We Really Are. (Harper
Collins, 2017.) O’Neil, Cathy. Weapons of Math Destruction: How Big Data
Increases Inequality and Threatens Democracy. (Random House, 2016.) And
Schneier, Bruce. Data and Goliath: The Hidden Battles to Collect Your Data and
Control Your World. (Norton, 2015.) The potential to misuse data has always
existed. See Darell Huff’s 1954 classic How to Lie with Statistics. But in today’s
world, the capacity to collect and analyze data has significantly increased the risk
of misuse with significant social implications.
16
https://bit.ly/2tNM53c.
17
Hasselbalch & Tranberg, 2016.
1
Results Behavior capture
• Content required
• Privileges granted or
• Capture method
2
denied Ethical Risks
4
• Activities
• Further engagement or not
• Relationship removal
in a Sampling • Sentiment
Project • Location
• Benefit or sanction
• Date/Time
• Trust or lack of trust
• Combination datasets
• Biased treatment
• Legal and ethical review
3
BI/Analytics/Data Science
• Profiling Prospects
• Actual and forecast activities
Data Governance
53
19
Ladley, 2012.
Data,
Information, Data Management
Data Governance
And Managing data to
Ensuring data is managed
Content achieve goals
Lifecycles
Oversight Execution
STORE/MAINTAIN:
Ensure data storage
Store / follows policy and
Enhance Use
Maintain regulatory
requirements
Figure 9: Data Governance and the Data Lifecycle (Adapted from DMBOK2,
p. 29)
20
The Data Governance Institute. https://bit.ly/1ef0tnb.
IT Program
Data Governance Steering Organizations Manage-
Committee Data Management
Executives
ment
Data Data
Data Governance Office Management
Divisions &
Program
Programs
Subject Area
Centralized
Data Governance
Business Unit
Replicated
Federated
Data Governance
21
Adapted from Ladley (2012).
Escalation
Rotating chairs Enterprise Data Governance Oversight Resolution
Program Oversight
Policy Development
Projects & Other Initiatives
Stewardship
Policy Issues
Domain Data
Accountability
Governance Consumer Partner Product Data Lifecycles
Data Responsibility
Forums ERP Analytics
Warehouse
DATA STEWARDSHIP
Data Stewardship is one of those concepts that people don’t
always understand. A steward is a person whose job it is to
manage the property of another person. Data Stewards manage
data assets on behalf of others and in the best interests of the
organization. 22 This concept grew out of the recognition that,
within any organization, there have always been people who
have expertise in the data and genuinely care about how an
organization maintains data and makes it available for use. As
the importance of data has grown, so too has formal recognition
of this stewardship function.
22
McGilvray, 2008.
• Defining policies
• Underwriting data improvement projects
• Engaging with change management to educate staff and
drive the adoption of desired behaviors
• Managing issues and conflicts that may arise during the
process of implementation
23
See Randy Bean, “The Chief Data Officer Dilemna”. Forbes.com, January 29,
2018. https://bit.ly/2J8ahVZ.
24
Dataversity.com
71
ENTERPRISE ARCHITECTURE
Architecture refers to an organized arrangement of component
elements intended to optimize the function, performance,
feasibility, cost, and aesthetics of an overall structure or system.
The term architecture has been adopted to describe several
facets of information systems design. Even in small
organizations, information technology is complicated.
Architectural artifacts and documentation which depict systems
and data flows show people how systems, processes, and data
work together. A strategic approach to architecture allows an
organization to make better decisions about its systems and data.
DATA ARCHITECTURE
Data architecture is fundamental to data management. Because
most organizations have more data than individual people can
comprehend, it is necessary to represent organizational data at
different levels of abstraction so that management can
understand it and make decisions about it.
All levels are part of the Enterprise Data Model, and linkages
create paths to trace an entity from top to bottom and between
models in the same level.
The Enterprise Data Model and the Data Flow Design need to fit
well together. As mentioned, both need to be reflected in current
state and target state (architecture perspective), and also, the in
transition state (project perspective).
Business Processes
Product
Product Part
Manufacturing
Plant
Customer
Major Entities
Sales Item
Assembly
Structure
Sales Order
Production
Order
Individual
Product
Shipping
Customer’s
Invoice
Create Read/use
CRM
Service
Customer data & repair
Customer statistics
agreement
Manufacturing Manufacturing
Routing System Planning Serial numbers
Manufacturing BOM System Individual product BOM
Production line routing
Lead times
DATA MODELING
A model is a representation of something that exists or a pattern
for something to be made. Maps, organization charts, and
building blueprints are examples of models in use every day.
Model diagrams make use of standard symbols that allow one to
understand content.
25
Simsion, 2007.
26
Hoberman, 2009.
Entity
Outside of data modeling, the definition of entity is a thing that
exists separate from other things. Within data modeling, an
entity is a thing about which an organization collects
information. Entities are sometimes referred to as ‘the nouns of
an organization.’ In relational data models, entities are the boxes
that identify the concept being modeled.
Relationship
A relationship is an association between entities. 28 A relationship
captures the high-level interactions between conceptual entities,
the detailed interactions between logical entities, and the
constraints between physical entities. Relationships are shown as
lines on the data modeling diagram.
27
Hoberman, 2009.
28
Chen, 76.
Organization Job
Employ Be assigned to
Employee
Support
Dependent
Attribute
An attribute is a property which identifies, describes, or
measures an entity. The physical correspondent of an attribute in
an entity is a column, field, tag, or node in a table, view,
document, graph, or file. In the example in Figure 18, the entity
Organization has the attributes of Organization Tax ID,
Organization Phone Number, and Organization Name. Employee
has the attributes of Employee Number, Employee First Name,
Employee Last Name, and Employee Birth Date. Dependent and
Job details have attributes that describe their characteristics.
Organization Job
Organization Tax ID Job Requisition Number
Employ Be assigned to
Employee
Employee Number
Organization Tax ID (FK)
Support
Dependent
Dependent Number
Employee Number (FK)
Organization Tax ID (FK)
Domain
In data modeling, a domain is the complete set of possible values
that an attribute can be assigned. A domain provides a means of
standardizing the characteristics of the attributes and constrains
the data that can be populated in the field. For example, the
domain Date, which contains all possible valid dates, can be
assigned to any date attribute in a logical data model or date
columns/fields in a physical data model such as:
• EmployeeHireDate
• OrderEntryDate
• ClaimSubmitDate
• CourseStartDate
93
While DBAs can seem far removed from the data governance
function, their knowledge of the technical environment is
essential to implement data governance directives related to such
things as access control, data privacy, and data security.
Experienced DBAs are also instrumental in enabling
organizations to adopt and leverage new technologies.
• Loading data
• Replicating data
• Tracking usage patterns
• Planning for business continuity
• Managing backup and recovery
• Managing database performance and availability
• Managing alternate environments (e.g., for development
and test)
• Managing data migration
• Tracking data assets
• Enabling data audits and validation
In short, DBAs make sure the engines are running. They are also
first on the scene when databases become unavailable.
• Archiving data
• Managing data interfaces
• Obtaining and ingesting external data
• Integrating structured and unstructured data
• Providing operational intelligence and management
decision support
The main driver for DII is to ensure that data moves efficiently
to and from different data stores, both within the organization
and between organizations. It is very important to design with an
eye toward reducing complexity. Most enterprises have
hundreds, sometimes thousands, of databases. If DII is not
DATA WAREHOUSING
Data warehouses allow organizations to integrate data from
disparate systems into a common data model in order to support
operational functions, compliance requirements, and Business
Intelligence (BI) activities. Warehouse technology emerged in
the 1980s and organizations began building warehouses in
earnest in the 1990s. Warehouses promised to enable
organizations to use their data more effectively by reducing data
redundancy and bringing about more consistency.
The term data warehouse implies all the data is in one place, as
in a physical warehouse. But data warehouses are more
complicated than that. They consist of multiple parts through
which data moves. During its movement, the structure and
format of data may be changed, so that it can be brought together
in common tables, from which it can be accessed. It may be used
directly for reporting or as input for downstream applications.
29
https://bit.ly/2J6ve3u.
Historical
Reference Data
Reference Data
DM DM DM Analysis*
Raw Detailed Data
App Exploratory
DW Analysis*
App
Op DM Operational
Analysis
App
ODS
30
Adapted from figures in Inmon, W., Claudia Imhoff, and Ryan Sousa. The
Corporate Information Factory. 2nd ed. Wiley 2001.
Conformed Dimensions
DATA STORE:
DW BUS
Flat Files
Relational Tables ANALYTICAL
Extract XML Datasets Load Access APPLICATIONS
MODELS:
PROCESSING:
Extract Sorting Load Data Mart #N Access Forecasting
Sequencing Scoring
Data Mining
31
Adapted from figures in The Data Warehouse Toolkit, 2nd Edition, Ralph
Kimball and Margy Ross, John Wiley & Sons, 2002.
• One size does not fit all: Use the right tools and
products for each group of data consumers.
32
Chisholm, 2008.
34
Laney, 2001.
Big Data can be used for a range of activities, from data mining
to machine learning and predictive analytics. But to get there, an
organization must have a starting point and a strategy. An
organization’s Big Data strategy needs to be aligned with and
support its overall business strategy. It should evaluate:
Report Interact
Clean Geospatial and
Operational Integrate ODS Demographic Analytics
Systems Enrich Central Warehouse Performance
Standardize Management
Data Mart
Subject-Oriented
Non-Volatile
Compare Evaluate
Time-Variant Data Visualization
DaaS Reference &
Big Data Atomic
Master Data Data & Text Mining
MDM Historical Data
Results Conformed Cubes
Dimensions Unstructured
Big Data Analytics
Email
Predict Learn
Multimedia
© DATALEADERS.ORG
Sensors Predictive Analytics
Evaluate
IoT Ingest Data Lake Integrate Explore
Social Network Model
Web DaaS Machine Learning
DW
Figure 21: Conceptual DW/BI and Big Data Architecture (DMBOK2, p. 391)
35
Aiken & Billings, 2014.
113
• Analytics
• Data Visualization
• Data Monetization
BUSINESS INTELLIGENCE
The development of Business Intelligence reporting is another
activity where the use of data results in the creation of new data
that requires a level of ongoing management.
DATA SCIENCE
Data science has existed for a long time. It used to be called
applied statistics. But the capability to explore data patterns has
quickly evolved in the twenty-first century with the advent of
Big Data collection and storage technologies.
36
Data Science models, which contain logic (algorithms) to process data and make
predictions from it, are not the same thing as data models described in Chapter 6,
which document the structure of data and the relationships between data entities
and attributes.
DATA VISUALIZATION
Visualization is the process of interpreting concepts, ideas, and
facts by using pictures or graphical representations. Data
visualization facilitates understanding of the underlying data by
summarizing it in a visual form, such as a chart or graph. Data
visualizations condense and encapsulate characteristics data,
making them easier to see. Doing so, they can surface
opportunities, identify risks, or highlight messages. 37
37
Data visualization is an evolving field but the principles that guide it are based on
design principles. See Tufte, (2001) and McCandless (2012). Numerous web-
based resources exist with examples and counter examples. See the Periodic
Table of Visualization Methods at Visual Literacy.Org https://bit.ly/IX1bvI.
0,21 UCL=0,21061
Percentage of defaults per load
0,20
0,19 ͞x=0.1885
0,18
0,17
LCL=0,16639
0,16
TGT_TBL_ETL_DT
38
Data for Business Performance, Southekal, Prashanth, Technics Publications,
2017.
DATA MONETIZATION
Any organization engaged in Data Science or other forms of
analytics is likely to gain valuable insight about its own
customers, products, services, and processes. Advanced analytics
can generate insight about external entities as well. Such an
organization is also likely to develop techniques that might be
valuable to others. If these insights and techniques can be
packaged and sold, then an organization would be leveraging its
data not only as an asset, but as a product. In some circles, direct
data monetization is perceived as the holy grail of data
management. Some companies (Dun & Bradstreet, Google,
Amazon) have made a business of monetizing their data. But
selling data and information is not the only way to get value
from data assets.
39
Aiken and Billings, 2014.
getting value from their data. For some, as Aiken and Billings’s
and Laney’s case studies show and other research confirms, low
quality data is a significant liability. Others, however, have been
able to break through, with operational improvements as well as
direct monetization. Case studies show that innovation uses of
data require reliable data management. While not every
organization will want to sell its data, all organizations want to
have confidence in the decisions they make based on their data.
The first step in this direction is to manage the data well.
125
STAKEHOLDER GOVERNMENT
CONCERNS REGULATION
NECESSARY LEGITIMATE
BUSINESS BUSINESS
ACCESS NEEDS CONCERNS
• Data security must be • Trade secrets
appropriate • Research & other IP
• Data security must not be • Knowledge of customer
too onerous to prevent needs
users from doing their • Business partner
jobs relationships and
• Goldilocks principle impending deals
40
Ray, 2012.
Store / STORE/MAINTAIN:
Enhance Use Ensure data storage
Maintain follows policy and
regulatory
requirements
Figure 25: Data Security and the Data Lifecycle (Adapted from DMBOK2, p.
29)
41
Kark, 2009.
Metadata Management
137
even a specific topic. The card catalog not only provides the
necessary information (which books and materials the library
owns and where they are shelved) it also enables patrons to find
materials using different starting points (subject area, author, or
title). Without the catalog, finding a specific book would be
difficult if not impossible. An organization without Metadata is
like a library without a card catalog.
Organizations get more value out of their data assets if their data
is of high quality. Quality data depends on governance. Because
it explains the data and processes which enable organizations to
function, Metadata is critical to data governance. If Metadata is a
guide to the data in an organization, then it must be well-
managed. Poorly managed Metadata leads to:
TYPES OF METADATA
Metadata is generally categorized into three types: business,
technical, or operational.
METADATA IS DATA
While Metadata can be understood through its uses and the
categories, it is important to remember that Metadata is data.
Like other data, it has a lifecycle (see Figure 26). We must
manage it in relation to its lifecycle.
STORE/MAINTAIN:
Store / Ensure metadata
Enhance Use remains current and
Maintain continues to meet
requirements
METADATA STRATEGY
As noted, the types of information that can be used as Metadata
are wide-ranging. Metadata is created in various places
throughout an enterprise. The challenges come with bringing
Metadata together so that people and processes can use it.
METADATA ARCHITECTURE
Like other forms of data, Metadata has a lifecycle. While there
are different ways to architect a Metadata solution, conceptually,
all Metadata management solutions include architectural layers
that correspond to points in the Metadata lifecycle
METADATA QUALITY
When managing the quality of Metadata, it is important to
recognize that a lot of Metadata originates through existing
processes. For example, the data modeling process produces
table and column definitions and other Metadata essential to
creating data models. To get high-quality Metadata, Metadata
should be seen as a product of these processes, rather than as a
byproduct of them.
All processes, systems, and data have a need for some level of
meta-information; that is, some description of their component
pieces and how they work. It is best to plan how to create or
collect this information. In addition, as the process, system or
data is used, this meta-information grows and changes. It needs
to be maintained and enhanced. Use of Metadata often results in
recognition of requirements for additional Metadata. For
example, sales people using customer data from two different
systems may need to know where the data originated in order to
better understand their customers.
METADATA GOVERNANCE
Moving from an unmanaged to a managed Metadata
environment takes work and discipline. It is not easy to do, even
if most people recognize the value of reliable Metadata.
Organizational readiness is a major concern, as are methods for
governance and control. A comprehensive Metadata approach
requires that business and technology staff be able to work
closely together in a cross-functional manner.
Many organizations simply fail to define what makes data fit for
purpose in the first place and therefore lack commitment to data
quality.
151
This chapter will define key concepts related to data quality and
discuss data quality management in relation to overall data
management.
DATA QUALITY
The term data quality is used to refer both to the characteristics
associated with high-quality data and to the processes used to
measure or improve the quality of data. This dual usage can be
confusing, so it helps to look at both meanings, starting with
high-quality data. Later in the chapter we will look at the
definition of data quality management.
The team is also responsible for working with those who need
data to do their jobs to ensure the data meets their needs, and
working with those who create, update, or delete data in the
course of their jobs to ensure they are properly handling the data.
Data quality depends on all who interact with the data, not just
data management professionals.
STORE/MAINTAIN:
Monitor quality of data
Store / within systems and
Enhance Use
Maintain processes to ensure it
continues to meet
expectations
Figure 27: Data Quality Management and the Data Lifecycle (Adapted from
DMBOK2, p. 29)
43
For the full text of The Leader’s Data Manifesto, see: https://bit.ly/2sQhcy7.
• Setting priorities
44 See Wang (1998), English (1999), Redman (2001), Loshin (2001), and
McGilvray (2008). See Pierce (2004) for an overview of literature related to the
concept of data as a product.
PLAN DO
ACT CHECK
CHECK: actively
ACT: Address and
monitoring the
resolve emerging
quality of data as
data quality issues
measured against
and continue the
requirements
cycle
Figure 28: A Data Quality Management Cycle based on the Shewhart Cycle
(DMBOK2, p. 263)
In the Plan stage, the data quality team assesses the scope,
impact, and priority of known issues, and evaluates alternatives
to address them. This plan should be based on a solid foundation
of analysis of the root causes of issues. From knowledge of the
causes and the impact of the issues, cost / benefit can be
understood, priority can be determined, and a basic plan can be
formulated to address them.
46
Evans & Price, 2012.
47
Adapted from The Leader’s Data Manifesto. https://dataleaders.org/.
No Revenue
tertiary Information
Lack of
education Assets Not Competitive
Awareness
Properly Advantage
• Executive No on-
• Practitioner the-job Managed
Productivity
education Don’t know how to put
information to work Information management Costs
Lack the ability to do the work tools are not understood
Fail to invest in quality, adding cost Language is imprecise Risk
and complicating efforts to use data • Continuity
Inappropriate culture (e.g. intuition Software seen as a panacea/ • Compliance
valued over the “facts”, information not confusion about IT vs. data • Discovery
valued as asset) • Security
Inappropriate structure (e.g. silos Accounting principles don’t
impede sharing) allow capitalisation of
Confused about “who does what” information assets
Pro-active leadership missing.
Lack the equivalent of GAAP
Lack:
• Vision
• Strategy
• Policy
• Guiding Principles
Lack of • Management System Inappropriate or
Leadership and
ineffective
Management
Instruments
© 2017-2018 dataleaders.org
Used with permission
Work based on research by Dr. Nina Evans and James Price, see
“Barriers to the Effective Deployment of Information Assets” at
www.dataleaders.org
48
Diagram developed by Danette McGilvray, James Price and Tom Redman. Used
by permission. https://dataleaders.org/.
• The cost of getting data right the first time is lower than
the cost of getting data wrong and fixing it.
What to do Now
This chapter will discuss steps that are critical steps to initiate
improvements to organizational maturity around data
management. These include:
169
49
Adopting an appropriate maturity model is a key to success. See the DMBOK2
and: Alan McSweeney, Review of Data Management Maturity Models,
SlideShare.net, published 2013-10-23. https://bit.ly/2spTCY9. Jeff Gorball,
Introduction to Data Management Maturity Models, SlideShare.net, published
2016-08-01. McSweeney includes the DAMA-DMBOK as one of his maturity
models, although the DMBOK is not structured as such.
3
Metadata 2 Modeling
0
DW&BI Storage & Ops
R&MD Security
D&C DII
The specifics of the action plans will depend on the results of the
current state assessment, but an example will show how the
process works.
183
Adelman, Sid, Larissa Moss, and Majid Abai. Data Strategy. Addison-
Wesley Professional, 2005. Print.
Aiken, Peter and Michael M. Gorman. The Case for the Chief Data
Officer: Recasting the C-Suite to Leverage Your Most Valuable Asset.
Morgan Kaufmann, 2013. Print.
Aiken, Peter and Todd Harbour. Data Strategy and the Enterprise
Executive. Technics Publishing, LLC, 2017. Print.
185
Boiko, Bob. Content Management Bible. 2nd ed., Wiley, 2004. Print.
Cokins, Gary et al. CIO Best Practices: Enabling Strategic Value with
Information Technology. 2nd ed., Wiley, 2010. Print.
Dean, Jared. Big Data, Data Mining, and Machine Learning: Value
Creation for Business Leaders and Practitioners. Wiley, 2014. Print.
EMC Education Services, ed. Data Science and Big Data Analytics:
Discovering, Analyzing, Visualizing and Presenting Data. Wiley, 2015.
Print.
Fisher, Tony. The Data Asset: How Smart Companies Govern Their
Data for Business Success. Wiley, 2009. Print.
Inmon, W. Building the Data Warehouse. 4th ed., Wiley, 2005. Print.
Inmon, W.H., and Dan Linstedt. Data Architecture: A Primer for the
Data Scientist: Big Data, Data Warehouse and Data Vault. 1st ed.,
Morgan Kaufmann, 2014.
Jensen, David. “Data Snooping, Dredging and Fishing: The Dark Side
of Data Mining A SIGKDD99 Panel Report.” ACM SIGKDD, Vol. 1,
Issue 2. January 2000. Retrieved from http://ftp.bstu.by/ai/Data-
mining/Stock-market/expl99.pdf.
Kimball, Ralph, and Margy Ross. The Data Warehouse Toolkit: The
Definitive Guide to Dimensional Modeling. 3d ed., Wiley, 2013. Print.
Kitchin, Rob. The Data Revolution: Big Data, Open Data, Data
Infrastructures and Their Consequences. SAGE Publications Ltd.,
2014. Print.
Lee, Yang W., Leo L. Pipino, James D. Funk, and Richard Y. Wang.
Journey to Data Quality. The MIT Press, 2006. Print.
Marz, Nathan and James Warren. Big Data: Principles and best
practices of scalable realtime data systems. Manning Publications,
2015. Print.
Provost, Foster and Tom Fawcett. Data Science for Business: What you
need to know about data mining and data-analytic thinking. O’Reilly
Media, 2013. Print.
Quinn, Michael J. Ethics for the Information Age. 6th ed., Pearson,
2014. Print.
Redman, Thomas. “Bad Data Costs U.S. $3 Trillion per Year.” Harvard
Business Review, 22 September 2016.
Redman, Thomas. Data Quality: The Field Guide. Digital Press, 2001.
Print.
Schmarzo, Bill. Big Data MBA: Driving Business Strategies with Data
Science. Wiley, 2015. Print.
Soares, Sunil. The Chief Data Officer Handbook for Data Governance.
MC Press, 2015. Print.
Verhoef, Peter C., Edwin Kooge, and Natasha Walk. Creating Value
with Big Data Analytics: Making Smarter Marketing Decisions.
Routledge, 2016. Print.
Zeng, Marcia Lei and Jian Qin. Metadata. 2nd ed., ALA Neal-
Schuman, 2015. Print.
199