Data Governance Best Practices
Data Governance Best Practices
Prepared for
Prepared by
Version Control
Version Date Description Editor / Reviewer
i
Data Governance Best Practices
Table of Contents
1. INTRODUCTION ........................................................................................................................................................ 4
1.1. SCOPE ............................................................................................................................................................................ 4
1.2. BACKGROUND................................................................................................................................................................ 4
1.3. DOCUMENT ORGANIZATION ......................................................................................................................................... 4
2. ARC CHALLENGES AND DATA GOVERNANCE BENEFITS ......................................................................... 5
3. DATA GOVERNANCE OVERVIEW ........................................................................................................................ 8
3.1. DATA GOVERNANCE DEFINED...................................................................................................................................... 8
3.2. DATA GOVERNANCE FRAMEWORKS ............................................................................................................................ 9
4. DATA GOVERNANCE FRAMEWORK: BUSINESS STRATEGIES AND ORGANIZATION ..................... 12
4.1. OVERVIEW.................................................................................................................................................................... 12
4.2. DATA GOVERNANCE GOALS AND OBJECTIVES ........................................................................................................ 12
4.3. POLICIES ...................................................................................................................................................................... 14
4.4. ORGANIZATION ............................................................................................................................................................ 16
4.5. PERFORMANCE AND MATURITY MODELS ................................................................................................................. 17
5. DATA LIFECYCLE MANAGEMENT..................................................................................................................... 20
6. CHANGING NEEDS IN TRANSFORMATIVE TRANSPORTATION ENVIRONMENTS ............................. 22
6.1. GENERAL IMPLICATIONS ............................................................................................................................................. 22
6.2. SPECIFIC IMPLICATIONS FOR TRANSPORTATION DATA ............................................................................................ 23
6.2.1. Integrated Transportation Management Systems Impacts .................................................................... 23
6.2.2. Mobility on Demand and Accessible Travel Impacts ............................................................................... 23
6.2.3. Automated Vehicle Impacts........................................................................................................................... 24
7. GETTING STARTED WITH DATA GOVERNANCE .......................................................................................... 25
7.1. FHWA APPROACH...................................................................................................................................................... 25
7.2. MNDOT APPROACH ................................................................................................................................................... 26
7.3. FHWA CHALLENGES AND LESSONS LEARNED ....................................................................................................... 28
8. ARC’S ROLE IN A REGIONAL DATA GOVERNANCE FRAMEWORK ....................................................... 31
8.1. RECOMMENDATIONS FOR ESTABLISHING A REGIONAL DATA GOVERNANCE FRAMEWORK ................................ 31
8.1.1. Data Set Catalog.............................................................................................................................................. 32
8.1.2. Agency Roles and Responsibilities ............................................................................................................. 34
8.2. RECOMMENDATIONS FOR ESTABLISHING A LONG TERM DATA GOVERNANCE FRAMEWORK ............................. 34
9. ENDNOTES............................................................................................................................................................... 37
10. APPENDICES ....................................................................................................................................................... 39
ii
Data Governance Best Practices
List of Figures
Figure 1: FDOT ROADS Project Data Governance Overview ............................................................. 9
Figure 2: Data Governance and Stewardship Context Diagram from DMBOK2 ............................... 10
Figure 3: MnDOT Data Governance Roles ......................................................................................... 16
Figure 4: Extended Data Governance Roles for a Distributed Enterprise .......................................... 17
Figure 5: IBM Maturity Model .............................................................................................................. 18
Figure 6: IBM Model to Assess Effective Data Governance ............................................................... 19
Figure 7: DCC Curation Lifecycle Model ............................................................................................. 20
Figure 8: Digital Maps for AV Supply Chain (source: USDOT) ........................................................... 24
Figure 9: MnDOT Data Governance Framework ................................................................................ 27
Figure 10: Data Business Plan Development Process ....................................................................... 31
Figure 11: Federal Data Catalog (https://catalog.data.gov/dataset) ................................................... 32
Figure 12: Snapshot of the Maryland Transit Data Catalog
(https://data.imap.maryland.gov/items?page=3&tags=CME)..................................................... 33
Figure 13: Longer Term Data Governance Framework Process ........................................................ 34
List of Tables
Table 1: FHWA Data Governance Goals and Objectives ................................................................... 12
Table 2: Data Governance Principle Focus ......................................................................................... 13
Table 3: FHWA Data Governance Policies [source: FHWA, 2015].................................................... 14
Table 4: MnDOT Recommendations and Suggested Strategies for Data Governance..................... 27
Table 5. Experiences, Benefits, Challenges and Lessons Learned from Implementing Data
Governance. ................................................................................................................................ 29
Table 6: Data Catalog Attribute List ..................................................................................................... 33
iii
Data Governance Best Practices
1. Introduction
1.1. Scope
This white paper on Best Practices for Data Governance (DG) explores industry recommendations on the
purpose, benefits, and strategies related to applying data governance to the role of the Atlanta Regional
Commission (ARC) with respect to “data”. The area of data governance is extensive, with every enterprise
architecture and information technology business analysis methodology promoting a framework for people,
processes, and technologies to manage their data.
This paper will introduce the various topics related to data governance best practices, providing an extensive
bibliography, however, the paper attempts to focus the reader on the role of ARC in fostering good data
governance best practices rather than implementing regional data management processes.
1.2. Background
Effective transportation systems management and operations (TSMO) at a regional scale involves
coordination among a wide array of partners -- including agencies involved in operating highways, transit
services, and emergency response services, as well as the private sector – to optimize system performance.
Data sharing for enhanced situational awareness is key to implementing many TSMO strategies; moreover,
sharing real-time data and predictive analytics with the public and private sectors plays an important role in
influencing travel decisions. High-quality, consistent data that is managed over time and throughout its life is
the foundational element of multi-agency, regional approaches to TSMO.
At the same time, public agencies operate with limited and shrinking resources, changing technological
landscapes, and shifting roles and expectations. Data is becoming a major asset and investment. To manage
these assets, government needs to become
• responsive with the ability to transform data into information and decisions;
• support interoperability in order to share information;
• effective and efficient data custodians to manage data discovery and access; and
• a trusted source to manage data quality and privacy.
To that end, key data should be planned and managed to support the enterprise rather than just a project,
which is currently what is often done. Data governance, therefore, is becoming increasingly important for
organizations and overall systems of organizations that work together. Many organizations that adopt
data governance practices recognize the need to undergo a cultural transformation, changing the ways
individuals and systems handle and process data. TSMO strategies also require cultural change, thus,
there may be no better time for ARC and its stakeholders to adopt data governance practices then now.
Section 2 ARC Challenges and Data Governance Benefits. Section 2 introduces the benefits of data
governance given current challenges in data access and exchange in the ARC region.
4
Data Governance Best Practices
Section 3 Data Governance Overview. This section describes data governance, its definition and framework
components. Many organizations claim to be the authority over the best practice. This section summarizes
and identifies the commonalities of the various methodologies.
Section 4 Data Governance Framework: Business Strategies and Organization. Section 4 describes the
various data governance framework components and how transportation organizations implement them. The
components include goals and objectives, policies, organizational models and maturity models.
Section 5 Data Lifecycle Management. Section 5 describes an overview of the data lifecycle and categories
of plans and procedures that are included to curate data over if life.
Section 6 Changing Needs in Transformative Transportation Environments. This section describes future
challenges with changing and emerging transportation technologies and strategies. Detailed topics include
impacts due to integrated transportation management systems, mobility on demand and accessible travel, and
automated vehicles.
Section 7 Lessons Learned from Government Initiatives. Section 7 describes the approach used by
transportation agencies to get started – what drives them to adopt data governance frameworks, how they get
started, and steps recommended by the USDOT to set up a data governance framework.
Section 8 ARC’s Role in a Regional Data Governance Framework. Section 8 provides a set of
recommendations for ARC to initiate data governance for regional constituents and stakeholders (both internal
and external). The section includes recommendations for ARC’s role and responsibilities, as well as plans and
artifacts needed to promote good data management practices for the region.
Section 9 End Notes. This section provides notes and references for citations contained in the report.
Appendices. The appendices section includes Appendix A Acronym Table and Appendix B -- the results from
the data exercises conducted during Workshop #2 (2019 March 18).
Bibliography. The Bibliography, though providing the full references for the end notes, provides example
documents that ARC can use to model framework components. In particular, the Data Business Plan
(Hillsborough…) describes
5
Data Governance Best Practices
oriented and technology-related: What data is available? From who? Where is it stored? How often is it
updated? And how can it be accessed without major technology issues?
• DG provides “rules of engagement” which describe user- and owner-roles, access procedures,
and methods of exposing and describing data sets irrespective of technology or platform. The
rules of engagement promulgated by a data governance framework helps improve sharing
efficiencies through the adoption of: (1) data discovery services to support searching for data
across multiple organizations; and (2) technology-agnostic and role-based data access methods.
Challenge #2: Inconsistent structures, formats, and semantics. Current systems have incompatible data
descriptions that makes it difficult to understand data in detail (e.g., type, meaning, scale, temporal, coverage,
estimated vs. observed) and to integrate it into their systems.
• DG framework provides rules and guidelines on how to describe, organize, and share data. This
ensures that all data is collected, named, defined, and grouped consistently and according to
standards across all stakeholders, including vendors. This standardization also helps with any
future system/software integration, as agencies now operate using a consistent data organization
and structure. Finally, this standardization facilitates aggregating data to provide key performance
metrics in an efficient manner.
Challenge #3: Unclear data responsibility. Currently, there is limited organizational structures that specify
management, accountability, and audit responsibilities for data. These details include accountability for
upkeep, quality, description, and dissemination to downstream users, including sharing information to other
stakeholders. Data curation is often overlooked once data is collected or an application is deployed.
Maintaining information about data quality, lineage, point of contact, or storage location may not be maintained
because there is no one assigned to manage the data.
• DG framework identifies the need to define the roles and responsibilities of data owners,
stewards, and users of data over its lifecycle. A common theme of data governance is the
relationship of people to data: who is responsible for data curation, who is responsible for
ensuring the data serves enterprise needs, who is accountable for the quality and access to the
data, who owns the data, how is the data used, specifically as it relates to privacy issues. As
such, a DG framework designates roles such as data stewards, data custodians, data policy
committees, and data champion as well as the responsibilities assigned to each.
Challenge #4: Data restrictions. The lack of defined data ownership and rights to data also leads to unclear
data distribution and use, that is, what can or cannot be shared due to contracting agreements or licensing
restrictions?
• By articulating data policies for sharing and use, a DG framework clarifies the distribution and
privacy rules for requests made by internal and external stakeholders. As such, DG helps in
setting clear relationships to manage shared data and information exchanges among internal and
external stakeholders, addressing any policy or legal limitations for sharing data.
Challenge #5: Limited and costly resources to manage data. There is a vast amount of data being
collected and processed for static and real time consumption. Collecting, managing, and distributing this data
requires resources, including human, that may exceed the financial capabilities of the stakeholders.
• A common feature of most DG frameworks involves the development of a data business plan that
o prioritizes critical data needs,
o identifies redundancies in data collection, processing and storage,
o develops strategies for migrating manual collection and quality control to automated
processes, and frames organizational responsibilities for data stewards who role is to
manage data for the enterprise
6
Data Governance Best Practices
A complete set of the challenges as well as current and future data sharing needs are included in Appendix B:
Workshop #2 Data Discussion .
7
Data Governance Best Practices
8
Data Governance Best Practices
9
Data Governance Best Practices
10
Data Governance Best Practices
Each objective has its own context diagram much like the one shown in Figure 2.
Generally, IT Governance and enterprise architecture methodologies and tailored DG frameworks incorporate
different aspects of data governance; however, there are consistent recurring themes throughout these various
models, including the following characteristics:
• Accountability and leadership roles in organization
• Planning and rules for data handling – quality, integrity, access
• Strategic enterprise perspective
• Cultural change to a data-centric organization
11
Data Governance Best Practices
In its DG Primer, the Federal Highway Administration (FHWA) identifies a sample set the goals and objectives
for data governance (FHWA, 2015). The goals and objectives are listed in Table 1.
12
Data Governance Best Practices
Prioritization – Prioritize efforts to • Establish clear priorities to address data gaps and needs.
address data gaps and needs. • Communicate priorities to FHWA business units.
Cooperation – Facilitate cross- • Increase opportunities for data sharing.
organizational collaboration, data • Eliminate data silos and other barriers.
sharing, and integration. • Ensure business units know the identity of Data Stewards.
• Ensure Data Stewards know the identity of Data Users.
Flexibility – Encourage creative • Identify innovative data solutions throughout FHWA.
and innovative solutions to data • Communicate innovative solutions to Data Stewards and Data
needs. Users.
Utilization – Improve data • Promote appropriate data usage throughout FHWA.
utilization and ease of access. • Provide staff the means to determine the extent and availability of
FHWA data.
Other organizations, such as Colorado and MnDOT address objectives such as:
• Build a culture of data cooperation by involving all organization members in data collaboration
(knowledge, access, accountability, use)
• Promote knowledge of data and reduce risk
• Develop guidelines that incorporates managing information value and reducing risk
• Understand and measure benefits of data management practices
• Involve business and IT in procurement decisions that incorporate data value
13
Data Governance Best Practices
4.3. Policies
Policies support oversight and compliance to standard operating practices (SOP). Some organizations tend to
develop broad-based policies while, some are detailed and structured, such as a SOP. FDOT developed
broad-based policies that incorporated their data governance objectives9. The FHWA Data Governance
Primer (FHWA, 2015) shows a detailed set of policies that cover every aspect of the data governance
organization, processes, and procedures; these policies are similar to data needs and corresponding data
management requirements (as shown in Table 3).
Table 3: FHWA Data Governance Policies [source: FHWA, 2015]
Policy Description
FHWA data are an Data, structured and unstructured, and the corresponding metadata, are business
enterprise asset. and technical resources owned in whole or in part by FHWA. FHWA data include
shared data about managed entities, interests, finances, employees, resources,
customers, providers, business affiliates, best practices, operating procedures,
experimental results, etc. All employees must recognize that the proper management
of strategic enterprise data is critical to the success of the organization.
FHWA data FHWA data programs or data related activities within IT projects require Investment
programs and Review Board (IRB) approval prior to and during an ongoing effort. This process is
activities must typically initiated, liaised, communicated to IT project managers, or executed by the
undergo IT Data Stewards. They are ultimately responsible for following the FHWA Information
investment Technology Investment Process in order to gain IRB approval prior to and during all
process. planned/ongoing data activities.
FHWA data must All strategic FHWA data shall be modeled, named, and defined consistently,
be consistent according to standards, across the organization. Efforts must be made by
management to share data and not maintain redundant data without justification.
Originating business stewards of data must recognize the informational needs of
downstream processes and business units that may require FHWA data.
FHWA data must Quality data are critical to ensuring FHWA mission success. Data Stewards are
be of acceptable responsible for ensuring that FHWA data are accurate and correct for the intended
quality purpose and use, and that data providers follow all reporting requirements regarding
the collection, processing, and reporting of FHWA data, and meet all requirements of
14
Data Governance Best Practices
the Data Quality Act. Data quality standards shall be managed and applied actively to
the approved reliability levels of FHWA data as defined by the business owners.
FHWA data must All enterprise data (structured and unstructured) must conform to a common set of
be interoperable standards and schemas across all data sharing parties. Data sharing must also be
with dependent accounted for and facilitated through a designated authority.
systems
FHWA data must All FHWA data must be maintained as close to the source as feasible, to reduce the
be maintained at collection and storage of redundant data.
the source
Enterprise data FHWA data, in all electronic formats, shall be safeguarded and secured based on
must be safe and recorded and approved requirements and compliance guidelines. These
secured requirements are to be determined by the OITS. Appropriate backups and disaster
recovery measures shall be administered and deployed for all FHWA data. The
enterprise data must adhere to the privacy rules and requests made by each
respective business steward both internal and external to FHWA.
FHWA data must FHWA data, information, and meta-data shall be readily accessible to all, except
be accessible where determined to be restricted. When restrictions are made, business stewards of
the data are accountable for defining specific individuals and levels of access
privileges that are to be enabled. The OITS will be responsible for the implementation
of proper security controls.
Meta-data will be All FHWA information system development and integration projects will utilize the
recorded and defined meta-data program for data naming, data modeling, and logical and physical
utilized database design purposes. The DGAC is responsible for developing plans to capture
and record specific data administration-focused meta-data consistent with the defined
meta-data program.
Data stewards will Individuals designated as stewards will have specific enterprise data accountabilities
be accountable by incorporated into their job descriptions.
job description
Timeliness of data Data must be obtained, processed and be made available in a timeframe consistent
with its intended use.
MnDOT, in their business plan, describes policies to improve data management through stewardship, curation,
data security, database recovery, and data retention, as well as policies needed to implement data
governance. The business plan identifies the executive group as responsible for developing data governance
policies; the scope of their responsibility is to advance policies which includes:
• Charging a Data Stewardship Steering Committee to assess current policies relating to data to
determine their efficacy
• Revising any policies that are obsolete, confusing or inaccurate
• Developing new policies that need to be implemented
• Developing an implementation plan to include a process for accountability, maintenance,
communications and training
Policy needs, cited in Oregon’s Public Transportation Plan (ODOT, 2018), acknowledges that ongoing changes
in technology need to be met “with common system and data exchange standards”, methods, and/or guidance
for data collection, governance, sharing, and use. They identify several data exchange standards such as
GTFS and a regional trip planner, but do not detail specific policies that emerge from these needs. ODOT is
only beginning to initiate a data governance framework.
15
Data Governance Best Practices
4.4. Organization
A typical Data Governance organization will be composed of
• A Governing Board of key stakeholders who review, promulgate, and oversee compliance with the
“rules of engagement”
• Data Stewards who oversee the functional integrity and quality of specific data sets based on their
subject matter expertise
• Data Custodians who perform the operational tasks of collecting, ingesting, validating, storing, and
implementing tools to disseminate data sets
The MnDOT Data Governance roles (see Figure 3) include a data set domain steward (sometimes called data
custodian) who is responsible for the operational efforts of collecting, storing, syntax, and validating the data,
while the data domain steward (sometimes called the business data steward or data steward) is responsible
for the quality, meaning, and appropriate use of data. The data governance board is typically in charge of
advancing policies and procedures, change management, and championing the business and operational
stewards. An organization may have multiple stewards responsible for different data sets.
16
Data Governance Best Practices
Data Custodian
•Operational management of data from each organization
•Responsible for data capture, maintenance, and disseminateion and following data
governance policies/ procedures and participating in data working groups
The Enterprise Data Steward’s role is to coordinate the activities of data stewards and custodians for each
domain or data set. The coordination efforts include change management for data definitions and needs,
interface specifications, data models and transformation, as well as developing requirements and
specifications for verifying and validating existing or new data sets and interfaces. For example, there may be
an enterprise data steward for transit data and another for work zone data.
The MnDOT Data Business Plan includes a comprehensive list of roles and responsibilities that can be used
as reference for additional positions11.
17
Data Governance Best Practices
integrated with other business and technical areas; embedded in the culture and continuously assessed; and
adapted to improvements and changing environments.
18
Data Governance Best Practices
Many models create a rating system to score maturity based on a set of questions or based on measuring
against goals, key performance indicators (KPI), or against a set of questions in a scorecard. An example of
setting a Value Creation element measure will include the high-level element, objective(s), and related KPI(s)
as follows:
Element: measure the progress of providing discovery and access to data sets
Objective: migrate local (workstation) data to online access
KPI: record the number of data sets that were migrated and listed in the data catalog with respect to
the total number of local data sets to identify the percent increase of accessible data
The various maturity methodologies identify different self-assessment tools, scorecards, and KPIs that
may be used. Agencies can develop their own maturity models by capturing their goals and objectives
from their framework and describing their objectives around the key DG components: people (roles),
processes, and rules of engagement (quality, metadata, standards, etc.).
A continuously improving organization will collect and score their compliance with the objective through
the KPI on a periodic basis – monthly, quarterly or annual reviews. The Data Governance Board will set
the performance metrics to measure priorities and effectiveness of their policies.
19
Data Governance Best Practices
20
Data Governance Best Practices
The cost of the software and hardware to store and process data is only a small fraction of the cost to collect,
update, preserve and access the data. Manual collecting, transforming, updating, validating, managing and
disseminating costs account for much of the investment in data; tools, particularly automated tools, to manage
steps in the curation lifecycle will increase efficiency, reduce costs and enhance data quality. In particular,
automated tools to collect, process and disseminate data in real-time are paid for as the value of the data
becomes more critical to sustain and adapt operations of critical business strategies.
The “artifacts” that compose the lifecycle include the following types of document:
• Data catalogs
• Data dictionaries, models, and interface and feed specifications (e.g., API)
• Guidelines for applying and provisioning API and data feed specifications including procurement
language and requirements to support regional interface standards
• Standard Operating Procedures for managing, ingesting, preserving, storing, and disseminating
data
• Metadata requirements, including naming conventions, quality, lineage, configuration, and version
control procedures
• Data improvement plans (at the data steward and enterprise steward levels)
The MnDOT Data Business Plan advocates that each data steward develop a 5-year plan to propose and
prioritize their activities for the near future. A plan supports other near-term projects because the plan
can leverage data or support the implementation of an activity. A typical initial priority for a data plan is to
migrate spreadsheet data to a web-based tool so the data is listed in an enterprise catalog and accessible
to multiple users.
The reappraise and migration processes included in the curation lifecycle are essential to ensure the data
addresses changing usage; in a regional environment when downstream data users rely on regional data
specifications, impacts to data meaning, formats and structure may be significant. These changes may
be due to a new application or system coming on-line that produces new information or consumes data
that is not currently available. The change process to agree on updates is typically assigned to an
Enterprise Data Steward working group. Working group members identify impacts to upstream and
downstream data systems and agree to a timeline for changes that incorporate regional concerns.
21
Data Governance Best Practices
22
Data Governance Best Practices
o Personal information and data collection strategies were not anticipated by regulations
and policy makers, and authorities have not accounted for the new knowledge that
emerges from data fusion. As these continue to evolve, outdated rules will continue to
govern the collection and use of personal data.
Furthermore, agencies that regulate emerging mobility services, such as bikesharing and e-scooter
services, also need to distribute and collect data that is not currently available:
• Public access zones where restrictions exist (e.g., no free bikeshare or e-scooter storage)
• Applications to audit micromobility device compliance with regulations
23
Data Governance Best Practices
These emerging trends elicit questions about the role of government with respect to collecting and managing
the information:
• What role does government have in collecting and updating infrastructure data, communicating
condition and status information, and offering applications for travelers to locate these information
sets?
• In the future, if emerging multimodal and mobility on demand services are regulated, then what
role, facility, and applications will government possess to audit and ensure compliance with the
regulations?
24
Data Governance Best Practices
25
Data Governance Best Practices
4. Develop data governance charter – The data governance charter will then set the purpose,
mission, vision, goals and objectives, and data management policies for data governance within a
U.S. DOT stakeholder office.
5. Develop data catalog – This catalog provides a centralized location for information about the
data used by stakeholders involved with roadway travel mobility data programs. Involvement
means the office is performing one or several of the following functions related to data: collection,
analysis, reporting, dissemination, or providing guidance to other stakeholders related to those
functions. The catalog should be reviewed and revised by data business owners within each U.S.
DOT stakeholder office to ensure that all data systems, data standards, roles, and
responsibilities, etc., are correctly identified. It should also be revised at least on an annual basis,
or monthly if changes occur that require updating the information listed in the catalog.
6. Assess data governance maturity – Implement a data management maturity model to assess
where the organization stands with respect to implementing certain data governance processes.
The maturity model also can be used to benchmark for comparison or assist an agency in
understanding common concepts related to an issue or process. A typical maturity model
identifies levels and characteristics of those levels. The model can be used to assess an agency’s
status and assist in identifying next steps to achieve success toward an ultimate goal state.
26
Data Governance Best Practices
27
Data Governance Best Practices
standards needed to mature data D. Develop an implementation plan to include a process for
governance at MnDOT accountability, maintenance, communications and training
Recommendation 4: Form a Data A. Develop a staffing plan to identify positions and/or
Governance Board to replace the persons who take on the Data Governance Board role
BIC with members representing the B. Review and adopt the Data Governance Board
divisions, the CIO and the Data responsibilities as the board charter
Management Coordinator (see C. Develop a work plan for implementing policies, standards
and processes for data governance
Figure 3)
Recommendation 5: Create the A. Determine the purpose or charge for each data
Data Stewardship Steering stewardship steering committee
Committee role as part of the larger B. Identify data domain coordinators to serve on each
data governance program committee
Recommendation 6: Formalize A. Integrate the notion of data stewardship into policies,
the Data Steward role as part of the standards and processes
data governance program B. Define data domains, data sets and stewards needed to
represent all the data used by MnDOT’s products and
services
C. Formally identify data stewards for core or department-
wide data domains and sets
Recommendation 7: Assign the A. Develop a staffing plan to fill the Data Management
Data Management Coordinator role Coordinator role.
within MnDOT.
Recommendation 8: Develop a A. Incorporate data projects into the Division Director’s IT
process to integrate or create touch Development Investment Plan
points between data governance
and Division Directors’ investment
management
Recommendation 9: Initiate a A. Initiate an IT project to implement a Business Data
project to implement a Business Catalog using the recommendations made by the
Data Catalog independent consultant
B. Develop the catalog concurrently with the business
intelligence project in order to eliminate duplication of
effort during the development of the catalog iterations
C. The project will implement multiple deliverables and
activities, including a method to validate the data and
implement a data management plan, maintenance plan
and security procedures. In addition, the project will
identify a tool to implement the Business Data Catalog.
The data will need to be organized and cataloged based
on the data domains/sets with responsibilities assigned to
corresponding Data Stewards
28
Data Governance Best Practices
FHWA surveyed four State DOTs (Arizona, Arkansas, Ohio and Texas) regarding their efforts in data
governance and data management within the context of their Geospatial Information Systems programs.22 The
States described noticeable benefits and impacts they experienced through their efforts to promote/implement
data governance strategies/programs, as well as challenges and key lessons learned. These insights are
summarized in Table 5—note that GIS-specific are not included here.
Table 5. Experiences, Benefits, Challenges and Lessons Learned from Implementing Data
Governance.
Experiences and Benefits
Communicating value Executive management respond positively to strategies that can help the agency
to executive reduce cost or make processes more efficient with the same budget. As such,
management. engaging the executive management’s desire to save the agency time and
money is a strategic method of gaining traction and support for data governance.
Some agencies reported that they were able to make headway in their efforts by
making it clear that the policies will save the agency resources.
Facilitating Data governance and data management policies facilitate the ability of agencies
collaboration between to work with, and learn from, other State agencies. Cross-compatibilities between
DOT staff. State agencies is a huge benefit.
Organizational Structures such as steering committees or designated oversight roles have been
structures are highly extremely helpful for agencies in their data governance and data management
effective projects. These organizational structures fill a need for a feedback loop that can
implementation tools assess the progress of the agency and see the direction it is heading. In some
instances, data governance committees have members that are from top-level
management.
Positive impacts are Even at early stages, staff begin to realize change is necessary to remain
quickly recognized and relevant, do meaningful work, and spread institutional knowledge across the
appreciated by staff. agency. Staff also begin to realize that data is a powerful tool and has great
intrinsic value, and so are more willing to put the effort into properly maintaining it.
Challenges
Agency culture can be It can be difficult for some staff to view data as an asset with a monetary value. In
difficult to overcome. this sense, some staff may hesitate to volunteer to do tasks that are not directed
by administrators as they might not understand why change is needed.
Additionally, the lack of ownership of the data process among staff and business
owners means that many times parties assume that other staff will be taking care
of data maintenance for them.
Bureaucracy between The value of data governance and data management to upper management can
executives and the be diluted by levels of bureaucracy. When executives are not in touch with the
29
Data Governance Best Practices
agency can result in day-to-day work going on within their agency, it is difficult for them to set policy
miscommunication. that will help their staff achieve their goals.
Personnel turnover is a When key personnel leave an agency, it has a great detrimental effect on the
serious obstacle to continuity of effort, results in the loss of institutional knowledge, and hinders
implementation. follow-through on existing projects. New staff or a new administration will then
need to be convinced of the value, which may not be successful. All participants
made note of this issue.
Internal departments The sub-groups within a DOT oftentimes have their own mission and goals, and
and teams can have the best way of achieving those goals might not align with the rest of the agency.
different missions. This makes compromising and moving in a direction that benefits the agency as a
whole complicated and can result in territorial disputes for resources.
Administrators’ focus Administrators can be intensely focused on operations, maintenance and
on engineering can construction. Because of this, they can have difficulty in understanding how data
prevent them from governance and management fits into the agency’s business operations.
understanding the
value of data
governance and data
management.
Lessons Learned
Definitions of these The creation of official definitions can provide DOT staff with a point of reference
concepts may differ in when communicating with upper management, and shows it is a concept valued
language, but they are by FHWA.
functionally the same.
Without a governing Steering Committees and similar governing bodies are an essential part of an
body, implementing effective implementation process as they provide a voice of authority and vision.
data governance is Implementation becomes hard to perform without this governing body.
very difficult.
Data governance and Performing data governance without the day-to-day practices of data
data management have management results in little to no progress being made towards the goals and
a symbiotic policies set forth in a data governance policy. Likewise, data management
relationship. practices that are performed without the guiding framework provided by a data
governance policy will result in haphazard or unorganized activity that is subject
to individual workflow preferences, and data collection and maintenance
standards.
30
Data Governance Best Practices
•As a stakeholder group, meet to confirm major challenges and short term
action plan
Step 2 Data Scope •Scope initial data sets by business or assessment areas
•Develop Data Catalog (initially as Google Doc)
Step 3 Steward •Develop data, metadata and quality priorities and standards for each
data set topic
Strategies •Augment and publish Data Catalog on line for major stakeholders
Step #4 Establish
Data Governance •Develop DG Charter with organizational structure, roles, responsibilities
Framework
31
Data Governance Best Practices
Governance is an iterative process which can start by applying small steps and developing into a mature
framework over time. This approach, agreed to at the FHWA Data Business Plan Workshop (held May 14),
uses an existing working group and an initial data inventory to document current, available data sets.
Another example, the Maryland GIS Data Catalog, shown in Figure 12, posts a short description of the data,
available file formats, source data, publication date and link to the data file.
32
Data Governance Best Practices
These on-line, public facing catalogs are simple. Both examples attach tags to the data sets which classify the
type, owner, topics and sometimes geographic coverage. The details associated with the data catalog can be
extended and grow over time. Table 6 shows a list of a minimum and detailed set of attributes that may be
included in the catalog and additional attributes that can extend available data set information.
Table 6: Data Catalog Attribute List
Minimum Set of Attributes Detailed Attributes
Data set name Geographic coverage
Topic (from a selected list) Data source and creation process
Data set description / abstract Location referencing method(s), if geographic
Date of collection / publication Expiration date
Owner Update frequency of data set
File format(s) Data steward contact information
Data dictionary (with a list of data definitions Quality, accuracy and validation process
and formats) or standard that describes data.
Extended data dictionary (with examples and
Link to data file by format(s) guidance on how data was applied to standard
format)
33
Data Governance Best Practices
These roles and responsibilities will be assigned to each organization to update and augment the data catalog
as additional information is accessible.
Step 2 Data and Gap •Assess level of maturity for each data topic
•Develop gap assessment
Assessment •Establish priorities for data cataloging and improvement
34
Data Governance Best Practices
The Data Governance charter establishes a formal structure to govern data quality, access and reporting.
Using the recommended organizational model described in Figure 4, ARC can play a significant role to
facilitate, lead or be a pivotal player in the Data Governance Board. Additionally, ARC can convene and
facilitate the Enterprise Data Steward working groups as chair or secretary. Recommendations for ARC
responsibilities related to the Board and EDS include:
• Coordinate business data steward plans and working groups at the enterprise level
• Facilitate each EDS working group to
o Develop regional standards and guidelines for implementing including processes, and
metadata for each business area;
o Support each business data steward in developing a 5-year plan
o Meet periodically to report on progress of 5-year plan and add to plan based on new
programs
• Generate catalog of data sets available for sharing
Business Data Stewards (BDS) and Data Custodians are stakeholder organization responsibilities. Data
custodians (DC) are typically IT staff or project staff who are responsible for implementing 5-year Data
Steward Plan including the standards set by the enterprise data stewards (e.g., FHWA data feed or ITS
standard message specification). ARC will need to define internal BDS and DC roles to formally manage
and participate within the Data Governance Board to encourage adherence to the Data Business Plan.
The typical documents that are initially required to kick off a regional Data Governance Framework
includes the following:
35
Data Governance Best Practices
In addition to leading the development of formal data governance processes and plans, ARC has the
opportunity to encourage data governance best practices through their processes and guidance
resources. ARC is currently developing Local Agency Deployment Guidelines which will support data
governance best practices and resources. Once more established regional data governance plans and
processes are in place, they should be integrated into other MPO processes such as Comprehensive
Transportation Plans and Transportation Improvement Program (TIP) solicitations. For example,
stakeholders could be asked to submit data sets during other regular cycle updates. Another example
would be to ask for data management information during project development such that data governance
practices are being considered at the initial stages of project inception.
36
Data Governance Best Practices
9. Endnotes
37
Data Governance Best Practices
38
Appendices
10. Appendices
Appendix A: Acronym Table
ARC Atlanta Regional Council
DAMA Data Management Association International
DCC Digital Curation Centre
DG Data Governance
FHWA Federal Highway Administration
IRB Investment Review Board
IT Information Technology
ITF International Transportation Forum
ITS Intelligent Transportation Systems
KPI Key Performance Indicators
OITS Office of Information Technology Services
PII Personal Identifiable Information
SOP Standard Operating Practices
TMC Transportation Management Center
TSMO Transportation Systems Management and Operations
• What are 3 major challenges to your organization sharing data with other organizations today?
Topic Details Count
39
Appendices
40
Appendices
• Legal issues
• Need to record/retrieve camera footage from ratesign cams, but
GDOTs cameras aren’t recorded due to policy
Access / This category is related to “Inconsistent Access / Challenges to Access 8
discovery
(platform)” category. The Inconsistent access category is technology
oriented.
• Availability
• Access to data
• Lack of information about what data exists and which organization
or department has it
• Access to data for analytical purposes in an easy to use format
• Not knowing which organization have what
• Knowing who to share with…
• Institutional bottlenecks (IT – “need to know security, proprietary,
restrictions)
• Internal – provisioning quick information aggregating key
performance metrics
Too much • Hosting large amounts of data 6
data! • Resources (people) to manage and maintain data connectivity,
storage, etc.
Resources • Understanding the cost / resource requirements to get clean,
/costs useable, consistent, complete operational data
• Workload
• Time availability of staff to create data sets to share
• Cost of integration and cost of maintenance
Metadata / • Data accuracy 6
quality • Various and conflictions uses of the data
• Accurate data
• Differing needs / efforts across organization
• Up to date data (regarding project implementation)
Privacy • Privacy / privacy data public data 3
• Privacy
• Privacy issues
Too many • Interface: with what interface we should go? Per security, 3
choices (How performance, legal…
do we • Formatting data properly for external consumption
choose the • I don’t know what I don’t know! Understanding the data landscape. I
best data need on inventory, schema, and dictionary that stays up-to-data and
management accessible
approach?)
Why collect • No one knows what to actually do with data or why they even need 2
it/want it
41
Appendices
• List the 3 most important data sets that you need from other organizations today.
42
Appendices
43
Appendices
• Given projects under deployment, list the 3 most important data sets you expect to share in
the future?
• GTFS, GTFS-realtime
• GTFS-realtime
• Real time transit data
• Transit
Transit Performance
44
Appendices
45
Appendices
46
11. Bibliography
Ahmed, M., Gopalakrishna, D., Garcia, V., Ragan, A., English, T., Zumpf, S., . . . Hsu, E. (2016). Connected
Vehicle Pilot Deployment Program Phase 1, Human Use Approval Summary. Washington, DC:
USDOT.
Ahmed, M., Gopalakrishna, D., Garcia, V., Ragan, A., English, T., Zumpf, S., . . . Hsu, E. (2016). Connected
Vehicle Pilot Deployment Program Phase 1, Participant Training and Education Plan. Washington,
DC: USDOT.
Atlanta Regional Transportation System Management and Operations (TSMO) Visioning Workshop Summary.
(2019, January 23).
CTA. (2019). Chicago Transit Authority. Retrieved from How we manage delays:
https://www.transitchicago.com/performance/wtgw/#addressingbunches
Cupoli, P., Earley, S., & Henderson, D. (2014). DAMA- DMBOK2 Framework. DAMA International. Retrieved
March 28, 2019, from https://dama.org/sites/default/files/download/DAMA-DMBOK2-Framework-V2-
20140317-FINAL.pdf
Digital Curation Center Data Curation Lifecycle Model. (n.d.). Retrieved April 4, 2019, from
http://www.dcc.ac.uk/resources/curation-lifecycle-model
Earley, S. (2015). Presentation on DAMA-DMBOK2 and CDMP. Retrieved March 28, 2019, from Dama
International: http://dama-phoenix.org/wp-content/uploads/2015/09/DAMA-Phoenix-DMBOK2.pdf
FDOT. (2017, June 14). Florida Data Governance Policy. Tallahassee, Florida.
FHWA. (2015). Data Governance Plan, Volume 1 Data Governance Primer. Washington, DC: USDOT.
Firican, G. (2018, October 3). Data Governance Maturity Models. Retrieved April 4, 2019, from
LIGHTSONDATA: http://www.lightsondata.com/category/data-governance/
Gidley, S., & Rausch, N. (n.d.). Best Practices in Enterprise Data Governance - White Paper. SAS.
Goetz, M. (2015, September 15). Data Governance and Data Management Are Not Interchangeable.
Retrieved March 28, 2019, from Forrester: https://go.forrester.com/blogs/15-09-11-
data_governance_and_data_management_are_not_interchangeable/
Gopalakrishna, D., Garcia, V., Ragan, A., English, T., Zumpf, S., Young, R., . . . Hsu, E. (2016). Connected
Vehicle Pilot Deployment Program Phase 1, Outreach Plan . Washington, DC: USDOT.
Green, M., & Lucivero, A. (July 2018). Data Governance & Data Management: Case Studies of Select
Transportation Agencies. Washington, D.C.: US DOT. Retrieved from
https://www.gis.fhwa.dot.gov/documents/GIS_Data_Governance_and_Data_Management_Case_Stu
dies.pdf
Kitchener, F., Gopalakrishna, D., Garcia, V., Ragan, A., English, T., Zumpf, S., . . . Hsu, E. (2016). Connected
Vehicle Pilot Deployment Program Phase 1, Performance Measurement and Evaluation Support Plan.
Washington, DC: USDOT.
MnDOT. (2011). Data Business Plan. Minnesota Department of Transportation. Retrieved from
https://www.dot.state.mn.us/tda/databusinessplan.docx
MnDOT. (2019). MnDOT Policies - Data Data Stewardship. Retrieved from Minnesota Department of
Transportation: https://www.dot.state.mn.us/policy/dm/dm002.html
47
NASCIO. (April 2008, April). Data Governance - Managing Information As An Enterprise Asset - An
Introduction. Retrieved from NASCIO Governance Series:
https://www.nascio.org/Portals/0/Publications/Documents/NASCIO-DataGovernance-Part1.pdf
National Academies of Sciences, Engineering, and Medicine. (2010). Target-Setting Methods and Data
Management to Support Performance-Based Resource Allocation by Transportation Agencies -
Volume I: Research Report, and Volume II: Guide for Target Setting and Data Management.
Washington, DC: The National Academies Press. doi:. https://doi.org/10.17226/14429.
ODOT. (2011). Asset Management Strategic Plan. Oregon Department of Transportation. Retrieved from
https://www.oregon.gov/ODOT/Programs/Documents/AMI/04-amsp-10-111711_final.pdf
ODOT. (2018). Oregon Public Transportation Plan. Oregon Department of Transportation. Retrieved from
https://www.oregon.gov/ODOT/Planning/Documents/OPTP_FINALDRAFT.pdf
OECD/ITF. (2015). Big Data and Transport: Understanding and assessing options. International Transport
Forum at the OECD.
Thomas, G. (2019, March 28). Defining Data Governance. Retrieved from The Data Governance Institute:
http://www.datagovernance.com/defining-data-governance/
USDOT. (Januaray 2013). U.S. DOT Roadway Transportation Data Business Plan (Phase 1): Data Business
Plan, FHWA-JPO-13-084. Washington D.C.: US DOT. Retrieved April 4, 2019
Vandervalk, A., Snyder, D., & Hajek, J. (2013). U.S. DOT Roadway Transportation Data Business Plan (Phase
1) - Data Business Plan. Washington, DC: U.S. Department of Transportation.
What is Data Governance? (2015-6). Retrieved March 28, 2019, from The MDM Institute:
http://www.tcdii.com/whatIsDataGovernance.html
48