0% found this document useful (0 votes)

8 views6 pages

Enterprise Data Management (Midterm Reviewer)

Enterprise Data Management (EDM) is essential for maximizing the value of data as a corporate asset, encompassing structured and unstructured data management, analytics, and business intelligence. Best practices for EDM include assessing data flows, defining deliverables, and emphasizing data quality, while components such as data governance, security, and integration are critical for effective management. Tools like Python, NumPy, and Pandas facilitate data analysis and visualization, supporting informed decision-making across organizations.

Uploaded by

annemaylee547

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views6 pages

Enterprise Data Management (Midterm Reviewer)

Uploaded by

annemaylee547

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

ENTERPRISE DATA MANAGEMENT

LESSON 1

Data is a corporate asset that must be managed to maximize its value, and its management must be enabled as a
common capability to ensure that this asset is used properly.

Data are raw facts that describe the characteristics of an event or object.

Structured data has a defined length, type and format and includes number, dates, or strings such as customer address.

The sources of structured data include : 20%

1. Machine generated data is created by a machine without human intervention. Machine generated structured data

includes sensor data, point of sale data, and web log data.

2. Human generated data is data that humans, in interaction with computers, generate. Human-generated structured
data includes input data, clickstream data, or gaming data.

Unstructured data is not defined and does not follow a specified format and is typically free-form text such as emails,
Twitter tweets, and text messages.

The sources of unstructured data include: 80 %

1. Machine-generated unstructured data, including satellite images, scientific atmosphere data, and radar data.

2. Human-generated unstructured data, including text messages, social media data, and emails.

Big data is a collection of large, complex data sets, including structured and unstructured data, which cannot be analyzed
using traditional database methods and tools.

A snapshot is a view of data at a particular moment in time.

Information is data converted into a meaningful and useful context.

The simple difference between data and information is that computers or machine need data and humans need
information.

Data is raw building block that has not be shaped, processed, or analyzed and frequently appears disorganized and
unfriendly.

Information gives meaning and context to analyzed data, making it insightful for human by providing context and
structure that is extremely valuable when making informed business decisions.
A report is a document containing data organized in a table, matrix, or graphical format allowing users to easily
comprehend and understand information.

A static report is created once based on data that does not change.

A dynamic report changes automatically during creation.

A variable is a data characteristic that stands for a value that changes or varies over time.

Business intelligence (BI) is information collected from multiple sources such as suppliers, customers, competitors,
partners, and industries that analyzes patterns, trends, and relationships for strategic decision making.

Analytics is the science of making decision based on factual data.

Business analytics is the scientific process of transforming data into insight for making better decisions.

Analytics

1. Descriptive analytics use techniques that describe past performance and history.

2. Predictive analytics use techniques that extract information from data and use it to predict future trends and identify
behavioral patterns.

3. Prescriptive analytics use techniques that create models indicating the best decision to make or course of action to
take.

Knowledge includes the skills, experience, and expertise, coupled with information and intelligence, that create a
person's intellectual resources.

Knowledge assets, also called intellectual capital, are the human, structural, and recorded resources available to the
organization.

Knowledge facilitators help harness the wealth of knowledge in the organization.

What is Enterprise Management? The ability of the organization to effectively create, integrate, disseminate and
manage data for all applications, processes and entities of the enterprise requiring accurate and timely delivery of data.

Enterprise data managers are most often database administrators, IT administrators, or IT project managers. They are in
charge of the process of managing your business’s entire data life cycle.

Benefits of enterprise data management

• Improved access to organized, properly defined data through data governance and metadata management.

• Improved quality of data for decision making and operations – faster operations and faster, more accurate decisions.

• Multi-user data access was appropriate.

• Improved reporting and analytic capabilities on both enterprise and local scales, for accurate results.

• Improved data security and privacy access according to standards and procedures applied consistently.

• Integration of data across sources according to standards and using a consistent architecture framework, for ease of
integration and access.

Components of EDM

• Data Governance – planning, oversight, and control over management of data and the use of data and data-related
resources; development and implementation of policies and decision rights over the use of data.

• Data Security and Privacy – ensuring privacy, confidentiality and appropriate access to data.

• Data Integration & Development – acquisition, extraction, transformation, movement, delivery, replication, federation,
virtualization and operational support.

• Data Analytics & Business Intelligence – managing analytical data processing and enabling access to decision support
data for reporting and analysis.

• Data Quality – defining, monitoring, maintaining data integrity, and improving accuracy, completeness, validity,
timeliness, consistency of data
LESSON 2

EDM Best Practices

1. Perform Assessment – Businesses need a clear understanding of their data flows and the types of data they have in
order to craft an effective data management strategy.

2. Define Deliverables – It is important for an organization to outline what they hope to accomplish by implementing

enterprise data management.

3. Determine Standards, Policies and Procedures - Standards, policies, and procedures are invaluable guideposts,
keeping data where it needs to be and helping to prevent corruption, security breaches, and loss of data.

4. Educate the stakeholders - Enterprise data management is sure to fail if the standards, policies, and procedures
surrounding it are not properly disseminated and emphasized. Additionally, EDM strategies are better positioned for
success if all of those who deal with data are on board with the project.

5. Emphasize Quality - Bad data is actually worse than no data at all. Adopting a culture of data quality will help protect
your data’s security and integrity and ultimately preserve its worth.

6. Invest in the Right People and Technology - Understanding the art of managing data isn’t everyone’s forte. It’s best to
have an in-house or consultative expert with experience establishing enterprise data management systems. Their
knowledge can help identify the right technologies to use.

DAMA-DMBOK is a detailed guidebook that provides best practices and standards for specific data management
functions.

• Data Governance – Ensuring all data management activities align with organizational policies, standards, and
regulations. It provides direction and control over data management processes.

• Data Architecture - Designing and managing the structure of an organization's data, ensuring it aligns with business
goals and supports data flow across systems.

• Data Modeling and Design – Creating models that define data elements, their relationships, and rules for data storage,
which support effective database design.

• Data Storage and Operations - Managing how data is stored, accessed, and maintained. This includes the physical and
technical aspects of data storage.

• Data Security – Protecting data from unauthorized access and breaches, ensuring confidentiality, integrity, and
availability of data.

• Data Integration and Interoperability - Ensuring data can be shared and used across different systems, applications,
and platforms without losing consistency.

• Document and Content Management - Managing unstructured data, including documents, images, and multimedia,
ensuring they are properly stored and retrievable.

• Reference & Master Data Management - Managing critical data that is used across various systems and ensuring
consistency, accuracy, and reliability.

• Data Warehousing & Business Intelligence - Collecting, storing, and analyzing large volumes of data to support
decision-making and business intelligence activities.

• Metadata - Managing data about data, which provides context and meaning, making it easier to manage and
understand other data assets.

• Data Quality - Ensuring that data is accurate, complete, consistent, and reliable, which is critical for effective decision-
making.
LESSON 3

A repository is a central location in which data is stored and managed. A data warehouse is a collection of information-
gathered from many different operational databases-that supports business analysis activities and decision-making tasks.

The primary purpose of a data warehouse to combine information, more specifically, strategic information, throughout
an organization a single repository in such a way that the people who need that information can make decision and
undertake business analysis.

Standardization of data elements allows for greater accuracy, completeness, and consistency and increases the quality of
information in making strategic business decisions.

Data aggregation is the collection of data from various sources for the purpose of data processing. Businesses collect a
tremendous amount of transactional information as part of their routine operations.

Extraction, transformation, and loading (ETL) is a process that extracts information from internal and external
databases, transforms it using a common set of enterprise definitions, and loads it into a data warehouse. The data
warehouse then sends portions (or subsets) of the information to data marts.

A data mart contains a subset of data warehouse information.

Dirty data is erroneous or flawed data

Information cleaning or scrubbing is a process that weeds out and fixes or discards inconsistent, incorrect, or incomplete
information.

In a data warehouse, information cleansing occurs first during the ETL process and again once the information is in the
data warehouse.
LESSON 4

Python is an easy to learn, powerful programming language. It has efficient high-level data structures and a simple but
effective approach to object-oriented programming.

Python has gone from a bleeding-edge or “at your own risk” scientific computing language to one of the most important
languages for data science, machine learning, and general software development in academia and industry.

NumPy, short for Numerical Python, has long been a cornerstone of numerical computing in Python. It provides the data
structures, algorithms, and library glue needed for most scientific applications involving numerical data in Python.

•A fast and efficient multidimensional array object ndarray.

• Functions for performing element-wise computations with arrays or mathematical operations between arrays.

• Tools for reading and writing array-based datasets to disk.

• Linear algebra operations, Fourier transform, and random number generation.

• A mature C API to enable Python extensions and native C or C++ code to access NumPy’s data structures and
computational facilities.

Pandas, provides high-level data structures and functions designed to make working with structured or tabular data fast,
easy, and expressive. (emergence in 2010,)

The primary objects in pandas that will be used is the DataFrame, a tabular, column-oriented data structure with both
row and column labels, and the Series, a one-dimensional labeled array object.

Pandas blends the high-performance, array-computing ideas of NumPy with the flexible data manipulation capabilities of
spreadsheets and relational databases (such as SQL).

Matplotlib is the most popular Python library for producing plots and other two-dimensional data visualizations. It was
originally created by John D. Hunter and is now maintained by a large team of developers. It is designed for creating plots
suitable for publication.

IPython and Jupyter is designed from the ground up to maximize your productivity in both interactive computing and
software development. It encourages an execute-explore workflow instead of the typical edit-compile- run workflow of
many other programming languages.

In 2014, Fernando and the IPython team announced the Jupyter project, a broader initiative to design language-
agnostic interactive computing tools.

The IPython web notebook became the Jupyter notebook, with support now for over 40 programming languages.

The IPython system can now be used as a kernel (a programming language mode) for using Python with Jupyter.

Anaconda is a Python distribution (prebuilt and preconfigured collection of packages) that is commonly used for data
science.

NumPy, short for Numerical Python, is one of the most important foundational packages for numerical computing in
Python.

One of the key features of NumPy is its N-dimensional array object, or ndarray, which is a fast, flexible container for large
datasets in Python.

Arrays enable you to perform mathematical operations on whole blocks of data using similar syntax to the equivalent
operations between scalar elements.

An ndarray is a generic multidimensional container for homogeneous data; that is, all of the elements must be the same
type.

Every array has a shape, a tuple indicating the size of each dimension, and a dtype, an object describing the data type of
the array.

You can also slice NumPy arrays. Slicing is used to extract some portion of data from actual array.

Arrays are important because they enable you to express batch operations on data without writing any for loops. NumPy
users call this vectorization.
A Series is a one-dimensional array-like object containing a sequence of values and an associated array of data labels,
called its index.

DataFrame represents a rectangular table of data and contains an ordered collection of columns, each of which can be a
different value type (numeric, string, boolean, etc.).

The DataFrame has both a row and column index; it can be thought of as a dictionary of Series all sharing the same
index.

An aggregated function returns a single aggregated value for each group. Once the group by object is created, several
aggregation operations can be performed on the grouped data.

We can use the Pandas DataFrame merge() function is used to merge two DataFrame objects with a database-style join
operation. The joining is performed on columns or indexes.

Data Management Fundamentals
No ratings yet
Data Management Fundamentals
14 pages
CDMP Chapter 1 Notes
No ratings yet
CDMP Chapter 1 Notes
17 pages
Unit 1-Pda - Extra
No ratings yet
Unit 1-Pda - Extra
79 pages
Dama Dmbok 1-31
100% (3)
Dama Dmbok 1-31
31 pages
Data Analytics
No ratings yet
Data Analytics
56 pages
MIS - UNIT 3 New
No ratings yet
MIS - UNIT 3 New
72 pages
Spring Kafka Reference
No ratings yet
Spring Kafka Reference
241 pages
DA Unit 1
No ratings yet
DA Unit 1
33 pages
Modul 1 - Introduction To Data Management
No ratings yet
Modul 1 - Introduction To Data Management
42 pages
C Comments - GeeksforGeeks
No ratings yet
C Comments - GeeksforGeeks
9 pages
CSC 404 07-11-2023
No ratings yet
CSC 404 07-11-2023
24 pages
Instant Download Java Programming Joyce Farrell PDF All Chapter
100% (6)
Instant Download Java Programming Joyce Farrell PDF All Chapter
53 pages
Session 1
No ratings yet
Session 1
49 pages
Authentication & Authorization
No ratings yet
Authentication & Authorization
7 pages
Group 3 Chapter 4 - The Data Resource
No ratings yet
Group 3 Chapter 4 - The Data Resource
69 pages
Unit I-Database Management System
No ratings yet
Unit I-Database Management System
67 pages
Business Data Management Week 1 - Read-Only
No ratings yet
Business Data Management Week 1 - Read-Only
49 pages
Technology Stack - Template
No ratings yet
Technology Stack - Template
3 pages
Chapter13 Programming Languages
No ratings yet
Chapter13 Programming Languages
57 pages
Dbms Lab Manual
No ratings yet
Dbms Lab Manual
58 pages
Data Architecture Module 1
No ratings yet
Data Architecture Module 1
40 pages
Enterprise Data Management PDF
0% (1)
Enterprise Data Management PDF
18 pages
XpressBees ReverseReattemptDate CustomerAlternateAddress MobileUpdationAPI
No ratings yet
XpressBees ReverseReattemptDate CustomerAlternateAddress MobileUpdationAPI
5 pages
Write Your Own PHP MVC Framework
No ratings yet
Write Your Own PHP MVC Framework
20 pages
Session 2
No ratings yet
Session 2
26 pages
DWHDM 22cse120 Module 2
No ratings yet
DWHDM 22cse120 Module 2
60 pages
Building Java Programs: Linked Lists
No ratings yet
Building Java Programs: Linked Lists
97 pages
Chapter 1 DATA MANAGEMENT
No ratings yet
Chapter 1 DATA MANAGEMENT
16 pages
MIS Unit-3
No ratings yet
MIS Unit-3
15 pages
Lecture 1 - Data Management
No ratings yet
Lecture 1 - Data Management
33 pages
Cap01 Data Management
No ratings yet
Cap01 Data Management
18 pages
Data Assignment
No ratings yet
Data Assignment
24 pages
cc6 1
No ratings yet
cc6 1
21 pages
Data and Database Administration
No ratings yet
Data and Database Administration
20 pages
Data Management Cap1
No ratings yet
Data Management Cap1
27 pages
MIS Notes Unit-3
No ratings yet
MIS Notes Unit-3
16 pages
Data Management
No ratings yet
Data Management
20 pages
Chapter 1 Summary
No ratings yet
Chapter 1 Summary
7 pages
Data-Analytic Im 2021-2022
No ratings yet
Data-Analytic Im 2021-2022
67 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
52 pages
Data Management Handout
No ratings yet
Data Management Handout
19 pages
Coding and Programming-Introduction
No ratings yet
Coding and Programming-Introduction
4 pages
Mit Topic 3
No ratings yet
Mit Topic 3
7 pages
Data Management Challenges
0% (1)
Data Management Challenges
9 pages
Data, Information and Knowledge Management Framework and The Data Management Book of Knowledge (DMBOK)
100% (11)
Data, Information and Knowledge Management Framework and The Data Management Book of Knowledge (DMBOK)
356 pages
Data For Business Analytics Unit 2
No ratings yet
Data For Business Analytics Unit 2
23 pages
5 LO3a) - Types of Enterprise Data
No ratings yet
5 LO3a) - Types of Enterprise Data
22 pages
21CSS101J Programming For Problem Solving
No ratings yet
21CSS101J Programming For Problem Solving
135 pages
001 Chapter1 DataManagement
100% (1)
001 Chapter1 DataManagement
21 pages
Chapter 5 SUMMARY
No ratings yet
Chapter 5 SUMMARY
16 pages
Review Book Dmbok DR Yutub
No ratings yet
Review Book Dmbok DR Yutub
61 pages
Group7 Presentation (MIS)
No ratings yet
Group7 Presentation (MIS)
12 pages
Hazelcast Documentation 3.2
No ratings yet
Hazelcast Documentation 3.2
163 pages
Journal 4
No ratings yet
Journal 4
6 pages
Computer Practice
No ratings yet
Computer Practice
9 pages
cs50 Harvard Edu X 2022 Notes 0
No ratings yet
cs50 Harvard Edu X 2022 Notes 0
20 pages
Databases & Database Management
No ratings yet
Databases & Database Management
8 pages
Java Problemsheet 2-1
No ratings yet
Java Problemsheet 2-1
2 pages
Itm Mod 3
No ratings yet
Itm Mod 3
25 pages
Implementation of Beowulf Cluster: Submitted To
No ratings yet
Implementation of Beowulf Cluster: Submitted To
11 pages
DSA Project v1
No ratings yet
DSA Project v1
3 pages
Data Management
No ratings yet
Data Management
6 pages
74hc165 x2 Daisy Chain
No ratings yet
74hc165 x2 Daisy Chain
3 pages
Manajemen Data DMBOK (Kelompok 1)
No ratings yet
Manajemen Data DMBOK (Kelompok 1)
55 pages
Bana1 Midterm Reviewer
No ratings yet
Bana1 Midterm Reviewer
10 pages
Nitish Aggarwal: Education Experience
No ratings yet
Nitish Aggarwal: Education Experience
1 page
Exercise 3: Create An XML File: Step 1: One Day's Forecast
No ratings yet
Exercise 3: Create An XML File: Step 1: One Day's Forecast
3 pages
Reporter Group 2 Module6
No ratings yet
Reporter Group 2 Module6
5 pages
Extract, Transform, Load: Inmon Bill
No ratings yet
Extract, Transform, Load: Inmon Bill
11 pages
Answer:: Free Exam/Cram Practice Materials - Best Exam Practice Materials
No ratings yet
Answer:: Free Exam/Cram Practice Materials - Best Exam Practice Materials
2 pages
Chapter 1 Intro - DONE DONE DONE
No ratings yet
Chapter 1 Intro - DONE DONE DONE
27 pages
Data Management Handout
No ratings yet
Data Management Handout
19 pages
New 10 Steps To DM Success Web 1
No ratings yet
New 10 Steps To DM Success Web 1
4 pages
1 James M. Curran: 1 Summary
No ratings yet
1 James M. Curran: 1 Summary
7 pages
Atelierul Digital Pentru Programatori: Java (30 H)
No ratings yet
Atelierul Digital Pentru Programatori: Java (30 H)
5 pages
ShortBus Quick Start
No ratings yet
ShortBus Quick Start
23 pages
Mcs 023
No ratings yet
Mcs 023
261 pages
NSC - DAMA Data Management Body of Knowledge
No ratings yet
NSC - DAMA Data Management Body of Knowledge
28 pages
Data Management and Data Quality
No ratings yet
Data Management and Data Quality
15 pages
CNC Codes
No ratings yet
CNC Codes
8 pages
T315C-08 Exercise 8.1 - RevF
No ratings yet
T315C-08 Exercise 8.1 - RevF
14 pages
Engineering - Ebook - PDF - Matlab Programming
No ratings yet
Engineering - Ebook - PDF - Matlab Programming
283 pages
Chapter 1 Data Management
No ratings yet
Chapter 1 Data Management
27 pages
Data MGMT - Some Points
No ratings yet
Data MGMT - Some Points
9 pages
A. CALL Proc1 ( SALES', ?) : Create Procedure Proc1 (In Var1 Varchar (10), Out RC Integer) Specific Myproc Language SQL
No ratings yet
A. CALL Proc1 ( SALES', ?) : Create Procedure Proc1 (In Var1 Varchar (10), Out RC Integer) Specific Myproc Language SQL
3 pages
Mysql Workbench A Data Modeling Guide For Developers and Dbas
No ratings yet
Mysql Workbench A Data Modeling Guide For Developers and Dbas
13 pages
Decision Making with Data
From Everand
Decision Making with Data
Ravi Deshpande
No ratings yet
Data-Driven Decision Making
From Everand
Data-Driven Decision Making
Aadinath Pothuvaal
No ratings yet
Data Analytics and Data Processing Essentials
From Everand
Data Analytics and Data Processing Essentials
gareth thomas
No ratings yet
Data-Driven Business Strategies: Understanding and Harnessing the Power of Big Data
From Everand
Data-Driven Business Strategies: Understanding and Harnessing the Power of Big Data
Steven Vollmer
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Enterprise Data Management (Midterm Reviewer)

Uploaded by

Enterprise Data Management (Midterm Reviewer)

Uploaded by

ENTERPRISE DATA MANAGEMENT

The sources of structured data include : 20%

The sources of unstructured data include: 80 %

A snapshot is a view of data at a particular moment in time.

Information is data converted into a meaningful and useful context.

A dynamic report changes automatically during creation.

Analytics is the science of making decision based on factual data.

Knowledge facilitators help harness the wealth of knowledge in the organization.

Benefits of enterprise data management

• Multi-user data access was appropriate.

EDM Best Practices

enterprise data management.

A data mart contains a subset of data warehouse information.

Dirty data is erroneous or flawed data

•A fast and efficient multidimensional array object ndarray.

• Tools for reading and writing array-based datasets to disk.

• Linear algebra operations, Fourier transform, and random number generation.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.