CHP 6 Slides For Share
CHP 6 Slides For Share
Chapter 6
Foundations of Business
Intelligence: Databases and
Information Management
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Learning Objectives
6.1 What are the problems of managing data resources in a
traditional file environment?
6.2 What are the major capabilities of database management
systems (DBMS), and why is a relational DBMS so powerful?
6.3 What are the principal tools and technologies for accessing
information from databases to improve business performance
and decision making?
6.4 Why are data governance and data quality assurance
essential for managing the firm’s data resources?
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
File Organization Terms and Concepts
• Database: Group of related files
• File: Group of records of same type
• Record: Group of related fields
• Field: Group of characters as word(s) or number(s)
• Entity: Person, place, thing on which we store information
• Attribute: Each characteristic, or quality, describing entity
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.1 The Data Hierarchy
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.2 Traditional File Processing
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Problems with the Traditional File
Environment
• Files maintained separately by different departments
• Data redundancy
• Data inconsistency
• Program-data dependence
• Lack of flexibility
• Poor security
• Lack of data sharing and availability
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Database Management Systems
• Database
– Serves many applications by centralizing data and controlling
redundant data
• Database management system (DBM S)
– Interfaces between applications and physical data files
– Separates logical and physical views of data
– Solves problems of traditional file environment
Controls redundancy
Eliminates inconsistency
Uncouples programs and data
Enables organization to centrally manage data and data
security
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.3 Human Resources Database
with Multiple Views
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Relational DBMS
• Represent data as two-dimensional tables
• Each table contains data on entity and attributes
• Table: grid of columns and rows
– Rows (tuples): Records for different entities
– Fields (columns): Represents attribute for entity
– Key field: Field used to uniquely identify each record
– Primary key: Field in table used for key fields
– Foreign key: Primary key used in second table as look-
up field to identify records from original table
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.4 Relational Database Tables
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Operations of a Relational DBMS
• Three basic operations used to develop useful sets of data
– SELECT
Creates subset of data of all records that meet stated
criteria
– JOIN
Combines relational tables to provide user with more
information than available in individual tables
– PROJECT
Creates subset of columns in table, creating tables with
only the information specified
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.5 The Three Basic
Operations of a Relational DBMS
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Capabilities of Database Management
Systems
• Data definition
• Data dictionary
• Querying and reporting
– Data manipulation language
Structured Query Language (SQ L)
• Many DBM S have report generation capabilities for
creating polished reports (Microsoft Access)
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.6 Access Data Dictionary
Features
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.7 Example of an SQL Query
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.8 An Access Query
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Designing Databases
• Conceptual design vs. physical design
• Normalization
– Streamlining complex groupings of data to minimize redundant
data elements and awkward many-to-many relationships
• Referential integrity
– Rules used by RDBM S to ensure relationships between tables
remain consistent
• Entity-relationship diagram
• A correct data model is essential for a system serving the
business well
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.9 An Unnormalized Relation
for Order
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.10 Normalized Tables
Created from Order
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.11 An Entity-Relationship
Diagram
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Non-relational Databases, Cloud
Databases, and Blockchain (Slide 3 of 3)
• Blockchain
– Distributed ledgers in a peer-to-peer distributed database
– Maintains a growing list of records and transactions shared by all
– Encryption used to identify participants and transactions
– Used for financial transactions, supply chain, and medical records
– Foundation of Bitcoin, and other crypto currencies
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
The Challenge of Big Data
• Big data
– Massive sets of unstructured/semi-structured data from
web traffic, social media, sensors, and so on
• Volumes too great for typical DBM S
– Petabytes, exabytes of data
• Can reveal more patterns, relationships and anomalies
• Requires new tools and technologies to manage and
analyze
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Analytical Tools: Relationships,
Patterns, Trends
• Tools for consolidating, analyzing, and providing access to
vast amounts of data to help users make better business
decisions
– Data mining
– Text mining
– Web mining
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Data Mining
• Finds hidden patterns, relationships in datasets
– Example: customer buying patterns
• Infers rules to predict future behavior
• Types of information obtainable from data mining:
– Associations
– Sequences
– Classification
– Clustering
– Forecasting
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Figure 6.15 Linking Internal Databases to
the Web
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Data Quality Assurance
• More than 25 percent of critical data in Fortune 1000
company databases are inaccurate or incomplete
• Before new database is in place, a firm must:
– Identify and correct faulty data
– Establish better routines for editing data once database
in operation
• Data quality audit
• Data cleansing
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved
Copyright
This work is protected by United States copyright laws and is
provided solely for the use of instructors in teaching their
courses and assessing student learning. Dissemination or sale of
any part of this work (including on the World Wide Web) will
destroy the integrity of the work and is not permitted. The work
and materials from it should never be made available to students
except by instructors using the accompanying text in their
classes. All recipients of this work are expected to abide by these
restrictions and to honor the intended pedagogical purposes and
the needs of other instructors who rely on these materials.
Copyright © 2022, 2020, 2018 Pearson Education, Ltd. All Rights Reserved