0% found this document useful (0 votes)
237 views13 pages

Management Information Systems Unit - 3 Notes-1

The document discusses key concepts in data management including challenges of data management, data independence, data redundancy, data consistency, and data administration. It also discusses database management systems, including the concepts of fields, records, tables, views, reports, and queries. Finally, it discusses data warehousing and data mining techniques as well as business intelligence.

Uploaded by

Vikas Thakur
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
237 views13 pages

Management Information Systems Unit - 3 Notes-1

The document discusses key concepts in data management including challenges of data management, data independence, data redundancy, data consistency, and data administration. It also discusses database management systems, including the concepts of fields, records, tables, views, reports, and queries. Finally, it discusses data warehousing and data mining techniques as well as business intelligence.

Uploaded by

Vikas Thakur
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 13

MANAGEMENT INFORMATION SYSTEMS

Managing Data Resources-


The need for data management, Challenges of data management, Data independence,
Data redundancy, Data consistency, Data administration.
Database Management System – Concepts and types of DBMS, Fields, Records, Table,
View, Reports and Queries.
Data warehouse and Data mining – Characteristics and uses of Data warehouse,
Techniques of Data Mining, Business Intelligence
Database Management System (Lab): Creation of Table, View and Reports. Basics of
SQL and running queries

The need for data management


Data management is the practice of collecting, organizing, protecting, and storing an
organization’s data so it can be analyzed for business decisions.

Data management is a crucial first step to employing effective data analysis.

With effective data management, people across an organization can find and access trusted data
for their queries.

Some benefits of an effective data management solution include:

Visibility

Data management can increase the visibility of your organization’s data assets, making it easier
for people to quickly and confidently find the right data for their analysis.

Reliability

Data management helps minimize potential errors by establishing processes and policies for
usage and building trust in the data being used to make decisions across your organization.
With reliable, up-to-date data, companies can respond more efficiently to market changes and
customer needs.

Security

Data management protects your organization and its employees from data losses, thefts, and
breaches with authentication and encryption tools.

Scalability

Data management allows organizations to effectively scale data and usage occasions with
repeatable processes to keep data and metadata up to date.
Challenges of data management

Data management addresses several challenges some of them are as follows

1- Large volume of data

Every day, it is estimated that 2.5 quintillion bytes of data are created. As a result, the
organization now faces new challenges in terms of obtaining, maintaining, and generating
value from data. A skilled team equipped with the appropriate tools and technology may assist
in controlling and managing the exponential expansion of data created or gathered.

2- Data storage on many places

This is one of the most significant challenges that businesses confront. Data redundancy occurs
when the same piece of data is stored in two or more separate places and is a common
occurrence in many businesses.

3- Data quality

Data quality is one of the most obstacles confronting many companies today. Most businesses
utilize a database to update information, however maintaining data quality becomes difficult
while processing or recording information.

4- Lack of processes and systems

When data is gathered from many sources, inconsistency in the data is unavoidable. Inadequate
DM processes and systems contribute to inaccurate data. As a result of the insufficient amount
of data, the data is of poor quality and does not fulfill the criteria.

5- Data Integrity

Data integrity is a concept and process that ensures the accuracy, completeness, consistency,
and validity of an organization's data. It is biggest challenge to maintain the integrity of data.

6- Lack of skilled resources

There is a severe lack of experienced specialists available for immediate recruitment. In reality,
these skilled professionals often have larger pay packages since they are required in any firm
that has to maintain strong control and management of their data.

7- Data security
It is a significant challenge that causes concern among organizations. Data security threats can
come from a variety of sources, including hackers, insider threats, natural disasters and human
error.
DATA INDEPENDENCE
Data Independence

o Data independence can be explained using the three-schema architecture.


o Data independence refers characteristic of being able to modify the schema at one level
of the database system without altering the schema at the next higher level.

There are two types of data independence:

1. Logical Data Independence

o Logical data independence refers characteristic of being able to change the conceptual
schema without having to change the external schema.
o If we do any changes in the conceptual view of the data, then the user view of the data
would not be affected.

2. Physical Data Independence

o Physical data independence can be defined as the capacity to change the internal schema
without having to change the conceptual schema and external schema as well.
o If we do any changes in the storage size of the database system server, then the
Conceptual structure and external level of the database will not be affected.
Data redundancy

It is also called data duplicity. In DBMS, when the same data is stored in different tables, it
causes data redundancy.

Sometimes, it is done on purpose for recovery or backup of data, faster access of data, or
updating data easily. Redundant data costs extra money, demands higher storage capacity, and
requires extra effort to keep all the files up to date.

Sometimes, unintentional duplicity of data causes a problem for the database to work properly,
or it may become harder for the end user to access data. Redundant data unnecessarily occupy
space in the database to save identical copies, which leads to space constraints, which is one of
the major problems.

Let us understand redundancy in DBMS properly with the help of an example.

Student_id Name Course Marks

101 Ram MBA 89

102 Sonu MBA 60

103 Varun MBA 90

104 Satish MBA 87

102 Sonu MBA 60

In the above example, there is a "Student" table that contains duplicate data of student Sonu.

Data consistency:

Consistency also implies that any changes made to a single object in one table must be mirrored
in all other tables in which that object appears.

Example, if the student’s home address changes, the change must be shown in all tables where
the previous address previously existed. Data inconsistency occurs when one table has the old
address, and the others have the updated address.

Data administration

Data administration is the process by which data is monitored, maintained and managed by a
data administrator and/or an organization. Data administration allows an organization to
control its data assets, as well as their processing and interactions with different applications
and business processes.
A database management system (DBMS)

A database management system (DBMS) is system software for creating and managing
databases.
A database management system is a group of interrelated data and a set of programs to access
the data.
Operations that can be performed on the database are:
• Creation of table
• Insertion of data in the table
• Protection of data
• Updation of data
• Deletion of data
• Alteration of data definition.
Example:
STUDENT
Student_rollno Name Course Marks

101 Abhinav Gupta MBA 89

102 Akshita MBA 60

103 Riya MBA 90

104 Saloni MBA 87

102 Yash MBA 60

Types of Database System


Many different types of database systems exist based on how they manage the database
structure.

Two types of database structure


Databases typically have one of two basic forms:

1. Flat file database 2. Relational or structured database


1. A flat file database stores data in a plain text file, with each line of text typically
holding one record. Delimiters such as commas or tabs separate fields. A flat file
database uses a simple structure and, unlike a relational database, cannot contain
multiple tables and relations.

2. A relational database contains multiple tables of data with rows and columns that
relate to each other through special key fields. These databases are more flexible than
flat file structures, and provide functionality for reading, creating, updating, and
deleting data. Relational databases use Structured Query Language (SQL).

EXAMPLE :
COLLEGE DATABASE includes the following relations(tables) like

ROLL_NO NAME CLASS MARKS ADDRESS

FID FACULTY_NAME SALARY ADDRESS

ROLL_NO BOOK_NO BOOK_NAME ISSUE_DATE

Four types of database management systems


There are four types of database management system:

• Hierarchical database systems


• Network database systems
• Object-oriented database systems
• Relational database management System
A hierarchical database model resembles a tree structure. The relationships between records
are in one to one manner, between 'parent and child' nodes. They require the user to pass a
hierarchy in order to access needed data.

Department
t
Student Faculty

Network database models also have a hierarchical structure. However, instead of using a
single-parent tree hierarchy, this model supports many to many relationships, as child tables
can have more than one parent.

College

Library
MBA Department

Student
An Object-oriented databases, the information is represented as objects, with different types
of relationships possible between two or more objects. Such databases use an object-oriented
programming language for development.

Relational databases, contains multiple tables of data with rows and columns that relate to
each other.

Terms related to Relational databases


1.Tables
Table is collection of rows and coloumn. Table is also called relation in Database management
system.
Tables are essential objects in a database because they hold all the information or data.
Before creating a tables, consider your requirements and determine all the tables that you might
need.
For example, a database for a College include student table ,Faculty table, Library table and so
on.
SQL query to create a table:
Create table student (rollno int,name char(10),address varchar(20),class
char(5),date_of_birth date);

2.Fields: Fields are also called attributes that defines a table.It have different types of
data, such as text, numbers, date etc.For example creating a student table we have to include
attributed related to student like rollno.,name,class,marks,address all these fields are related to
student.

3.A field value: Each record has a field value. For example, 1, Abhinav,89, MBA.

4.A record: Contains specific data, like information about a particular employee or students.In
below student table there are 7 records.

View

Views are considered as virtual table.

View can be defined as a subset of a database.

View contains the row and coloumn of selected database or table.

SQL query to create view

Create view d_marks as select ROLLNO,MARKS from student;


OUTPUT:

ROLLNO MARKS
1 89
2 78
3 80
4 90
5 87
6 79
7 85

A database Report

A database report is the formatted result of database queries and contains useful data
for decision-making and analysis.

Most good business applications contain a built-in reporting tool.

Example 1, a banking software application may contain specifically defined reports on


all customers with large deposits or reports on monthly loan summaries for all
customers.

For example, when you want to see the details of students who are doing an MBA.

Select * from student where course = ’MBA’;

QUERY:

The query can be defined as a request for data from the database. We use SQL queries
to create, select, and update data in the database.

Data warehouse
Data warehousing is the process of gathering, storing, and managing data from various sources
into one convenient repository.
Different departments of an organization can use it for analysis, reporting, and decision-
making.
It is designed to support the decision-making process by providing a centralized location for
all of an organization's data.
Bill Inmon's definition of a data warehouse is that it is a “subject-oriented, nonvolatile,
integrated, time-variant collection of data in support of management's decisions.”
Characteristics of a Data Warehouse
1. Integrated Data
One of the key characteristics of a data warehouse is that it contains integrated data. This
means that the data is collected from various sources, such as transactional systems, and then
cleaned, transformed, and consolidated into a single, unified view. This allows for easy access
and analysis of the data, as well as the ability to track data over time.
2. Subject-Oriented
A data warehouse is also subject-oriented, which means that the data is organized around
specific subjects, such as customers, products, sales. This allows for easy access to the data
relevant to a specific subject, as well as the ability to track the data over time.
3. Non-Volatile
Another characteristic of a data warehouse is that it is non-volatile. This means that the data
in the warehouse is never updated or deleted, only added to. This is important because it allows
for the preservation of historical data, making it possible to track trends and patterns over time.
4. Time-Variant
A data warehouse is also time-variant, which means that the data is stored with a time
dimension. This allows for easy access to data for specific time periods, such as the last quarter
or last year. This makes it possible to track trends and patterns over time.
Data Warehousing Tools
ETL (Extract, Transform, Load) Tools
One of the key tools used in data warehousing is ETL (Extract, Transform, Load) tools. These
tools are used to extract data from various sources, transform the data to fit the data warehouse
schema, and then load the data into the warehouse.
Data Mining Techniques

Data mining includes the utilization of refined data analysis tools to find previously unknown,
valid patterns and relationships in huge data sets. These tools can incorporate statistical models,
machine learning techniques, and mathematical algorithms, such as neural networks or decision
trees. Thus, data mining incorporates analysis and prediction.

In recent data mining projects, various major data mining techniques have been developed and
used, including association, classification, clustering, prediction, sequential patterns, and
regression.
1. Classification:

This technique is used to obtain important and relevant information about data and metadata.
This data mining technique helps to classify data into different classes.

Example:
This classification is as per the type of data handled. For example, multimedia, spatial data,
text data, time-series data, World Wide Web, and so on.

Another example is: Classification of spam emails and general emails.

2. Clustering: Clustering is a division of information into groups of connected objects.

we can say that Clustering analysis is a data mining technique to identify similar data. The
process of making a group of abstract objects into classes of similar objects is known as
clustering. One group is treated as a cluster of data objects

For example, a business may collect the following information about consumers:

• Percentage of emails opened


• Number of clicks per email
• Time spent viewing email

Another example It helps marketers to find the distinct groups in their customer base and they can
characterize their customer groups by using purchasing patterns.

3. Regression:
Regression in data mining is a tool that helps predict numerical values in a given data set,
such as predicting temperature, cost or such values. Hence, regression techniques in data
mining are widely popular in business settings, most popularly in marketing, trend analysis
and varied kinds of financial forecasting.

4. Association Rules:

This data mining technique helps to discover a link between two or more items.

Association rules are if-then statements that support showing the probability of interactions
between data items within large data sets in different types of databases. Association rule
mining has several applications and is commonly used to help sales correlations in data or
medical data sets.

The way the algorithm works is that you have various data, For example, a list of grocery items
that you have been buying for the last six months. It calculates the percentage of items being
purchased together.

5. Outlier detection:
This type of data mining technique relates to the observation of data items in the data set, which
do not match an expected pattern or expected behavior. It is also known as Outlier Analysis or
Outlier Mining. The outlier is a data point that diverges too much from the rest of the dataset.

For Example, a dataset of credit card fraud detection contains transactional data of a bank’s
customer who holds a credit card. If we consider the daily transaction amount by the customer
as one of the attributes, then a transaction with a very high amount as compared to the normal
range of the individual’s expenditure will be considered an outlier.

6. Sequential Patterns:

The sequential pattern is a data mining technique specialized for evaluating sequential data to
discover sequential patterns.

In other words, this technique of data mining helps to discover or recognize similar patterns in
transaction data over some time.

Example: Customer shopping sequences: • First buy Computer, then CD-ROM, and then digital
camera, within 12 months.

7. Prediction:

Prediction used a combination of other data mining techniques such as trends, clustering,
classification, etc. It analyzes past events or instances in the right sequence to predict a future
event.

An example of this is, Any retailer can look through a customer database and predict future
transactions by looking at previous transactions. In other words, previous data may allow the
shopkeeper to forecast what will happen in the future, allowing businesspeople to plan
accordingly.

Business Intelligence

Business Intelligence is one of the most powerful tools many organizations use to know their
customer base and market better. It describes the business methodology in which the raw data
is transformed into useful information which helps in decision-making.

Business Intelligence (BI) is a set of ideas, methodologies, processes, architectures, and


technologies that change raw data into significant and useful data for business purposes.
Business Intelligence can handle large amounts of data to help identify and evolve new
opportunities for the business.

Benefits of Business Intelligence

Business intelligence has broad applications, and if talking about the benefits of business
intelligence in the retail sector, nowadays business intelligence tools enable organizations to
take benefit of data not only to assume current sales but also to estimate future potential,
patterns, trends and know the demand of the customer on a deeper level.

Database Management System (Lab):

CREATION OF TABLE

create table student(rollno int ,name char(20),class char(5),marks int);

INSERTION OF DATA

insert into student values(1,'Aman','MBA',75);


insert into student values(2,'Tushar','MBA',85);
insert into student values(3,'Shamit','MBA',95);
insert into student values(5,'Sonam’,'MBA',95);
insert into student values(7,'Akash','MBA',95);
DISPLAY ALL THE CONTENT OF THE TABLE

select * from student;

DISPLAY SELECTED FIELDS OF THE TABLE

select rollno,class from student;

CREATION OF VIEWS
create view student_marks as select rollno,marks from student;

CREATION OF REPROTS
Create a report to show rollno and class of all students.
select rollno,class from student;
create a report that includes all the students who are doing MBA.
select * from student where class=’MBA’;

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy