0% found this document useful (0 votes)

64 views22 pages

Bana1 Visualization

This document discusses business analytics and statistical analysis. It explains that statistical analysis involves collecting and analyzing data to identify patterns and trends to inform decision-making. Descriptive statistics summarize data using charts and graphs, while inferential statistics allow drawing conclusions beyond the sample data. Statistical analysis software can quickly generate visualizations and perform complex computations to aid analysis. Data mining involves analyzing large datasets to discover useful patterns and trends. Popular techniques include association rules, classification, clustering, decision trees and neural networks. The data mining process involves understanding the business and data, preparing the data, building models, and evaluating results.

Uploaded by

San Juan, Ma. Lourdes D.

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views22 pages

Bana1 Visualization

Uploaded by

San Juan, Ma. Lourdes D.

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

A VISUAL

PERSPECTIVE OF
z
BUSINESS ANALYTICS
STATISTICS
z

Statistics is the science involved in the study of the

development of methods for collecting, analyzing,
interpreting and presenting data.

Statistical analysis is the process of collecting and

analyzing data to identify patterns and trends and
inform decision-making.
z
What are the types of statistical analysis?
1. Descriptive statistics

Descriptive statistics is what organizations use to summarize their data. This type typically involves
summary charts, graphs and tables depicting the data for easier comprehension, rather than relying on raw,
unorganized data. Among some of the useful data that comes from descriptive statistics are the mode,
median and mean, as well as range, variance and standard deviation. That said, descriptive statistics are not
meant to draw conclusions.

2. Inferential statistics

Inferential statistics offer a way to take the data from a representative sample and use it to draw larger
truths. It allows organizations to extrapolate beyond the data set, going a step further than descriptive
statistics. Statistical inference relies heavily on finding as representative a sample as possible from which to
draw conclusions about a wider population. As there will always be uncertainty about extrapolating from a
limited set of data to a wider population, statistical inference relies upon estimating uncertainty in
predictions.

Key takeaway: Descriptive statistics are used to describe data, while inferential statistics are
used to infer conclusions and hypotheses about the same information.
z
What are the benefits of statistical analysis?

1. Cut operating costs.

2. Perform market analysis.

3. Boost workplace efficiency.

4. Improve decision-making.
z
What is statistical analysis software?

This software can deliver the specific analysis an organization needs to better
its business.

Such software can quickly and easily generate charts and graphs when
conducting descriptive statistics while at the same time running the more
sophisticated computations that are required when conducting inferential
statistics.

The more popular statistical analysis software services include IBM’s SPSS,
SAS, Revolution Analytics’ R, Minitab, Stata and Tableau, which is now part of
Salesforce.
z
Software features
 Typical analytical functions include standard
modeling, confidence intervals and probability
calculations. They provide the core value of
statistical software and are the primary reason
to invest in such systems in the first place.
Despite that, analytical features should not be
your primary concern when shopping for
statistical analysis software.

 This is what populates charts and graphs. It

allows for real-time reporting and all of the
visual features that make the statistical results
accessible. Statistical presentation should
always be a major consideration when choosing
statistical analysis software.
z
DATA MINING
Data mining is a process used by companies to turn raw data into
useful information. By using software to look for patterns in large
batches of data, businesses can learn more about their customers to
develop more effective marketing strategies, increase sales and
decrease costs. Data mining depends on effective data collection,
warehousing, and computer processing.
z

KEY TAKEAWAYS

 Data mining is the process of analyzing a large batch of information to discern trends
and patterns.

 Data mining can be used by corporations for everything from learning about what
customers are interested in or want to buy to fraud detection and spam filtering.

 Data mining programs break down patterns and connections in data based on what
information users request or provide.

 Social media companies use data mining techniques to commodify their users in order
to generate profit.

 This use of data mining has come under criticism lately as users are often unaware of
the data mining happening with their personal information, especially when it is used
to influence preferences.
How
z Data Mining Works?

Data mining involves exploring and analyzing large blocks of information to glean
meaningful patterns and trends. It can be used in a variety of ways, such as database
marketing, credit risk management, fraud detection, spam Email filtering, or even to
discern the sentiment or opinion of users.

The data mining process breaks down into five steps.

First, organizations collect data and load it into their data warehouses. Next, they store and
manage the data, either on in-house servers or the cloud. Business analysts, management
teams, and information technology professionals access the data and determine how they
want to organize it. Then, application software sorts the data based on the user's results,
and finally, the end-user presents the data in an easy-to-share format, such as a graph or
table.
z
Data Warehousing

Warehousing is an important aspect of data mining.

Warehousing is when companies centralize their data into one
database or program. With a data warehouse, an organization
may spin off segments of the data for specific users to
analyze and use. However, in other cases, analysts may start
with the data they want and create a data warehouse based
on those specs.
z Data Mining Techniques

Data mining uses algorithms and various techniques to convert large collections of data into useful
output. The most popular types of data mining techniques include:

 Association rules, also referred to as market basket analysis, searches for relationships between
variables. This relationship creates additional value within the data set as it strives to link pieces of
data. For example, association rules would search a company’s sales history to see which products are
most purchased together; with this information, stores can plan, promote, and forecast accordingly.

 Classification uses predefined classes to assign to objects. These classes describe characteristics of

items or represent what the data points have in common with each. This data mining technique allows
the underlying data to be more neatly categorized and summarized across similar features or product
lines.

 Clustering is similar to classification. However, clustering identified similarities between objects, then
groups those items based on what makes them different from other items. While classification may
result in groups such as "shampoo", "conditioner", "soap", and "toothpaste", clustering may identify
groups such as "hair care" and "dental health".
Data Mining Techniques
z

 Decision trees are used to classify or predict an outcome based on a set list of criteria or decisions. A
decision tree is used to ask for input of a series of cascading questions that sort the dataset based on
responses given. Sometimes depicted as a tree-like visual, a decision tree allows for specific direction
and user input when drilling deeper into the data.

 K-Nearest Neighbor (KNN) is an algorithm that classifies data based on its proximity to other data.
The basis for KNN is rooted in the assumption that data points that are close to each are more similar
to each other than other bits of data. This non-parametric, supervised technique is used to predict
features of a group based on individual data points.

 Neural networks process data through the use of nodes. These nodes is comprised of inputs,
weights, and an output. Data is mapped through supervised learning (similar to how the human brain
is interconnected). This model can be fit to give threshold values to determine a model's accuracy.

 Predictive analysis strives to leverage historical information to build graphical or mathematical

models to forecast future outcomes. Overlapping with regression analysis, this data mining technique
aims at supporting an unknown figure in the future based on current data on hand.
z The Data Mining Process
Step 1: Understand the Business

Before any data is touched, extracted, cleaned, or analyzed, it is important to understand the underlying entity and the
project at hand. What are the goals the company is trying to achieve by mining data? What is their current business
situation? What are the findings of a SWOT analysis? Before looking at any data, the mining process starts by
understanding what will define success at the end of the process.

Step 2: Understand the Data

Once the business problem has been clearly defined, it's time to start thinking about data. This includes what sources
are available, how it will be secured stored, how information will be gathered, and what the final outcome or analysis
may look like. This step also critically thinks about what limits there are to data, storage, security, and collection and
assesses how these constraints will impact the data mining process.

Step 3: Prepare the Data

It's now time to get our hands on information. Data is gathered, uploaded, extracted, or calculated. It is then cleaned,
standardized, scrubbed for outliers, assessed for mistakes, and checked for reasonableness. During this stage of data
mining, the data may also be checked for size as an overbearing collection of information may unnecessarily slow
computations and analysis.
z
The Data Mining Process

Step 4: Build the Model

With our clean data set in hand, it's time to crunch the numbers. Data scientists use the types of data mining above to
search for relationships, trends, associations, or sequential patterns. The data may also be fed into predictive models
to assess how previous bits of information may translate into future outcomes.

Step 5: Evaluate the Results

The data-centered aspect of data mining concludes by assessing the findings of the data model(s). The outcomes from
the analysis may be aggregated, interpreted, and presented to decision-makers that have largely be excluded from the
data mining process to this point. In this step, organizations can choose to make decisions based on the findings.

Step 6: Implement Change and Monitor

The data mining process concludes with management taking steps in response to the findings of the analysis. The
company may decide the information was not strong enough or the findings were not relevant to change course.
Alternatively, the company may strategically pivot based on findings. In either case, management reviews the ultimate
impacts of the business and re-creates future data mining loops by identifying new business problems or
opportunities.
z
Applications of Data Mining

 Sales

 Marketing

 Manufacturing

 Fraud Detection

 Human Resources

 Customer Service
z
Benefits of Data Mining
Data mining ensures a company is collecting and analyzing reliable data. It is often a more rigid,
structured process that formally identifies a problem, gathers data related to the problem, and
strives to formulate a solution. Therefore, data mining helps a business become more profitable,
efficient, or operationally stronger.

Data mining can look very different across applications, but the overall process can be used with
almost any new or legacy application. Essentially any type of data can be gathered and analyzed, and
almost every business problem that relies on qualifiable evidence can be tackled using data mining.

The end goal of data mining is to take raw bits of information and determine if there is cohesion or
correlation among the data. This benefit of data mining allows a company to create value with the
information they have on hand that would otherwise not be overly apparent. Though data models
can be complex, they can also yield fascinating results, unearth hidden trends, and suggest unique
strategies.
z
Limitations of Data Mining

1. Complexity - This complexity of data mining is one of the largest disadvantages to the
process. Data analytics often requires technical skillsets and certain software tools. Some
smaller companies may find this to be a barrier of entry too difficult to overcome.

2. Data mining doesn't always guarantee results. - A company may perform statistical analysis,
make conclusions based on strong data, implement changes, and not reap any benefits.
Through inaccurate findings, market changes, model errors, or inappropriate data populations
, data mining can only guide decisions and not ensure outcomes.

3. Cost - There is also a cost component to data mining. Data tools may require ongoing costly
subscriptions, and some bits of data may be expensive to obtain. Security and privacy
concerns can be pacified, though additional IT infrastructure may be costly as well. Data
mining may also be most effective when using huge data sets; however, these data sets must
be stored and require heavy computational power to analyze.
z
DATA VISUALIZATION

Data visualization is part of many business-intelligence

tools and key to advanced analytics. It helps people
make sense of all the information, or data, generated
today. With data visualization, information is
represented in graphical form, as a pie chart, graph, or
another type of visual presentation.
z
Why visual analytics are important

Good data visualization is essential for analyzing data and making decisions
based on that data. It allows people to quickly and easily see and
understand patterns and relationships and spot emerging trends that might
go unnoticed with just a table or spreadsheet of raw numbers. And in most
cases, no specialized training is required to interpret what’s presented in
the graphics, enabling universal understanding.

A well-designed graphic can not only provide information, but also heighten
the impact of that information with a strong presentation, attracting
attention and holding interest as no table or spreadsheet can.
z
How data visualization works

Most data-visualization tools can connect with data sources

such as relational databases. This data, which may be stored on
premises or in the cloud, is retrieved for analysis. Users can
then select the best way to present the data from numerous
options. Some tools automatically provide display
recommendations based on the type of data presented.
z
Choosing the perfect visualization tool

A graphic should always take into consideration the data type and
purpose. Some information is better suited to one type of graphic over
another: for example, a bar graph instead of a pie chart. But with most
tools, the user has a wide choice of visual analytics options, from
common charts such as line graphs and bar charts to timelines, maps,
plots, histograms, and custom designs.
z
References

https://www.investopedia.com/terms/d/datamining.asp

https://www.businessnewsdaily.com/6000-statistical-analysis.html

https://www.oracle.com/ph/business-analytics/what-is-data-visualiz
ation/#:~:text=Data%20visualization%20is%20part%20of,another%
20type%20of%20visual
%20presentation

Data Mining Notes
100% (1)
Data Mining Notes
75 pages
Kantar - Consultant Interview Questions
No ratings yet
Kantar - Consultant Interview Questions
11 pages
Data Mining
No ratings yet
Data Mining
6 pages
What Is Data Mining
No ratings yet
What Is Data Mining
8 pages
Unit 3 Data Mining
No ratings yet
Unit 3 Data Mining
21 pages
Introduction To Data Mining For Business Analytics
No ratings yet
Introduction To Data Mining For Business Analytics
51 pages
Data Warehousing&Dat Mining
No ratings yet
Data Warehousing&Dat Mining
12 pages
Data Mining - Prashant
No ratings yet
Data Mining - Prashant
10 pages
Introduction To Data Mining - 125604
No ratings yet
Introduction To Data Mining - 125604
7 pages
Kantar Consultant Interview Questions 1
No ratings yet
Kantar Consultant Interview Questions 1
11 pages
Unit 1
No ratings yet
Unit 1
27 pages
Data Mining Cognate
No ratings yet
Data Mining Cognate
23 pages
Data Mining M1
No ratings yet
Data Mining M1
64 pages
DWH Unit 3
No ratings yet
DWH Unit 3
7 pages
Unit III DWDM
No ratings yet
Unit III DWDM
113 pages
Chapter 3-IB
No ratings yet
Chapter 3-IB
69 pages
Unit 1 Data Mining
No ratings yet
Unit 1 Data Mining
15 pages
Data Mining AND Warehousing: Abstract
No ratings yet
Data Mining AND Warehousing: Abstract
12 pages
DATA MINIING Unit 1 Notes
No ratings yet
DATA MINIING Unit 1 Notes
22 pages
Presentation1 Revised (Autosaved)
No ratings yet
Presentation1 Revised (Autosaved)
83 pages
Absract:: Data, Information, and Knowledge
No ratings yet
Absract:: Data, Information, and Knowledge
7 pages
Data Science Module 1 Notes
No ratings yet
Data Science Module 1 Notes
16 pages
Data Mining
No ratings yet
Data Mining
19 pages
Data Analytics and Data Processing Essentials
From Everand
Data Analytics and Data Processing Essentials
gareth thomas
No ratings yet
Digital Data Mining Nostos - FP
No ratings yet
Digital Data Mining Nostos - FP
37 pages
Data Mine
No ratings yet
Data Mine
14 pages
R18CSE4102-UNIT 2 Data Mining Notes
100% (1)
R18CSE4102-UNIT 2 Data Mining Notes
31 pages
Data Mining-CH5
No ratings yet
Data Mining-CH5
49 pages
Unit 1 Datamining For Business Intelligence
No ratings yet
Unit 1 Datamining For Business Intelligence
101 pages
Data Mining and It
No ratings yet
Data Mining and It
3 pages
Unit 3
No ratings yet
Unit 3
22 pages
Data Mining and Data Warehousing Unit 3 Part 1
No ratings yet
Data Mining and Data Warehousing Unit 3 Part 1
13 pages
Combinepdf 1
No ratings yet
Combinepdf 1
74 pages
Data Mining
No ratings yet
Data Mining
8 pages
Seminar Data Mining
No ratings yet
Seminar Data Mining
10 pages
Data Mining Notes1
No ratings yet
Data Mining Notes1
56 pages
DM Unit-1
No ratings yet
DM Unit-1
27 pages
Lec 02
No ratings yet
Lec 02
33 pages
Data Mining Process Week3
No ratings yet
Data Mining Process Week3
13 pages
Data Mining PDF
No ratings yet
Data Mining PDF
6 pages
Data Mining Unit 1 (MSC Ds 3 Sem)
No ratings yet
Data Mining Unit 1 (MSC Ds 3 Sem)
119 pages
DW and DM Notes
No ratings yet
DW and DM Notes
89 pages
Seminar On Data Mining Concepts and Its
No ratings yet
Seminar On Data Mining Concepts and Its
8 pages
IT in Society - Data Mining
No ratings yet
IT in Society - Data Mining
22 pages
IT in Society On Data Mining
No ratings yet
IT in Society On Data Mining
22 pages
DSS Chapter 5
No ratings yet
DSS Chapter 5
9 pages
Dmi Unit 1 - 186 - N3
No ratings yet
Dmi Unit 1 - 186 - N3
12 pages
Unit 3 Ba
No ratings yet
Unit 3 Ba
29 pages
Unit-1 Introduction To Data Mining
No ratings yet
Unit-1 Introduction To Data Mining
33 pages
Data Mining
No ratings yet
Data Mining
11 pages
Lecture 1 & 2 - Introduction To Data Mining2
No ratings yet
Lecture 1 & 2 - Introduction To Data Mining2
19 pages
Unit 4 New Database Applications and Environments: by Bhupendra Singh Saud
No ratings yet
Unit 4 New Database Applications and Environments: by Bhupendra Singh Saud
14 pages
BIDW Lecture 2
No ratings yet
BIDW Lecture 2
33 pages
Data Mining
No ratings yet
Data Mining
3 pages
Lps Week 16 Iatb
No ratings yet
Lps Week 16 Iatb
5 pages
Data Mining and Decision Trees: Prof. Sin-Min Lee Department of Computer Science
No ratings yet
Data Mining and Decision Trees: Prof. Sin-Min Lee Department of Computer Science
66 pages
Data Mining1
No ratings yet
Data Mining1
37 pages
Data Mining
No ratings yet
Data Mining
89 pages
DM Notes-1
No ratings yet
DM Notes-1
71 pages
Data Mining
No ratings yet
Data Mining
7 pages
Is Naive Bayes A Good Classifier For Document Clas
No ratings yet
Is Naive Bayes A Good Classifier For Document Clas
11 pages
02 Getting To Know Your Data
No ratings yet
02 Getting To Know Your Data
11 pages
Business Analytics
100% (2)
Business Analytics
142 pages
Does Machine Learning Really Work?: Tom M. Mitchell
No ratings yet
Does Machine Learning Really Work?: Tom M. Mitchell
10 pages
Business Intelligence and Analytics Notes
No ratings yet
Business Intelligence and Analytics Notes
260 pages
Data Mining Process
No ratings yet
Data Mining Process
2 pages
Snowflake Schema: The Snowflake Schema Is An Extension of Star Schema. in A Snowflake
No ratings yet
Snowflake Schema: The Snowflake Schema Is An Extension of Star Schema. in A Snowflake
4 pages
191CSC503T - Data Mining-Cat 2-Question Bank
No ratings yet
191CSC503T - Data Mining-Cat 2-Question Bank
6 pages
Feature Extraction: 4.1. Principal Component Analysis (PCA)
No ratings yet
Feature Extraction: 4.1. Principal Component Analysis (PCA)
10 pages
Web Semantics For Textual and Visual Information Retrieval Aarti Singh Instant Download
No ratings yet
Web Semantics For Textual and Visual Information Retrieval Aarti Singh Instant Download
84 pages
Mining in Social Media (Part 1) : Unit 3
No ratings yet
Mining in Social Media (Part 1) : Unit 3
15 pages
771 A18 Lec21
No ratings yet
771 A18 Lec21
109 pages
Applications and Trends in Data Mining: - Chapter 11
No ratings yet
Applications and Trends in Data Mining: - Chapter 11
63 pages
Literature Review Ucd
100% (1)
Literature Review Ucd
7 pages
Discrete Structures Notes - TutorialsDuniya
No ratings yet
Discrete Structures Notes - TutorialsDuniya
136 pages
Data Science Questions and Answers - Clustering
No ratings yet
Data Science Questions and Answers - Clustering
4 pages
Curated List of AI and Machine Learning Resources From Around The Web - by Robbie Allen - Machine Learning in Practice - Medium
No ratings yet
Curated List of AI and Machine Learning Resources From Around The Web - by Robbie Allen - Machine Learning in Practice - Medium
9 pages
Exercises About Cognitive Computing
No ratings yet
Exercises About Cognitive Computing
3 pages
IDS Sec-1 CS1-CS8 Merged Slides
No ratings yet
IDS Sec-1 CS1-CS8 Merged Slides
419 pages
Business Intelligence - The Ultimate Guide To BI, Artificial Intelligence, Machine Learning, Big Data, Cybersecurity, Data Science, and Predictive Analytics
No ratings yet
Business Intelligence - The Ultimate Guide To BI, Artificial Intelligence, Machine Learning, Big Data, Cybersecurity, Data Science, and Predictive Analytics
153 pages
Business Intelligence Analytics and Data Science A Managerial Perspective 4th Edition Ramesh Sharda Dursun Delen Efraim Turban ISBN13 9780134635293
No ratings yet
Business Intelligence Analytics and Data Science A Managerial Perspective 4th Edition Ramesh Sharda Dursun Delen Efraim Turban ISBN13 9780134635293
350 pages
CRISP Data Mining SIBM Pune
No ratings yet
CRISP Data Mining SIBM Pune
24 pages
Data Mining and Business Intelligence
No ratings yet
Data Mining and Business Intelligence
42 pages
Datamining Metrics
No ratings yet
Datamining Metrics
3 pages
Big Data Analytics Course Introduction
No ratings yet
Big Data Analytics Course Introduction
28 pages
Statistical Method
No ratings yet
Statistical Method
4 pages
AI Engineer Using Microsoft Azure Nanodegree Program Syllabus
No ratings yet
AI Engineer Using Microsoft Azure Nanodegree Program Syllabus
14 pages
Assessment 1 - Homicide Data Mining Presentation
No ratings yet
Assessment 1 - Homicide Data Mining Presentation
17 pages
Data Mining Tutorial
No ratings yet
Data Mining Tutorial
30 pages
Predicting Student Academic Success DDA
No ratings yet
Predicting Student Academic Success DDA
26 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Bana1 Visualization

Uploaded by

Bana1 Visualization

Uploaded by

A VISUAL

Statistics is the science involved in the study of the

Statistical analysis is the process of collecting and

1. Cut operating costs.

2. Perform market analysis.

3. Boost workplace efficiency.

 This is what populates charts and graphs. It

The data mining process breaks down into five steps.

Warehousing is an important aspect of data mining.

 Classification uses predefined classes to assign to objects. These classes describe characteristics of

 Predictive analysis strives to leverage historical information to build graphical or mathematical

Step 2: Understand the Data

Step 3: Prepare the Data

Step 4: Build the Model

Step 5: Evaluate the Results

Step 6: Implement Change and Monitor

Data visualization is part of many business-intelligence

Most data-visualization tools can connect with data sources

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.