Unit 2
Unit 2
Overall, descriptive analytics is a valuable tool for gaining insights into historical data
and identifying areas for further investigation. However, it is important to use
descriptive analytics in conjunction with other types of data analysis, such as diagnostic,
predictive, and prescriptive analytics, to gain a more complete understanding of the
data and make informed decisions.
Descriptive Analytics Example - Suppose a company wants to analyze its sales data for
the past year to gain insights into its performance. The company collects data on sales
revenue, sales volume, and customer demographics, among other variables.
Using descriptive analytics, the company can summarize the data and gain insights such
as:
Total revenue: The company generated $10 million in revenue over the past year.
Sales volume: The company sold 100,000 units of product A, 50,000 units of product B,
and 25,000 units of product C.
Customer demographics: The majority of customers were between the ages of 25-45 and
lived in urban areas.
Sales by region: The company generated the highest revenue in the Northeast region,
followed by the West and South regions.
The company can then use this information to make informed decisions, such as:
Increasing marketing efforts in urban areas to target its primary customer demographic.
Developing new products to expand its product line and increase sales volume.
Investing in the Northeast region to further grow sales in that area.
Overall, descriptive analytics provides a starting point for analyzing data and gaining
insights into past performance. By summarizing and visualizing data, companies can
identify trends and patterns that can help inform decision-making and improve business
outcomes.
Descriptive analytics involves several methods and techniques for analyzing and
summarizing data. Here are some common methods used in descriptive analytics:
1. Data visualization: This involves representing data in a visual format such as graphs,
charts, and tables to help identify patterns and relationships. Examples of data
visualization techniques include scatter plots, bar charts, and pie charts.
2. Measures of central tendency: These are statistical measures used to describe the center
of a dataset. Common measures of central tendency include mean, median, and mode.
3. Measures of variability: These are statistical measures used to describe the spread or
variability of a dataset. Common measures of variability include range, standard
deviation, and variance.
4. Frequency distribution: This involves organizing data into categories and determining
the frequency of each category. This can be helpful in identifying patterns and trends in
data.
5. Percentiles: This involves dividing a dataset into 100 equal parts and determining the
position of a specific value within the dataset. This can be helpful in understanding the
distribution of data and identifying outliers.
6. Cross-tabulation: This involves analyzing the relationship between two or more
variables by creating a table that shows the frequency of each combination of values.
1. Microsoft Excel: This is a popular spreadsheet program that can be used for analyzing
and summarizing data. Excel has built-in functions for calculating measures of central
tendency and variability, as well as tools for creating charts and tables.
2. Tableau: This is a business intelligence software that can be used for data visualization
and analysis. Tableau allows users to create interactive dashboards and visualizations that
can help identify patterns and trends in data.
3. Python: This is a programming language that can be used for data analysis and
visualization. Python has many libraries and packages, such as NumPy, Pandas, and
Matplotlib, that can be used for performing descriptive analytics.
4. SPSS: This is a statistical analysis software that can be used for descriptive and
inferential analytics. SPSS has a range of tools for calculating measures of central
tendency and variability, creating frequency distributions, and cross-tabulating data.
5. R Programming: This is a programming language and software environment for
statistical computing and graphics. R has many packages and libraries, such as ggplot2
and dplyr, that can be used for performing descriptive analytics.
Data visualization can be utilized for a variety of purposes, and it’s important
to note that is not only reserved for use by data teams. Management also
leverages it to convey organizational structure and hierarchy while data
analysts and data scientists use it to discover and explain patterns and
trends.
Tables: This consists of rows and columns used to compare variables. Tables
can show a great deal of information in a structured way, but they can also
overwhelm users that are simply looking for high-level trends.
Pie charts and stacked bar charts: These graphs are divided into sections that
represent parts of a whole. They provide a simple way to organize data and
compare the size of each component to one other.
Line charts and area charts: These visuals show change in one or more
quantities by plotting a series of data points over time and are frequently used
within predictive analytics. Line graphs utilize lines to demonstrate these
changes while area charts connect data points with line segments, stacking
variables on top of one another and using color to distinguish between
variables.
Histograms: This graph plots a distribution of numbers using a bar chart (with
no spaces between the bars), representing the quantity of data that falls within a
particular range. This visual makes it easy for an end user to identify outliers
within a given dataset.
Scatter plots: These visuals are beneficial in reveling the relationship between
two variables, and they are commonly used within regression data analysis.
However, these can sometimes be confused with bubble charts, which are used
to visualize three variables via the x-axis, the y-axis, and the size of the bubble.
Heat maps: These graphical representation displays are helpful in visualizing
behavioral data by location. This can be a location on a map, or even a
webpage.
Tree maps, which display hierarchical data as a set of nested shapes, typically
rectangles. Treemaps are great for comparing the proportions between
categories via their area size.
ECM Framework
Enterprise content management (ECM) is a set of
capabilities for capturing, storing, activating, analyzing and
automating business content, in order to provide new value
from data that was previously unstructured and unavailable.
Why enterprise content management is important to your business
Content is the currency that fuels and funds digital transformation. Content possesses
useful information about customers—their behaviors, sentiments and value to the
organization—but only if you can harness it. Collectively, content buried in
repositories, file shares and cloud folders across the enterprise represents the
knowledge of the organization.
Now, more than ever, ECM software is an essential element on the journey to
becoming a digital business.
Analyze: With all of these documents and contact types now entered
into the system, you need to be able to explore these documents in a
sophisticated manner. ECM should be able to extract and infer
metadata and create bridges between unstructured content and search
capabilities.