Business Analytics Summary (Units 1.2 - 1.8)
Business Analytics Summary (Units 1.2 - 1.8)
8)
1.2 Introduction
● What is Data?
○ Data is like raw information. It's the basic stuff we collect, like numbers, text, or
observations.
○ Think of it as the ingredients before you cook something.
○ Data itself has meaning and value, but it's hard to understand until we process it.
● Structured vs. Unstructured Data:
○ Structured Data: This data is organized in a clear way, like in tables, lists, or
spreadsheets. It's easy to work with directly.
■ Example: An Excel sheet with names and ages in columns.
○ Unstructured Data: This data is not organized. It's like a messy pile of information.
It needs to be processed first to become useful.
■ Example: A paragraph of text, or a satellite image.
● Why understand Data Types?
○ Knowing data types helps us choose the right way to analyze data and present
results.
● Main Categories of Data:
○ Data is broadly divided into two main categories: Qualitative and Quantitative.
● Qualitative (Categorical) Data:
○ This type describes characteristics or qualities. You can't do math with it easily.
○ Example: Gender, languages spoken, types of diseases, clothing sizes.
○ Nominal Data: These are categories with no specific order.
■ Example: Eye color (blue, brown, green), types of pets. You can count how
many of each, but there's no "higher" or "lower."
○ Ordinal Data: These are categories that can be ordered logically.
■ Example: Clothing sizes (Small, Medium, Large). There's a clear order, but
the difference between "Small" and "Medium" might not be the same as
"Medium" and "Large."
● Quantitative (Numerical) Data:
○ This type of data consists of numbers. You can perform calculations with it.
○ Discrete Data: These are distinct, separate values that can only be counted. They
usually involve whole numbers.
■ Example: Number of cars, population.
○ Continuous Data: These represent measurements and can take any value within a
range.
■ Example: Height, weight, distance.
● Data Quality - Important points:
○ Relevance: Data should be useful and meet the needs of the user.
○ Accuracy: Data must be true and correct. Inaccurate data is useless for decisions.
○ Timeliness: Data needs to be available when it's meaningful, like stock information
updating instantly after an order.
○ Interpretability: Data should be easy to understand. Sometimes, extra information
(metadata) helps explain it.
○ Coherence: Data should be consistent over time and comparable across different
programs. This helps in tracking changes.
○ Accessibility: Data should be easy for people to find, get, use, and understand.
● Big Data is used in almost every field now, like banking, healthcare, and entertainment.
● Healthcare:
○ Big Data helps predict health risks and personalize treatments.
○ Doctors can predict diseases based on a patient's medical history, genetic data,
and lifestyle.
○ Example: IBM Watson Health uses Big Data to help doctors diagnose and treat
diseases like cancer by processing huge medical datasets.
● Retail and E-commerce:
○ Retailers use Big Data to understand customer behavior and preferences.
○ This helps them give personalized shopping experiences, increasing customer
satisfaction and sales.
○ Example: Amazon recommends products based on your past purchases and
browsing history.
● Finance and Banking:
○ Financial institutions use Big Data to detect fraud by watching transaction patterns
and finding unusual activities in real-time.
○ Example: PayPal uses Big Data algorithms to flag unusual transactions, like
purchases from unfamiliar locations or devices.
● Transportation:
○ Transportation systems use Big Data to analyze traffic, find better routes, and
reduce traffic jams.
○ Example: Google Maps collects data from GPS and sensors to give real-time traffic
updates and suggest alternative routes.
● Media and Entertainment:
○ Streaming platforms use Big Data to analyze viewing habits and recommend
personalized content.
○ Example: Netflix studies what shows you watch or skip to suggest similar content,
keeping you engaged.
● Education:
○ Educational platforms use Big Data to track student performance, find learning
gaps, and customize study materials.
○ Example: Khan Academy analyzes user progress to recommend specific exercises
or videos if a student is struggling.
● Manufacturing:
○ Big Data helps manufacturers monitor equipment performance and predict potential
failures.
○ This reduces downtime and maintenance costs.
○ Example: General Electric monitors its engines using Big Data to schedule
maintenance before breakdowns.
● Government:
○ Governments use Big Data for urban planning and to optimize resources, making
cities smarter.
○ Example: Singapore's Smart Nation initiative uses IoT data to manage traffic and
monitor air quality.
● Energy and Utilities:
○ Energy providers use Big Data to analyze consumption patterns, forecast demand,
and improve energy efficiency.
○ Example: Smart meters collect electricity usage data, helping companies adjust
power distribution and avoid outages.
● Social Media:
○ Social media sites scan user-generated content (posts, comments) to understand
public opinion and trends.
○ Example: Twitter uses Big Data for sentiment analysis during elections to
understand public opinion about candidates.
● Conclusion:
○ These examples show how Big Data is changing industries by offering new
solutions to complex problems.
○ Understanding and using Big Data's power is very important today.
● Data analytics is very powerful, but it also comes with some challenges that organizations
need to overcome to get good results.
● Poor Data Quality:
○ Getting bad quality data, like old or inconsistent data, is a big problem.
○ Such data can lead to incorrect analysis and wrong decisions.
○ Example: If a company uses wrong customer details, it might send irrelevant
marketing messages, making customers lose trust.
● Data Integration Complexity:
○ Combining data from many different sources (like databases, IoT devices, social
media) can be very complex.
○ This is because data often comes in varying formats and structures.
○ This incompatibility can cause delays and incomplete analysis.
● High Resource Needs (for Big Data):
○ The huge size of Big Data makes storing, processing, and analyzing it very
demanding on resources.
○ Handling petabytes of data requires a lot of computing power, which can strain a
company's infrastructure.
● Real-time Processing Difficulty:
○ Getting immediate insights from data (real-time processing) is hard, especially with
very fast-moving data streams.
○ Delayed insights can lead to missed opportunities in situations where time is critical,
like fraud detection.
● Data Security and Privacy:
○ Protecting sensitive data from security breaches and making sure rules like GDPR
or HIPAA are followed is very important.
○ Any data leaks can harm a company's reputation and lead to legal penalties.
● Shortage of Skilled Professionals:
○ There aren't enough qualified data scientists and analysts who know how to use
advanced tools and techniques.
○ This shortage means organizations might struggle to get useful insights from
complex datasets, slowing down decision-making.
● Complexity of Advanced Techniques:
○ Techniques like machine learning and predictive analytics need advanced
knowledge and powerful computing resources.
○ Smaller organizations might find it hard to adopt these methods effectively.
● Conclusion:
○ To get all the benefits of data analytics, organizations must overcome these
challenges.
○ Investing in technology, skilled people, and ethical practices can turn these
obstacles into opportunities for growth.