0% found this document useful (0 votes)

8 views11 pages

Predective Analytics

The document provides a comprehensive overview of analytics, including its definition, importance, and applications in decision-making and predictive analytics. It covers various statistical methods such as linear and multiple regression, logistic regression, decision trees, and unstructured data analysis, along with their applications in different fields. Additionally, it discusses forecasting techniques, time series analysis, and accuracy metrics for predictions.

Uploaded by

anishkumarprasad49

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views11 pages

Predective Analytics

Uploaded by

anishkumarprasad49

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Module 1

1. Overview & Definition of Analytics

• Analytics refers to the use of data, statistical analysis, and modeling to solve problems, gain
insights, and make decisions.

• It involves collecting, processing, and interpreting data to find meaningful patterns.

2. Need for Analytics

• Organizations generate huge amounts of data; analytics helps convert this data into
actionable insights.

• Helps improve efficiency, reduce costs, and predict trends.

• Necessary for staying competitive in today’s data-driven world.

3. Analytics in Decision Making

• Analytics supports evidence-based decision making.

• Reduces guesswork and increases the accuracy of business decisions.

• Used in areas like marketing, finance, operations, and HR to make better decisions.

4. Analytics as a Game Changer and Innovator

• Transforms how businesses operate by providing real-time insights.

• Drives innovation by identifying new opportunities.

• Examples: Personalized recommendations by Amazon or Netflix.

5. Power of Analytics

• Enables data-driven strategies.

• Helps in understanding customer behavior, market trends, and operational performance.

• Supports automation and optimization of processes.

6. Predictive Analytics

• A branch of analytics that uses historical data and statistical models to predict future
outcomes.

• Common tools: Regression analysis, machine learning, forecasting models.

• Example: Predicting customer churn or future sales.

Module 2

1. Types of Predictive Analytics

• These are the main approaches to predicting future outcomes:

• Classification Models
➤ Predict categories (e.g., spam or not spam)
➤ Used in email filters, fraud detection

• Regression Models
➤ Predict numeric values (e.g., sales, revenue)
➤ Used in forecasting

• Time Series Analysis

➤ Predict future trends based on time-based data
➤ Used in stock price or demand forecasting

• Clustering
➤ Group similar data (not direct prediction but used for segmentation)
➤ Used in market research

• 2. Techniques of Predictive Analytics

• These are the tools and methods used:

• Linear & Logistic Regression

➤ Predict continuous and binary outcomes

• Decision Trees
➤ Easy-to-understand visual models for decision-making

• Neural Networks & Deep Learning

➤ Powerful for complex patterns like image/speech recognition

• Random Forest, XGBoost

➤ Advanced ensemble techniques for higher accuracy

• Machine Learning (ML)

➤ Used for self-learning predictive models

• 3. Applications of Predictive Analytics

• Manufacturing

• Healthcare

• Personalized treatment plans

• Telecommunication
• Network optimization

• Fraud detection

• Supply Chain

• Inventory forecasting

• Optimize delivery routes

• Supplier risk assessment

• Information Technology

• Cyber threat prediction

• Server failure prediction

• IT resource planning

• 4. Digital Analytics (Simplified)

• Definition:

• Digital Analytics is the analysis of digital data from websites, mobile apps, social media, etc.,
to optimize user experience and business outcomes.

• Key Focus Areas:

• Website traffic (page views, bounce rate)

• User behavior (clicks, time spent)

• Conversion tracking (sales, signups)

• Social media metrics (likes, shares, comments)

• Tools Used:

• Google Analytics

• Adobe Analytics

• Social media insights (Meta, X, LinkedIn)

• Heatmaps (Hotjar, Crazy Egg)

MODULE 3

• Simple Linear Regression (SLR) is a statistical method to predict the value of one variable (Y)
using one independent variable (X).

• It fits a straight line to the data:

Y = β₀ + β₁X + ε
Where:

o Y = dependent variable (outcome)

o X = independent variable (predictor)

o β₀ = intercept

o β₁ = slope

o ε = error term

2. Importance of SLR

• Helps in predicting outcomes (e.g., predicting sales based on advertising spend)

• Useful for understanding relationships between variables

• Forms the foundation for more advanced regression models

3. Types of Regression (Basic Mention)

• Simple Linear Regression – one independent variable

• Multiple Linear Regression – more than one independent variable

4. SLR Model Building Steps

1. Identify variables – Define X and Y

2. Collect data – Numerical, continuous values

3. Plot the data – Scatter plot to check linearity

4. Fit the line – Use regression to find β₀ and β₁

5. Evaluate model – Check goodness of fit

5. OLS Estimation (Ordinary Least Squares)

• A method to find best-fitting line by minimizing the sum of squared errors (differences
between actual and predicted Y)

• Gives estimates of β₀ (intercept) and β₁ (slope)

6. Model Interpretation

• Intercept (β₀): Value of Y when X = 0

• Slope (β₁): Change in Y for a one-unit change in X

• R² (R-squared): Explains how well the model fits the data (ranges from 0 to 1)
7. Model Validation

• Residual analysis: Errors should be randomly scattered (no pattern)

• Check assumptions:
Linearity
Independence
Homoscedasticity (constant variance)
Normality of residuals

• Use metrics:

o R²

o RMSE (Root Mean Square Error)

o p-value (for statistical significance of β₁)

MODULE 4

1. Multiple Linear Regression (MLR): Introduction

• MLR is used to predict the value of a dependent variable (Y) using two or more
independent variables (X₁, X₂, ..., Xn).

• General equation:
Y = β₀ + β₁X₁ + β₂X₂ + ... + βnXn + ε
Where:

o Y = Dependent variable

o X₁, X₂, ... Xn = Independent variables

o β₀ = Intercept, β₁...βn = Coefficients

o ε = Error term

2. Estimation of Regression Parameters

• Done using Ordinary Least Squares (OLS) method

• Objective: Minimize sum of squared residuals (errors)

• Output: Coefficients (β-values) that best fit the data

3. Model Diagnostics

Helps check if the model is valid and reliable:

• R²: How well independent variables explain variation in Y

• Adjusted R²: Better for MLR; adjusts for number of variables

• Residual plots: Should be randomly scattered

• p-values: Check if variables are statistically significant (p < 0.05)

• F-test: Tests overall significance of the model

4. Dummy, Derived & Interaction Variables

Dummy Variables

• Used to represent categorical data (e.g., Male = 0, Female = 1)

• Needed because regression requires numeric input

Derived Variables

• Created by transforming existing variables

• Example: log(salary), age²

Interaction Variables

• Show combined effect of two variables

• Example: X₁ * X₂ — effect of X₁ depends on X₂

5. Multicollinearity

• When independent variables are highly correlated with each other

• It distorts regression results

• Detected using:

o VIF (Variance Inflation Factor): VIF > 10 indicates a problem

• Solution:

o Drop one of the correlated variables

o Use Principal Component Analysis (PCA)

6. Model Deployment

• Deploying a model means using it in real-world applications

• Steps:

1. Finalize model

2. Convert into software/script

3. Integrate into systems (e.g., websites, apps)

4. Monitor performance
7. Demo Using Software

• Tools like Excel, Python, R, SPSS, or SAS used for MLR

• Common steps in software:

➤ Import data → Fit model → Interpret output (coefficients, R², p-values) → Plot diagnostics

MODULE 5

Logistic regression is a supervised machine learning algorithm used for classification tasks where
the goal is to predict the probability that an instance belongs to a given class or not. Logistic
regression is a statistical algorithm which analyze the relationship between two data factors. The
article explores the fundamentals of logistic regression, it's types and implementations.

• It can be either Yes or No, 0 or 1, true or False, etc. but instead of giving the exact value as
0 and 1, it gives the probabilistic values which lie between 0 and 1.

• In Logistic regression, instead of fitting a regression line, we fit an "S" shaped logistic
function, which predicts two maximum values (0 or 1).

MODULE 6

1. Overview of Decision Trees

• A decision tree is a flowchart-like model used for classification and regression tasks.

• It splits data into branches based on conditions to make predictions.

• Simple to visualize, interpret, and explain.

2. Applications of Decision Trees

Used in many fields like:

• Marketing → Predict customer behavior

• Finance → Credit risk scoring

• Healthcare → Disease diagnosis

• HR → Predict employee attrition

5. Introduction to CHAID

• CHAID = Chi-Square Automatic Interaction Detector

• Used for categorical target variables

• Splits based on Chi-square tests (checks statistical significance)

• Creates multi-way splits (unlike binary splits in CART)

6. Classification and Regression Tree (CART)

• Common decision tree algorithm

• Uses binary splits only (Yes/No)

• Two types:

o Classification Tree: For categorical output (e.g., yes/no)

o Regression Tree: For numeric output (e.g., price, salary)

• Splits based on:

o Gini Index (for classification)

o Mean Squared Error (MSE) (for regression)

MODULE 7
1. Introduction to Unstructured Data Analysis

What is Unstructured Data?

• Data that doesn’t follow a fixed format

• Examples:

o Text (emails, reviews, tweets)

o Images, videos, audio

Why is it important?

• 80–90% of data today is unstructured

• Analyzing it helps discover hidden insights (e.g., customer opinions, trends)

2. Sentiment Analysis

Definition:

• A technique to determine whether text expresses positive, negative, or neutral emotion.

Example:

• “This product is amazing!” → Positive

• “Worst service ever.” → Negative

Applications:

• Customer feedback analysis

• Social media monitoring

• Brand reputation tracking

Techniques:

• Lexicon-based: Uses predefined word lists

• Machine Learning-based: Uses models like Naïve Bayes or SVM

3. Naïve Bayes Algorithm

Overview:

• A supervised machine learning algorithm based on Bayes' Theorem

• Called "naïve" because it assumes all features (words) are independent of each other

Formula:

• P(A|B) = [P(B|A) * P(A)] / P(B)

o A = class (e.g., positive/negative)

o B = input data (e.g., words in review)

How it works (for text):

1. Learn probabilities of words in each class

2. For new text, calculate probability for each class

3. Choose the class with the highest probability

Applications:

• Spam detection

• Email filtering

• Sentiment classification

• Text categorization

MODULE 8
1. Forecasting

• Forecasting is predicting future values based on past data.

• Used in sales, finance, inventory, weather prediction, etc.

• Helps in planning and decision making.

2. Time Series Analysis

• Deals with data collected over time at regular intervals (daily, monthly, yearly).

• Goal: Identify patterns/trends to forecast future points.

• Components of time series:

o Trend: Long-term upward or downward movement

o Seasonality: Repeating patterns at fixed periods (e.g., sales peak in December)

o Cyclic: Irregular, long-term fluctuations

o Random (noise): Unpredictable variations

3. Additive & Multiplicative Models

• Additive Model:
Time series = Trend + Seasonality + Noise
Use when seasonal variations are constant over time

• Multiplicative Model:
Time series = Trend × Seasonality × Noise
Use when seasonal variations increase/decrease proportionally with trend

4. Forecasting Accuracy

• Measures how close forecasted values are to actual values.

• Common metrics:

o MAD (Mean Absolute Deviation)

o MSE (Mean Squared Error)

o MAPE (Mean Absolute Percentage Error)

5. Moving Average Models

• Smoothes time series by averaging data points over a fixed window.

• Types:

o Simple Moving Average (SMA): Equal weights to all points

o Weighted Moving Average (WMA): Different weights, recent data given more
importance

6. Exponential Smoothing Techniques

• Give more weight to recent observations using a smoothing factor (α, 0 < α < 1).

• Types:

o Simple Exponential Smoothing: For data without trend or seasonality

o Holt’s Linear Trend Model: For data with trend

o Holt-Winters Model: For data with trend and seasonality

Quick Exam Summary:

Topic Key Points

Forecasting Predict future using past data

Time Series Components Trend, Seasonality, Cyclic, Random

Additive vs Multiplicative Additive = add parts; Multiplicative = multiply parts

Accuracy Metrics MAD, MSE, MAPE

Moving Average Smooth data by averaging

Exponential Smoothing Recent data weighted more

Grade 10 Data Handling QP 2024
No ratings yet
Grade 10 Data Handling QP 2024
6 pages
GR 11 Paper 1 Business 2025 Term 1
No ratings yet
GR 11 Paper 1 Business 2025 Term 1
7 pages
Bronchitis Akut Print
No ratings yet
Bronchitis Akut Print
24 pages
A Uniform Thin Ring of Radius R and Mass M Suspended in A Vertical Pla
No ratings yet
A Uniform Thin Ring of Radius R and Mass M Suspended in A Vertical Pla
1 page
Sample Memorial
No ratings yet
Sample Memorial
18 pages
Handwriting Analysis
100% (5)
Handwriting Analysis
17 pages
Module 1
No ratings yet
Module 1
138 pages
Statistics For Data Science
No ratings yet
Statistics For Data Science
39 pages
EXSPI
No ratings yet
EXSPI
235 pages
(BI 2025-1) Lesson15
No ratings yet
(BI 2025-1) Lesson15
70 pages
Da Sem Unit 3-1
No ratings yet
Da Sem Unit 3-1
13 pages
Experience Counts : BA-922 SAE-BB Air Compressor - Overview Experience Counts
No ratings yet
Experience Counts : BA-922 SAE-BB Air Compressor - Overview Experience Counts
8 pages
Unit III
No ratings yet
Unit III
7 pages
Module 2-b Prediction Methods and Models-Data Preperation
No ratings yet
Module 2-b Prediction Methods and Models-Data Preperation
26 pages
Module 6 Predictive Analytics
No ratings yet
Module 6 Predictive Analytics
20 pages
How To Process Malta Student Visa Applications
No ratings yet
How To Process Malta Student Visa Applications
8 pages
BA - Unit 5
No ratings yet
BA - Unit 5
19 pages
Aditya Praksh Jalan Saraswati Vidya Mandir, Kudlum: Online Class Routine
No ratings yet
Aditya Praksh Jalan Saraswati Vidya Mandir, Kudlum: Online Class Routine
1 page
Statistics For Data Science
No ratings yet
Statistics For Data Science
4 pages
CH 5
No ratings yet
CH 5
42 pages
Da Imp Qna Cleaned
No ratings yet
Da Imp Qna Cleaned
7 pages
DA Unit-2
No ratings yet
DA Unit-2
7 pages
Kenken Puzzle
No ratings yet
Kenken Puzzle
3 pages
MLT Unit 2 Linear Regression
No ratings yet
MLT Unit 2 Linear Regression
26 pages
Log in and Out DTR
No ratings yet
Log in and Out DTR
5 pages
Regression Logistic Unit3 Notes
No ratings yet
Regression Logistic Unit3 Notes
6 pages
Assignment Group C
No ratings yet
Assignment Group C
8 pages
BIA Notes
No ratings yet
BIA Notes
10 pages
CFP Iwinac24
No ratings yet
CFP Iwinac24
2 pages
Forecasting and Forecasting Modelss
No ratings yet
Forecasting and Forecasting Modelss
30 pages
Tire Impressions: Reveals More Than You Think!
No ratings yet
Tire Impressions: Reveals More Than You Think!
30 pages
Dsbda Ut4
No ratings yet
Dsbda Ut4
12 pages
How To Develop Quantitative Analysis Model
No ratings yet
How To Develop Quantitative Analysis Model
36 pages
How To Develop Quantitative Analysis Model
No ratings yet
How To Develop Quantitative Analysis Model
36 pages
Interview Writing
No ratings yet
Interview Writing
4 pages
Crime and Punishment
No ratings yet
Crime and Punishment
2 pages
ML 01 (Pranavv)
No ratings yet
ML 01 (Pranavv)
14 pages
Unit-5 Bda
No ratings yet
Unit-5 Bda
21 pages
Co Dab 2024-25
No ratings yet
Co Dab 2024-25
10 pages
Reflective Journal Writing 6 - 1733814927
No ratings yet
Reflective Journal Writing 6 - 1733814927
4 pages
DA (All CHP.)
No ratings yet
DA (All CHP.)
14 pages
Regression Vs Classification in Machine Learning Explained!
No ratings yet
Regression Vs Classification in Machine Learning Explained!
10 pages
LinearRegressionUsing R
No ratings yet
LinearRegressionUsing R
91 pages
Accounting Analytics 2
No ratings yet
Accounting Analytics 2
41 pages
Data Analysis Process My Notes
No ratings yet
Data Analysis Process My Notes
7 pages
Presentation On Supervised Learning
No ratings yet
Presentation On Supervised Learning
8 pages
Module - 03
No ratings yet
Module - 03
28 pages
Big Data Analysis
No ratings yet
Big Data Analysis
25 pages
Big Data Questions
No ratings yet
Big Data Questions
5 pages
BA Unit 2 Notes
No ratings yet
BA Unit 2 Notes
5 pages
About The Handout: Professional Statement
No ratings yet
About The Handout: Professional Statement
21 pages
Chapter 2
No ratings yet
Chapter 2
136 pages
Unit - III - PREDICTIVE ANALYTICS
No ratings yet
Unit - III - PREDICTIVE ANALYTICS
28 pages
1 Introduction
No ratings yet
1 Introduction
30 pages
1817-18 Gas Lighting in London
No ratings yet
1817-18 Gas Lighting in London
1 page
File 1704445511 0009750 Unit-1 PPT 01
No ratings yet
File 1704445511 0009750 Unit-1 PPT 01
41 pages
Lecture 1 Introduction PM
No ratings yet
Lecture 1 Introduction PM
21 pages
2 4 Module Lectures
No ratings yet
2 4 Module Lectures
10 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Seraph
0% (1)
Seraph
5 pages
Predictive Analytical Models CHAP 2
No ratings yet
Predictive Analytical Models CHAP 2
24 pages
Inverter: Power
No ratings yet
Inverter: Power
1 page
DSR Notes 3 To 5
No ratings yet
DSR Notes 3 To 5
70 pages
Predictive Analytics Steps
No ratings yet
Predictive Analytics Steps
13 pages
Regression
No ratings yet
Regression
24 pages
Lecturer-Predictive Analytics Techniques and Regression Analysis
No ratings yet
Lecturer-Predictive Analytics Techniques and Regression Analysis
29 pages
Analytics 02011 Learning Path - Curriculum (6632)
No ratings yet
Analytics 02011 Learning Path - Curriculum (6632)
22 pages
Predictive Analytics
No ratings yet
Predictive Analytics
8 pages
1.descriptive Statistics and Probability Distributions:: Datascience Course Content
No ratings yet
1.descriptive Statistics and Probability Distributions:: Datascience Course Content
10 pages
Speaking (Daily Activities)
100% (1)
Speaking (Daily Activities)
3 pages
Spiritual Self
No ratings yet
Spiritual Self
23 pages
Ctual4 Est: Actual4test - Actual Test Exam Dumps-Pass For IT Exams
No ratings yet
Ctual4 Est: Actual4test - Actual Test Exam Dumps-Pass For IT Exams
4 pages
Aws Resume Sample
67% (3)
Aws Resume Sample
1 page
Midterm Notes MGMT 2050
No ratings yet
Midterm Notes MGMT 2050
10 pages
Machine Learning Introduction
100% (1)
Machine Learning Introduction
20 pages
Lecture 1
No ratings yet
Lecture 1
19 pages
Chapter 1: Introduction To Business Analytics
No ratings yet
Chapter 1: Introduction To Business Analytics
14 pages
Ivy - Data Science and Data Visualization Certification Course
100% (1)
Ivy - Data Science and Data Visualization Certification Course
10 pages
Unit 5
No ratings yet
Unit 5
19 pages
Syllabus
No ratings yet
Syllabus
7 pages
Narrative (SPES)
No ratings yet
Narrative (SPES)
2 pages
First Video: How Your Brain Predictions Interfere With What You See - Georg Keller
No ratings yet
First Video: How Your Brain Predictions Interfere With What You See - Georg Keller
2 pages
(Jewish Lives) Benjamin Taylor - Proust - The Search - Yale University Press (2015)
No ratings yet
(Jewish Lives) Benjamin Taylor - Proust - The Search - Yale University Press (2015)
221 pages
Data Science Training in Hyderabad
No ratings yet
Data Science Training in Hyderabad
7 pages
Moroccan Arabic Textbook 23
No ratings yet
Moroccan Arabic Textbook 23
2 pages
Data Science Course Agenda
No ratings yet
Data Science Course Agenda
29 pages
Bda Unit 5
No ratings yet
Bda Unit 5
14 pages
SQL Server DBA Interview Questions and Answers: Answer
No ratings yet
SQL Server DBA Interview Questions and Answers: Answer
13 pages
Selective High School Placement Test: Session
100% (1)
Selective High School Placement Test: Session
10 pages
Data Analytics with Generative AI
From Everand
Data Analytics with Generative AI
Younish P
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.