0% found this document useful (0 votes)

300 views18 pages

The Cricket Winner Prediction With Applications of ML and Data Analytics

This document describes a project to predict the winner of cricket matches using machine learning models and historical match data. It discusses factors that influence match outcomes like batting, bowling, player and team performances. It presents several machine learning models - Naive Bayes, Decision Tree, SVM, Random Forest - that were implemented and evaluated. The research methodology involved data preparation, feature selection, model building and evaluation. Decision Tree achieved the best accuracy of 94.87% in predicting cricket match winners. The goal of the project was to analyze cricket data and predict IPL match winners from 2008-2017 using data science techniques.

Uploaded by

Muhammad Swalih

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

300 views18 pages

The Cricket Winner Prediction With Applications of ML and Data Analytics

Uploaded by

Muhammad Swalih

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

The Cricket Winner Prediction With Applications

Of ML and Data Analytics

Name:Nasheel Rehman Guide: Seminar Coordinator:

USN: 4BP17CS024 Prof. Afsar Bhaig Prof. Sinan Marikattay
Year: 2020-21 Dept. of CSE Dept. of CSE
INDEX
 ABSTRACT
 INTRODUCTION
 FACTORS TO ANTICIPATE CRICKET WINNER
 CRICKET WINNER PREDICTION MODEL
 RESEARCH METHODOLOGY
 MODEL IMPLEMENTATION
 CONCLUSION
 REFRENCE
Abstract
o With the evolution in the field of Data Sciences, every business firm is adapting latest
technologies to grow their business.

o There are competitions in delivering better management, better quality of evaluations and better
services in the market.

o The only possible way to meet all these qualities is to conduct analysis of data with purity and
more accurately.

o Machine learning is the emerging field to predict future outcomes with existing data and based
on these predictions better decisions can be made.

o Cricket is a well-known game that played and watched around a globe in 104 countries. Many
of these cricket fans want their team to perform good and declare as a winner.

o In this research various features have been analyzed to predict the match winner of the game
Introduction
o SPORTS statistical analysis use in sports has been growing quickly year by year.
o Due to which the ways in which game strategies are formed or the player’s evaluation criteria
has been changed but also has the got the more interest of audience towards cricket.
o Today, there are three major formats in which cricket is being played internationally, One Day
Internationals (ODIs) and the T20 cricket and Test Matches.
o Besides these international cricket matches, T20 League cricket is getting attention in the fans
due to its shortest format and the most exciting format of the game.
o Indian Premiere League is one of most popular t20 cricket league in the world.
o Every team’s performance based on the key performances of players, team conditions and
other important aspects which decides the team’s performances in a cricket match
o The model will be built on all the possible factors affecting the outcome of cricket match.
Ground impacts, team quality and home field advantage were observed.
Factors to Anticipate Cricket winner
o Winning a cricket match depends on multiple factors like
• Batting
• Bowling
• Fielding
• Team performances
• Player performances

oTo predict the winner of a cricket match is never an easy task.

oBut there are always some kind of unique aspects or match conditions that may favor to
some team and sometime does not such as home advantage, Key Players, Pitch Conditions
and weather condition
Cricket winner prediction model
1. Naïve Bayes

2. Decision Tree Regressor

3. Support Vector Machine (SVM)

4. Random Forest Classifier

1.Naïve Bayes

o Naïve Bayes works on the Bayes probability theorem.

o Works with the assumption that all the features are independent of class label (predicted
variable) which may be a wrong assumption.

o Naive Bayes model used in conjunction with recursive feature elimination

2.Decision Tree Regressor

• Decision Tree Regressor has been used to check the overfit by learning from the noise
of data using tree node system.
• If max depth of tree is high, decision tree regressor take details from training data’s
noise
• Decision Trees classification works on tree node principal in which instances are
sorted into tree node system
• By this hierarchy complex decision-making system are break-down into smaller
simpler decisions which provides a simple solution that is easy to implemen
3. Support Vector Machine (SVM)
• Support Vector Machine has been proven to be most used component classifier of Ada Boosting
for different prediction techniques like image recognition, medical health diagnosis and facial
recognition
• SVM classifier on given Training data, outputs an optimal hyperplane by which new example
• Hyperplane is a plane that divides line into two parts where in each class lay in either side.
SVM’s optimization measured by Regularization parameters. s can be categorized
• Regularization parameter tells about the SVM Optimization.SVM is a category of supervised
machine learning algorithms which has to be trained with pre-defined output class.
• The SVM classifier on given Training data, outputs an optimal hyperplane by which new
examples can be categorized. Hyperplane is a plane that divides line into two parts where in
each class lay in either side .
4. Random Forest Classifier

o Random Forest classifier is a method used for regression and classification techniques
o In the Random Forest Classifiers, to classify a new instance, there are number of trees in
working randomly in a forest putting input vector down
o duty of every tree is to give a class label or target variable as a vote for the class
o And which node has highest votes will be chosen by Random Forest Classifier.
Research methodology
 Methodology is a process in which data is selected, transformed and prepared for
the calculations needed to generate useful insights [.
 For this research methodology is SEMMA modeling.

Semma modeling
SEMMA Process

• The SEMMA process was developed by the SAS Institute that considers a cycle
with 5 stages for the process.
• Sample, Explore, Modify, Model, and Assess.
• Data mining is the process of discovering predictive information from the analysis
of large databases
• Python is used for the data mining of the following steps:
• There should be one informational dataset which contains enough information to
fulfill the purpose of data mining and should be able to do calculations on it to
generate useful insights.
• If the model, is not appropriate and not giving the best results then try different
techniques to make it appropriate.
Model implementation
Decision Tree Classifier
o Decision Tree works on flow chart tree like structure having nodes, branches and leafs.
o Node represents attributes of dataset; branches are represented by decision rules and
outcome of the model is represented by trees.
o The node on the top is called as root node and partitioning is done by it in recursive
manner.
o With the structure of tree like flow chart it helps to make decisions
o In machine learning decision trees are like white box which take a part in logics of
internal- decision making which cannot be find in the black box type of algorithms like
neural networks.
Contd..

o Decision tree’s time complexity can be found by number of observations and

number of features in the dataset.
o Decision trees are non- parametric and high dimensional data can easily handle by
the Decision Trees
o The splitting of records in Decision trees are done by Attribute Selection Method
o splitting the data into smaller portions of data and recursively tree building
process continue and end when every record plotted successfully.
Confusion Matrix

• The above confusion matrix of Decision Tree model has successfully

predicted the values of ‘winner’ by 76.9% accuracy
Conclusion

o The objective of this research was to predict the match winner of IPL using
historical data of IPL from season 2008 to 2017.
o To conduct the analysis and predicting the winner of IPL various branches of Data
Science has been converged including Pre-Processing of data, Visualizations of
data, preparation pf data, feature selection and implementing different machine
learning models for the predictions.
o Decision Tree model was applied which predicted the match winner with good
accuracy 94.87%.
Reference

 Jhanwar, G. M., 2017. Quantitative Assessment of Player Performance and Winner

Prediction in ODI Cricket. International Institute of Information Technology
Hyderabad - 500032, INDIA.

 Ahmed, W. & Nazir, K., 201. A Multivariate Data Mining Approach to Predict Match
Outcome in One-Day International Cricket. 10.13140/RG.2.2.30683.4688

 Asare-Frempong, J. and Jayabalan, M., 2017. Predicting customer response to bank

direct telemarketing campaign. In 2017 International Conference on Engineering
Technology and Technopreneurship (ICE2T) (pp. 1-4). IEEE. [
THANK YOU

Complete Bundle Discovering Psychology The Science of Mind 4th Edition Cacioppo HQ File
100% (1)
Complete Bundle Discovering Psychology The Science of Mind 4th Edition Cacioppo HQ File
408 pages
ML-2 Guided Project Report
No ratings yet
ML-2 Guided Project Report
63 pages
SMDM Project Report - Shubham Bakshi - 07.05.2023
0% (1)
SMDM Project Report - Shubham Bakshi - 07.05.2023
23 pages
SMDM Project
100% (1)
SMDM Project
19 pages
SQL Project Questions
0% (1)
SQL Project Questions
3 pages
As 1831-2007 Ductile Cast Iron
No ratings yet
As 1831-2007 Ductile Cast Iron
10 pages
Thera Bank-Project
100% (12)
Thera Bank-Project
26 pages
Factor-Hair RV PDF
No ratings yet
Factor-Hair RV PDF
23 pages
Project SQL
No ratings yet
Project SQL
2 pages
FASA - Federation Ship Recognition Manual 2385
100% (4)
FASA - Federation Ship Recognition Manual 2385
204 pages
Lead Scoring Group Case Study Presentation
100% (2)
Lead Scoring Group Case Study Presentation
19 pages
Saipem at A Glance PDF
No ratings yet
Saipem at A Glance PDF
9 pages
Data Mining Project Report
100% (1)
Data Mining Project Report
98 pages
Project - Finance and Risk Assessment: Submitted By: Navendu Mishra
No ratings yet
Project - Finance and Risk Assessment: Submitted By: Navendu Mishra
18 pages
Capstone Project Final Report Rupesh Kumar PGP-DSBA APR 21C
No ratings yet
Capstone Project Final Report Rupesh Kumar PGP-DSBA APR 21C
77 pages
Answer Report (Preditive Modelling)
100% (1)
Answer Report (Preditive Modelling)
29 pages
Opening Range Trading Strategy
100% (1)
Opening Range Trading Strategy
20 pages
Project 3 - Build A Logistic Regression Model To Predict Custo Mer Churn in Telecom IndustryV1.0 PDF
100% (1)
Project 3 - Build A Logistic Regression Model To Predict Custo Mer Churn in Telecom IndustryV1.0 PDF
38 pages
PM ProjectJune - 2021
100% (1)
PM ProjectJune - 2021
33 pages
Data Mini Proj
100% (2)
Data Mini Proj
44 pages
Predictive Modelling Project Gloria Susan Raju 11 APR 2021 PDF
No ratings yet
Predictive Modelling Project Gloria Susan Raju 11 APR 2021 PDF
56 pages
Facebook Comment Volume Prediction
No ratings yet
Facebook Comment Volume Prediction
20 pages
Random Forest - US - Heart - Patients - Class
100% (1)
Random Forest - US - Heart - Patients - Class
24 pages
Machine Learning - Nabeel Khan - Final Project Report - Problem 2
100% (1)
Machine Learning - Nabeel Khan - Final Project Report - Problem 2
24 pages
SQL - Basics
No ratings yet
SQL - Basics
25 pages
Data Mining Business Report
No ratings yet
Data Mining Business Report
38 pages
Boston Condo Info-Case Study: Click Here
No ratings yet
Boston Condo Info-Case Study: Click Here
3 pages
Cars Project PDF
No ratings yet
Cars Project PDF
9 pages
Business Analytics Report: Submitted To
No ratings yet
Business Analytics Report: Submitted To
32 pages
Capstone-2 Market Basket Analysis Vinothkumar R
No ratings yet
Capstone-2 Market Basket Analysis Vinothkumar R
18 pages
Great Learning DVT Final Project - Car Claims For Insurance
100% (1)
Great Learning DVT Final Project - Car Claims For Insurance
113 pages
SMDM Project Report
100% (1)
SMDM Project Report
9 pages
Business Report DSBA Data Mining Project - Part 2 Segmentation Using K-Means Clustering
No ratings yet
Business Report DSBA Data Mining Project - Part 2 Segmentation Using K-Means Clustering
28 pages
SMDM Project Report
100% (1)
SMDM Project Report
19 pages
Problem 2 Businessreport ML
No ratings yet
Problem 2 Businessreport ML
9 pages
Palash Bhai - Machine Learning Assignment
100% (2)
Palash Bhai - Machine Learning Assignment
18 pages
Marketing & Retail Analytics - Report - Part A
100% (2)
Marketing & Retail Analytics - Report - Part A
18 pages
Capstone Project Report
No ratings yet
Capstone Project Report
187 pages
SMDM Project
100% (1)
SMDM Project
22 pages
SMDM Report
No ratings yet
SMDM Report
12 pages
Sandhya Assignment SQL
No ratings yet
Sandhya Assignment SQL
16 pages
MySQL - Week 5 Quiz
100% (1)
MySQL - Week 5 Quiz
6 pages
NIrupam Agarwal Business Report-ML
100% (1)
NIrupam Agarwal Business Report-ML
23 pages
Fra Project Report-Bajaj Auto Ltd. Vs Hero Motocorp Ltd. (Group-X)
100% (1)
Fra Project Report-Bajaj Auto Ltd. Vs Hero Motocorp Ltd. (Group-X)
10 pages
Quantifying and Analyzing The Performance of Cricket Player Using Machine Learning
No ratings yet
Quantifying and Analyzing The Performance of Cricket Player Using Machine Learning
7 pages
Education - Post 12th Standard - CSV
No ratings yet
Education - Post 12th Standard - CSV
11 pages
SMDM Project
No ratings yet
SMDM Project
17 pages
SQL Quiz Results
No ratings yet
SQL Quiz Results
17 pages
Project - Data Mining: Bank - Marketing - Part1 - Data - CSV
No ratings yet
Project - Data Mining: Bank - Marketing - Part1 - Data - CSV
4 pages
Fembot
No ratings yet
Fembot
7 pages
Rahulsharma - 03 12 23
No ratings yet
Rahulsharma - 03 12 23
25 pages
Data Mining Project - PCA - Hair Salon
No ratings yet
Data Mining Project - PCA - Hair Salon
8 pages
UNIT 5 MCQs
No ratings yet
UNIT 5 MCQs
12 pages
Meivalvole SerieCD ENG 20150212
No ratings yet
Meivalvole SerieCD ENG 20150212
4 pages
Data Mining - Project
100% (2)
Data Mining - Project
25 pages
FINANCE & RISK ANALYTICS – PROJECT - YARESH VIJAYASUNDARAM
No ratings yet
FINANCE & RISK ANALYTICS – PROJECT - YARESH VIJAYASUNDARAM
48 pages
TSF - Project
100% (1)
TSF - Project
5 pages
RACHIT MITTAL Capstone Project. Notes 2 PDF
No ratings yet
RACHIT MITTAL Capstone Project. Notes 2 PDF
39 pages
Strategi Pengembangan Bisnis Tambak Ikan Bandeng Di Desa Mengare Watuagung Gresik
No ratings yet
Strategi Pengembangan Bisnis Tambak Ikan Bandeng Di Desa Mengare Watuagung Gresik
8 pages
Extended Project
No ratings yet
Extended Project
1 page
End Term Quiz1 - Attempt Review
No ratings yet
End Term Quiz1 - Attempt Review
5 pages
Listening Skills Practice: Living Online - Exercises: Preparation
100% (1)
Listening Skills Practice: Living Online - Exercises: Preparation
2 pages
Application of Metering Process in Oil and Gas Production in Niger Delta Fields
No ratings yet
Application of Metering Process in Oil and Gas Production in Niger Delta Fields
7 pages
CH 6 Synchronization
No ratings yet
CH 6 Synchronization
69 pages
Electronics Today 1981 01
No ratings yet
Electronics Today 1981 01
124 pages
DS 14NHG28
No ratings yet
DS 14NHG28
2 pages
Tableau Questions
No ratings yet
Tableau Questions
2 pages
Payroll System
No ratings yet
Payroll System
6 pages
Social Media Tourism - Capstone Project
No ratings yet
Social Media Tourism - Capstone Project
13 pages
FRA Project Report Milestone 1 PDF
No ratings yet
FRA Project Report Milestone 1 PDF
29 pages
Documents
No ratings yet
Documents
8 pages
HiperLAN - Wikipedia
No ratings yet
HiperLAN - Wikipedia
3 pages
Wiring Mikrohidro
No ratings yet
Wiring Mikrohidro
6 pages
Aedt 04 02 2023
No ratings yet
Aedt 04 02 2023
15 pages
Capstone Notes-1
No ratings yet
Capstone Notes-1
18 pages
SQL Quiz
No ratings yet
SQL Quiz
4 pages
FRA Main Project Part B Guided
No ratings yet
FRA Main Project Part B Guided
23 pages
EBAB
No ratings yet
EBAB
2 pages
TED (21) 4281 QP
No ratings yet
TED (21) 4281 QP
2 pages
Great Lakes Extraa - Learn Project Business Report - 2-Kavish-Rathod
No ratings yet
Great Lakes Extraa - Learn Project Business Report - 2-Kavish-Rathod
22 pages
Chapter 5a - Concrete & Formwork
No ratings yet
Chapter 5a - Concrete & Formwork
28 pages
Cepsa Atf Avant Diii
No ratings yet
Cepsa Atf Avant Diii
1 page
Printout For Record - CS3591 CN Lab
No ratings yet
Printout For Record - CS3591 CN Lab
50 pages
1Z0 116 Demo
No ratings yet
1Z0 116 Demo
5 pages
ML 2 - Problem Statements and Rubirics
No ratings yet
ML 2 - Problem Statements and Rubirics
3 pages
ABC-CLIO Victorian Technology, Invention Innovation and The Rise of The Machine (2009)
No ratings yet
ABC-CLIO Victorian Technology, Invention Innovation and The Rise of The Machine (2009)
190 pages
Seismic Reference Datums
No ratings yet
Seismic Reference Datums
12 pages
Steps in Setting Up Business On Internet
No ratings yet
Steps in Setting Up Business On Internet
7 pages
Sss Log 04 04 2025 21 16 28
No ratings yet
Sss Log 04 04 2025 21 16 28
2 pages
TechCrunch - The Rise of AI 'Reasoning' Models Is Making Benchmarking More Expensive
No ratings yet
TechCrunch - The Rise of AI 'Reasoning' Models Is Making Benchmarking More Expensive
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

The Cricket Winner Prediction With Applications of ML and Data Analytics

Uploaded by

The Cricket Winner Prediction With Applications of ML and Data Analytics

Uploaded by

The Cricket Winner Prediction With Applications

Of ML and Data Analytics

Name:Nasheel Rehman Guide: Seminar Coordinator:

oTo predict the winner of a cricket match is never an easy task.

2. Decision Tree Regressor

3. Support Vector Machine (SVM)

4. Random Forest Classifier

o Naïve Bayes works on the Bayes probability theorem.

o Naive Bayes model used in conjunction with recursive feature elimination

o Decision tree’s time complexity can be found by number of observations and

• The above confusion matrix of Decision Tree model has successfully

 Jhanwar, G. M., 2017. Quantitative Assessment of Player Performance and Winner

 Asare-Frempong, J. and Jayabalan, M., 2017. Predicting customer response to bank

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.