0% found this document useful (0 votes)

15 views5 pages

Intracollege Datathon 2.0 - Case

The Intra College Datathon Competition 2.0 challenges participants to predict the winner of the ICC Champions Trophy 2025 using historical data from past tournaments (1998-2017). Participants must create and deploy a machine learning model based on various performance metrics of teams, with a focus on accuracy and innovative feature engineering. The competition includes guidelines for data preprocessing, model training, evaluation, and deployment, with attractive prizes for the top teams.

Uploaded by

matkook07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views5 pages

Intracollege Datathon 2.0 - Case

Uploaded by

matkook07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Intra College Datathon Competition 2.

Problem Statement:

Predicting the Winner of the ICC Champions Trophy 2025

In this datathon, participants are tasked with analyzing past data of ICC Champions Trophy
tournaments to predict the winners of the upcoming ICC Champions Trophy 2025. A machine
learning model should be created and deployed to predict the outcome based on the given dataset.

Dataset Overview:

The dataset contains records from the ICC Champions Trophy, spanning tournaments held in 1998,
2000, 2004, 2006, 2009, 2013, and 2017. These records consist of 56 rows, with each row
representing a team’s performance in a specific tournament year. The data includes the performance
metrics for eight teams that usually participate in the ICC Champions Trophy.

The dataset has columns representing the teams’ key performance statistics such as matches played,
matches won, average runs per match, strike rate, top scorers, number of centuries and fifties,
wickets taken, top wicket-takers, bowling average, economy rate, fielding metrics, and many others.
These statistics will be used to train the model.

For the competition, participants are also provided with data for the 2025 ICC Champions Trophy.
This data is for testing the machine learning model, to make predictions on the outcome of the 2025
tournament.

Dataset Columns:

1. Year: Year of the tournament.

2. Team: The team participating in the tournament (e.g., India, Pakistan).

3. Group: Group classification (A or B).

4. Matches Played: Total matches played by the team.

5. Matches Won: Total matches won by the team.

6. Avg Runs Per Match: Average runs scored by the team per match.

7. Strike Rate: Team’s batting strike rate.

8. Team’s Top Scorer: The top run scorer from the team.

9. Number of Centuries: Total centuries scored by the team.

10. Number of Fifties: Total fifties scored by the team.

11. Highest Team Total: Highest total scored by the team.

12. Wickets Taken: Total wickets taken by the team.

13. Top Wicket-Taker: The top wicket-taker from the team.

14. Bowling Average: Average runs conceded per wicket by the team.

15. Bowling Economy Rate: Team’s bowling economy rate.

16. Five-Wicket Hauls: Total five-wicket hauls by the team.

17. Catches Taken: Total catches taken by the team.

18. Run Outs: Total run-outs by the team.

19. Stumpings: Total stumpings by the team.

20. Maiden Overs: Total maiden overs bowled by the team.

21. Net Run Rate (NRR): The team’s net run rate.

22. Total Fours: Total number of boundaries (fours) hit by the team.

23. Total Sixes: Total number of sixes hit by the team.

24. Host Advantage: Whether the team is the host nation (1 for yes, 0 for no).

25. Outcome: The target variable, whether the team was the winner (1 for Winner, 0 for
otherwise).

Expected Outcomes:

Participants are expected to:

1. Model Creation:

o Use the given historical data (1998-2017) to train a machine learning model that can
predict the outcome (winner or otherwise) of each team based on the team’s
statistics.

o Identify and engineer relevant features from the dataset that contribute to the
model's accuracy.

o Evaluate different machine learning algorithms (e.g., logistic regression, random

forest, XGBoost, etc.) and select the best performing model based on metrics like
accuracy, precision, recall, and F1 score.

2. Model Deployment:

o Deploy the model in a way that it can be tested with the current 2025 dataset.
Participants should create a system where, upon entering the performance statistics
of the teams from the 2025 dataset, the model predicts whether a particular team
will win or not (1 for winner, 0 for otherwise).
o Ensure the deployed model is user-friendly and can easily intake new data for
prediction.

Steps for Participation:

1. Data Preprocessing:

o Clean the data: Handle missing values and ensure all numeric data types are
correctly formatted.

o Normalize or standardize data if necessary to improve model performance.

o Perform feature selection based on correlation or importance metrics.

2. Feature Engineering:

o Use domain knowledge to create new features, such as combinations of batting and
bowling statistics, or derived metrics like win ratio, run differential, etc.

3. Model Training:

o Train various machine learning models (e.g., logistic regression, decision trees,
random forest, gradient boosting, or deep learning) to classify whether a team will
win or not.

o Tune hyperparameters using techniques such as cross-validation and grid search to

optimize model performance.

4. Model Evaluation:

o Evaluate the model using classification metrics such as accuracy, precision, recall, F1
score, and AUC-ROC curve.

o Compare models and select the one that performs the best on the validation set.

5. Model Deployment:

o Deploy the trained model using a web-based interface or a command-line tool.

o Ensure the deployed system can predict the outcome for new data provided (i.e., the
2025 data).

o Implement error handling and ensure the deployment is robust.

Submission Guidelines:

 Code: Submit the code used for data preprocessing, model training, and deployment as
HTML file. That has the following

a. Model: Provide a brief description of the model and the reasoning behind the
selection of the final model.
b. Documentation: Include a report that explains the methodology, feature selection,
model evaluation, and deployment process.
c. Deployment: How will you deploy the model (an Excel file/ python code for
deployment).

Additional Notes:

 You are free to explore advanced techniques such as ensemble methods, stacking models, or
neural networks if deemed appropriate.

 Domain knowledge of cricket and key factors that affect match outcomes (e.g., home
advantage, player form, etc.) will be helpful in improving model accuracy.

 Feel free to add more columns in the data, if you feel any other columns are relevant.

Evaluation Criteria:

The results of the ICC Champions Trophy will be announced on March 9. For evaluation, only those
teams that have accurately predicted the winning team will be considered. This accuracy must be
achieved through rigorous, data-driven analysis. Predictions that fail to identify the winning team will
not be reviewed, and any attempt to rely on speculation, guesswork, or personal bias instead of
analytical methods will disqualify the team from further evaluation. It is essential that the prediction
process is grounded solely in the use of data and statistical insights

1. Accuracy of Predictions: How well the model performs in predicting the winners. (15%)

2. Innovative Feature Engineering: How well new features are derived and contribute to the
model’s performance. (20%)

3. Model Selection and Justification: The rationale behind selecting a particular model and the
performance metrics achieved. (20%)

4. Model Deployment: The usability and functionality of the deployed model. (20%)

5. Documentation: Clarity and completeness of the documentation provided. (25%)

This datathon is a great opportunity for participants to showcase their skills in predictive modeling,
feature engineering, and deployment. Good luck!

Attractive Prizes
Attractive cash prizes and trophies will be awarded to the winners and runners-up of the
competition. We will recognize the top three teams, with prizes for first, second, and third positions.
Make sure your predictions are data-driven for a chance to win these prestigious rewards!
Rules for Team formation
1. Team Formation: Each team can have 2 to 5 members.

2. Eligibility: Team members must be current students of TSM, enrolled in MBA, PGDM, or
PGDDSBA programs.

3. Outsourcing: Teams must complete all work internally. Outsourcing any part of the work to
external agencies or individuals is strictly prohibited.

4. Collaboration: Collaboration between teams is not allowed. Each team must work
independently.

5. Originality: All submissions must be original and based on the team’s work. Plagiarism will
result in disqualification.

Electrical and Electronics Measurements and Instrumentation by Prithwiraj Purkait PDF
83% (6)
Electrical and Electronics Measurements and Instrumentation by Prithwiraj Purkait PDF
651 pages
57 Pages - Thesis About Prediction of Cricket Match Outcome
No ratings yet
57 Pages - Thesis About Prediction of Cricket Match Outcome
57 pages
Ipl Prediction Documentation
No ratings yet
Ipl Prediction Documentation
18 pages
Ai PPT
No ratings yet
Ai PPT
24 pages
YMER210577
No ratings yet
YMER210577
3 pages
DA Phase 3 Dharani
No ratings yet
DA Phase 3 Dharani
19 pages
Predicting Peak Performance of A Cricket Player Using Machine Learning and Data Analytics
No ratings yet
Predicting Peak Performance of A Cricket Player Using Machine Learning and Data Analytics
59 pages
Final Prjoect
No ratings yet
Final Prjoect
32 pages
Math Sport 2015 Proceedings
No ratings yet
Math Sport 2015 Proceedings
223 pages
The Paper About The Method of Cricket Match Outcome
No ratings yet
The Paper About The Method of Cricket Match Outcome
67 pages
BCA 8th Project Report (Linear Regression)
No ratings yet
BCA 8th Project Report (Linear Regression)
34 pages
Cricket Player Data Analysis Using Clustering Technique
No ratings yet
Cricket Player Data Analysis Using Clustering Technique
5 pages
Research - Paper Harshit 212-1
No ratings yet
Research - Paper Harshit 212-1
4 pages
Capstone Presentation
No ratings yet
Capstone Presentation
10 pages
Assignment Question
No ratings yet
Assignment Question
6 pages
Capstone Final Project Report Cricket Win Prediction
No ratings yet
Capstone Final Project Report Cricket Win Prediction
20 pages
PaperaravindIPLPaper Writefull
No ratings yet
PaperaravindIPLPaper Writefull
7 pages
560
No ratings yet
560
3 pages
IPL Data Analysis and Prediction Using M
No ratings yet
IPL Data Analysis and Prediction Using M
4 pages
Results of Sports Matches For 2025
No ratings yet
Results of Sports Matches For 2025
8 pages
Predicting Baseball Wins Using Machine Learning
No ratings yet
Predicting Baseball Wins Using Machine Learning
3 pages
Winning Prediction Analysis in One-Day-International (ODI) Cricket Using Machine Learning Techniques
No ratings yet
Winning Prediction Analysis in One-Day-International (ODI) Cricket Using Machine Learning Techniques
8 pages
Final PPT New
No ratings yet
Final PPT New
13 pages
Report Paper Gaurav
No ratings yet
Report Paper Gaurav
28 pages
Application of Machine Learning in Cricket and Predictive Analytics of IPL 2020
No ratings yet
Application of Machine Learning in Cricket and Predictive Analytics of IPL 2020
26 pages
Python Programming
No ratings yet
Python Programming
13 pages
Cricket JETIR2005307
No ratings yet
Cricket JETIR2005307
5 pages
Tittle Page
No ratings yet
Tittle Page
30 pages
Project New
No ratings yet
Project New
13 pages
Ipl Prediction
No ratings yet
Ipl Prediction
12 pages
Cricket Score Prediction Using Machine Learning
No ratings yet
Cricket Score Prediction Using Machine Learning
6 pages
ML RP
No ratings yet
ML RP
11 pages
Rocks Seminar
No ratings yet
Rocks Seminar
19 pages
Imaging Brain Function With EEG
100% (4)
Imaging Brain Function With EEG
266 pages
Life Insurance Management System
No ratings yet
Life Insurance Management System
26 pages
Blue Futuristic Technology Presentation
No ratings yet
Blue Futuristic Technology Presentation
19 pages
Major Project Report Estimating The Chances of Winning Ipl Using Machine Le 20240531 235827 0000
No ratings yet
Major Project Report Estimating The Chances of Winning Ipl Using Machine Le 20240531 235827 0000
28 pages
Slide 1
No ratings yet
Slide 1
6 pages
Ipl Win Probablity Predictor
No ratings yet
Ipl Win Probablity Predictor
8 pages
B.E Cse Batchno 185
No ratings yet
B.E Cse Batchno 185
42 pages
Score Prediction and Analysis in Cricket
No ratings yet
Score Prediction and Analysis in Cricket
9 pages
Project Report
No ratings yet
Project Report
16 pages
Artificial Intelligence and Machine Learning (18CS71) : "Personality Prediction System"
No ratings yet
Artificial Intelligence and Machine Learning (18CS71) : "Personality Prediction System"
28 pages
Ipl Report
No ratings yet
Ipl Report
12 pages
Predicting Players' Performance in One Day International Cricket Matches Using Machine Learning
No ratings yet
Predicting Players' Performance in One Day International Cricket Matches Using Machine Learning
17 pages
SSRN Id3572740
No ratings yet
SSRN Id3572740
5 pages
DBKaw43MTtSI74qlMGTc - Machine Learning Project Task Document
No ratings yet
DBKaw43MTtSI74qlMGTc - Machine Learning Project Task Document
4 pages
Ipl Nan
No ratings yet
Ipl Nan
11 pages
5sem - MP - Synopsis Miniproject
No ratings yet
5sem - MP - Synopsis Miniproject
4 pages
Ijst 2023 29441
No ratings yet
Ijst 2023 29441
7 pages
Final PDF
No ratings yet
Final PDF
13 pages
Paper 9073
No ratings yet
Paper 9073
11 pages
Capstone Notes-1
No ratings yet
Capstone Notes-1
18 pages
Jsa - 2018 - 4 4 - Jsa 4 4 Jsa196 - Jsa 4 Jsa196
No ratings yet
Jsa - 2018 - 4 4 - Jsa 4 4 Jsa196 - Jsa 4 Jsa196
11 pages
Ijsred V8i2p177
No ratings yet
Ijsred V8i2p177
6 pages
Fin Irjmets1697356356
No ratings yet
Fin Irjmets1697356356
4 pages
Cricket Analysis Using Machine Learning: B V S Sai Praneeth, V Srighan Reddy, P Jayanth, K Jeevan Reddy
No ratings yet
Cricket Analysis Using Machine Learning: B V S Sai Praneeth, V Srighan Reddy, P Jayanth, K Jeevan Reddy
5 pages
Quantitative Assessment of Player Performance... (Madan Gopal Jhanwar, MS, 201202018)
No ratings yet
Quantitative Assessment of Player Performance... (Madan Gopal Jhanwar, MS, 201202018)
69 pages
Madan Gopal Jhanwar
No ratings yet
Madan Gopal Jhanwar
11 pages
IPL Score Prediction (Journal) - 4nm18cs142-169-191-215.
No ratings yet
IPL Score Prediction (Journal) - 4nm18cs142-169-191-215.
10 pages
API 579 Fitness For Service For Nozzles and Flanges (APIFFSB) Module Overview
No ratings yet
API 579 Fitness For Service For Nozzles and Flanges (APIFFSB) Module Overview
49 pages
CH-2, Stress & Strain
No ratings yet
CH-2, Stress & Strain
74 pages
MQL4 Tutorial
No ratings yet
MQL4 Tutorial
370 pages
Mass, Weight and Gravity
No ratings yet
Mass, Weight and Gravity
1 page
II PU PASSING Questions & Answers 2024 Annual Exam PART D & E
No ratings yet
II PU PASSING Questions & Answers 2024 Annual Exam PART D & E
27 pages
ECCS P123 Preface Table of Contents
No ratings yet
ECCS P123 Preface Table of Contents
7 pages
SAT Equivalent-Expressions
No ratings yet
SAT Equivalent-Expressions
79 pages
Dual Space
No ratings yet
Dual Space
17 pages
International - Competitions IMO 2013 16 PDF
No ratings yet
International - Competitions IMO 2013 16 PDF
2 pages
Bearings
No ratings yet
Bearings
7 pages
BEng Mechanical 2024
No ratings yet
BEng Mechanical 2024
7 pages
T4 ENG Questions
No ratings yet
T4 ENG Questions
6 pages
2565 Phaiboon Friction Loss
No ratings yet
2565 Phaiboon Friction Loss
10 pages
Design of An Improved Interval Type-2 Controller Using FCM and Supervised Clustering Algorithms
No ratings yet
Design of An Improved Interval Type-2 Controller Using FCM and Supervised Clustering Algorithms
10 pages
Osl Languagespec
No ratings yet
Osl Languagespec
101 pages
Definition of The Laplace Transform
No ratings yet
Definition of The Laplace Transform
15 pages
Discrete Structures
No ratings yet
Discrete Structures
2 pages
Resnick 1973
No ratings yet
Resnick 1973
31 pages
Average Case Analysis of Binary Search
No ratings yet
Average Case Analysis of Binary Search
3 pages
SVM Handout
No ratings yet
SVM Handout
9 pages
WG - Calculated AIR DENSITY
No ratings yet
WG - Calculated AIR DENSITY
2 pages
To Predict The Bead Geometry Parameters and Shape Relationships in MIG Welding of Stainless Steel 301 by Mathematical Modelling
No ratings yet
To Predict The Bead Geometry Parameters and Shape Relationships in MIG Welding of Stainless Steel 301 by Mathematical Modelling
10 pages
Joint Sparse Channel Estimation and Data Detection For Underwater Acoustic Channels Using Partial Interval Demodulation
No ratings yet
Joint Sparse Channel Estimation and Data Detection For Underwater Acoustic Channels Using Partial Interval Demodulation
6 pages
Image Enhancement in Spatial Domain: Spatial Filtering Anisha M. Lal
No ratings yet
Image Enhancement in Spatial Domain: Spatial Filtering Anisha M. Lal
12 pages
18Mem/Mpd/Mpe/Mpm/Mpt/ Mpy/Mse/Mde/Mea/Mmd11: (10 Marks)
No ratings yet
18Mem/Mpd/Mpe/Mpm/Mpt/ Mpy/Mse/Mde/Mea/Mmd11: (10 Marks)
2 pages
Nptel: NOC:Computational Electromagnetics & Applications - Video Course
No ratings yet
Nptel: NOC:Computational Electromagnetics & Applications - Video Course
3 pages
IEOR 6711: Stochastic Models I Fall 2003, Professor Whitt Class Lecture Notes: Tuesday, November 18. Solutions To Problems For Discussion
No ratings yet
IEOR 6711: Stochastic Models I Fall 2003, Professor Whitt Class Lecture Notes: Tuesday, November 18. Solutions To Problems For Discussion
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Intracollege Datathon 2.0 - Case

Uploaded by

Intracollege Datathon 2.0 - Case

Uploaded by

Intra College Datathon Competition 2.

Predicting the Winner of the ICC Champions Trophy 2025

1. Year: Year of the tournament.

2. Team: The team participating in the tournament (e.g., India, Pakistan).

3. Group: Group classification (A or B).

4. Matches Played: Total matches played by the team.

5. Matches Won: Total matches won by the team.

7. Strike Rate: Team’s batting strike rate.

9. Number of Centuries: Total centuries scored by the team.

10. Number of Fifties: Total fifties scored by the team.

11. Highest Team Total: Highest total scored by the team.

12. Wickets Taken: Total wickets taken by the team.

15. Bowling Economy Rate: Team’s bowling economy rate.

16. Five-Wicket Hauls: Total five-wicket hauls by the team.

17. Catches Taken: Total catches taken by the team.

18. Run Outs: Total run-outs by the team.

19. Stumpings: Total stumpings by the team.

20. Maiden Overs: Total maiden overs bowled by the team.

23. Total Sixes: Total number of sixes hit by the team.

Participants are expected to:

o Evaluate different machine learning algorithms (e.g., logistic regression, random

Steps for Participation:

o Normalize or standardize data if necessary to improve model performance.

o Perform feature selection based on correlation or importance metrics.

o Tune hyperparameters using techniques such as cross-validation and grid search to

o Deploy the trained model using a web-based interface or a command-line tool.

o Implement error handling and ensure the deployment is robust.

5. Documentation: Clarity and completeness of the documentation provided. (25%)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.