MoneyBall - Data Mining on Cricket Dataset








Abstract

Cricket is one of the most popular sports in the whole world, and also one of the most popular sports in India. Cricketing events such as the Indian Premier League (IPL) are thoroughly enjoyed by fans all across the country. Fans of the game love predicting the ongoing match results, and this is something that has ended up being a hobby for several people who follow the game. This is a sport with abundant amount of data and using this data, we can make an evaluation on whether a team can win an ongoing IPL match or not. This prediction is implemented by using machine learning algorithms such as Gaussian Naive Bayes, Support Vector Machine, K-Nearest Neighbor and Random Forest. The required dataset is obtained by collecting using a website and consolidated. As a result, the output is obtained which lists whether the home team has won the match or not. The accuracies obtained are 75%, 80%, 55%, 75%, 80%, 80%, 75% and 84% for the teams CSK, RR, DD, RCB, MI, SRH, KXIP and KKR respectively.


Modules


Algorithms

Data Mining algorithms


Software And Hardware

• Hardware: Processor: i3 ,i5 RAM: 4GB Hard disk: 16 GB • Software: operating System : Windws2000/XP/7/8/10 Anaconda,jupyter,spyder,flask Frontend :-python Backend:- MYSQL