Estimating the win probability in a hockey game

dc.contributor.authorYang, Shudan
dc.date.accessioned2016-05-06T14:50:41Z
dc.date.available2016-05-06T14:50:41Z
dc.date.issued2016-04-15
dc.description.abstractWhen a hockey game is being played, its data comes continuously. Therefore, it is possible to use the stream mining method to estimate the win probability (WP) of a team once the game begins. Based on 8 seasons’ data of NHL from 2003-2014, we provide three methods to estimate the win probability in a hockey game. Win probability calculation method based on statistics is the first model, which is built based on the summary of the historical data. Win probability calculation method based on data mining classification technique is the second model. In this model, we implemented some data classification algorithms on our data and compared the results, then chose the best algorithm to build the win probability model. Naive Bayes, SVM, VFDT, and Random Tree data classification methods have been compared in this thesis on the hockey dataset. We used stream mining technique in our last model, which is a real time prediction model, which can be interpreted as a trainingupdate- training model. Every 20 events in a hockey game are split as a window. We use the last window as the training data set to get decision tree rules used for classifying the current window. Then a parameter can be calculated by the rules trained by these two windows. This parameter can tell us which rule is better than another to train the next window. In our models the variables time, leadsize, number of shots, number of misses, number of penalties are combined to calculate the win probability. Our WP estimates can provide useful evaluations of plays, prediction of game result and in some cases, guidance for coach decisions.en_CA
dc.description.degreeMaster of Science (M.Sc.) in Computational Sciences
dc.identifier.urihttps://laurentian.scholaris.ca/handle/10219/2569
dc.language.isoenen_CA
dc.publisher.grantorLaurentian University of Sudbury
dc.subjecthockeyen_CA
dc.subjectNHLen_CA
dc.subjectStream miningen_CA
dc.subjectNaive Bayesen_CA
dc.subjectSVMen_CA
dc.subjectVFDTen_CA
dc.subjectRandom Tree,en_CA
dc.subjectWin Probabilityen_CA
dc.titleEstimating the win probability in a hockey gameen_CA
dc.typeThesisen_CA

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Thesis-Shudan final version.pdf
Size:
2.8 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.92 KB
Format:
Item-specific license agreed upon to submission
Description: