sfu-cl-lab / yeti-thesis-project Goto Github PK
View Code? Open in Web Editor NEWfor my thesis--Yejia Liu
for my thesis--Yejia Liu
Want to make sure we have valid raw data. Can use birthdate to identify players.
@chaostewart I see on nhl.com more stats for each player, like shots, shots percentage, face-off win percentage. Shouldn't we be using that data? @liuyejia
Evaluate using Shucker's metric (i.e. correlation between predicted ranking and actual ranking according to number of games).
Can we
Hi
Please bind the ~/.ssh/id_rsa to your github configuration so that you don't get publickey error
using linear model: which components contribute most to predictive difference from average member of cluster?
Some notes for the conference paper
For basketball,
many clustering approaches focus on defining appropriate roles or positions for a
player.
meet with Max and Galen about learning model trees
explain why plus-minus is important in the tree
submit to KDD workshop?
Hi Yeti,
thanks for doing this work. It looks like an interesting result, especially that SVM does so much better. A few suggestions.
how about neural net? Deep neural net?
In your report, please include a brief description of the dataset, maybe some sample lines from the data file. Also a description of what you want to predict.
How does this relate to Wilson's result?
How does this relate to Shuckers' work? I guess we should get a script for computing the correlations with TOI performance as he did.
let's find out which players form the AHL set also appeared in the NHL
https://github.com/sfu-cl-lab/Yeti-Thesis-Project/blob/master/AHL_seasons_players/AHL%20Data%20-%20Full%20Set.csv
plus learning tree to support linear equations (see R, Quinlan M5)
link to Shucker's paper: https://github.com/sfu-cl-lab/Yeti-Thesis-Project/blob/master/papers/1559-Draft-by-Numbers.pdf
Hi @oschulte Oliver,
Connecting to our datebase in WEKA is non-trivial. I uploaded an instruction file for this particular task. Please check out: https://github.com/sfu-cl-lab/Yeti-Thesis-Project/blob/master/How%20to%20connect%20to%20MySql%20database%20in%20WEKA.md
I can't make the connection work on my loptop for now due two multi-hops. Neither can I export databases I need (chao_draft or ckm_and_exception_mining) to my local. Thus, (if i can't come up with a better solution in the next 24 hours) I suggest, to be able to use WEKA in our next meeting, the easiest way is to have you install it on your desktop in your office. OR come to my desktop in the lab.
Thank you for your understanding!
@chaostewart : I've added instructions for how to transfer databases to Bugagoo
https://github.com/sfu-cl-lab/group-sharing/wiki/Database-Servers
We tested these once, not 100% sure about them. Galen has worked with Bugaboo too.
Could try using a discrete prediction. Probably with three classes:
Problem 1 seems especially natural, compared to the somewhat artificial threshold of 160.
If we implement Shucker's ranking metric #4 then we can evaluate both regression trees and decision trees in terms of how they rank players. For that we may have to combine decision trees with logistic regression - I wonder if that exists? In R perhaps?
Player stats crawled from eliteprospects.com are saved as table chao_draft
.elite_prospects_skaters_stats_1998_2008_original
. There are ~150 of them have the position as 'W' or 'F'. How do we determine if they are 'L', 'R' or 'C'? Note that even their 'Shoots" information is also know, left or right-handed shoots does not determine a player's position as 'L' or 'R'.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.