duolingo / halflife-regression Goto Github PK

View Code? Open in Web Editor NEW

480.0 480.0 82.0 594 KB

License: MIT License

R 4.87% Python 95.13%

halflife-regression's People

Contributors

Stargazers

Watchers

Forkers

kamalkashyap13 raulpercy fundou aliajouz benjamesbabala reynoldscem btabibian pavloko vsskanand icaroo belalc arlm-attic ugofred 23197681-5 musically-ut sojvai rubinovitz scarroll32 leonyip anette-hunziker kbahtus poppingtonic muranava stonedcoder yaskapp xiaoc-ca marcotradpo fendaq hzauccg eosyam morristech antonkalochelitis ranglang aureliennioche booellean katyatulit davidrkearney spencersharp cutekane vibster aanand300 lockee14 danish-moengage azaidi93 abhayalekal74 fagan2888 fasiha nileshtrivedi manulatorre hiramekun rahul2240 chuang0310 bryanpze lukeeg starryeg solarmist l-m-sherlock malsubmal smerkme mzheng3 andreupifarre sanjay-rendu eydemidov alex-oesterling jungledrum hacktapp crinmatic johnmichaelt alvations metot-srs classicvalues sternbach-software morcinus darwinwen princevarshney05 kamalbanga yasminal-mohammadi standingalone alankashkash davidhefan brianpark314 theperiperi

halflife-regression's Issues

Variable naming clarification

I am trying to understand the Python implementation with a view to port it to another language, but can't make sense of some variables.

hlwt at line 37.
dlp_dw at line 86.
dlh_dw at line 87.
slp at line 124.
slh at line 125.

Could someone please explain what the acronyms for the above mean?

How to offer then recommendation based on the result?

If I understood correctly, I see that pp in the result is the predicted probability of forgetting a word. So if pp is low, I should recommend a word associated with the pp as a user might have forgotten the word already. But from the result, I dont see associated word with pp. Then how can I recommend words to users?
And again, is my saying correct? (words with low pp values should be recommended)

Finally, I checked roughly the result and compared p (ground truth) and pp (prediction) and seemed to find that p and pp are quite not matching when they should. For example, when p is 0.0001 pp is 0.9999 and when p is 0.9999, pp is 0.0001. I saw this case quite often when I was skimming. But MAE shows it is very similar as your paper which means training is not broken. Is it normal that p and app are not so matching often? Should I still recommend repeating lexeme according to the pp values?

MAE of fixed Pbar

I have tried to reproduce your figure of 0.175 MAE for the fixed p-bar of 0.859. On neither the test set, nor the training set can I get this value. Could you detail for me how you have calculated it?

For context I am seeing if I can reproduce some of your results, or try various different modules on the same set of data. Firstly, I would like to reproduce your results as a baseline, but this one has eluded me.

Thanks.

Weights for 'right' and 'wrong' are both negative?

Hi. First of all, thanks for making those codes public. While I was running the codes and data on my laptop, I found the experiment results are a little bit strange and counterintuitive. Though the recall error rate is quite similar to the results in paper, some other feature values really confused me:

I used the preset features for hlr: 'right' and 'wrong'. But when I read the model weights for these two features, it showed that both are negative (I have run the codes many times to confirm this). That means under the same circumstances, the more right answers you got from past history, the smaller half life you got, which means the more easily you would forget this word??

else: # fv.append((intern('right'), right)) # fv.append((intern('wrong'), wrong)) fv.append((intern('right'), math.sqrt(1+right))) fv.append((intern('wrong'), math.sqrt(1+wrong)))

When I used the model created in the experiment to predict all the half life of 13 million data, it shows that over 90% of them have a half life longer than 120 days? Does that mean averagely people need more than 4 months to likely forget a foreign word? This seems a little counterintuitive for me.

Implementation of SGD.

Thanks for making the great app even better and for uploading the code!

I am working on extending the model to allow each lexeme to have more features and have some minor doubts about the SGD implementation in the code:

The learning rate for the hlr case seems to differ from the method given in the appendix of the paper by a factor of 1 / (1 + inst.p). Is there any particular reason for that?
Usually, SGD iterations are repeated until some convergence criteria is met. In the code, however, it seems that only one pass is made on the complete data-set. Was that because empirically you observe that the results converge by then?

Thanks!

Evaluation results

I have noticed the evaluation results for the half-life regression model are slightly different each time I run the code. So I was wondering how the evaluation results presented in the paper are determined.

Are the results given in table 2 of the paper based on one run or are they based on multiple runs (using the best result or the average or something else)?

Thanks!

negative weight for "right" feature

@sueyhan previously opened a ticket for this, but then closed it without getting a response.

These are the model weights I get for training without lexical features (python experiment.py -l settles.acl16.learning_traces.13m.csv.gz):

wrong -0.2245
right -0.0125
bias 7.5365

I do not see how it can be correct that the right feature has a negative weight. This will cause the half life to get shorter as a user gets more correct answers, and therefore the model will predict a lower and lower probability of the user getting correct answers.

How can this be correct?

iterator should return strings, not bytes (did you open the file in text mode?)

I am trying to run the experiment file, but I get the following error with the dataset that you have provided:

iterator should return strings, not bytes (did you open the file in text mode?) at line 217.

I am using python 3.7.6 and executing experiment.py with the following arguments:
python3 experiment.py data/settles.acl16.learning_traces.13m.csv.gz

duolingo / halflife-regression Goto Github PK

halflife-regression's People

Contributors

Stargazers

Watchers

Forkers

halflife-regression's Issues

Variable naming clarification

How to offer then recommendation based on the result?

MAE of fixed Pbar

Weights for 'right' and 'wrong' are both negative?

Implementation of SGD.

Evaluation results

negative weight for "right" feature

iterator should return strings, not bytes (did you open the file in text mode?)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent