keisks / jfleg Goto Github PK
View Code? Open in Web Editor NEWJFLEG (JHU FLuency-Extended GUG) corpus for Grammatical Error Correction Evaluation
JFLEG (JHU FLuency-Extended GUG) corpus for Grammatical Error Correction Evaluation
Most times I run this script (though not every time), I get:
.../python2.7/site-packages/scipy/stats/_distn_infrastructure.py:1910: RuntimeWarning: invalid value encountered in multiply lower_bound = self.a * scale + loc
.../python2.7/site-packages/scipy/stats/_distn_infrastructure.py:1911: RuntimeWarning: invalid value encountered in multiply upper_bound = self.b * scale + loc
[['0.584945', '0.000000', '(nan,nan)']]
due to std passed to scipy.stats.norm.interval(0.95, loc=mean, scale=std)
in get_gleu_stats
being 0.0 (because the scores are either all the same, or a list of size 1).
I think you should add this paper to the leaderboard: https://arxiv.org/pdf/1807.01270.pdf
It only reports JFLEG test GLEU, not dev, but it is beating the reference score.
Just so I understand better, I see that this repo includes dev and test data, but not train data. Where can I find train data? or, do I need to train on a different source and only evaluate on this data?
Thanks
I tested the gleu.py script and got these 4 values. I know that the last 2 values (0.647,0.649) are confidence values. However, I'm not sure about the first 2 values which is (0.668878 and 0.010937). Upon checking the script, i know that one of them is the average but i don't know which is which. I've been reading the docs and ReadME files and I can't find any information on how to decipher the 4 output value. It would be better if you can add a label besides the values or update the docs regarding the 4 output values of GLEU.py script. Thank you
A cased version of test.spellchecked.src would be helpful (like it is in dev), since many papers any way use a spellchecker.
Thanks
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.