Datasets like FCE have been standardised into m2 formats using ERRANT. But some models

There is a on the BEA 2019 shared task website to apply the edits

Is there a functionality for convering m2 file back to their parallel corpuses? about errant HOT 4 CLOSED

chrisjbryant commented on May 28, 2024

Is there a functionality for convering m2 file back to their parallel corpuses?

from errant.

Comments (4)

chrisjbryant commented on May 28, 2024 1

No, that's correct. The reason is because the reference edits in Codalab were annotated by humans, while the hypothesis edits were extracted automatically with ERRANT. Consequently, even though the corrected sentences are the same, there is a mismatch between the spans of the gold and automatic edits.

You can read a bit more about this in Section 4.1 of the ERRANT paper and also Appendix A.7 in the BEA-19 shared task paper.

from errant.

sappy5678 commented on May 28, 2024 1

Thank you for your detailed explanation.

from errant.

chrisjbryant commented on May 28, 2024

There is a script on the BEA 2019 shared task website to apply the edits to get the corrected text: link

As for the original text, I just run:
grep ^S m2_file | cut -c 3- > output.orig

from errant.

sappy5678 commented on May 28, 2024

There is a script on the BEA 2019 shared task website to apply the edits to get the corrected text: link

As for the original text, I just run:
grep ^S m2_file | cut -c 3- > output.orig

I run the code to get correct sentences from BEA2019 dev m2. But the F0.5 score is only 86.45 when I upload the correct sentences to Codalab of BEA2019. Did I do something wrong?

The command that I run:
python office_correct.py ~/Corpus/BEA2019/wi+locness/m2/ABCN.dev.gold.bea19.m2 -out office.gold

from errant.

Recommend Projects

Is there a functionality for convering m2 file back to their parallel corpuses? about errant HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent