Comments (4)
I found this one :
errant_parallel -orig <orig_file> -cor <cor_file1> [<cor_file2> ...] -out <out_m2>
I think this mean that in [<cor_file2> ...] I should write other correct ones for that sentence . So for other sentences that have just one correction, what should we write in other files ? the correct sentence again or just an empty line?
from errant.
Yeah this works.
Also I tried the CLI and it worked, just it repeats the same lines of edits for sentences with one correction but with a little change it fixed.
Thank you
from errant.
Check out the readme! There are instructions on how to do that using both the CLI and the API.
from errant.
Ah, so you have a different number of suggestions for each sentence?
In that case, you could repeat the correction from <cor_file1>
in the other files and it won't affect anything, but I'd instead recommend using the API. Something like:
import errant
annotator = errant.load('en')
with open("output.m2", "w") as out:
for orig in orig_sents:
out.write("S "+orig) # Write the tokenised orig sentence in an m2 file (preceded by S)
orig = annotator.parse(orig)
for i, hyp in enumerate(hyp_sents):
hyp = annotator.parse(hyp)
edits = annotator.annotate(orig, hyp)
# Write the edits for each hypothesis in M2 format
for e in edits:
out.write(e.to_m2(i)+"\n")
# Empty line after each orig sentence in an m2 block
out.write("\n")
from errant.
Related Issues (20)
- Expose errant_compare functionality via the API HOT 3
- Merge Casing Issue HOT 2
- Handling Missing Annotations on certain sentence HOT 5
- Edits missed for a substitute -> Delete -> Substitute sequence. HOT 3
- OSError: [E053] Could not read meta.json from en\meta.json HOT 3
- Implementation issue HOT 6
- UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 2490: ordinal not in range(128) HOT 2
- Parallel_to_m2 is not working HOT 1
- Licensing concerns HOT 6
- Errant parse method not working HOT 5
- Wrong format for incorr_sentences.txt HOT 4
- ‘’AttributeError: 'English' object has no attribute 'tagger'” when running the "Quick Start" code in API given in README.md HOT 4
- Ignore temporary files generated by installation HOT 1
- cancelling
- Edit indices HOT 3
- Simulate Errors HOT 1
- API Quickstart script not working - Please update with fix provided HOT 2
- Is there any way to further improve the method of summarizing error types? HOT 1
- Questions about evaluating duplicate corrections HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from errant.