Comments (4)
/home/yyy/working/train/corpus/zh.vcb
ERROR: Execution of: /home/yyy/mosesdecoder-master/tools/GIZA++ -CoocurrenceFile /home/yyy/working/train/giza.zh-en/zh-en.cooc -c /home/yyy/working/train/corpus/zh-en-int-train.snt -m1 5 -m2 0 -m3 3 -m4 3 -model1dumpfrequency 1 -model4smoothfactor 0.4 -nodumps 1 -nsmooth 4 -o /home/yyy/working/train/giza.zh-en/zh-en -onlyaldumps 1 -p0 0.999 -s /home/yyy/working/train/corpus/en.vcb -t /home/yyy/working/train/corpus/zh.vcb
died with signal 11, with coredump
from mosesdecoder.
it's usually an issue with the data. eg. the data isn't encoded in utf8, there's non-printing characters, there's a double space separating words.
from mosesdecoder.
I have checked that the codes of my parallel corpus are ALL UTF-8, but I still reported the following errors. Could you please help me to check again./home/yyy/working/train/corpus/zh-en-int-train.snt > /home/yyy/working/train/giza.zh-en/zh-en.cooc
Segmentation fault (core dumped)
Exit code: 139
ERROR at /home/yyy/mosesdecoder-master/scripts/training/train-model.perl line 1351.
from mosesdecoder.
I found another set of data, which failed to call the kenLM provided by Moses, but could be generated successfully with IRSTLM.The file ./ build_binary can be called successfully.
Reading /home/yyy/corpus/train.seq.mx.clean.zh
----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100
Segmentation fault (core dumped)
from mosesdecoder.
Related Issues (20)
- Lexical reordering scoring failed at /home/ubuntu/Moses/mosesdecoder/scripts/training/train-model.perl line 1924. HOT 2
- Please don't create new issues HOT 4
- No abbreviation Files Found HOT 1
- Evaluation with multi-bleu.perl or multi-bleu-detok.perl HOT 16
- Placeholders should be separated by comma HOT 4
- tiny weights after tuning HOT 2
- sentence-splitter HOT 10
- Question:Related translation models.
- How to increase BLEU? HOT 1
- PROBLEM: alignment is 0. HOT 2
- normalize-punctuation.perl Change the Chinese punctuation marks in English sentences into English. HOT 1
- symal crashes on Linux after latest update HOT 3
- It seems the home page is not working HOT 1
- Tunning translation model failed with this error
- Replace non-breaking space with regular space HOT 1
- tokenizer.perl supported language HOT 2
- symal: permission denied HOT 4
- Looking for Arabic/English demo
- webshell exists in the project HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mosesdecoder.