Comments (3)
Genesis 4:3, BTW. :)
from esperanta-analizisto.
This will connect with the parsing of kia/kiam, kio/kiom, tia/tiam, tio/tiom. Right now the shorter root is in the database, and I can add "m" as a separate root, but it doesn't actually have its own meaning. However, with tiam in the list, tiamaniere (in Genesis 6:15), which should parse to tia-manier-e (in such a manner), instead parses as tiam-a-ni-er-e, which isn't even sort of right (it would mean at-that-time + adjective + we + part-of-the-whole + adverb).
I'm beginning to think the greedy algorithm is correct, but needs to select the biggest root that fits ANYWHERE in the word first (and then recursively parse the remaining pieces as two separate words, if that piece is in the middle of the word). Or perhaps try all possible parsings and pick the one with fewest roots. Or something like that. :/
from esperanta-analizisto.
Another interesting example: aliris in Genesis 19:9. It should parse as al-ir-is, but since ali is a root (meaning other), it instead parses as ali-{ris} (where ris doesn't parse). This one would be fixed not by being greedy (ali is the longest root that fits in the word), but by backtracking until you find at least one collection of roots that allows parsing to complete.
from esperanta-analizisto.
Related Issues (19)
- Algorithm for dealing with text HOT 2
- Investigate Polymer and Firebase HOT 3
- Split words HOT 1
- Select ES6 testing framework HOT 2
- Translate from (and to?) alternate "encodings" (ux, ch, j^, others?) HOT 2
- Polymer component to highlight extracted roots
- Alternate methods to input Esperanto special characters HOT 1
- Don't allow addition of duplicate roots
- Trim leading & trailing whitespace when adding roots
- Toggle to indicate what *isn't* parsed yet instead of what *is*
- Root list should scroll independently of input & results
- Output list of unique roots that appear in the passage HOT 1
- Light blue color looks like selection highlighting HOT 1
- -o-n vs -on- (e.g., dekonaĵon) HOT 6
- Parsing challenge: deven- HOT 3
- Parsing challenge: enuiĝis HOT 2
- Keep track of properly parsed words (maybe click)
- Try longest suffix first (instead of longest prefix first)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from esperanta-analizisto.