Comments (3)
Hey there!
- What are the exceptions?
- If the goal is to support different languages, are exceptions enough? Would more extensions be needed and possible? Couldn’t a
parse-french
be made?
from parse-latin.
For example, academic papers might use "Fig."
when referring to tables, charts, etc. Corporate content often contains abbreviations like "dept."
or "govt."
. And other edge cases can come up, which might not be part of a formal spec.
I intended to add some of these exceptions and tests in a PR in parse-english
but I quickly found a solution using plugins. I also wasn't sure if "fig"
should be handled in parse-latin
or -english
... but I'd be happy to make the PR(s) if you think it's worthwhile.
Ultimately, I agree that language-specific parsers would be the most semantic way to handle i18n - since exceptions are only a part of the equation (I may have overestimated their significance).
from parse-latin.
Thanks for your patience.
I am open to such an API. Preferrably as discussed a clean, shared, API that works with the different projects.
I am closing this though, as I think it’s a nice to have, that I personally am not currently interested in working on!
But, let me know if you (or someone else?) is interested in working on this!
from parse-latin.
Related Issues (20)
- Typo in unit tests for `cp.`
- error instalar vía npm HOT 3
- Should expose tokenizeWord, tokenizeWhiteSpace, and tokenizePunctuation HOT 1
- Should expose tokenizeText, and tokenizeSource HOT 1
- Should add a mergeEtceteraAbbreviation sentence modifier HOT 1
- Deny comma as first token in a sentence
- Ignore sentence terminal markers meant as literals HOT 2
- Should allow single closing quote as initial punctuation
- Should count quotes to detect if they are part of adjacent words HOT 2
- Mistakenly categorises :email: as SymbolNode + WordNode HOT 1
- Please publish @types/parse-latin HOT 1
- Please publish @types/parse-latin
- Importing non-default export of "toString" as default in some plugins causes webpack errors HOT 10
- mergeNonWordSentences should give precedence to preceding, rather than following, children
- Throws an incorrect error
- Should have a list depicting how the parser works HOT 1
- (maybe) Should add the slash to inner word punctuation
- Typo in API makes apostrophes not work as inter-word punctuation
- Should add Location and Position to "TextNode" and "SourceNode" HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from parse-latin.