Comments (2)
This is actually pretty hard (''no 💩, sherlock', he said', while smiling at 'er cats' toys, is an 'example' of hard to detect quotes).
Especially when the same sign is used for everything in different languages:
- Starting a quote in Bulgarian, Finnish, Norwegian, Swedish;
- the aforementioned reasons (not sure why ending elision is not mentioned in that post);
I think the best way to detect the apostrophes use, is by looking at “enclosed content” (not sure what to call it) to detect pairs of punctuation marks (which other quotes near-never cross).
from parse-latin.
Closing for now. Would be cool, but hard. Maybe later.
from parse-latin.
Related Issues (20)
- Typo in unit tests for `cp.`
- error instalar vía npm HOT 3
- Should expose tokenizeWord, tokenizeWhiteSpace, and tokenizePunctuation HOT 1
- Should expose tokenizeText, and tokenizeSource HOT 1
- Should add a mergeEtceteraAbbreviation sentence modifier HOT 1
- Deny comma as first token in a sentence
- Ignore sentence terminal markers meant as literals HOT 2
- Should allow single closing quote as initial punctuation
- Mistakenly categorises :email: as SymbolNode + WordNode HOT 1
- Please publish @types/parse-latin HOT 1
- Please publish @types/parse-latin
- Using custom prefix exceptions HOT 3
- Importing non-default export of "toString" as default in some plugins causes webpack errors HOT 10
- mergeNonWordSentences should give precedence to preceding, rather than following, children
- Throws an incorrect error
- Should have a list depicting how the parser works HOT 1
- (maybe) Should add the slash to inner word punctuation
- Typo in API makes apostrophes not work as inter-word punctuation
- Should add Location and Position to "TextNode" and "SourceNode" HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from parse-latin.