Comments (6)
It only affects some of the metrics. I think we should handle this in the same way we handle #218 - for instance by raising a warning and setting the column to nan
from textdescriptives.
The error is correct. There is no lexeme_prob for Croatian. See the full list here. Thus this is intended behaviour. It might not be ideal though it might more robust to display a warning?
from textdescriptives.
So should we remove croatian from the list of languages (in the app)? Or does it only affect some of the metrics?
If the language stays we should make a more helpful error/warning.
from textdescriptives.
A warning and nan seems reasonable yes!
from textdescriptives.
We might as well batch the two issues - up for grabs if anyone is interested ;)
from textdescriptives.
Related Issues (20)
- Avoid calculating metrics multiple times HOT 3
- Allow threshold for the quality pipeline to be changed after data thresholds have been set. HOT 1
- Make output of doc._.quality specify if each filter was passed HOT 3
- New Metric: hertz component HOT 2
- New Metric: Approximate entropy HOT 2
- Add proportion of word in vocabuary
- Add Unigram entropy
- How is `doc_length` different from other length measures such as `n_characters`? HOT 3
- Ensure component consistency HOT 4
- Quick start not working as expected HOT 4
- Quickstart broke on Spacy example HOT 1
- Issue with tests HOT 1
- Failing test HOT 5
- Demos / Browser-Based Usage HOT 34
- Fails for empty strings HOT 4
- Support transformer models
- Pyphen does not support all languages that spaCy does HOT 2
- Listed metrics deviate between extraction functions in docs HOT 7
- References for readability metrics HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from textdescriptives.