Comments (5)
This seems to be the same (or similar) kind of error I ran into (#43). We didn't really manage to pinpoint the exact source of the problem at that time.
One workaround that worked for me was replacing -np.inf
by a large negative number. But if you can help trace down why this sometimes happens and sometimes doesn't, it would be great, too!
from skweak.
Could you send a minimal data example, so I can have a look at it?
Typically, when this happens, it means that the HMM has reached a position where no state is possible (all values have a probability of zero), given the current transition and emission probabilities.
from skweak.
Could you send a minimal data example, so I can have a look at it? Typically, when this happens, it means that the HMM has reached a position where no state is possible (all values have a probability of zero), given the current transition and emission probabilities.
example:
Sentence: Eastern Equine Encephalitis Virus (EEEV) is a dangerous disease
we have predicted labels before from 2 models.
model1: ["negative", "not safe"]
model2: ["positive", "not safe"]
how to generate a labelling function for this kind of multi labelling.
from skweak.
same error occured to me while working on an NER Task with the train set from the CoNLL dataset.
from skweak.
Ok, I found what was going on (and sorry for the answering delay): even though the aggregation model allows for multi-label predictions, skweak
still assumes that each labelling source only provides a single label for a given text. In other words, the aggregator may predict multiple labels, but each labelling source must stick to one single label at a time.
Fortunately, there is an easy way to change your code: just create a separate labelling source for each possible label outputted by a given model, like model1_negative
, model1_positive
, model1_not_safe
, etc.
from skweak.
Related Issues (20)
- matcher annotator HOT 1
- Functionality to construct the detected span from start and end index HOT 1
- Converting .spacy files to conll format to train other models on it. HOT 5
- skweak.utils.docbin_reader always loads 'en_core_web_md' regardless which model was saved? HOT 2
- Support for loading any pre-trained model inside the 'Model Annotator' HOT 2
- SpanCategorizer HOT 1
- Custom NER model training HOT 2
- Support options in displacy.render
- minimal example not working HOT 3
- Does skweak use POS tags and lemma information to aggregate labels? HOT 1
- How to use the already available Label Matrix to train Skweak? HOT 1
- Step by step NER alternative 2 HOT 1
- Annotating whole sentences (without using regex) HOT 2
- Adding to the gazetteer annotator constrains HOT 1
- Is skweak being actively maintained and will be maintained? HOT 1
- How to import annotator in the annotator(doc)
- hmmlearn 0.3.0 HOT 1
- Update examples stepbystep
- How to use prefix ner tags with skweak aggregation.HMM HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from skweak.