winkjs / wink-ner Goto Github PK
View Code? Open in Web Editor NEWLanguage agnostic named entity recognizer
Home Page: http://winkjs.org/wink-ner/
License: MIT License
Language agnostic named entity recognizer
Home Page: http://winkjs.org/wink-ner/
License: MIT License
var ner = require( 'wink-ner' );
var myNER = ner();
myNER .defineConfig({ignoreDiacritics: true, tagsToIgnore: ['punctuation'], valuesToIgnore: []});
var trainingData = [
{ text: 'T-Mobile', entityType: 'company', symbol: 'TMUS' }
];
myNER.learn( trainingData );
var winkTokenizer = require( 'wink-tokenizer' );
var tokenize = winkTokenizer().tokenize;
var tokens = tokenize( 'I work for T-Mobile and I like it' );
tokens = myNER.recognize( tokens );
console.log( tokens );
'T-Mobile' is not recognized although punctuation is specified in defineConfig
.
Examples are:
UK
or U.K.
or U K
Kg
or K.g.
IBM
or I.B.M.
or I. B. M.
For example, if I want to identify a price, is there a way to define a learning entry composed of a currency token followed by a number token?
Hello,
I play around with the library. My training set leads to a lot of wrong entityType assignments, but I want to test it a bit more.
With my trained ner model. Is it possible to use that with wink-nlp? I haven't seen any docs for that. I would like to use the visualization features showcased in the wink-nlp with a custom annoted ner set.
Hello, nice package! Do you have a default training data file, so that I could train the model on some default settings to get it up and running? Rather than having to create the training data file myself? Thanks!
The examples shows how NER can identify and distinguish between Manchester (the city) vs Manchester United (the club).
However, I can't find a way to do the opposite.
Consider USA and United States of America. Both should be detected as the same entity.
Is this possible?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.