Comments (4)
Well, SQuAD dataset contains more than 100k pairs of question and answer. The target of NLP.js is to more modest datasets. I didn't do the test, but for training the logistic regression classifier with 100K intents and the amount of features (different words) of SQuAD, is for sure too much for training with CPU, and only the matrix of features and labels can be too much.
Perhaps with tensorflow, NLTK and with GPUs can be a better approach. Who knows, perhaps in the future we will go to the path of tensorflow.js and NLTK, but right now is not in the roadmap.
from nlp.js.
I tried ~100K Q&A train with AWS 32GB Memory 8 vCPU 75 GB SSD Space, but didn't create model.nlp file and didn't show any error, no error, no model.nlp :/ I don't know problem. When ı training dataset memory is ~16GB level.
from nlp.js.
We have an example supporting 10.000 intents, extracted from SQuAD v2.
Open Question with BERT is added, but with it needs a python app to work.
Right now I'm working on doing OpenQuestion fully javascript. The first problem is that tokenizers only work in node 11 or 13 and is a bridge to rust, I solved that by building BERT word piece tokenizer by myself. Now I'm working on the tenworflow open question model and runtime
from nlp.js.
Closing as integration with BERT API is provided
from nlp.js.
Related Issues (20)
- Is there a code example for reading a .txt or .pdf file by using bert
- ContextData can't be saved and loaded
- Can't run dockStart
- Do not remove dots from utterance in entity recognition HOT 1
- Not able to extract entities HOT 9
- @nlpjs/xtables depends on vulnerable version of xlsx HOT 3
- [QUESTION] How to change the NLP dock logger ? HOT 2
- Bug in extractor-enum.js with original text indexes HOT 1
- Critical dependency: the request of a dependency is an expression HOT 2
- Question - How to run Entity extraction HOT 5
- [Question] How to define epoch & loss threshold? HOT 1
- How to extract date when intent is "What date is today?" HOT 3
- Provide timezone to Duckling API HOT 4
- Is it possible to filter the detection of the intent by the entity type? HOT 1
- Amend sentiment.md HOT 3
- Error when process regex with group
- get score from 1 and not less
- Simple question: can NLP.js be used for the app I describe?
- Can we document nlpmanager options better?
- Is NLP.js still maintained? HOT 11
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nlp.js.