Comments (5)
Sorry, no we haven't released the training code.
from bilm-tf.
In order to create ELMO embeddings for my own dataset, do I need to use the trained weights provided by the following link : https://s3-us-west-2.amazonaws.com/allennlp/models/elmo/2x4096_512_2048cnn_2xhighway/elmo_2x4096_512_2048cnn_2xhighway_weights.hdf5
Thank you
from bilm-tf.
Yes, that is the weight file for the pre-trained biLM. You will need the options file that specifies the hyperparameters. These both linked from README.md.
from bilm-tf.
Thank you.
That is the weight file for the pre-trained biLM. I will use it with its options file like you mentioned in README.md. So in order to create ELMO embeddings for questions and paragraphs separately, which of your example python files I need to follow? (usage_cached.py or usage_token.py) .
Can you please correct me if I am wrong? Per my understanding, usage_token is like Glove weights right? It provides weights for each token in voc?
from bilm-tf.
Can you provide more detailed instructions to train my own elmo embedding in your README? I want to train elmo on NYT and compare against word2vec or glove.
from bilm-tf.
Related Issues (20)
- UnicodeError when run example: python -m unittest discover tests/ HOT 2
- how many time it takes if I training a 5GB corpus on 1080ti
- How to use ELMo in Keras version? HOT 1
- why should vocabulary file be sorted in descending order by token count in our training data?
- bug during data loading (and tests) HOT 2
- Training ELMO on TPU for generating embedding from custom dataset
- how to freeze some layers during training HOT 1
- Import Error: No module named bilm.training when I try to incrementally train ELMo HOT 1
- drastic impact of Changing the vocabulary on perplexity HOT 1
- incremental training on less than 2000 sentences HOT 2
- elmo sentence embeddings using usage_token.py
- The PyTorch version replication of the "Deep contextualized word representations" is missing
- Does any code about ELMO+ESIM? HOT 1
- Exporting finetuned model to SavedModel format for Tensorflow Serving HOT 6
- Compute training time
- Preparing training data for a domain with many multi keyword token HOT 1
- Running into error on tests HOT 3
- Saving the custom trained ELMO embedding (hdf5) in the format of tensorhub models
- 2021-10-29 14:13:30.313623: E C:\users\nwani\_bazel_nwani\mmtm6wb6\execroot\org_tensorflow\tensorflow\stream_executor\cuda\cuda_event.cc:49] Error polling for event status: failed to query event: CUDA_ERROR_OUT_OF_MEMORY
- Pre-trained links do not work HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bilm-tf.