Comments (5)
can you try changing the flags to:
--gin_param="decode_from_file.input_filename='$DATA_DIR/testdata_pred.tsv'"
--gin_param="decode_from_file.output_filename='$DATA_DIR/testdata_outputs.txt'"
from text-to-text-transfer-transformer.
It worked and produced an output. But at the end after the outputs were produced the script stalled at:
INFO:tensorflow:Waiting for new checkpoint at $MODEL_DIR
I can use it to produce what I want, but it's kinda odd that the script just stalls since I just told it to predict on that one checkpoint.
Also, FYI, the readme has another error in the instructions for invoking the decoder:
--gin_param == "utils.run.mode='infer'"
Should be changed to:
--gin_param="utils.run.mode='infer'"
This was a relatively easy error to find and fix though.
from text-to-text-transfer-transformer.
Thanks, I'll fix the README.
It actually should exit after outputting since you specified a checkpoint number. Can you paste the final command you used?
from text-to-text-transfer-transformer.
Can you try
--gin_param="utils.run.eval_checkpoint_step = 1005000"
and see if that fixes the stall?
from text-to-text-transfer-transformer.
Can you try
--gin_param="utils.run.eval_checkpoint_step = 1005000"
and see if that fixes the stall?
Yes, that worked, thanks! Looks like the argparse didn't put the arguments into the right places and it went with a default option?
from text-to-text-transfer-transformer.
Related Issues (20)
- ValueError when evaluating tuning model using Mtf library
- using A100(40G)*8 gpus server to train T5-3b,it reports OOM resource is exhausted problem HOT 2
- How should I speed up T5 exported saved_model by using TF-TRT ?
- model.finetune(...) does not show the loss of the model HOT 6
- CUDA OOM with HF Model
- Predictions are inconsistent unless model is reloaded for each prediction HOT 1
- how to change teacher forcing fashion to autogressive fashion in training stage?
- ERROR:root:Path not found: gs://t5-data/pretrained_models/large/operative_config.gin HOT 6
- Fine tuning t5 without TPU
- About "seqio" in "hf_model.py"
- Question about the metric reported in the paper?
- All attempts to get a Google authentication bearer token failed, returning an empty token. HOT 2
- How to fine-tune T5 with a Casual Language Modeling object?
- cmd vs entrypoint youtube video suggestion HOT 1
- Question about cross-node(multi-node) data parallelism on GPU HOT 1
- Dependencies in `setup.py` have module conflicts.
- How can I get the best checkpoint in Squad?
- Custom Model
- Columns and DataType Not Explicitly Set on line 163 of eval_utils_test.py
- Clarification on T5 Model Pre-training Objective and Denoising Process
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from text-to-text-transfer-transformer.