Comments (2)
Hi @hanjiale ,
The hyperparameters you have used looks fine to me. Maybe I can suggest you to try 1e-5 as learning rate and increase the epochs to 5. You can also use the load_best_model_at_end
with accuracy (or defining the entailment f1_score) as metric_for_best_model
to improve the checkpoint selection. But in any case, those hyperparameters should be fine. I am concerned about the optimal_threshold
value... Did you use the development set, at least on a small portion of it, to optimize the threshold? Maybe that explains the huge gap between the precision and recall.
One of the weakness points of the approach is the need tof tunning the threshold, at least on a very small portion of data. We are working on it and have very promising results, but the improvements are still work in progress.
Let me know,
Oscar
from ask2transformers.
I see. The problem is that I remove the "dev_path"
in the tacred.relation.config.json
. Thank you for your reply and looking forward to your new work!
from ask2transformers.
Related Issues (12)
- Please update the README? HOT 3
- Tutorial or examples HOT 2
- Typo in apostrophes HOT 1
- Zero-shot Tacred Relation Classification HOT 2
- How to reproduce the EAE task result? HOT 3
- verbalization HOT 4
- Few-Shot RE HOT 2
- Run GLUE for fine-tuning Few-Shot Relation Classification HOT 2
- Positive (isNext) output for Next Sentence Prediction might be 0 HOT 3
- Incomplete documentation
- Fewshot checkpoints for TACRED HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ask2transformers.