Git Product home page Git Product logo

Comments (2)

osainz59 avatar osainz59 commented on May 24, 2024 1

Hi @hanjiale ,

The hyperparameters you have used looks fine to me. Maybe I can suggest you to try 1e-5 as learning rate and increase the epochs to 5. You can also use the load_best_model_at_end with accuracy (or defining the entailment f1_score) as metric_for_best_model to improve the checkpoint selection. But in any case, those hyperparameters should be fine. I am concerned about the optimal_threshold value... Did you use the development set, at least on a small portion of it, to optimize the threshold? Maybe that explains the huge gap between the precision and recall.

One of the weakness points of the approach is the need tof tunning the threshold, at least on a very small portion of data. We are working on it and have very promising results, but the improvements are still work in progress.

Let me know,
Oscar

from ask2transformers.

hanjiale avatar hanjiale commented on May 24, 2024

I see. The problem is that I remove the "dev_path" in the tacred.relation.config.json. Thank you for your reply and looking forward to your new work!

from ask2transformers.

Related Issues (12)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.