Git Product home page Git Product logo

jointre's Introduction

Dat Quoc Nguyen is a Senior Research Scientist and the Head of the Natural Language Processing department at VinAI Research, Vietnam. He was an Honorary Fellow in the School of Computing and Information Systems at the University of Melbourne, Australia, where previously he was a Research Fellow. Before that, he received his Ph.D. in Computer Science from Macquarie University, Australia.

Dat Quoc Nguyen is the author of 70 peer-reviewed publications covering core NLP problems, ML methods for NLP and their applications for low-resource languages and specific domains, with over 5500 citations and an h-index of 34 (Google Scholar). He released many ML/NLP toolkits and datasets, which are widely used in both academia and industry. He also created large language models and other foundation models, including PhoGPT, RecGPT, PhoBERT, BARTpho, XPhoneBERT and BERTweet, with millions of downloads.

jointre's People

Contributors

datquocnguyen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

jointre's Issues

How do you load the saved model ?

Im working on something similar so curios how you load a saved model to be used as inference.I understand we'd have to load the parameters file from what I have seen params file is not getting created.

No documentation or scripting to download GloVe pretrained embeddings

Not a big issue, but it's probably worth a quick update to include either documentation in the readme to download the GloVe pretrained 100B word embeddings, or a curl command to the scripts, otherwise following procedure causes an assertion error that requires traceback to the missing file.

Traceback (most recent call last):
  File "jNERE.py", line 77, in <module>
    parser = learner.jNeRE(words, nertags, rels, w2i, c2i, options)
  File ".../jointRE/jNERE/learner.py", line 58, in __init__
    assert (ext_emb_dim == self.wdims)
AssertionError

shortest path dependency

Im trying to modify this code to include shortest path dependency to this. Dependency parsing and pos tags would be additional features. Do you think we can collaborate if you have the badnwidth

questions about train

Thanks for your hard word,
1.I have learned the paper,found that
'although using Biaffine increases training time over using Linear by 35%, relatively,',I want to know the speed of train and the machine configuration
2.whether to support batch_size?
I want to use it in my Chinese dataset

Looking forward to your reply

Invalid gradient occurs during training at later epochs

I'm noticing that the training routine can fail due to NaN gradient values occasionally.

Starting epoch 91
Processing sentence number: 100 , Loss: 0.0293 , Time: 12.89
Processing sentence number: 200 , Loss: 0.0100 , Time: 14.91
Traceback (most recent call last):
  File "jNERE.py", line 83, in <module>
    parser.Train(train_data, train_id2nerBILOU, id2arg2rel)
  File ".../jointRE/jNERE/learner.py", line 449, in Train
    self.trainer.update()
  File "_dynet.pyx", line 5728, in _dynet.Trainer.update
  File "_dynet.pyx", line 5733, in _dynet.Trainer.update
RuntimeError: Magnitude of gradient is bad: -nan

Problems in metrics

Hello,

I think there are two issues with your metrics that prevent comparison with state-of-the-art:

  1. In your paper, you write about macro-averaged F1 which indeed seems to be what you use in your code. Almost every work in NER or RE use micro-averaged F1 score.

  2. You discard the "Other" entity type. As far as I understand in (Bekoulis 2018), this is only done with Entity Classification (i.e. your Setup 2) and not joint NER and RE.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.