Comments (5)
We trained and evaluated our models on English datasets, and our released models can only handle English.
However, you can definitely train your own models on the other datasets or in other languages using our code. To do so, besides converting the dataset into the format described in the repo, you need to add labels of the dataset in shared/const.py
and add the dataset into the argument --task
in run_entity.py
and run_relation.py
.
If your data is in Chinese, you may want to use a pre-trained language model that supports Chinese (e.g., bert-base-multilingual-uncased
).
Hope this helps!
from pure.
We trained and evaluated our models on English datasets, and our released models can only handle English.
However, you can definitely train your own models on the other datasets or in other languages using our code. To do so, besides converting the dataset into the format described in the repo, you need to add labels of the dataset in
shared/const.py
and add the dataset into the argument--task
inrun_entity.py
andrun_relation.py
.If your data is in Chinese, you may want to use a pre-trained language model that supports Chinese (e.g.,
bert-base-multilingual-uncased
).Hope this helps!
thanks a lot!
from pure.
Great job. So, if I change these codes, will I be able to use your code on any data set of the same task ?
from pure.
@YaoXinZhi Yes, the code should work on other datasets of the entity-relation extraction task.
from pure.
Can this code deal with chinese text please?
大佬,你好,我在跑对比实验,能否分享一份可以在中文数据集上跑通的代码以及几条数据示例(好用chatgpt快速完成数据格式转换),我的邮箱是[email protected],感激不尽!
from pure.
Related Issues (20)
- Multiple issues HOT 2
- different F1 with the same seed HOT 2
- tensorflow版本 HOT 1
- About the relation in datasets HOT 1
- [Paper] What are "gold" entity and relationship types? HOT 2
- Provide full environment
- Input Data Format HOT 5
- How to load models into Python HOT 2
- some code problems reguarding run_relation_approx(get_features_from_file) HOT 2
- where is the code of Efficient Batch Computations
- Approximation Model Training & Inference HOT 1
- entity is S or O ?
- Further question of f1 and e2e_f1
- 版本库问题 HOT 1
- 版本库问题
- ACE dataset
- Training a model on a dataset that is not ace04, ace05, or scierc HOT 1
- training model for WLP -- stuck in suboptimal solution
- Input data format question for custom dataset !
- cuda out of memory
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pure.