DECENT

Decoupled Encoding and Cross-Attention for Efficient Named Entity Typing

Environment

Requirements

The code has been tested with Python 3.9.2 and the following requirements.

$ pip install -r requirements.txt --extra-index-url https://download.pytorch.org/whl/cu113

Given our GPU, we need CUDA 11.3 which is indexed under https://download.pytorch.org/whl/cu113. Depending on your specification you may not have to rely on this CUDA version and use a standard version of Pytorch. However, under these circumstances we cannot guarantee a successful environment setup.

Dotenv

To override certain environment variables, copy .env_template to .env and adapt as needed, e.g. wandb key.

Data

We use the data and the respective format provided by Onoe et al.

Distantly Supervised Data

From Choi et al.: https://www.cs.utexas.edu/~eunsol/html_pages/open_entity.html but only needed when pretraining.

Use the following script for formatting:

$ python scripts/format.py --file el_train.json --output el_train_processed.json

Checkpoints

Models

You can download the model checkpoints of DECENT trained on UFET, OntoNotes or FIGER.

Model Ids:

UFET: otkl66o1
FIGER: 1965bsiq
OntoNotes: 1oxzic0i

Output

DECENT: link (contains model output + best prediction of UFET, OntoNotes and FIGER for dev and test)
MLMET/Lite: link (contains model output + best predictions of UFET for dev and test)

Overview of best Loose Macro-F1 scores for the different training configurations. The respective threshold was identified using the validation dataset and is shown in parentheses.

Model	UFET (Dev)	UFET (Test)	FIGER (Dev)	FIGER (Test)	OntoNotes (Dev)	OntoNotes (Test)
DECENT (UFET)	50.84 (0.985)	49.74 (0.985)	60.37 (0.97)	69.12 (0.97)	82.41 (0.955)	83.10 (0.955)
MLMET	49.06 (0.500)	49.08 (0.5)	-	-	-	-
Lite	50.44 (0.93)	50.61 (0.93)	-	-	-	-
DECENT (FIGER)	-	-	90.87 (0.96)	83.81 (0.96)	-	-
DECENT (OntoNotes)	-	-	-	-	76.86 (0.99)	77.01 (0.99)

Training

To train a model:

$ python src/train.py --config config/train.yaml

Overview of Training Configurations:

Config	Description
`train.yaml`	Training DECENT on UFET
`figer.yaml`	Training DECENT on OntoNotes
`onto.yaml`	Training DECENT on OntoNotes
`pretrain.yaml`	Pretraining DECENT on distantly supervised UFET
`fine_tune.yaml`	Fine-tuning the pretrained model on UFET
`unist.yaml`	Training configuration of UniST on UFET

Important Flags:

Flag	Description
`--wandb.offline`	`True` to turn of wandb; default: `False`
`--result-dir`	Result directory

The full list of available parameters and there default values can be viewed in the respective file.

The parameters are given the following priority (highest to lowest):

Command line arguments, e.g. --model.optimizer_params.classifier.lr 0.005
Configuration file
Default values

Evaluation

1. Get Model Output

Use this script to get the prediction and output of a model for specific dataset.

Please refer to the documentation in the respective file.

Example:

$ python src/predict.py predict --checkpoint BEST_MODEL.ckpt --dataset data/ufet/ufet_dev.json --labels data/ontology/ufet_types.txt --output OUTPUT_FOLDER --save-model-output CACHE_FOLDER --model-id 123 --batch-size 128

Note: --save-model-output needs to be defined to find the best threshold.

2. Best Threshold

Use this script to find the best threshold for a model given its output.

Please refer to the documentation in the respective file.

Example:

$ python src/predict.py reuse --cache MODEL_OUTPUT_CACHE.pkl --output OUTPUT_FOLDER --threshold-step 0.005

3. Additional Metrics

Use this script to get additional metrics regarding granularity and type of mention.

Please refer to the documentation in the respective file.

Example:

$ python eval.py --results PREDICTIONS.json --labels data/ontology/ufet_types.txt

Miscellaneous

UniST

We reimplemented parts of UniST for our evaluation. For more details view the paper by Huang et al. You can train a model with our framework using the unist.yaml training configuration. The prediction and evaluation is the same as with DECENT.

OntoNotes & FIGER

To validate our approach, we used the fine-grained datasets for OntoNotes and FIGER. Training and prediction is the same as for UFET. We provide results and predictions for a model that has been trained and the respective dataset and the model that has solely been trained on UFET.

kabongosalomon / decent Goto Github PK

decent's Introduction

DECENT

Environment

Requirements

Dotenv

Data

Distantly Supervised Data

Checkpoints

Models

Output

Training

Evaluation

1. Get Model Output

2. Best Threshold

3. Additional Metrics

Miscellaneous

UniST

OntoNotes & FIGER

decent's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org