Git Product home page Git Product logo

cmrc-information-extraction-and-knowledge-injection's Introduction

CMRC-Information-Extraction-and-Knowledge-Injection

Multi-turn MRC Implementation for "A Study on the Information Extraction and Knowledge Injection for Machine Reading Comprehension"

基於 BERTBERT-HAEBERT-HAM 以及 ExCorD 的資源修改。 適用於英文多輪機器閱讀理解 QuAC

本程式碼為論文 A Study on the Information Extraction and Knowledge Injection for Machine Reading Comprehension 於多輪機器閱讀理解的實作部分。實作分為四個部分。

(1) Baseline: BERT, BERT-HAE, ExCorD

(2) Information Extraction(資訊擷取)

(3) Knowledge Graph(知識注入)

(4) Ensemble(N-best 答案進行 Reranking)

Clutering Strategies Clustering Strategies

Information Extraction 架構 IE

Knowledge Injection - WortNet 架構 KI1

Knowledge Injection - PLSA/PRGC 架構 KI2

Answer Re-ranking 示意圖 reranking

Conda 環境設定檔

  • tf.yml 模型訓練環境 (BERT-HAE, BERT-HAM)
  • ex.yml 模型訓練環境 (ExCorD-BERT, ExCorD-RoBERTa, PRGC, K-BERT)
  • ex2.yml 模型訓練環境 (ExCorD-DeBERTa)
  • prep.yml 資料預處理環境 (Clustering, Ensemble, WordNet, PLSA)

主程式位置

  • BERT-HAE: dialog/hea_clustering/bert_hae-master
  • BERT-HAE + IE: dialog/hea_clustering/hae_kg
  • BERT-HAE + KI: dialog/hea_clustering/hae_kg
  • ExCorD: dialog/excord/excord-main
  • ExCorD + IE: dialog/excord/excord-clus
  • ExCorD + KI: dialog/excord/kbert-ex
  • Ensemble: dialog/ensemble
  • BERT-HAM + IE: dialog/attentive_cls
  • PRGC-based KI: dialog/PRGC-main
  • WordNet based KG: dialog/PRGC-main
  • PLSA-based KG: plsa
  • PRGC based KG: wordnet

使用

  • Information Extraction
bash cls/run_cls.sh
  • Train & Prediction
bash run.sh
  • Evaluate
bash eval.sh

Information Extraction 結果 Result-IE

Knowledge Injection 結果 Result-KI

cmrc-information-extraction-and-knowledge-injection's People

Contributors

kamelain avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.