Git Product home page Git Product logo

joint-bootstrapping-machines's Introduction

"Joint Bootstrapping Machines for High Confidence Relation Extraction" in conference proceedings of NAACL-HLT 2018.
	
	
Dependecies:

- Python 2.7, Numpy, NLTK, Gensim, jellyfish, whoosh, etc.

	
Directory Structure:

Joint-Bootstrapping-Machines

  resources
    freebase-easy-14-04-14
	  freebase_facts.txt 
	  - Download from url http://freebase-easy.cs.uni-freiburg.de/dump/
	  
  data
    input
      sentences.txt 
      - Download corpus from url: https://drive.google.com/file/d/0B0CbnDgKi0PyM1FEQXJRTlZtSTg/view)
    output
      BREE
        REL_ACQUIRED_ORG_ORG
          relationships_baseline.txt 
          - The output file containing a list of the relationships extracted from BREE system)
      BRET
        REL_ACQUIRED_ORG_ORG
          relationships_config5.txt
          - The output file containing a list of the relationships extracted from BRET system)
      BREJ
        REL_ACQUIRED_ORG_ORG
          relationships_config9.txt
          - The output file containing a list of the relationships extracted from BREJ system)
		  
    code
      automatic_evaluation
        index_dir
        - Directory of corpus-index
		
        index_whoosh.py
        - To create corpus-index in directory index_dir.
		
        Sentence.py
        - To extract entities infomation, clean and filter.
		
        easy_freebase_clean.py
        - To collect relationships facts from Freebase and prepare databases.
		
        large_scale_evaluation_freebase.py
        - To automatically evaluate relation extraction systems on large-scale 
          (https://akbcwekex2012.files.wordpress.com/2012/05/8_paper.pdf).
			

Usage (Evaluation):
 
    $ python large_scale_evaluation_freebase.py threshold system_output rel_type database root_dir corpus-index
	
	
Example:

    $ cd Joint-Bootstrapping-Machines/code/automatic_evaluation
 
    $ python large_scale_evaluation_freebase.py 0.5 ../../data/output/BREE/REL_ACQUIRED_ORG_ORG/relationships_baseline.txt acquired ../../resources/freebase-easy-14-04-14/freebase_facts.txt ../../data ../../data/input/sentences.txt ./index_dir
	
	
	
	
	
	
	
	

joint-bootstrapping-machines's People

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.