Git Product home page Git Product logo

figer's Introduction

Fine-Grained Entity Recognizer (FIGER)

This distribution contains the source code for the experiments presented in the following research publication (PDF):

Xiao Ling and Daniel S. Weld (2012).
"Fine-Grained Entity Recognition",
in Proceedings OF THE TWENTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI), 2012.

Download the model file

One can test the trained model on the evaluation data or new data as they wish.

Run ./downloadModel.sh to download the Model and save it at the root directory. Run ./downloadModel.sh new for an updated model.

A better model has been trained and can be fetched by ./downloadModel.sh new. Change the config value accordingly.

Requirement

sbt >= 0.13.0

Replicate the experiments

To run the experiments in the AAAI-12 paper, you can proceed as follows:

$ ./run.sh "aaai/exp.conf" &> aaai/exp.log

Run FIGER on new data

To make predictions on new data, please see package edu.washington.cs.figer.FigerSystem for example code or run:

$ sbt "runMain edu.washington.cs.figer.FigerSystem <text_file>"

Alternatively, you can change the parameter values (e.g. the input file name) in config/figer.conf and get a more structured output by running:

$ ./run.sh "config/figer.conf"

Make a stand-alone jar

$ sbt assembly

A simple web interface

Run

$ sbt ~container:start

and go to localhost:8080/index.html for a simple web demo.

Training Data

The training data train.data.gz is gzipped and serialized in Protocol Buffer. Please see entity.proto in the code package for the definitions.

Download link

In config/figer.conf, make the following changes:

useModel=false
modelFile=<the output model file>
# the training file has to follow the specs from `entity.proto`. See `train.data.gz` for example
trainFile=<training file>

Then run ./run.sh config/figer.conf to train a new model (It will need over 10G memory and about an hour to finish).

Mapping between Freebase MIDs and Wikipedia Titles

A mapping file from mids to titles is available here.

figer's People

Contributors

xiaoling avatar

Watchers

James Cloos avatar Praneeth Gubbala avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.