Git Product home page Git Product logo

datalogism / dbpedia-spotlight-model Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dbpedia-spotlight/dbpedia-spotlight-model

0.0 0.0 0.0 390 KB

DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text. Improving Efficiency and Accuracy in Multilingual Entity Extraction approach

Home Page: http://www.dbpedia-spotlight.org

License: Apache License 2.0

Shell 0.43% Python 1.24% Java 42.12% Scala 56.00% PigLatin 0.21%

dbpedia-spotlight-model's Introduction

DBpedia Spotlight spotlight-build

Links

website - http://www.dbpedia-spotlight.org

demo service - http://demo.dbpedia-spotlight.org

General Notes

Since v1.0, DBpedia Spotlight was split into two versions, under the same API, as follow:

This important movement was the way that we found to deliver faster fixes and new releases, providing solutions for each annotation approach.

Our first achievement is related with licensing. DBpedia Spotlight Model is now full compliance with Apache 2.0. It means that you can use it without any commercial restrictions.

We are so excited because there's even more great news to come.

If you require any further information, feel free to contact us via [email protected]. We are already very excited to spend time with you on further community meetings and to publish new DBpedia releases.

Keep annotating,

All the best

Shedding Light on the Web of Documents

DBpedia Spotlight looks for ~3.5M things of unknown or ~320 known types in text and tries to link them to their global unique identifiers in DBpedia.

Compilation

The requirements to compile the DBpedia Spotlight project are as follows:

  • Maven 3.1, or later.
  • JDK 1.8

The process to compile the DBpedia Spotlight code to produce the corresponding jar files are as follows:

  1. [optional] export JAVA_HOME=/path/of/the/jdk1.8/ - only required if the default JDK is different
  2. run mvn package - this process will take some time

Each module (core, index, rest) will produce the corresponding jar file in the "target" directory, e.g., core/target or index/target. The jar file containing the DBpedia Spotlight annotation service will be in the rest module directory (rest/target/rest-1.1-jar-with-dependencies.jar).

To install the DBpedia Spotlight in the local maven repository, replace the mvn package command with the mvn install command.

Testing the DBpedia Spotlight service (the compiled jar file)

To test the produced jar file, download a language model from the DBpedia Databus, uncompress it and execute the following command:

java -Dfile.encoding=UTF-8 -Xmx10G -jar rest-1.1-jar-with-dependencies.jar /path/to/the/uncompress/language/model http://0.0.0.0:2222/rest

When the DBpedia Spotlight annotation service is ready, it could be queried through a curl command as follows:

curl http://0.0.0.0:222/rest/annotate --data-urlencode "text=Barack Obama was the president of the United States" - H "Accept: text/turtle"

Demonstration

Go to our Demonstration page, copy+paste some text and play with the parameters to see how it works.

Endpoints

https://api.dbpedia-spotlight.org/{LANGUAGE}/annotate

Call our web service

You can use our demonstration Web Service directly from your application.

curl https://api.dbpedia-spotlight.org/en/annotate  \
  --data-urlencode "text=President Obama called Wednesday on Congress to extend a tax break
  for students included in last year's economic stimulus package, arguing
  that the policy provides more generous assistance." \
  --data "confidence=0.35"

or for JSON:

curl https://api.dbpedia-spotlight.org/en/annotate  \
  --data-urlencode "text=President Obama called Wednesday on Congress to extend a tax break
  for students included in last year's economic stimulus package, arguing
  that the policy provides more generous assistance." \
  --data "confidence=0.35" \
  -H "Accept: application/json"

Run your own server

If you need service reliability and lower response times, you can run DBpedia Spotlight in your own In-House Server based on Docker.

Models and data

Models and raw data for most languages are available here.

Citation

If you use DBpedia Spotlight in your research, please cite the following paper:

@inproceedings{isem2013daiber,
  title = {Improving Efficiency and Accuracy in Multilingual Entity Extraction},
  author = {Joachim Daiber and Max Jakob and Chris Hokamp and Pablo N. Mendes},
  year = {2013},
  booktitle = {Proceedings of the 9th International Conference on Semantic Systems (I-Semantics)}
}

Licenses

All the original code produced for DBpedia Spotlight Model is licensed under Apache License, 2.0.

Documentation

More documentation is available from the DBpedia Spotlight wiki.

FAQ

Check the FAQ here

Maintainers

dbpedia-spotlight-model's People

Contributors

augusto-herrmann avatar julio-noe avatar kfitzgerald avatar m1ci avatar manonthegithub avatar ragnarok85 avatar sandroacoelho avatar skunnyk avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.