Git Product home page Git Product logo

nliwod's Introduction

NLIWOD - Natural Language Interfaces for the Web of Data

Build Status Project Stats BCH compliance Codacy Badge codecov

Collection of tools, utilities, datasets and approaches towards realizing natural language interfaces for the Web of Data. Currently, we are focusing on Question Answering (QA) utilities.

Especially, this repository contains

  • QA Systems: A set of existing online webservices of QA systems all executable via a simple Java interface.
  • QA Datasets: A collection of existing Question Answering datasets
  • QA Machine Learning: This projects aims at learning a ML-based algorithm to combine multiple QA systems into one
  • QA Commons: A collection to ease handling of QA datasets. It allows to load, store and evaluate datasets and systems.

We aim at providing a fast entrance to the field of natural language interfaces (search, question answering, ranking). Thus, we will offer here Maven dependencies and source code for using many available datasets, systems and techniques.

More interesting Question Answering and Natural Language Generation projects can be found here:

  • SemWeb2NL https://github.com/aksw/semweb2nl: This repository provides means to verbalize triples, entities, SPARQL queries and many more RDF dataset. This is especially useful to allow users understand Web of Data content.

Foreseen modules:

  • QA Features: Features calculated on a NL question to train ML algorithms.

If you are interested in standardization efforts join or W3C Commmunity Group https://www.w3.org/community/nli/ !

For developers

To deploy a new version increase the according versions in the pom.xml and execute mvn clean deploy after setting your ~.m2/settings.xml in accordance to https://wiki.aksw.org/private/infrastructure/aksw-responsibilities/maven .

Maven Dependency

This library is available as snapshot here: http://maven.aksw.org/archiva/#artifact~snapshots/org.aksw.qa/datasets

Add the following repository to your project:

Artifacts are described in the sub-modules.

Look for more interesting libraries here: http://maven.aksw.org/archiva/#browse/org.aksw.qa

Papers using NLIWOD

nliwod's People

Contributors

2start avatar cdemmler avatar earthquakesan avatar firmao avatar idreestahir avatar jonathanhuthmann avatar konradhoeffner avatar lorenzbuehmann avatar lukasbluebaum avatar michaelroeder avatar mnafshin avatar mohnish-rygbee avatar nikit91 avatar param-jot avatar perevalov avatar querela avatar renespeck avatar ricardousbeck avatar rrichajalota avatar santteegt avatar suganya31 avatar vdanielupb avatar ysu1989 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

nliwod's Issues

Merge SPARQL functionality

In qa.commons, there are 2 different ways to fire queries to an endpoint, SPARQL and SPARQLExecutor. This becomes tedious to keep clean, especially when both share the same (hardcopied) code.
Consider merging the functionality in favor of SPARQL.

org.apache.jena.riot.lang.ReaderRIOTRDFXML NoClassDefFoundError Exception when deploying qa.hawk

Hello,

I'm getting the a NoClassDefFoundError exception when trying to deploy hawk using docker. I tried adding the jena-arq dependency but then different classes appeared missing. What could be the problem ? I'm using the latest version of the source code

This is the entire stacktrace during spring startup:

Error starting ApplicationContext. To display the auto-configuration report re-run your application with 'debug' enabled.
Application startup failed
org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'gerbilQABenchmarking' defined in file [/home/ciadmin/NLIWOD/qa.hawk/target/classes/org/aksw/hawk/webservice/GerbilQABenchmarking.class]: Instantiation of bean failed; nested exception is org.springframework.beans.BeanInstantiationException: Failed to instantiate [org.aksw.hawk.webservice.GerbilQABenchmarking]: Constructor threw exception; nested exception is java.lang.NoClassDefFoundError: org/apache/jena/riot/lang/ReaderRIOTRDFXML
	at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.instantiateBean(AbstractAutowireCapableBeanFactory.java:1110)
	at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.createBeanInstance(AbstractAutowireCapableBeanFactory.java:1054)
	at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.doCreateBean(AbstractAutowireCapableBeanFactory.java:512)
	at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.createBean(AbstractAutowireCapableBeanFactory.java:482)
	at org.springframework.beans.factory.support.AbstractBeanFactory$1.getObject(AbstractBeanFactory.java:306)
	at org.springframework.beans.factory.support.DefaultSingletonBeanRegistry.getSingleton(DefaultSingletonBeanRegistry.java:230)
	at org.springframework.beans.factory.support.AbstractBeanFactory.doGetBean(AbstractBeanFactory.java:302)
	at org.springframework.beans.factory.support.AbstractBeanFactory.getBean(AbstractBeanFactory.java:197)
	at org.springframework.beans.factory.support.DefaultListableBeanFactory.preInstantiateSingletons(DefaultListableBeanFactory.java:754)
	at org.springframework.context.support.AbstractApplicationContext.finishBeanFactoryInitialization(AbstractApplicationContext.java:866)
	at org.springframework.context.support.AbstractApplicationContext.refresh(AbstractApplicationContext.java:542)
	at org.springframework.boot.context.embedded.EmbeddedWebApplicationContext.refresh(EmbeddedWebApplicationContext.java:122)
	at org.springframework.boot.SpringApplication.refresh(SpringApplication.java:761)
	at org.springframework.boot.SpringApplication.refreshContext(SpringApplication.java:371)
	at org.springframework.boot.SpringApplication.run(SpringApplication.java:315)
	at org.springframework.boot.SpringApplication.run(SpringApplication.java:1186)
	at org.springframework.boot.SpringApplication.run(SpringApplication.java:1175)
	at org.aksw.hawk.webservice.WebApplication.main(WebApplication.java:9)
Caused by: org.springframework.beans.BeanInstantiationException: Failed to instantiate [org.aksw.hawk.webservice.GerbilQABenchmarking]: Constructor threw exception; nested exception is java.lang.NoClassDefFoundError: org/apache/jena/riot/lang/ReaderRIOTRDFXML
	at org.springframework.beans.BeanUtils.instantiateClass(BeanUtils.java:154)
	at org.springframework.beans.factory.support.SimpleInstantiationStrategy.instantiate(SimpleInstantiationStrategy.java:89)
	at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.instantiateBean(AbstractAutowireCapableBeanFactory.java:1102)
	... 17 more
Caused by: java.lang.NoClassDefFoundError: org/apache/jena/riot/lang/ReaderRIOTRDFXML
	at org.apache.jena.tdb.TDB.init(TDB.java:248)
	at org.apache.jena.tdb.sys.InitTDB.start(InitTDB.java:29)
	at org.apache.jena.system.JenaSystem.lambda$init$1(JenaSystem.java:111)
	at java.util.ArrayList.forEach(ArrayList.java:1255)
	at org.apache.jena.system.JenaSystem.forEach(JenaSystem.java:186)
	at org.apache.jena.system.JenaSystem.forEach(JenaSystem.java:163)
	at org.apache.jena.system.JenaSystem.init(JenaSystem.java:109)
	at org.apache.jena.rdf.model.ModelFactory.<clinit>(ModelFactory.java:49)
	at org.aksw.qa.annotation.index.IndexDBO_classes.index(IndexDBO_classes.java:115)
	at org.aksw.qa.annotation.index.IndexDBO_classes.<init>(IndexDBO_classes.java:63)
	at org.aksw.hawk.nlp.Annotater.<init>(Annotater.java:27)
	at org.aksw.hawk.controller.PipelineStanford.<init>(PipelineStanford.java:55)
	at org.aksw.hawk.webservice.GerbilQABenchmarking.<init>(GerbilQABenchmarking.java:28)
	at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
	at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
	at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
	at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
	at org.springframework.beans.BeanUtils.instantiateClass(BeanUtils.java:142)
	... 19 more
Caused by: java.lang.ClassNotFoundException: org.apache.jena.riot.lang.ReaderRIOTRDFXML
	at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
	at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
	at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:335)
	at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
	... 37 more

Creating surface forms from redirects

My old script does not work anymore, so either we fix that script or make this project run https://github.com/dice-group/SFGeneratorTemp

if [ ! -f "redirect_labels.ttl" ];
then
	sort labels_en.ttl > labels_en.ttl.sorted 
	sort transitive_redirects_en.ttl  > transitive_redirects_en.ttl.sorted
	join -1 1 -2 1 labels_en.ttl.sorted transitive_redirects_en.ttl.sorted  > join.tsv
	cat join.tsv | awk '{sub("<http://www.w3.org/2000/01/rdf-schema#label>","",$0);sub("<http://dbpedia.org/ontology/wikiPageRedirects>","",$0); sub(" .  "," ",$0); print $0}'| awk -F">  \"" '{print "\""$2 }'| awk -F"@en <" '{print $2 " <http://www.w3.org/2000/01/rdf-schema#label> " $1}' | awk '{sub(" . "," ",$0);  print "<"$0 " . " }' > redirect_labels.ttl
fi 

Tasks Lukas

@lukas fühl dich frei hier weitere Checkboxes hinzuzufügen

False answers running HAWK and questions about Fuseki and Indexing

Hello, I'm trying to familiarize myself with HAWK and have encountered some problems.
Executing HAWK using mvn spring-boot:run worked well, but unfortunately I have yet to get back a correct answer, e.g. the answer for the query localhost:8181/simple-search?query=What is the capital of Spain ? was http://dbpedia.org/resource/Captain_Alatriste (At least this was found under the answer key) which is obviously wrong. I have attached the complete result for reference: response.txt . My question is now: Did I do something wrong or is this a known issue?

After having run index_fuseki.sh as suggested in the qa.hawk/deploy-scripts/README.md I tried to unzip the file I downloaded from ( http://139.18.2.164/rusbeck/hawk/index_2016-04.zip), which was not possible and returned the following:

  End-of-central-directory signature not found.  Either this file is not
  a zipfile, or it constitutes one disk of a multi-part archive.  In the
  latter case the central directory and zipfile comment will be found on
  the last disk(s) of this archive.
unzip:  cannot find zipfile directory in one of index_2016-04.zip or
        index_2016-04.zip.zip, and cannot find index_2016-04.zip.ZIP, period.

I don't think the file got corrupted while I downloaded it, because I downloaded it muliple times.

Building and running the docker image for the Fuseki server however was successful.
Which brings me to the last question: Does hawk automatically search for this fuseki server on port 3030 or do I have to configure this somewhere?

when I execute the index_fuseki.sh,one error occured,please help to look at about this.

Exception in thread "main" java.lang.UnsupportedClassVersionError: org/apache/jena/tdb/store/bulkloader2/CmdNodeTableBuilder : Unsupported major.minor version 52.0

above error should happen when execute:
$APACHE_JENA_BIN/tdbloader2 --loc=hawk_data_10-2015_dbpediatbd dbpedia_2015-10.owl disambiguations_en.ttl instance_types_en.ttl long_abstracts_en.ttl pnd_en.ttl labels_en.ttl mappingbased_objects_en.ttl mappingbased_literals_en.ttl persondata_en.ttl specific_mappingbased_properties_en.ttl transitive_redirects_en.ttl en_surface_forms.ttl pagerank_en_2015-10.ttl redirect_labels.ttl

java -Xmx16G -cp $JENA_FUSEKI_JAR jena.textindexer --desc=fuseki_hawk_assembler.ttl

Remove CoreNLP library

Remove the CoreNLP dependency at all and remove or replace all classes using it to decrease package size

QALD 6 test

Please implement QALD 6 test multilingual and hybrid in qa-commons

HawK resources and tools

Hello,
I'm really interested in getting into HawK
However, when tried to test the system locally, i found some differences between the published paper and the documentation on gitHub

So on Hawk online QA website, are the used tools and techniques the ones on the published paper?

If they are not could you plz tell which DBpedia version, parsing tool, free text resources, pruning techniques and ranking techniques that are used.

Any clarification would be much appreciated

do we have Q&A dataset for spanish language ?

Hi,

I am trying to build Q$A machine reading comprehension model for spanish language.

but i am not finding dataset similar to Squad . is there any dataset do we have ? please share some link.

Thanks.

Hardcoding of paths in deploy scripts

Is it possible to change to Environment variables instead of fixed hardcoded absolute path? (z.B /home/datascience/.....). WE have to manually change path to run scripts in case code is not created in that path.

broken build and maven dependency

Check for all modules whether the builds run through and the latest releases are working.

For example

  • qa.commons 0.4.6 build broken and latest release has no source files
  • qa.systems not building

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.