answer_extraction
This project trains a model for answer extraction using Microsoft Research WikiQA Corpus^1. The data is converted to nlp4j input format
add nlp4j lexica to train answer-extraction model using edu.emory.mathcs.nlp.bin.QANLPTrain. the dataset and config files are provided in resources.
Microsoft Research WikiQA Corpus
@InProceedings{YangYihMeek:EMNLP2015:WikiQA, author = {Yang, Yi and Yih, Wen-tau and Meek, Christopher}, title = {{WikiQA}: {A} Challenge Dataset for Open-Domain Question Answering}, booktitle = {Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP)}, month = {September}, year = {2015}, address = {Lisbon, Portugal}, publisher = {Association for Computational Linguistics} }