ashim95 / malayalam-newspaper-article-dataset Goto Github PK
View Code? Open in Web Editor NEWThis project forked from abhishekvalsan/malayalam-newspaper-article-dataset
The project scraps articles from a malayalam newspaper website to create a corpus. A set of queries is created and corresponding ground truth answers is retrieved. This can be used as a dataset that can check new tools in future like malaylam stemmer, stopwords removal, lemmatizers, etc...