This project gathers two data sets: (Diplomatic Documents of Switzerland)[http://dodis.ch/en/home] and Le Temps Historical Archive for year 1914. Our aim is to find occurrences appearing in both data sets and detect the events they have in common.
More details in the project's wiki
$ pip install flask gensim nltk beautifulsoup4
$ python
>>> import nltk
>>> nltk.download()
>>> l
>>> d stopwords