This project is to release pre-trained Korean Word2Vec to the public. The purpose is only to build a free testbed for researchers or students in Korean NLP. Not yet finished.. After the completion, feel free to use!
- Python
- KoNLPy (for Korean tokenizer)
- Gensim (for Word2Vec)
- Jupyter Notebook
If you want to train Korean Word2Vec by yourself, you should download used dataset on these links. Not included. It would not be difficult for you to convert files into .txt format.