Scrapes the language of each Japanese author from AozoraBunko. Gives a glimpse of the distribution of language / parts of speech used by each authors. Graphs it with matplotlib. Uses 100 authors in the video.
⚡ Features:
Scrapes authors from AozoraBunko(https://www.aozora.gr.jp/), a repository of popular Japanese literature.
Scrapes all of the works from each author desired.
Works with a list of authors or just one author.
Creates individual CSV files for each author and / or one merged CSV.
Each CSV has the frequency of each word within a work, along with its part of speech.