You have to install WordCloud from here. The rest of the code should run with the Anaconda distribution of Python. The code should run with no issues using Python versions 3.*. The necessary datafiles can be downloaded from here, but for convenience are included in the repository as zip files in folder 2018/2019.
For this project, I was interestested in using Stack Overflow data from 2019 to better understand:
-
What are the languages with most change worked with this year and desired next year?
-
Are there differences in countries?
and with the survey data of 2018
-
Is the raise or fall of a language also seen in survey a year ago?
-
How good is the value
LanguageDesireNextYear
in comparison to the real valueLanguageWorkedWith
from next year?
This project is part of the Data Science nano degree in Udacity.
There a one notebook here to answer the above questions. The notebook is well documented and markdown cells are used to assist the thought process. Additional you find the resulting notebook as html.
The main findings you can find in the also available pdf file here.
Must give credit to Stack Overflow for the data. You can find the Licensing for the data and other descriptive information at the StackOverflow site here. Otherwise, feel free to use the code here as you would like!