dinap18 / big_data Goto Github PK
View Code? Open in Web Editor NEWHomework assignments from the course, Big Data. Topics covered include: data warehousing, linear regression, NLP, KMeans, TF-IDF, PCA, decision trees, data cleaning, and recommendation systems - UBCF and IBCF. The assingments were completed with the following tools: R, RStudio, DataGrip, MySQL, and R libraries such as ggplot2, recommenderlab, quanteda, rpart, lubridate, RMySQL, and sqldf.