stefanik12 / multimodal_learning Goto Github PK
View Code? Open in Web Editor NEWPreprocess Flickr dataset using important words extractor, pre-trained convolutional neural network and pre-trained Word2Vec to obtain corresponding representations. Project those representation into common space with different methods. Classify data into categories (one neural net and at least two other learning algorithms) and compare.