Git Product home page Git Product logo

multimodal-cnns's Introduction

Multimodal-CNNs

Code of Text Classification with Topic-based Word Embedding and Convolutional Neural Networks (ACM BCB 2016).

In this paper, we propose a novel neural language model, Topic-based Skip-gram, to learn topic-based word embeddings for text classification with CNNs.

Please cite our paper if it helps your research:

@inproceedings{xu2016text,
  title={Text Classification with Topic-based Word Embedding and Convolutional Neural Networks.},
  author={Xu, Haotian and Dong, Ming and Zhu, Dongxiao and Kotov, Alexander and Carcone, April Idalski and Naar-King, Sylvie},
  booktitle={The 7th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM BCB)},
  pages={88--97},
  year={2016}
}

Requirements

  • Python==2.7
  • Theano==0.7
  • Lasagne==0.1
  • Pre-trained word2vec vectors (CNN-gn in the paper) and Skip-gram model are available here.
  • Latent Dirichlet Allocation model can be downloaded here.

Preprocessing

To process raw textual data,

python process_data_channel.py /path/to/wordvec1 /path/to/wordvec2 LengthofWordvec1 LengthofWordvec2 classnum /path/to/data_class0 /path/to/data_class1  

OR

python process_data_multi.py /path/to/wordvec1 /path/to/wordvec2 LengthofWordvec1 LengthofWordvec2 classnum /path/to/data_class0 /path/to/data_class1

Running CNN-channel or CNN-concat models

python conv_channel_w2v.py epochnum batchsize

OR

python conv_multi_w2v.py epochnum batchsize

You can choose epochnum=25 and batchsize=64.

Acknowledgment

The preprocessing code is adapted from Dr. Kim's work.

multimodal-cnns's People

Contributors

shuashuashua avatar haotianmxu avatar

Stargazers

 avatar Suzy avatar  avatar Yaduvendra Singh avatar  avatar Trinadh Gupta avatar  avatar wangyizhen avatar Zli avatar

Watchers

James Cloos avatar  avatar  avatar

multimodal-cnns's Issues

About dataset

hi, i am a Machine learning enthusiasts. Would you like providing some data sets about this program

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.