Git Product home page Git Product logo

deeptextgeneration's Introduction

DeepTextGeneration

This was submitted as a project, required to be completed as part of the coursework for Machine Learning in UMass (Fall 2016).

Project Members:

  1. Aditya Narasimha Shastry - https://github.com/adityanshastry
  2. Pratik Mehta - https://github.com/pratikmehta14
  3. Shehzaad Dhuliawala - https://github.com/shehzaadzd

In this project, we explored various generative Deep Learning models, and tested their ability to generate discrete text data. We used an LSTM as a baseline measure (which turned out to be a skyline measure!!), and perplexity of the generated text as a metric to compere the models. The dataset used for training the models was the The (20) QA bAbI tasks in the Facebook's bAbI dataset (https://research.fb.com/downloads/babi/). The below models were explored:

  1. Generative Adversairal Networks - https://arxiv.org/pdf/1406.2661.pdf
  2. Variational Auto Encoders - https://arxiv.org/pdf/1312.6114.pdf

We found that though GANs generate sentences of lower perplexity, there were a lot of repititions. In our experiments, out of the 500 sentences that were generated, there were only 3 unique sentences. In case of VAEs, though there was a lot of variation in terms of the number of sentences generated, the perplexity was quite high compared to GANs. We also ran some experiments on some datasets with a higher vocaulary and sentence lengths. We chose the Europarl dataset (http://www.statmt.org/europarl/) for this task. With the same model configurations for the bAbI dataset, we didnt get good results. Unfortunately, we didnt have enough time to tune the models for the larger dataset.

Languages used:

  1. Python

Libraries Used:

  1. Keras - Deep Learning
  2. Gensim word2vec - Generating word embeddings of the text data
  3. Numpy - Matrix operations, and distance measurements

deeptextgeneration's People

Contributors

adityanshastry avatar

Stargazers

 avatar Trinadh Gupta avatar

Watchers

James Cloos avatar  avatar paper2code - bot avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.