Git Product home page Git Product logo

twitter-sent-dnn's People

Contributors

xiaohan2012 avatar yaolubrain avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

twitter-sent-dnn's Issues

Probabilities Greater Than 1

Hi,
Running this on some testing data that I've hand-classified, I'm getting results as large as 9.0, although most inputs seems to behave well. I'll try to find out why, perhaps a strange Unicode issue.

License

Hi guys. Thank you for this great tool. I wonder if you could put up an open source license so I can use it in my project? Thanks!

Running Sentences One at A Time Gives A Different Result than Batching Them

Running a list of tweets in self.data through sentiment_score one at a time gives different results than batching that same data in 25 or 100 at a time through sentiment_scores_of_sents.

It's not just floating point issues either. I ran a list of 9068 tweets, and I found that the largest difference was 0.9579128974724299, and that 161 tweets in total had a different score in batch than in the single run that were greater than .5!

Code for running them one at a time:

    out = []
    for tweet in self.data:
      out.append(sentiment_score(tweet))

Code for running the data in batches:

    out = []
    for batch in self.batch_data:
      out.extend(sentiment_scores_of_sents(batch))

Code for batching the data:

    temp_list = []
    for x in cls.data:
      if count >= 25:
        cls.batch_data.append(temp_list)
        temp_list = []
        count = 0
        
      temp_list.append(x)
      count += 1
      
    if len(temp_list) > 0:
      cls.batch_data.append(temp_list)

Can more details be provided?

This project is pretty good. However, I haven't found its detail description about it, such as what the training and test dataset is, and how about its performance. Could anyone provide more details?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.