Git Product home page Git Product logo

Comments (4)

WASHEDDEVELOPEUR avatar WASHEDDEVELOPEUR commented on July 17, 2024

yeah, that would be a nice feature

from twitter-scraper.

kennethreitz avatar kennethreitz commented on July 17, 2024

send a PR!

from twitter-scraper.

brachna avatar brachna commented on July 17, 2024

I was tinkering with this script and here's what i found.

For retweets you can add:

try:
    retweetId = tweet.find('.js-stream-tweet')[0].attrs['data-retweet-id']
except:
    retweetId = None

If it's None - it's not a retweet.

You can also check if a tweet is pinned or not:

try:
    pinned = tweet.find('.user-pinned')[0]
    pinned = True
except:
    pinned = False

Also it's better to have a url for a tweet to get a proper tweet id to figure out unique tweet id:

href = tweet.find('.js-permalink')[0].attrs['href']
hrefTweet = href.split( '/' )[-1]

["tweetId"] is not a unique id of a tweet in case of a retweet or a tweet that's part of a tweet chain (reply tweet?).
If retweetId is present then it's a unique id of a tweet.
If not then href holds a unique id of a tweet.

realTweetId = tweetId
if ( retweetId != None ):
    realTweetId = retweetId
else:
    realTweetId = hrefTweet

Using info from before we can determine if a tweet is a reply tweet or not:

tweetChain = False
if ( hrefTweet != tweetId ):
    tweetChain = True

Then add this information to tweet dictionary:

tweets.append( { 'realTweetId': realTweetId, 'tweetId': tweetId, 'retweetId': retweetId,
                 'href': href, 'pinned': pinned, "tweetChain": tweetChain,

Hope that helps!

from twitter-scraper.

bisguzar avatar bisguzar commented on July 17, 2024

Just added this field. Please check #58 and thanks to @brachna for detailed research

from twitter-scraper.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.