Comments (4)
yeah, that would be a nice feature
from twitter-scraper.
send a PR!
from twitter-scraper.
I was tinkering with this script and here's what i found.
For retweets you can add:
try:
retweetId = tweet.find('.js-stream-tweet')[0].attrs['data-retweet-id']
except:
retweetId = None
If it's None - it's not a retweet.
You can also check if a tweet is pinned or not:
try:
pinned = tweet.find('.user-pinned')[0]
pinned = True
except:
pinned = False
Also it's better to have a url for a tweet to get a proper tweet id to figure out unique tweet id:
href = tweet.find('.js-permalink')[0].attrs['href']
hrefTweet = href.split( '/' )[-1]
["tweetId"] is not a unique id of a tweet in case of a retweet or a tweet that's part of a tweet chain (reply tweet?).
If retweetId is present then it's a unique id of a tweet.
If not then href holds a unique id of a tweet.
realTweetId = tweetId
if ( retweetId != None ):
realTweetId = retweetId
else:
realTweetId = hrefTweet
Using info from before we can determine if a tweet is a reply tweet or not:
tweetChain = False
if ( hrefTweet != tweetId ):
tweetChain = True
Then add this information to tweet dictionary:
tweets.append( { 'realTweetId': realTweetId, 'tweetId': tweetId, 'retweetId': retweetId,
'href': href, 'pinned': pinned, "tweetChain": tweetChain,
Hope that helps!
from twitter-scraper.
Just added this field. Please check #58 and thanks to @brachna for detailed research
from twitter-scraper.
Related Issues (20)
- f'Oops! Either "{query}" does not exist or is private.' HOT 3
- UnboundLocalError: local variable 'html' referenced before assignment HOT 4
- DeprecationWarning: invalid escape sequence \g
- Getting historical tweet volume data with specific hashtags
- Not get Profile Info HOT 3
- Feature: Get User Following/Friend (and Followers)
- Documentation: Addressing shadow-bans HOT 3
- Tweet Links and Media Files HOT 2
- Doesn't scrap anything. HOT 8
- Does this work? HOT 10
- get_tweets in your example is not working HOT 1
- Twitter scrapy
- I am getting this issue : UnboundLocalError HOT 3
- json.decoder.JSONDecodeError: in get_trends() HOT 5
- json.decoder.JSONDecodeError: in get_tweets() HOT 7
- UnboundLocalError: local variable 'html' referenced before assignment
- setup.py fails
- pip3 install fails
- Enable running on Apify cloud platform
- does anyone got an 403 error ? HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from twitter-scraper.