Git Product home page Git Product logo

Comments (7)

Altimis avatar Altimis commented on June 16, 2024

Hi @DoctorDream, Thank you for your feedback. In fact I'm not sure that twint works these days, at least it didnt work for me and thats why I worked on Scweet. The thing that I'm sure about is that all API based scrapers dont work because they changed it to version 2. Did Scweet meet your requirements ? What should I add to improve it ?

from scweet.

DoctorDream avatar DoctorDream commented on June 16, 2024

@Altimis
Thank you very much for your reply, your program basically met my needs, but I also encountered a little bit of problems in the process of using.
I use Twitter crawler to collect conversations for academic research, but the timeline based structure of Twitter has caused me some difficulties.
When I crawl the tweets, there may be two consecutive tweets replying to different tweets, which makes it impossible for me to use them to form a dialogue.
Do you have a way to crawl tweets based on the main tweet, just like browsing on the web?
Thank you very much for your enthusiasm!

from scweet.

Altimis avatar Altimis commented on June 16, 2024

@DoctorDream If I understood correctly, you want to scrape replies of every tweet, is it ? like for this tweet :
image
You want to click on the comments and gather all the replies (1k7 replies) . If that's true, it may be a true challenger for Scweet. Because first, you may be required to sign in to be able to view replies of a giver tweet, and seconde, the process may take too long since the script needs to have access to the replies (click) and scroll to scrape all of them.

from scweet.

DoctorDream avatar DoctorDream commented on June 16, 2024

@Altimis
Yes, that's what I means.
For a tweet, I don't have to collect all the responses. I just need to collect the highly praised ones, because those replies tend to be followed by more people.
I expect to spend weeks collecting data, so the length of time it takes won't have a big impact on me.
So, is it convenient for you to implement this function?
Thank you very much!

from scweet.

Altimis avatar Altimis commented on June 16, 2024

@DoctorDream I think it is possible. I'll work on that.

from scweet.

Altimis avatar Altimis commented on June 16, 2024

@DoctorDream I have a question for you. Are you supposed to have the tweet_id of a given tweet that you want to scrape its replies ? or you want to crawl all tweets and get their replies ?

from scweet.

DoctorDream avatar DoctorDream commented on June 16, 2024

@Altimis
Thank you very much!
Actually,I just need to crawl tweets with replies to form dialogues,so i dont need to crwal tweet with specific tweet_id.

from scweet.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.