Git Product home page Git Product logo

Comments (12)

PikioopSo avatar PikioopSo commented on July 20, 2024

@TyJK

What's your take on extremist/radical sites? Are those counted too.

Kip/PiReel

from echoburst.

TyJK avatar TyJK commented on July 20, 2024

@PiReel

I think that encompassing the full range of stances on each side is important. That said, it's also important to get primarily the middle 90% of perspectives, and only have a few samples from the fringes.

We want the corpus to reflect all views under the broad umbrella of 'for' and 'against', and the extremists are likely to have some of the most distinctive speaking patterns which will be beneficial. But they also tend to be the most vitriolic, and so we definitely want to keep those in the minority. They'll be the easiest to find, so really they'll most likely have to be avoided once a few decently sized examples are found.

from echoburst.

PikioopSo avatar PikioopSo commented on July 20, 2024

Thanks for the info, @TyJK.

I also wanted to let you know about an addon called, Pocket made for Firefox. It allows you to share bookmarked pages with a group of collaborators and it also allows you to assign labels to the bookmarked pages.

So things I like about it are:
Good looking interface.
Shareable web pages makes it easier to explain the concepts of complicated subjects.
connectable to third party accounts.

I think it would work really well for a project like this, where we need to organize a bunch of links and share them.

from echoburst.

TyJK avatar TyJK commented on July 20, 2024

@PiReel
I wasn't aware of that feature, that sounds like exactly what we need, thank you! I'll play around with it to get an idea of how we could use it in a systematic way and then maybe update it so that everyone who's contributing is on the same page.

from echoburst.

TyJK avatar TyJK commented on July 20, 2024

w.r.t. Pocket, unfortunately it doesn't seem there's any easy bulk sharing feature. For now I'm going to leave the recommendation to just share with a text file, but I'll keep tinkering with it to see how it can be used.

from echoburst.

PikioopSo avatar PikioopSo commented on July 20, 2024

@TyJK, sorry for the late response my email is packed with mozsprint stuff, but I was wondering if you were trying to find other people to share pocket stuff with. I guess you were.

I am going to try to do a search for you on Pocket. What should I search for?

from echoburst.

PikioopSo avatar PikioopSo commented on July 20, 2024

I believe you can use tags so that people contributing can do tag searches for an echoburst tag or something

from echoburst.

TyJK avatar TyJK commented on July 20, 2024

@PiReel Tyler JK is what I have the name set up as, hopefully that's unique enough, but if not let me know. I'll see if I can set that up, but I've come across a second advantage to a .txt file, which is I can read it into a scraper I write. Is there a way to download pocket links into something like that? Thanks :)

from echoburst.

 avatar commented on July 20, 2024

I'm writing an extension that turns the browsers file system in to a "Adobe Bridge" type application that works with Pi Reel.

For your case though we would have to write your scraper as a browser extension that works with pocket.

Which would be a nice piece of bookmarking software to have, but more elaborate.

from echoburst.

TyJK avatar TyJK commented on July 20, 2024

That seems like one of those things that might be an interesting project on it's own, but not necessarily something we can get up and running right now. I'm already going through the various features you could incorporate with this. From what I can tell, you need 3 things to efficiently scrape a site: the domain, the sub domains it should follow (I've found with most blogs, their archive has a longer url that's consistent for all blog posts, and not shared by non post content) and the webpage elements you want to scrape. Once you have that for each site you should be able to put it into a dictionary or list of lists and then run everything. Still doing research but hopefully I'll have something more detailed by tonight.

from echoburst.

TyJK avatar TyJK commented on July 20, 2024

I'm taking on Climate Change, both sides, atm. Will upload within the next few hours and then people can add to that if they wish.

from echoburst.

TyJK avatar TyJK commented on July 20, 2024

Climate Change is up, will be working on Drug Policy - Criminalization/Decriminalization next.

from echoburst.

Related Issues (10)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.