Git Product home page Git Product logo

readerisdead's Introduction

readerisdead.com

A collection of tools to help with the impending Google Reader shutdown.

reader_archive

Comprehensive archive of a Google Reader account.

Unlike Google Reader's Takeout support, provides a complete archive of a Reader account's data. This includes:

  • All your read items
  • All your starred items
  • All your tagged items
  • All your shared items
  • All the shared items from the people you were following.
  • All the comments on shared items
  • All your liked items
  • All items you've kept unread, emailed, read on your phone, clicked on or otherwise interacted with.
  • All items that have appeared in one of your subscriptions
  • All items that were recommended to you
  • All items in the (English) "Explore" section
  • All the profiles of the people you were following before the sharepocalypse.
  • All your preferences.

To use it:

bin/reader_archive --output_directory=~/Downloads/reader_archive

A browser window will appear asking you to authorize the app. Once you do, you'll be given a code to paste back into the terminal (you can also use the --use_client_login flag to instead be prompted for your Google Account username and password). See this wiki page for an explanation of the archive format. The intent is to be comprehensive, such that other tools that use the archive data may be created.

The archiving process can take a while, depending on the size of your account and your internet connection. For an account with 300,000 read items, the process took about 10 minutes and generated 1 GB of data.

reader_browser

Browse an archived Google Reader account.

Takes an archive generated by reader_archive and provides a browsing UI for it.

To use it:

bin/reader_browser ~/Downloads/reader_archive

Then you can load http://localhost:8071/ in your web browser to see the contents of the archive.

item_lookup

Look up an item in an archived Google Reader account.

Takes an archive generated by reader_arhive and provides a command-line UI for examining items within it.

To use it:

bin/item_lookup --archive_directory=~/Downloads/reader_archive 0306277b9d275db1

The tool will then list all of the streams that the item appears in, the item body, and any comments made on that item. You can provide multiple item IDs (as additional command line arguments).

feed_archive

Saves public feed data from Google Reader's feed archive.

Google Reader has (for the most part) a copy of all blog posts and other feed items published since its launch in late 2005 (assuming that at least one Reader user subscribed to the feed). This makes it an invaluable resource for sites that disappear, can serve as a backup mechanism and enables tools to be created.

Presumably access to this data is also going away come July 2013, and thus this tool can be used to get one last shot at archiving feeds you might want to refer to later.

The easiest way to use it is get the OPML file with all your Reader subscriptions, and run it like so:

bin/feed_archive \
    --opml_file=~/Downloads/feeds.opml \
    --output_directory=~/Downloads/feed_archive

The destination specified by --output_directory will be populated with one file per feed, named after its URL. The file contains all items that Reader ever saw in that feed, in the Atom format. Google Reader normally omits unknown (namespaced) elements in its API output, but in the script makes an attempt to use high-fidelity mode to reconstruct the original data as much as possible.

If you have specific feeds you'd like to save the archive for, instead of --opml_file you can also pass in feed URLs as command line arguments:

bin/feed_archive \
    --output_directory=~/Downloads/feed_archive \
    http://googlereader.blogspot.com/atom.xml \
    http://persistent.info/atom.xml \
    ...

The tool supports additional arguments for controlling how many items are fetched, see bin/feed_archive --help for more information.

readerisdead's People

Contributors

citizentools avatar dolapo avatar lichenbo avatar mihaip avatar mtstickney avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.