Git Product home page Git Product logo

waybackproxy-extra's Introduction

WaybackProxy

WaybackProxy is a retro-friendly HTTP proxy which retrieves pages from the Internet Archive Wayback Machine or OoCities and delivers them in their original form, without toolbars, scripts and other extraneous content that may confuse retro browsers.

1999 Google viewed on Internet Explorer 4.0 on Windows 95

Setup

  1. Edit config.py to your liking
  2. Start waybackproxy.py (Python 3 is required)
  3. Set up your retro browser:
    • If your browser supports proxy auto-configuration, set the auto-configuration URL to http://ip:port/proxy.pac where ip is the IP of the system running WaybackProxy and port is the proxy's port (8888 by default).
    • If proxy auto-configuration is not supported or fails to work, set the browser to use an HTTP proxy at that IP and port instead.
    • Transparent proxying is also supported for advanced users, with no configuration to WaybackProxy itself required.
      • The easiest way to set up a transparent WaybackProxy is to run it on port 80 (this cannot be done on Linux without security implications), set up a fake DNS server - such as dnsmasq -A "/#/ip" where ip is the IP of the system running WaybackProxy - to redirect all requests to the proxy, and point client machines at that DNS server.
  4. Try it out! You can edit most settings that are in config.py by browsing to http://web.archive.org while on the proxy, although you must edit config.py to make them permanent.
  5. Press Ctrl+C to stop the proxy

Known issues and limitations

  • The Wayback Machine itself is not 100% reliable. Known issues include:
    • Pages newer than the specified date (setting a specific YYYYMMDD date instead of a wider YYYYMM or YYYY helps with that);
    • Random broken images;
    • Strange 404 errors caused by bad server responses or incorrect URL capitalization at archival time;
    • Infinite redirect loops;
    • Server errors when it's having a bad day.
  • WaybackProxy will work around some redirection scripts (example: http://example.com/redirect?to=http://...) which are not archived by the Wayback Machine, but the destination URLs are sometimes not archived either.
  • WaybackProxy is not a generic proxy. The POST and CONNECT methods are not implemented.
  • Transparent proxying mode requires HTTP/1.1 and therefore cannot be used with some really old (pre-1996) browsers. Use standard mode with such browsers.

Other links

  • Donate to the Internet Archive, they need your help to keep the Wayback Machine and its petabytes upon petabytes of data available to everyone for free with no ads.
  • Check out 86Box, the emulator I use for testing WaybackProxy on older browsers.

waybackproxy-extra's People

Contributors

richardg867 avatar klanausse avatar

Stargazers

drelius avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.