Git Product home page Git Product logo

reimagined-wildbow-scraper's Introduction

reimagined-wildbow-scraper

Now deprecated in favor of rust-wildbow-scraper. Go use that instead!

Scrapes wildbow's twig, pact and worm into epubs with some extra ease-of-use features. Run node app.js with -t, -w, -g, -r or -p to scrape twig, worm, glow-worm, ward or pact respectively. --help for more info. New: Now scrapes Ward(Worm 2)!

Installation

All you'll need is npm and node. Run:

git clone https://github.com/nicohman/reimagined-wildbow-scraper.git

cd reimagined-wildbow-scraper

npm install

node app.js [options]

You might want to add an alias such as alias riws = node ~/reimagined-wildbow-scraper/app.js to make it easier to use.

Usage

node app.js --help for list of commands:

    -V, --version         output the version number
    -r , --resume <json>  Resume from JSON file
    -a, --ward            Scrape Ward
    -w, --worm            Scrape Worm
    -t, --twig            Scrape Twig
    -p, --pact            Scrape Pact
    -g, --glow_worm       Scrape Worm 2 Prologue
    -h, --help            output usage information

It'll output an epub and a json file in your current directory. If you want to, you can save the json file and use to to resume later, saving you from having to download the whole rest of whatever book you're following along with.

reimagined-wildbow-scraper's People

Contributors

hollowaytape avatar nicohman avatar wassname avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

reimagined-wildbow-scraper's Issues

Errors when inputing any command relating to scraping

For example, say I input "node app.js -a" to download Ward. This is what happens:

"C:\Users\rober\reimagined-wildbow-scraper>node app.js -a
Command Prompt - node app.js -a[6668]: src\node_contextify.cc:628: Assertion `args[1]->IsString()' failed.
1: node::DecodeWrite
2: node::DecodeWrite
3: uv_loop_fork
4: v8::internal::interpreter::BytecodeDecoder::Decode
5: v8::internal::RegExpImpl::Exec
6: v8::internal::RegExpImpl::Exec
7: v8::internal::RegExpImpl::Exec
8: 000003E4B2284281
"

But if I input "node app.js -h" to check the help menu, it comes out fine:

"C:\Users\rober\reimagined-wildbow-scraper>node app.js -h

Usage: app [options]

Options:

-V, --version         output the version number
-r , --resume <json>  Resume from JSON file
-a, --ward            Scrape Ward
-w, --worm            Scrape Worm
-t, --twig            Scrape Twig
-p, --pact            Scrape Pact
-g, --glow_worm       Scrape Worm 2 Prologue
-h, --help            output usage information"

I have double and triple re-cloned the source, and made sure I had all npm dependencies the program seemed to need. I've tried both GitBash and Windows Command prompt (all run as administrator). But I have no idea in heck what is going on. Do you have any input?

Getting "URLCon is not a constructor" error

$ node app.js -t
X:\...\reimagined-wildbow-scraper-master\app.js:57
        var nexturl = new URLCon(next);
                      ^

TypeError: URLCon is not a constructor
    at IncomingMessage.<anonymous> (X:\...\reimagined-wildbow-scraper-mast
er\app.js:57:23)
    at emitNone (events.js:91:20)
    at IncomingMessage.emit (events.js:185:7)
    at endReadableNT (_stream_readable.js:974:12)
    at _combinedTickCallback (internal/process/next_tick.js:80:11)
    at process._tickCallback (internal/process/next_tick.js:104:9)

I fixed it by replacing const URLCon = require('url').URL; with const URLCon = require('url'); and then replacing all instances of var nexturl = new URLCon(next); with var nexturl = URLCon.parse(next);.

Cannot find module 'follow-redirects'

This is my first time here, please correct me if I'm doing something wrong. Thanks for revising the Wildbow scraper. However, I can't get any of the books to scrape, I get the following error instead. I'm trying to run it while in the directory where app.js is located.

module.js:529
    throw err;
    ^

Error: Cannot find module 'follow-redirects'
    at Function.Module._resolveFilename (module.js:527:15)
    at Function.Module._load (module.js:476:23)
    at Module.require (module.js:568:17)
    at require (internal/module.js:11:18)
    at Object.<anonymous> (/Users/mow/Downloads/reimagined-wildbow-scraper-master/reimagined-wildbow-scraper-master/app.js:2:13)
    at Module._compile (module.js:624:30)
    at Object.Module._extensions..js (module.js:635:10)
    at Module.load (module.js:545:32)
    at tryModuleLoad (module.js:508:12)
    at Function.Module._load (module.js:500:3)

(Also, Worm 2 is now known as Ward, perhaps -r?)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.