a groovy script which downloads pdf files from slideshare with crazyworks.jp/slideshare_downloder .
$ groovy pdfdl.groovy <URLs list file> <folder>
- URLs list file
-
a file which contains URLs list. Please refer sample files at
sample
folder. - folder
-
a folder which stores downloaded pdf files.
If you are behind Proxy, please set JAVA_OPTS as following,
export JAVA_OPTS="-DproxyHost=proxy.example.com -DproxyPort=8080"
set JAVA_OPTS=-DproxyHost=proxy.example.com -DproxyPort=8080
For example,
$ groovy pdfdl.groovy sample/grailsDiary.txt grailsDiary sample/grailsDiary.txt ========== ========== ========== ========== ========== 1/9: making-the-most-of-your-gradle-build ... done 2/9: java-libraries-you-cant-afford-to-miss-66244982 ... done 3/9: ninja-tricks-for-groovy-system-scripting ... done 4/9: ratpack-the-core-for-your-microservices-for-jdkio-2016 ... done 5/9: mastering-grails-3-plugins-gr8conf-us-2016 ... done 6/9: creating-applications-with-grails-angular-js-and-spring-security-gr8conf-us-2016 ... done 7/9: reactive-streams-and-the-wide-world-of-groovy-64526341 ... done 8/9: an-introduction-to-jooq ... done 9/9: groovy-in-the-cloud-64312473 ... done
Then grailsDiary
folder has 9 pdf files.
If there are any errors like 503,
there is a error URLs list file like grailsDiary.txt.error
.
So please retry,
$ groovy pdfdl.groovy sample/grailsDiary.txt.error grailsDiary
-
Apache Groovy
-
jsoup: Java HTML Parser
-
slideshare downloder by crazyworks.jp.