Git Product home page Git Product logo

wenku_spider's Introduction

百度文库

爬取百度文库,支持doc,txt,ppt,pdf(word文件里的图片没有下载,用docx库的效果不太好还原度不高,在改进中,目前放出来版本的还原度都比较高)
也可网站上在线使用 http://106.15.231.202:8888
网页上的下载是返回docx文档 并支持豆丁word文档(觉得好用的各位赏个🌟呗)

使用

复制将网址复制进去即可
doc和txt保存为.txt在当前目录下,ppt和pdf保存为图片在img目录下
example效果:
examplae

wenku_spider's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

wenku_spider's Issues

技术

方便留个联系方式吗?

如何使用?

1  jigao  ~/workspace/wenku_spider  
> python 百度文库.py 
 File "百度文库.py", line 111
   print(text,end='')
                 ^
SyntaxError: invalid syntax
1  jigao  ~/workspace/wenku_spider  
> python 百度文库.py https://wenku.baidu.com/view/9f23f4ecf605cc1755270722192e453611665b54.html
 File "百度文库.py", line 111
   print(text,end='')
                 ^
SyntaxError: invalid syntax

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.