Git Product home page Git Product logo

doubanfilm_collector's Introduction

DoubanFilm_Collector

  • 支持版本: Python 3.6

  • 支持数据库: MongoDB

  • 使用:

    • 配置Config.ini
     [DB]
     ;Configure the database information
     ;type: mongodb
     type = mongodb
     host = 127.0.0.1
     port = 27017
     ;Configure the database name to save films info
     DBName = dbFilm
    
     [FilmCollector]
     ;choose the type of films in the following option:
     ;请填写[剧情,喜剧,动作,爱情,科幻,悬疑,惊悚,恐怖,犯罪,同性,音乐,歌舞,传记,历史,战争,西部,奇幻,冒险,灾难,武侠,情色]
     type  = 同性
     ;configure the number to collection.
     ;if the number is greater than the number of the films, the result will return the maxium films.
     number of films = 300
    
     [Proxy]
     ;configure the database name to save proxies
     DBName = proxiesName
     ;configure the collection name to save proxies
     CollectionName = proxy
     ;configure the interval to fetch the proxies
     interval = 5
    
    • 使用付费代理的可以直接修改GetProxy.py, 使用免费代理请参考https://github.com/jhao104/proxy_pool
    • 按顺序启动MongoDB,ProxyApi.py以及ProxyManager.py
    • 最后开始run.py

doubanfilm_collector's People

Contributors

luxiaoyuan666 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.