Git Product home page Git Product logo

findjob's Introduction

拉钩爬虫出图

Python 2.6

一、关于findjob

  • 爬取拉勾devops以及运维开发的数据
  • 进行可视化显示
  • 目的是为了方便找工作,爬去拉勾数据进行数据可视化展示分析以便找到一个称心如意的工作

二、依赖环境

pip install -r requirments.txt

三、具体部署步骤

  1. 进入项目目录
cd findjob/config/

2.创建数据库lagou,创建表lagou

CREATE DATABASE lagou;

CREATE TABLE `jobinfo` (
  `positionId` varchar(255) DEFAULT NULL,
  `positionName` varchar(255) DEFAULT NULL,
  `city` varchar(255) DEFAULT NULL,
  `createTime` varchar(255) DEFAULT NULL,
  `salary` varchar(255) DEFAULT NULL,
  `companyId` varchar(50) DEFAULT NULL,
  `companyName` varchar(255) DEFAULT NULL,
  `companyFullName` varchar(255) DEFAULT NULL,
  `minsalary` double DEFAULT NULL,
  `munsalary` double DEFAULT NULL,
  `maxsalary` double DEFAULT NULL
) ENGINE=InnoDB DEFAULT CHARSET=utf8;

3.编辑service.conf, 配置数据库地址以及账号密码和flask启动绑定的端口和地址

[common]
    mysql_host = 127.0.0.1
    mysql_port = 3306
    mysql_user = root
    mysql_passwd = 123456
    mysql_db = lagou
    mysql_charset = utf8
    passport_key = 123456
[web]
    bind = 0.0.0.0
    port = 8000

四、如何访问

1.cd lagou
2.scrapy crawl lagou
3.cd lagou/runweb
4.python runweb.py
5.访问http://IP:8000/jobstomoney && 访问http://IP:8000/jobstocity

五、版权

代码内容采用 新 BSD 许可 文档内容采用 署名-禁止演绎 4.0 国际协议许可

六、TODO

  • 集成BOSS直聘等平台数据,进行综合评测
  • 针对不同的岗位可进行筛选操作

七、鸣谢

  • 核心代码贡献者: 国内大型互联网公司首席扛服务器工程师--刘老师
  • 场景的需求才会让这个项目诞生

八、Demo

九、特别提示

  • 本项目仅做沟通交流使用,请勿用作非法用途,谢谢
  • 欢迎大家提pr

findjob's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.