Git Product home page Git Product logo

yzw-spider's Introduction

研招网院校库和专业信息爬取

说明

  1. main.py脚本:从研招网院校库爬取院校信息并标注A区或B区和学校是否是985或211院校,文件输出在uinnfo/edudata.json
  2. main2.py脚本:获取某个学校某类专业目录,输出json文件保存至university_majors/
  3. main3.py 获取某个学校的具体专业的考试科目招生人数等一些备注信息,文件输出在college_majors_exam_scope/
  4. 其他

参考资料

A、B区:https://yzc.hsi.com.cn/kyzx/jybzc/202009/20200904/1972918872.html
211工程名单:http://www.moe.gov.cn/srcsite/A22/s7065/200512/t20051223_82762.html
985工程名单:http://www.moe.gov.cn/srcsite/A22/s7065/200612/t20061206_128833.html

整理后的学校目录在:eduList.py

整理资料下载

一般情况下只需要下载点击下载就好了

  1. 全部院校院校库.xlsx

  2. 重点院校(985,211)0812专业详细信息:院校专业0812.xlsx

  3. 重点院校0854(电子信息专硕)

在线脚本

在此之前我也写了一个油猴脚本:前往安装

其他

github搜索'考研'


请慎重对待和使用,程序逻辑可能产生了错误的结果,请您仔细详查院校信息

yzw-spider's People

Contributors

xx025 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.