Git Product home page Git Product logo

python-script's Introduction

My Python Script

  1. auc_pr_roc
    Python scikit-learn计算PR ROC曲线AUC值。
  2. excel_combine
    Python实现 - Excel多文件一键自动合并。
  3. geetest_offline
    Python破解GeeTest滑块验证码offline V5.10.10,以国家企业信用信息公示系统网站为例。
  4. geetest_offline_gd
    Python爬虫 - 国家企业信用信息公示系统(广东) 企业详细信息。
  5. geetest_online
    Python破解GeeTest滑块验证码online,以国家企业信用信息公示系统网站为例。
  6. gitstats
    Python实现 - Git commit log统计分析。
  7. gsxt_mobile
    Python爬虫 - 国家企业信用信息公示系统 App,通过 App HTTP API 查询企业信息。
  8. lagou
    Python爬虫 - Selenium 拉勾网 数据采集。
  9. level
    Python leveldb Utils 常用方法封装。
  10. nacao_v1
    Python爬虫 - 全国组织结构代码管理中心V1.0。
  11. nacao_v2
    Python爬虫 - 全国组织结构代码管理中心V2.0。
  12. MonkeyRunner
    MonkeyRunner is DEAD!
  13. PageSpeed Insights
    前端DevOps之PageSpeed Insights - 使用 Google Cloud Scheduler, Pub/Sub, Functions , Storage 等云服务,搭建 PageSpeed Insights 前端网站网页的质量和性能 benchmark 定时审查系统。与 CI/CD 流程结合,定时大批量审查网站技术性能指标。

License

   Copyright 2017 ChenQi

   Licensed under the Apache License, Version 2.0 (the "License");
   you may not use this file except in compliance with the License.
   You may obtain a copy of the License at

       http://www.apache.org/licenses/LICENSE-2.0

   Unless required by applicable law or agreed to in writing, software
   distributed under the License is distributed on an "AS IS" BASIS,
   WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
   See the License for the specific language governing permissions and
   limitations under the License.

python-script's People

Contributors

9468305 avatar dependabot[bot] avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

python-script's Issues

python2.7 请指教!!!

因为手工收集资料太慢了,所以希望用python2.7能做一些事情。
我要在特定网站( http://**.gsxt.gov.cn ) 指定搜集"特定关键词"。然后获取搜索结果的某些资料保存到Excle
表格或txt文本。

我要的资料很简单(企业名称,法人,年报中的邮箱,电话,住址。)

因为我是python2.7的新手,所以请教高手,要学习哪些python2.7库?或知识?

------------------------------------网站分析F12浏览器记录----------------------------
http://gd.gsxt.gov.cn

search_form searchForm post /corp-query-search-1.html
search keyword searchword

查询按钮 btn_query

http://gd.gsxt.gov.cn/js/subpubsys_homepage.js ( 这个JS 控制输入? )
http://gd.gsxt.gov.cn/js/geetest/gt.js ( 这个JS 验证? )

省份 province 440000/450000,460000,470000.....

地区选择 ads-sci-list area selected
data-value="0" /默认0是全部 data-value="440100" , 后面440100会增长 440200,440300...

状态选择 ads-sci-title cStatus selected data-value="0" /默认0是全部, 增长1,2,3....

搜索结果列表页面 search_list_item db
(每页10个结果)
( 点击链接,进入详细页 href= )

公司名字 f20

法人 div-user2 fa fa-user2 ml20 g3

odd even

搜索结果页数选择 pagination

首页 href="javascript:turnOverPage(0)"
当前页 current
href="javascript:turnOverPage(*)" /*号,1,2,3....

年报 annual_menu_table
查看 jumpToAnnualDetail(this) clickToDetail
地址 addrForColor1 addr
电话 telForColor1 tel
邮件 emailForColor1 email
-------------------------------------------网站分析结束------------------------------------

nacao_v2方案失效

新版官网加了验证码,官方app也抓不了包,应该是在客户端写死了认证的证书,中间人抓包都看不了。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.