Git Product home page Git Product logo

python3-spider's Introduction

Python3 爬虫实战


python3 spider


Branch


简介

包含几十个 python3 爬虫实战案例。如果喜欢请 star 与 fork,这是对我继续更新下去的最大支持

Author Zok
Email [email protected]
博客 https://www.zhangkunzhi.com

QQ讨论群



Python 爬虫实战

字体加密

天眼查 | 大众点评 | 谷雨

验证码【仅作学术讨论】

w3c-滑块 | 腾讯-滑块识别腾讯滑块拖动 selenium

参数生成

拼多多 失效! | 小牛在线 | 开鑫贷 | 时光网 | 百度 | 公众号密码加密 | 移动 | 好莱客 | 青海移动 | 新浪微博 | 汽车之家 | steam | 百度wap端sig生成

自动登录

淘宝 | 5173平台 | 房天下 | Glidesky | 中关村 | 9377平台 | 逗游 | GitHub | 万创帮 | 空中网 | 易通贷 | DNS | TCL金融 | 国鑫所 | 满级网 | 试客联盟 | 人人网 | 豆瓣网 | 天翼

其他实战

文书网app查询接口 | 抖音无水印视频解析 | 企业名片查询百度找回密码美女壁纸下载 | 美女壁纸下载 | 美团 解析与token生成 | bilibili 视频下载 | 51job 查岗位 | 百度 翻译 | 美团 全国区域 | 企业名片查询 | 金逸电影 注册 | Python加密库Demo | 百度街拍图片下载京东商品数据爬取 | 房价获取

原创工具

此工具包在我另外一个项目中,欢迎 star

【推荐】爬虫练习网

一个很不错的爬虫练习题网,内涵十几个爬虫题目,由浅到深涵盖 ip反爬、js反爬、字体反爬、验证码 等题目。安利给大家,博主已撸完。


##淘宝:自动登录

自动登录

  • 打开 auto_login_pyppeteer.py Run 代码,输入淘宝账号、密码即可自动登录

##文书网app

《入门级安卓逆向 - 文书网app爬虫教程》


美女壁纸下载器

美女壁纸下载器

双色球头奖分布词云

双色球头奖 双色球

工具:解码器

可拓展式编码转换器

滑块还原识别

可拓展式编码转换器

可拓展式编码转换器

腾讯滑块缺口识别

缺口识别

QQ 讨论群

python3-spider's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

python3-spider's Issues

腾讯滑块验证问题

if 6000 < cv.contourArea(contour) < 8000 and 370 < cv.arcLength(contour, True) < 390:

这里的数值是如何求得的?

Taobao Login

Hi,
I used this script without problems a few months ago, but now, sometimes, I get this error:

Traceback (most recent call last):
  File "taobaoLogin.py", line 135, in <module>
    loop.run_until_complete(task)
  File "C:\Users\mihai\Anaconda3\lib\asyncio\base_events.py", line 584, in run_until_complete
    return future.result()
  File "taobaoLogin.py", line 100, in main
    await self.page.click('div.login-switch')
  File "C:\Users\mihai\Anaconda3\lib\site-packages\pyppeteer\page.py", line 1507, in click
    await frame.click(selector, options, **kwargs)
  File "C:\Users\mihai\Anaconda3\lib\site-packages\pyppeteer\frame_manager.py", line 569, in click
    raise PageError('No node found for selector: ' + selector)
pyppeteer.errors.PageError: No node found for selector: div.login-switch

问个问题

tb tm 最下方图片懒加载如何解决?尝试了scroll不太理想

make a suggestion

dianping-font.py-parse_ttf函数中的name_list最好直接从函数install_ttf直接传值,以免在解析的时候出现NoneType错误

对于淘宝反爬虫机制研究

我想请问一下,为什么selenium + webdriver已经将window.navigator.webdriver改为underfined了,但是只要代码移动鼠标或者点击图标就会无法登陆,人手工输入就可以登陆?

时间戳部分

str_json['ts'] = time.time() -->str_json['ts'] = int(time.time())*1000
str_json['cts'] = time.time() + 110--> str_json['cts'] = int(time.time())*1000+ 110
是不是这样,时间戳应该乘以1000

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.