Comments (1)
这个是微博自身搜索机制的问题,很难事前应对。可以增加一个事后补救模块,当检测到搜索进度里的较大跳跃,将该区间拆分成小区间再查一遍。
from weibo-crawler.
Related Issues (20)
- 增加脚本方便合并数据
- 整理搜索进度记录 HOT 1
- 创建待爬MID表
- 增加对搜索页面的初步记录
- base62引用
- README改进
- 自动解析一组Cookies
- 增加对无基础使用者的完整教程 HOT 1
- Program error HOT 1
- 邮件提示报错
- 任务完成后邮件提醒未触发 HOT 1
- 报错后退出信息不正确
- 任务完成后自动停止周期报告
- 自动更新 cookies
- 输出形式希望能改变 HOT 2
- 请问丁老师如果已有一列微博ID,如果利用get_content.py不通过SQL更简单地input id list得到微博内容 HOT 4
- 抓出了很多无关键词微博
- 待将查漏模块修改为自动迭代 HOT 1
- 增加地址字段
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from weibo-crawler.