1)百度指数爬虫代码,用于爬取“手机端+电脑端”的综合搜索指数;
2)关键词可定义为“旅游上市公司简称”、“旅游目的地+旅游六要素”...
1)将.idea(需要解压)和baidu_index文件夹放到“C盘Anaconda”文件夹根目录下面;
2)在Jupyter Notebook运行 Baidu_index_Davion.py 主程序;
1)登陆不同百度账号+更换浏览器获取新的Cookies;
2)F12 Network Headers Cookies 点击Index开头的字符;
1)确保输入的关键词在“百度指数”中有对应的搜索值;
2)每个Cookies用于爬取的数据有上限,更换Cookies或者等待半天时间;
👋 Hi, I’m @DavionWu2018
👀 I’m interested in sustainable tourism, tourism firm management, text mining, and event study.
🌱 I’m currently learning tourism management.
💞️ I’m looking to collaborate on text mining of tourism big data.
📫 How to reach me: [email protected].