igxe-csgo items compare prices for abrasion based on Python BeautifulSoup

sefl-study web and python crawlers, my first demo

level: easy

techniques: request, re, jsonpath, BeautifulSoup

my gain:

how to use web browser development kit "Network XHR" to get website asynchronous request(such as information on next page)

how to use jsonpath, to require json format data files

how to simply use Re to handle basic information

Deficiency: website igxe has low level of anti-robot settings, there is no login validation, so this is a simple crawlers project

how to use

requirements: request BeautifulSoup jsonpath（use pip install to install all the pacakges）

details:

input keywords (suggest to enter full product name e.g. AWP | 二西莫夫 (久经沙场))

input your highest expected price, the programme will return the lowest abrasion in the range of your costs.

programme will generate data.text file in the root directory, which includes all prices corresponding for the abrasion.

in the source code, the choice variable is 0 by default, but if you want to include StatTrack items, then you could change choice to 1

igxe-csgo饰品磨损比价爬虫

自学web以及python爬虫的第一个练手项目

难度：简单

应用技术：request库 re库 jsonpath库 BeautifulSoup库

主要收获：

利用浏览器开发者工具中的Network 的XHR来获取网页异步发送的请求（比如翻页之后的信息）

通过jsonpath库来抓取json格式的数据文件

简单的使用re库获取信息

不足：igxe网站反爬虫难度不大，未设有登录验证以及滑块验证码等复杂操作，因此难度过于简单

使用方法

环境要求：request库 BeautifulSoup库 jsonpath库（使用pip install 指令安装）

具体用法：

输入关键字（建议输入完整的商品名称 e.g. AWP | 二西莫夫 (久经沙场) )

输入期待的最高价格，程序会给出该价格内能买到的最低的磨损

程序会在根目录生成 data.text 文件，里面包含了该商品的所有磨损及对应价格

源码中的 choice 变量默认为0，但若搜索的饰品包含了StatTrack则将 choice 改成1可显示带计数器的皮肤

后记

该项目只是第一个练手，我根据情况说不定会更新其他饰品网站的比价程序（呼声较高的是网易BUFF），但是其他网站说不定会更难爬取一些，一切看我情况再说啦(。・∀・)ノ

dazhadazha / crawlers-for-csgo-igxe Goto Github PK

crawlers-for-csgo-igxe's Introduction

igxe-csgo items compare prices for abrasion based on Python BeautifulSoup

sefl-study web and python crawlers, my first demo

how to use

igxe-csgo饰品磨损比价爬虫

自学web以及python爬虫的第一个练手项目

使用方法

后记

crawlers-for-csgo-igxe's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent