Webscraping of house prices in Madrid, using Python and BeautifulSoup
Below are the dependencies needed to run the project:
pandas
BeautifulSoup
fake_headers
After installing dependencies, the script can be run with:
python3 main.py [options]
Positional arguments:
name
: website to scrape (pisos
,tucasa
orhabitaclia
).type
: type of dwellings to target (houses
orflats
).
Example:
python3 main.py pisos houses
The extracted data can be found in the file /dataset/madrid_immo.csv
(~30.000 entries in total).