- Log in to the AWS EC2 console.
- Click on "Instances" and select the instance named "ScraperRealty"
- In the top right, click the "Connect" button.
- In the EC2 Instance Connect dialog, click "Connect"
- Ensure the following parameters are filled by default, if not:
- Connection type: Connect using EC2 Instance Connect
- Username: ubuntu
- Open your terminal (commands are similar for Windows).
- Run the following command:
ssh -i "scraperKey.pem" [email protected]
(Make sure the scraperKey.pem is in the same folder, or provide the absolute path.)
- Change directory to the project folder:
cd scraper
- Activate the virtual environment for Python:
source env/bin/activate
- Run the main script:
python main.py
- For code comments in complex sections, refer to comments marked in the code.
- main.py imports all parsers.
- base.py contains classes and constants for parsing.
- scraper_fazwaz.py and scraper_thailandproperty.py are the actual parsers.
- watermark_resolver.py handles watermark removal (a simple version; editing may require modifying the watermark or its mask)
- Ensure Python is installed (preferably version 3.11)
- Create a virtual environment:
python -m venv env
- Activate the virtual environment:
source env/bin/activate
- Install required dependencies:
pip install -r requirements.txt
- Configure AWS:
aws configure
- Run the main script:
python main.py
Note: The application should work with both earlier and future versions of Python, though conflicts may arise in extreme cases.
Feel free to reach out for further assistance or enhancements!