This project demonstrates how to analyze a weather dataset using both Python and SQL. The dataset contains various weather attributes such as temperature, humidity, wind speed, and visibility. The project is divided into two parts:
Part 1: Python Analysis
Part 2: SQL Analysis
- Find all records where the weather was exactly clear.
- Find the number of times the wind speed was exactly 4 km/hr.
- Check if there are any NULL values present in the dataset.
- Rename the column "Weather" to "Weather_Condition."
- What is the mean visibility of the dataset?
- Find the number of records where the wind speed is greater than 24 km/hr and visibility is equal to 25 km.
- What is the mean value of each column for each weather condition?
- Find all instances where the weather is clear and the relative humidity is greater than 50, or visibility is above 40.
- Find the number of weather conditions that include snow.
The analysis uses the pandas library to load and manipulate the dataset. Various operations such as filtering, aggregating, and checking for null values are performed to extract meaningful insights from the data.
weather.ipynb
- Find all records where the weather was exactly clear.
- Find the number of times the wind speed was exactly 4 km/hr.
- Check if there are any NULL values present in the dataset.
- Find the mean visibility of the dataset.
The dataset is loaded into a SQLite database, and SQL queries are used to extract insights from the data.
weather.py
weather_data.db
- Python
- pandas
- sqlite3
- Clone the repository.
- Ensure you have the necessary Python libraries installed (pandas, sqlite3).
- Run the Python script to perform the analysis.