The given directory consists of two files: The directory structure is as follows:
- association.py All the code for the Association Rules is in this file. To run the file :
python3 association.py
- online_retail_data.xlsx Dataset that we are using.
List of questions :
- What is the total number of sales incurred by the company ?
- What is the total profit earned by the company ?
- Who are the top 20 customers based on the shopping amount that they spent ?
- What are the frequently sold items by quantitiy ?
- What are the frequently sold items by total amount ?
- Analyse number of sales for every month.
- What are the monthly earnings of the company ?
- Region (Country) wise sales.
- What are the number of active customers in each country ?
- Get United Kingdom top ranked customers based on the total amount.
- What are United Kingdom's frequently sold items by quantitiy ?
- What are United Kingdom's frequently sold items by total amount ?
KDD Process involved :
- Data Selection
- Data Pre Processing
- Data Transformation
- Data Mining Algo (Apriori)
- Interpretation of Results