Binary classification, with every feature a categorical (and interactions!). Second challenge of cat-in-the-dat from kaggle.
NOTES:
- Logistic Regression works much better than Random Forest, or even XGBoost!
cd src
conda activate ml
python create_folds.py
python -W ignore train.py [--model=lr] # [lr|rf|svd|xgb]