Python-based data preprocessing pipeline that handles missing values, scales features, and encodes categorical variables. Optimize for efficiency.
This repository contains a Python script for preprocessing data using scikit-learn pipelines.
Make sure you have the following dependencies installed:
- pandas
- scikit-learn
You can install them using:
pip install -r requirements.txt