A short example of using Apache Spark for ETL. The example creates multiple aggregations (Sum, Counts, Median, a "phonebook" with call count etc.) from call recordsand stores the result into parquet files
The code is free to use - licensed unders Apache 2.0