Code repository for post, Big Data Analytics with Java and Python, using Cloud Dataproc, Google’s Fully-Managed Spark and Hadoop Service.
To run InternationalLoansAppDataproc.java
use the following arguments, locally:
"data"
"ibrd-statement-of-loans-latest-available-snapshot.csv"
"ibrd-small-spark"
.master("yarn") must be changes to .master("local[*]")
To run InternationalLoansAppDataproc.java
on Dataproc:
"gs://dataproc-demo-bucket"
"ibrd-statement-of-loans-historical-data.csv"
"ibrd-large-spark"