In this workshop you will learn how to develop and deploy applications in DSX Local. The workshop has been divided into several stand-alone parts for those who are interested in a certain development tool or a certain deployment task.
This repository contains several lab subfolders. Some labs include notebooks and data, while others have additional instructions that are located in the Lab Instructions folder.
- Knowledge of analytics. These labs do not teach you the basics of analytics or how to implement analytics in R, Python and SPSS. The purpose of this workshop is to provide hands-on experience with analytics tools and deployment functions in DSX.
- To run this workshop you need an instance of DSX Local. Please note that while most code is the same between DSX Local and DSX Cloud, the notebooks included in sample projects will work in DSX Local only
- Download and unzip this this repository. Unzip the repository only, not files in subfolders.
- Rename DSX_Local_Workshop.zip located in DSX_Local_Projects folder of the unzipped repository to a unique name, for example, add your initials. Note: Project names in DSX Local cluster must be unique. When we create a project "from file", the project name is inherited from the file name.
- Log in to DSX Local.
- Select "Create New Project" and select "From File".
- Browse to the .zip file and click Create. .
- Open the project you just created.
- Navigate to Assets view and open TelcoChurn_SparkML Jupyter notebook. This notebook has been implemented for the Python 2.7 runtime. You can verify the runtime by running the first cell in the notebook.
- Follow instructions in the notebook.
- Open the project you just created.
- Navigate to Assets view and open CreditCardDefault_SkLearn notebook. If you want to stay with the telco churn example, you can work through the TelcoChurn_SkLearn notebook.
- Follow instructions in the notebook.
- Open the project you just created.
- Navigate to Assets view and open TelcoChurn_Zeppelin notebook.
- Follow instructions in the notebook.
- Follow instructions in the R_in_DSX.pdf in the Lab Instructions folder of the unzipped repository.
- Follow instructions in the SPSS_Modeler_in_DSX.pdf document in the Lab Instructions folder of the unzipped repository.
- Follow instructions in the DSX_Batch_Scoring.pdf document in the Lab Instructions folder of the unzipped repository.
- Follow instructions in the DSX_PMML_Lab.pdf document in the Lab Instructions folder of the unzipped repository.
Use Case 8: DSX - a platform that supports analytics application lifecycle (Model Evaluation in DSX)
- Follow instructions in the DSX_Evaluation_in_DSX.pdf document in the Lab Instructions folder of the unzipped repository.
- Follow instructions in the DSX_Data_Access.pdf document in the Lab Instructions folder of the unzipped repository.
- Create a project from DSX_Deep_Learning.zip located in the DSX_Local_Projects folder (make sure to rename the project before you create a project from file).
- Work through sample notebooks.
- Follow instructions in the Remote Spark Execution.pdf document in the Lab Instructions folder of the unzipped repository.