Git Product home page Git Product logo

machine-learning-das's Introduction

CMS Data Analsyis School (DAS) - Machine Learning Short Exercise

Based on CMSDAS 2019 @LPC Machine Learning Exercise, written by Xavier Duarte Editted and expanded by CMSDAS 2019 Pisa Facilitators: Thong Nguyen, Sitong An, Leonardo Giannini.

To refer to the lecture slides: click here.

Introduction

This is a set of tutorials for the CMS Machine Learning Data Analysis School (DAS) Short Exercise. They are intended to show you how to build machine learning models in python (Keras/TensorFlow) and use them in your ROOT-based analyses. We will build event-level classifiers for differentiating VBF Higgs and standard model background 4 muon events and jet-level classifiers for differentiating boosted W boson jets from QCD jets.

Main notebooks in this tutorial

 0.  0-setup-libraries.ipynb: setting up libraries using CMSSW
 1.  1-TMVA-Classification-py.ipynb: Simple BDT Demonstration with 'TMVA' in Python
 2. 2-datasets.ipynb: reading/writing datasets from ROOT files with 'root2hdf5'
 2.1 2.1-datasets-uproot.ipynb: Same with notebook 2, but with uproot
 3.  3-plotting.ipynb: plotting with pyROOT and matplotlib
 4.  4-dense.ipynb: building, training, and evaluating a fully connected (dense) neural network in Keras
 4.1 4.1-dense-pytorch.ipynb: Same with notebook 4, but with PyTorch
 4.2 4.2-dense-bayesian-optimization.ipynb: Optimize a dense network with Bayesian optimization
 5.  5-preprocessing.ipynb: preprocessing CMS open data to build jet-images (optional)
 6.  6-conv2d.ipynb: building, training, and evaluating a 2D convolutional neural network in Keras

Setup

We will be using the Vanderbilt JupyterHub. Point your browser to:

https://jupyter.accre.vanderbilt.edu/

If this is the first time using this JupyterHub, you should see:

Click the "Sign in with Jupyter ACCRE" button. On the following page, select CERN as your identity provider and click the "Log On" button. Then, enter your CERN credentials or use your CERN grid certificate to autheticate. Click "Spawn" to start a "Default ACCRE Image v2" image.

Now you should see the JupyterHub home directory. Click on "New" then "Terminal" in the top right to launch a new terminal.

To download the tutorials, type

git clone https://github.com/CMSDASPisa/machine-learning-das.git

Now, in your directory tab, there should be a new directory called machine-learning-das. All of the tutorials and exercises are in there. Start by clicking on 0-setup-libraries.ipynb and running it.

Important Notice

Please shut off notebooks after you are done running them. Leaving too many notebooks open (and kernels running) may drain your memory and cause notebooks to work improperly.

To check which notebooks are currently running, click on the "running" tab under "jupyter at ACCRE" LOGO. Click on the shutdown button on the right of the list of notebooks to shut them down.

Links

The indico page is: https://indico.cern.ch/event/758324/

The Mattermost for live support is: https://mattermost.web.cern.ch/cmsdaspisa2019/channels/shortexercise-ml

The twiki is: https://twiki.cern.ch/twiki/bin/view/CMS/SWGuideCMSDataAnalysisSchoolPisa2019MLExercise

machine-learning-das's People

Contributors

jpivarski avatar jhakala avatar jmduarte avatar sitongan avatar kpedro88 avatar ubparker avatar thongonary avatar perilousapricot avatar holzman avatar tmitchel avatar gbenelli avatar mtonjes avatar caleb-james-smith avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.