Git Product home page Git Product logo

bldap-python-data-science's Introduction

BLDAP Intro to Python / Data Science Curriculum

Introduction

This repository contains the Jupyter notebooks for the intro to Python / Data Science course for Berkeley Lab Director's Apprenticeship Program (BLDAP).

This course is designed for students with little to no experience in coding to learn skills in Python necessary for data science. Students utilize Jupyter notebooks throughout the course.

The overall goal is for students to learn how to use Python to clean, analyze, and visualize large data sets in order to communicate effectively their conclusions about the data set. Students apply the skills they learned on actual data sets provided by researchers in Berkeley Lab.

This repo contains the following:

  • 00 Intro to Python: Students learn the basics of Python, as well as libraries utilized in data science (numPy, pandas, matplotlib)
  • 01 Cell Phone Design Challenge: Students work with a dataset compiled from the Materials Project database of calculated materials properties to design a cell phone that fulfills criteria such as low cost, lightweight, and durability
  • 02 Energy Consumption Data Challenge: Students work with energy consumption data from a pilot site that was used to test a software platform, “Solar+ Optimizer” (SPO), which aims to reduce the overall operational cost for buildings by taking into account time-varying costs of energy and status of the grid connection
  • 03 Genomics Data Challenge: Students work with RNA sequencing data of the algae Chromochloris zofingiensis, which is of interest due to its potential as a biofuel sources

Installing Anaconda

The easiest way to run Python and Jupyter notebooks locally on the computer is through Anaconda. Anaconda is an open-source (free!) Python distribution platform.

Click here to download Anaconda Distribution (Python 3.9, 64-Bit Graphical Installer). Make sure to download the correct version for the computer's operating system (Windows, Mac, or Linux).

Running Jupyter Notebooks

First, download the coding bootcamp materials from this repository:

  1. Click the green "Code" button on the top right of the repository information page.
  2. Click "Download Zip". Extract the file to a folder where it can be easily accessed (such as the Desktop).

Then, use the Anaconda Navigator application to open the notebooks:

  1. Open the Anaconda Navigator application (note that it might take a few minutes to load up when the application is launched for the first time).
  2. Click the 'Launch' button under 'Jupyter Notebook' (NOT JupyterLab). A new tab in the internet browser should open with a file system. Navigate through the file system to the BLDAP-Python-Data-Science folder that was just downloaded.
  3. Go to any folder and click on a notebook file (ends in .ipynb). A new tab should open with the notebook.

Alternative to Anaconda

An alternative option is to run the Jupyter notebooks in the cloud through binder. Please keep in mind that work is not saved in binder, so it is recommended to run the notebooks locally on the computer instead.

Binder

Please note that binder takes a while to load when launching it for the first time.

Feedback and Suggestions

We welcome any feedback and suggestions for our curriculum resources so we can continuously improve them. Any questions about the curriculum resources can also be asked and a K-12 team member will email back.

Please also use the feedback form to request the answer keys for the Jupyter notebooks. In an effort to keep this curriculum as open source, please do not post the answer keys on any public websites.

Fill out the feedback form

Acknowledgements

The Berkeley Lab K-12 team is truly grateful to the following who contributed to the development of these notebooks:

Intro to Python:

UC Berkeley's Data Science Modules Program, Division of Computing, Data Science, and Society

Kseniya Usovich, Karla Palos, Eric Van Dusen, Rachel McCarty (UC Berkeley); Baishakhi Bose, Samanvitha Murthy (Berkeley Lab); Laurel Hales (Stanford)

Cell Phone Design Challenge:

Alex Ganose (Imperial College London); Ryan Kingsbury, Jianli Cheng (Berkeley Lab)

Energy Consumption Data Challenge:

Anand Krishnan Prakash, Kun Zhang, Lazlo Paul and Dave Blum, Marco Pritoni and Rich Brown (Berkeley Lab); Marc Marshall, James Zoellick, Peter Alstone (Schatz Energy Research Center, Humboldt State University); Rachel McCarty, Kseniya Usovich (UC Berkeley); Laurel Hales (Stanford)

Genomics Data Challenge:

Sharon Greenblum (Berkeley Lab); Ciara Acosta, Kseniya Usovich (UC Berkeley)

Repo format inspired from UC Berkeley's D-Lab

bldap-python-data-science's People

Contributors

lbnlnext avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.