Git Product home page Git Product logo

ds30's Introduction

Data Science in 30 Minutes

Check out neural_nets.ipynb for the full talk and example code for building a word2vec Neural Network in Python.

System requirements

  • Ipython notebook, numpy, scipy, pandas, matplotlib, seaborn
  • gensim (a C compiler will allow you to train more quickly, though isn't necessary).

Installation

You can easily install all of the above with Continuum Analytics' conda - if you haven't heard of it yet, we'd highly recommend taking a look!

The easiest way to install all these packages is the following, once you've gotten conda installed:

conda create --name ds30 --file environment.yaml

Pre-trained model to download (optional)

We use the following dataset in a few examples. Warning: It's 1.5GB, so sit back and relax while the download happens!

The Google News Model from the "pre-trained" section on this page.

Run:

To run this demo, you will need to startup an ipython notebook instance:

ipython notebook

Then go to http://localhost:8888 and click on neural_nets.ipynb.

To follow along:

You need visit our youtube channel.

More:

This is meant to just give you a brief guided tour of just a few topics in data science.

If you enjoyed this and want to learn more about doing data science in industry, consider applying to be a fellow at The Data Incubator

If you would like to hire data scientists, introduce data science corporate training, or partner to bring The Data Incubator to your country, reach out here.

ds30's People

Contributors

tianhuil avatar cmoscardi avatar awaemmanuel avatar synbiolucas avatar matt-jay avatar

Watchers

Steven Melendez avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.