Git Product home page Git Product logo

tcga's Introduction

#Software Overview

This repository contains instructions for reproduction and extension of Multi-tiered genomic analysis of head and neck cancer ties TP53 mutation to 3p loss by Gross et al. In general code for data-processing and computation is enclosed in standard python modules, while high level analyis was recorded in IPython Notebooks. The analysis for this project was relatively non-linear and has thus been split into a number of notebooks as described in Analysis Notebooks, but results should be able to be replicated by running these notebooks.

As of July 1, 2014 all error bars are off due to a Pandas bug. They now show the difference between the mean and the lower bound as the uncertanty for the upper and lower bound rather than show the true 95% confidence interval... hopefully this will be addressed soon.

##Dependencies

This code uses a number of features in the scientific python stack as well as a small set of standard R libraries. Thus far, this code has only been tested in a Linux enviroment, it may take some modification to run on other operating systems.

I highly recomend installing a scientific Python distribution such as Anaconda or Enthought to handle the majority of the Python dependencies in this project (other than rPy2 and matplotlib_venn). These are both free for academic use.

###Python Dependencies

  • Numpy and Scipy, numeric calculations and statistics in Python
  • matplotlib, plotting in Python
  • Pandas, data-frames for Python, handles the majority of data-structures
  • statsmodels, used for statstics
  • scikit-learn, used for supervised learning
  • rPy2, communication between R and Python
    • NOT IN DISTRIBUTIONS
    • I recommend installing with pip install rpy2
    • Needs R to be compiled with shared libraries
  • matplotlib_venn
    • NOT IN DISTRIBUTIONS
    • I recommend installing with pip install matplotlib_venn
    • Only used for Venn diagrams, not essential

###R Dependencies

  • Needs to be compiled with shared libraries to communicate with Python (this can be tricky)
  • Packages
    • base
    • survival
    • MASS

###Command Line Dependencies

tcga's People

Contributors

theandygross avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.