Git Product home page Git Product logo

matplotlib-challenge's Introduction

The Power of Plots - Matplotlib Challenge

If you would like to see my finding, please scroll down to the findings section.

Table of Contents

Background

What good is data without a good plot to tell the story? In this assignment, I took on the role of of a data scientist, working for Pymaceuticals, Inc. a burgeoning pharmaceutical company. The company began screening for potential treatments for squamous cell carcinoma(SCC), a commonly occurring form of skin cancer. I've been given to the complete data from their most recent study covering 250 mie idetntifed with SCC tumor growth. Over the course of 45 days, tumor development was observed and measured. The purpose of this study was to compare the performance of Capomulin versus the other treatment reignemnts. I have asked to generall all of the tables and figures needed for the technical report of the study. The executive team also hased for a top-level summary of the study results.

Reporting Steps

Summary Statistics Table

  • I created a table including the following:
    • Mean
    • Median
    • Variance
    • Standard Deviation
    • SEM of the tumor volume of each drug regimen

Bar plot

  • I generated bar plots showing the number of data points for each treatment regimen This was done using:
    • Matplotlib's pyplot
    • Pandas' DataFrame.plot

Here are the bar plots I generated:

Pie plot

  • I generated pie plots showing the distribution of mice per gender using:
    • Matplotlib's pyplot
    • Pandas' DataFrame.plot

Box and whisker plot

  • I caluclated the final tumor volume of each mouse across four of the most promising treatment regimens and used it to create a box and whisker plot. The four regimens were:
    • Capomulin
    • Ramicane
    • Infubinol
    • Caftamin

Line plot

  • I created a line plot of time point versus average tumor volume for the Capolmulin treatment regimen.

Scatter plot

  • I created a scatter plot of the mouse weight versus average tumor volume for the Capomulin treatment regimen.

Linear regression model

  • I calculated the correlation coefficient and created a linear regression model between mouse weight and average tumor volume.

Findings

These are my three inferences or observations of the data:

  1. There is almost a near positive correlation between the mouse's weight and the tumor volume. In fact with a correlation coefficient at 0.95, it is almost certain that these two are positively correlated.
  2. On mouse ID s185, there was a significant decrease in the tumor volume over the time period. The tumor volume on this mouse went from 45 cubic millimeters to less than 25 cubic millimeters, showing real promise for this drug.
  3. With the exception of 1 mouse on Infubinol, all four reigmens showed consistent results regarding the tumor volume of the mice at the end. Looking at the box and whisker plot, Capomulin and Ramicane seemed to be the most promising of results with the tumor volumes taken at maximum time point having a median of less than 40 cubic millimeters. Infubinol and Ceftamin still had median tumor volumes closer to 60 cubic millimeters.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.