Git Product home page Git Product logo

applied-data-science's Introduction

Applied Data Science Project: Mental Health and Screen Time

Investigating whether and how the amount of screen time (using phones, computers, and tablets) a person spends at age 16 affects their levels of depression and anxiety at age 18.

Visualisations

NaN Values in Diagnosis Variables

Heatmap of the four diagnosis variables: has_dep_diag, dep_score, dep_thoughts, secd_diag. NaN entries represented as 0 (dark) and not NaN entries represented as 1 (bright). The lines show that there are rows of missing values across all four diagnosis variables. The variable with the least data is dep_thoughts, the other three diagnosis variables have almost equal amounts of data.

NaN vs not NaN entries fo diagnosis variables

Correlation Matrix Heatmap

Heatmap of the pearson correlation matrix of the 4513 rows that have at least one not NaN diagnosis variable. It shows that none of the screen time columns are correlated with a diagnosis column. All correlation values all <0.03. Method used:

pearson Correlation Matrix Heatmap showing no correlation between mental health and screen time

Heatmap of the spearman correlation matrix: Spearman Correlation Matrix Heatmap showing no correlation between mental health and screen time

Strip Plots

Strip plots of the 4513 rows that have at least one at least one not NaN diagnosis variable. The rows of the plot are the different screen times features, the columns of the plot are the four different diagnosis values. A value of 0 represents a NaN value other than for has_dep_diag where 0 means No and 1 means Yes. The strip plot shows that there is no correlation between screen time and any of the diagnosis values.

Strip Plot

Correlation Matrix Heatmap for Negatively Correlated Variables

A heatmap of the correlation matrix of the 4513 rows that have at least one diagnosis available. The aim with this visualisation was to try and find a negative correlation between these variables and diagnosis variables. No correlation was found, the most negative correlation found was -.031

Correlation Matrix Heatmap showing no correlation between mental health and variables thought to positively affect a childs mental health

Dimensionality Reduction - PCA

Principal Component Analysis for all of the data. 1 (yellow) implies a diagnosis of depression, 0 (purple) implies no diagnosis of depression.

PCA for mental health and screen time

Dimensionality Reduction - UMAP

Unified Manifold Approximation Projection for all the data. The first demonstrates when the parameter nneighbours is 50 and the second shows nneighbours at 500 which allows a demonstration of a local and global view of the data. Very little changes between them. Like PCA, 1 (yellow) implies a diagnosis of depression, 0 (purple) implies no diagnosis of depression. UMAP for mental health and screen time nneighours 50 UMAP for mental health and screen time nneighbours 500

Confusion Matrix for Machine Learning Algorithms

A single confusion matrix as despite several different models (and variations of the same model) being ran, they all created the same resultant confusion matrix. Confusion Matrix

applied-data-science's People

Contributors

isabelladegen avatar phillipsloan avatar travelingsubin avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.