Git Product home page Git Product logo

analytics's Introduction

Overview

We're working to make every step of the Code for SF experience great, from finding your first project to making your first contribution.

Goals / Scope

The goal of this project is to build an understanding of member engagement and diversity with Code for San Francisco.

How to contribute

Roles

  • Product Owner (Greg)
  • Project Coordinator (Ti)
  • Data Scientists (Rocky)

Looking for Data Scientists Critical Thinkers

How to Get Started

For creating workbooks:

  1. Request permission from Greg to google drive that has data already available
  2. Create a local repository via github (or GitHub Desktop)
  3. Add files to the folder called "data" for organization.
  4. Create workbooks in the folder called "workbooks" for organization.

Available Raw Data

user_list_data -> A list of all the users in our slack community conversation_list_data -> A list of all channels in our slack community conversations_history_data -> A list of all the public messages in our slack community members -> A list of members and demographic information attendance -> A list of attendance information

For processing data yourself

=======

  1. Ensure Pip is updated (some packages may fail otherwise):

    pip3 install --upgrade pip
    
  2. Create and activate a virtual environment by running this in terminal (and install virtualenv if needed):

    pip3 install virtualenv
    
    python3 -m venv venv
    
  3. Set up local environmnetal variables. There are 2 api keys, one for slack and one for airtable that need to be added to the end of the file "venv/bin/activate". This is the environment config file that is run when you activate your virtual environment.

  4. Activate virtual environment by navigating to your local report and running:

    source venv/bin/activate
    
  5. Run requirements.txt so that you have all of the proper packages:

    pip install -r requirements.txt
    

analytics's People

Contributors

gregboyer avatar rgowni1 avatar tdevwk avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Forkers

tdevwk

analytics's Issues

User List Not Running

Getting a Python error when trying to run latest version of users_list.py.

The file itself seems to have a combination of stashed/local development changes.

Validate using emails as IDs

Validate that email address can be used to join Airtable Demographic data with slack data.

Answer the following questions:

  1. How many users are in the airtable?
  2. How many users are in the slack list?
  3. How many users are in both the airtable and slack?
  4. For those who are in list 1, but not in list 2, How many can we match manually if we were to use name and not email addresses?

Integrate team.accessLogs

https://api.slack.com/methods/team.accessLogs

For each user in users.list, add the following to the data set.

first access
most recent access
number of logins
country
region

Note, does not have paging, need to increment through page until all results are found.

Load entire log, identify the aggregate values based on userid, then join to users.list info

Analysis: Does Gender of Lead Impact Gender of Participants?

Compare proportion of public posts by male vs female in project by gender of project lead.

  | M | F.  <-- Project Leads

M. |. |
F. |. |
^ Participants

Take from conversations.list for channel name, conversation history for number of public messages, and member form for genders; create crosstab.

project channels with female leads:
discuss-data-science
proj-datasci-bay-releif
proj-waterthetrees

Mix leads
proj-intentional-walk (mix)
proj-teambats

Project channels with male leads:
issue-covid-19
proj-3d-street
proj-compass
proj-tinytown
proj-brigade-analytics
proj-brigade-website
proj-police-budgets

Brigade Channels
general
discuss-new-member
share-civic-jobs
share-events

Metric is count of posts by gender

% of Slack Activity covered

One question that comes here is in terms of percent of slack activity, how much can be associated with demographic data?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.