Git Product home page Git Product logo

coriassessment's Introduction

Senior Data Engineer Assessment

At CORI, we believe that everyone everywhere deserves the opportunity to participate in and benefit from our growing tech economy. We also understand that data based on different rural definitions can lead us to tell different stories about the state of rural America. Therefore, our organizational goal of "advancing economic prosperity in rural America through the creation of inclusive tech economy ecosystems that support scalable entrepreneurship and tech job creation" requires that our data team and products are informed by context-sensitive, flexible and nuanced approaches to defining rural. In a recent publication, we delve into a number of rural definitions. While putting together the background resources for this article, we identified the USDA's Rural-Urban Commuting Area (RUCA) codes as an important Census tract definition that we want to incorporate in our work moving forward.

In the US, many elected state and federal officials represent districts that are intended to reflect equalized populations. And, every ten years the US Census Bureau surveys the general population to assess general population trends and update underlying geographies to better reflect the latest population trends. As a team, we’re committed to incorporating the best available data in our products and we currently find ourselves in a transitory phase when it comes to our underlying geographic data definitions.

For this assessment, we ask you to translate our rural definition (RUCA) from the 2010 census tract definitions to the latest 2020 census definitions.

Assessment assets:

Assessment guidance:

  1. Use the RUCA Code 2010 to generate CORI’s rural definition.
  2. Join CORI’s rural definition to the 2010 TIGER/Line Census tracts
  3. Use relationship file to associate 2010 TIGER/Line tract list with 2020 TIGER/Line tract list and crosswalk CORI’s rural definition.
  4. Note - there are multiple ways to associate a 2020 tract with a rural definition, please develop and a methodological recommendation on how we can associate 2020 TIGERLine tracts CORI’s rural definition
  5. Produce dataset with 2 columns: tract GEOID’s (census tract unique identifier) and CORI rural indicator

Note:

  • The US geographic definitions are quirky, if you find anomalies in the data, please take note, but do not let this stop your progress.
  • This exercise does not necessarily require analysis or processing of the geometry included in these data sets.
  • State FIPs should be < 60: This reflects the 50 states

Data files

In the data folder, there are three files:

  1. tab20_tract20_tract10_natl.txt

This file is the 2020 census tract to 2010 census tract relationship file. You can find an explanation of the relationship file here.

  1. census_tract_populations_2010.csv

This file provides population counts for 2010 census tracts.

  1. census_tract_populations_2020.csv

This file provides population counts for 2020 census tracts.

Instructions

To begin, clone this repository.

Using the provided files, we would like you to analyze the tract relationship changes between 2010 and 2020 and provide a brief report summarizing what you found.

This exercise is designed to be open-ended and there is no right answer.

coriassessment's People

Contributors

camdenblatchly avatar defuneste avatar dancingfrog avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.