Git Product home page Git Product logo

delawareopendata's Introduction

Overview

DeSODP was built by Team 311 for the Stanford Open Datathon. We followed the Open Data Handbook.

Data Collection:

We collected our data from Delaware State University’s publicly shared fact sheets.

Additionally, we implemented a form for users to submit their own datasets

Data Workflow

Our workflow is as follows:

Source data from Delaware State University Fact Book Scrape data using Tabula Clean and format data using Microsoft Excel Log metadata in a Google Sheet, noting necessary fields Export metadata spreadsheet as CSV Run an offline script to convert the metadata file into json format. Manually upload metadata json file to AWS bucket along with datasets

We chose this as our approach to stay within our technical skill level while pushing ourselves to figure out a way to complete our mission of getting a data portal up and running. In the future, the workflow could be streamlined by implementing API requests to automate some of the steps.

Converting from CSV to JSON

Part of our workflow requires some light coding since we run an offline python script to convert our metadata from csv to json. Essentially, all that would need to be done is opening the csv_to_json python file and editing the file paths to correspond with the csv and json files in the metadata folder. Once that’s done, anyone can do the conversion by double clicking the python file so long as the csv and json files are present and named properly.

Data Storage and Access

Datasets are stored in an AWS bucket, which can be managed through the AWS root account and IAM user accounts. Each IAM user account has the ability to add and see the datasets. Only one account will have full access to add, modify, and delete objects if necessary.

delawareopendata's People

Contributors

vishustle avatar

Watchers

 avatar  avatar

Forkers

kidalecks

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.