Git Product home page Git Product logo

isabella232 / 10x-mlaas Goto Github PK

View Code? Open in Web Editor NEW

This project forked from 18f/10x-mlaas

0.0 0.0 0.0 8.12 MB

Repository for machine learning tool, MeL, that assist in providing insights for open text data. This tool is part of the 10x Machine Learning as a Service project (formerly known as Qualitative Data Management).

Home Page: https://10x.gsa.gov

License: Other

Jupyter Notebook 93.31% Python 6.61% Dockerfile 0.05% Shell 0.03%

10x-mlaas's Introduction

MlaaS

10x Machine Learning as a Service

[Formerly known as Qualitative Data Management]

Making is easier to identity what information the public is looking for.

We are currently in the third phase of 10x: Development. We estimate this phase will end in May 2020.

Overview

USA.gov and Gobierno.USA.gov provide a vital service: connecting members of the public with official information about government services. In an effort to better serve the public, both sites collect (and act on) feedback from users. They collect this feedback through various channels: call centers, the chat feature, sitewide surveys, and page-level surveys. (Our research focused almost entirely on page-level surveys.) For context, page-level surveys appear on “nearly every content page, directory record page, and state details page” — in other words, pages with the content that users need to answer their questions.

The problem statement

As a government employee, how can I more quickly and effectively analyze comments provided by site visitors to identify timely issues, improve the usability of the site and information provided, and further direct visitors to important services and information they need?

Our challenge

Help the USAgov team better serve their users by (a) introducing process improvements and simple automation in analyzing open-ended comments submitted through their website and (b) introduce experimental sentiment-analysis tools and other qualitative data analysis techniques to more robustly analyze these data with less human intervention.

As expected, the scope of our project has shifted to offering these machine learning tools to the entire Office of Customer Experience. During Phase II, we prototyped and delivered a machine learning tool to aid the USAgov team but we believe this tool (or similar SaaS) could be leveraged to reduce the burden on other teams in the Office of Customer Experience, as well as outside GSA.

During Phase III, we narrowed the scope from development of an expansive machine learning service to building a MVP that will use open text data to (1)provide data insights, and (2) decrease time to classify and identify themes manually. We would like to introduce you to MeL which uses machine learning to filter, classify and provide user sentiment, so that you have greater insights into your text data in less time.

We continue to work the Office of Customer Experience on the development of this MVP and are looking to work with other federal agencies and datasets to explore different use cases for MeL.

You can follow MeL development journey here.

Who are we?

Team members:

  • Tiffany Andrews, Innovation Specialist, 18F
  • Adam Gerber, Flexion Data Scientist and Machine Learning Engineer
  • Will Cahoe, Program Analyst, 10x

Advisers:

Former team members:

  • Amy Mok, Innovation Specialist, 18F
  • Amos Stone, Innovation Specialist, Login
  • Colin Craig, Innovation Specialist, 18F
  • Chris Goranson, research, 18F
  • Kate Saul, research, 18F

Progress

We are tracking the work for this Phase on our Kanban board.

Any issues or ideas that we want to keep track of for later are being noted in the GitHub issues.

We post weekly progress updates in updates.

Investigation

  1. USA.gov’s data-management process is entirely manual. Although the team uses HHS’s Voice of the Customer tool to capture survey data, all of the review and analysis are manual.
  2. This manual process takes time and creates significant challenges. Manual review and analysis take a considerable amount of time — time that could be spent creating more effective content and replying to urgent user inquiries.
  3. Workflow improvements would complement automation. In addition to automating data processing, identifying a product owner, documenting the workflow, and finding other ways to streamline the process would increase efficiency.
  4. Automation improvements will speed up the workflow improvements and reduce errors. Steps that are largely duplicative between analysis periods can be automated, thereby allowing the USA.gov team to spend more time gleaning insights from the valuable data.
  5. The USA.gov team enthusiastically seeks process improvements. Everyone we spoke to emphasized the need for process improvements and an openness toward change.
  6. Novel approaches to qualitative data enhancements can be applied without getting in the way. Once workflow and automation enhancements are complete, the analyses can be further explored using methods and tools that work well for analyzing qualitative data (natural language processing, sentiment analysis).
  7. We recommend moving forward with this project. Based on the potential for improving USA.gov’s service offerings, the applicability of automation tactics to other federal agencies and other qualitative data held by the government, the impact improvements will have on the lives of American citizens, and the team’s openness to change, we recommend that the next phase of this project should be funded.

The full Phase I investigation report is available here.

Contributing

See CONTRIBUTING for additional information.

Join us in #10x-mlaas or ask us a question.

Public domain

This project is in the worldwide public domain. As stated in CONTRIBUTING:

This project is in the public domain within the United States, and copyright and related rights in the work worldwide are waived through the CC0 1.0 Universal public domain dedication.

All contributions to this project will be released under the CC0 dedication. By submitting a pull request, you are agreeing to comply with this waiver of copyright interest.

10x-mlaas's People

Contributors

adamgerbergsa avatar amoose avatar amymok avatar csmcallister avatar dependabot[bot] avatar gsamoose avatar openglobe avatar tiffanyandrews avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.