Git Product home page Git Product logo

mscsshiny's Introduction

Welcome to MSCSShiny!

Phil Ferriere
June 2016

MSCSShiny is a test/demo application for R packages like {mscstexta4r} and {mscsweblm4r} that interface with the Microsoft Cognitive Services REST APIs.

Demo: Try it live on shinyapps.io!

Microsoft Cognitive Services? What's that about?

Microsoft Cognitive Services -- formerly known as Project Oxford -- are a set of large, diverse, truly awesome APIs, SDKs and services that developers can use to add AI features to their apps. Those features include emotion and video detection; facial, speech and vision recognition; as well as speech and NLP.

MSCS Language Services

Our interest, at this stage, is limited to the exploration and evaluation of the NLP features of MSCS:

https://www.microsoft.com/cognitive-services/en-us/documentation

As should be clear from the above, this subset itself isn't exactly small...

Text Analytics API

The {mscstexta4r} package is a wrapper around the MSCS Text Analytics REST API. This API offers a suite of text analytics web services - built with Azure Machine Learning - that can be used to analyze unstructured text. The API supports the following operations:

  • Sentiment analysis - Is a sentence or document generally positive or negative?
  • Topic detection - What's being discussed across a list of documents/reviews/articles?
  • Language detection - What language is a document written in?
  • Key talking points extraction - What's being discussed in a single document?

For more information about the {mscstexta4r} package (on CRAN, or on GitHub), please check out the Text Analytics API tab at the top of this page.

Web Language Model API

The {mscsweblm4r} R package exposes bindings for the MSCS Web Language Model REST API. Per Microsoft's website, this API uses smoothed backoff N-gram language models (supporting Markov order up to 5) that were trained on four web-scale American English corpora collected by Bing (web page body, title, anchor and query). The following operations are supported:

  • Calculate the joint probability that a sequence of words will appear together.
  • Compute the conditional probability that a specific word will follow an existing sequence of words.
  • Get the list of words (completions) most likely to follow a given sequence of words.
  • Insert spaces into a string of words adjoined together without any spaces (hashtags, URLs, etc.).
  • Retrieve the list of supported language models.

For additional information on the {mscsweblm4r} package (on CRAN, or on GitHub), please click the Web Language Model API tab at the top of this page.

Text Analytics Screenshots

Sentiment analysis

Topic detection

Language detection

Key talking points extraction

Web Language Model Screenshots

Supported web language models

Words most likely to follow a sequence of words

Break concatenated words into individual words

Conditional probability that a particular word follows a given sequence of words

Joint probability that a particular sequence of words appears together

Credits

All Microsoft Cognitive Services components are Copyright (c) Microsoft.

Customized progress bar style, courtesy of @jackolney.

Meta

Please report any MSCSShiny issues or bugs here.

License: MIT + file

This project is released with a Contributor Code of Conduct. By participating in this project, you agree to abide by its terms.

mscsshiny's People

Contributors

philferriere avatar

Watchers

James Chang avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.