Git Product home page Git Product logo

iphone-speech-to-text's Introduction

Speech-to-Text library used by Astrid for iPhone!

This library leverages the Google Voice API and the Speex audio codec to achieve speech-to-text on iOS devices where Siri is not available. The library includes version 1.2-rc1 of the Speex codec and three supporting classes to make speech-to-text on iOS as simple as possible. There are a few steps required to configure the project in Xcode:

  1. Clone the project

  2. Add the SpeechToText Xcode project to your project

  3. In your project's settings, update the target's build phases by adding SpeechToText to the "Target Dependencies" section and libSpeechToText.a under the "Link Binary with Libraries" section.

  4. Under the target's Build Settings tab, add the iPhone-Speech-to-Text directory (i.e. the path on your hard drive to wherever it exists) to the "Header Search Paths" field, and click the box marked "Recursive"

  5. Import "SpeechToTextModule.h" wherever you want to use it!

The SpeechToTextModule class is the core of the library, but is very simple. An instance of this class can start and stop audio recording, and will pass an NSData * object back to its delegate when it receives a response from the Google Voice API. This data should be interpreted as JSON and can be decoded by your favorite JSON parsing library. The resulting JSON object should contain an array under the key "hypotheses", and each entry in this array will be a possible interpretation of the audio with the key "utterance" for the text and the key "confidence" for the confidence.

Optional delegate methods for SpeechToTextModuleDelegate allow you to display a rolling sine wave view that indicates volume amplitude as recording takes place (showSineWaveView and dismissSineWaveView) or display an optional loading view while the module communicates with the voice API.

Contributors workflow

If you want to help make this library better, read this!

Setup:

git clone [email protected]:your-github-id/iPhone-Speech-to-Text.git (your-github-id should obviously be replaced)

git remote add upstream [email protected]:todoroo/iPhone-Speech-to-Text.git

Working on new features/fixes:

git checkout -b my-new-features upstream/master

work, work, work!

git commit (a separate commit for each bug fix, feature change, style or copy edit please!)

git fetch upstream

git rebase -i upstream/master (i like to rebase -i to verify what I'm committing and squish small commits)

git push origin HEAD

then go to github and submit a pull request!

For further information, read Tim's Collaborator Guide.

iphone-speech-to-text's People

Contributors

bearpelican avatar jonparis avatar sbosley avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

iphone-speech-to-text's Issues

Compiling issue

Maybe I am doing something wrong, but I can not compile with XCode 4.5 for arch armv7 and armv7s.
This is the "error"

Build operation failed without specifying any errors. Individual build tasks may have failed for unknown reasons. Some of these (up to 12) may be listed below.

Not working with iOS 7

iPhone Speech to Text code was not working for iOS 7. In this always return blank because recording is not working in iOS 7.

Please give me any solution for that.

How to Integrate speech-to-text

Hi,
I integrate the speech-to-text .when i click add button it show's the Speak Now alert dialog box.When i click done button the json return empty data.I dont know it's correct or not please help me.

Problem with audio html5 tag

Hi, I'm using this library and works great, the issue is when i use a html 5 audio tag with a hello message, to make this works I have to set mediaPlaybackRequiresUserAction = NO. But when the voice start the mic doesn't get any sound and the wave its static. Can you help me?.

No longer working with google voice api

I'm getting an internal error when I try to run the library

<TITLE>Internal Server Error</TITLE>

Internal Server Error

Error 500

Is any one getting this? If so, anyone have a workaround to fix it?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.