Git Product home page Git Product logo

speaking-docs's People

Contributors

gregpriday avatar vkazakevich avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar

Forkers

isabella232

speaking-docs's Issues

Implement Speaking Docs

For our writers at Briefer, we need to know what text will sound like when read by AWS Polly. This addon should make it very easy for them to listen to sentences and paragraphs.

For a good idea on how this addon should work, take a look at our Froala/Nova integration with AWS Polly.

https://github.com/brieferbooks/nova-froala-field/tree/develop

The quickstart guide seems to very useful. There's even a function for getting the selected text that we can utilize (unless it's part of the core API by now). The rest of the docs gives a lot more details.

Settings to add AWS credentials

We need to give a user a way of adding their AWS credentials for AWS Polly calls. Google docs seems to have a good way of creating forms and modals, so this should be fairly straight forward. I think the PropertiesService can store data like this.

We need to give users a way to access this settings form from the main menu.

Listening to selected text

Ideally, we need a button right inside the main Google Docs toolbar.

Screenshot 2020-06-18 at 16 17 51

Something like a speaker icon or a speech bubble should work fine. I'm not sure if Google docs has an entire icon pack or if we need to somehow add our own.

Clicking the button should start playing the selected text. Just like the Translate quickstart example, this could display a sidebar with a simple HTML5 audio player so the user can stop and replay the audio if they need to. It's also possible the Cards service will be useful for this.

In addition to the toolbar button, we should have a keyboard shortcut to read the selected text.

Additionally, clicking the listen button while audio is playing should pause the audio.

Challenges

Our Froala integration has the advantage of having a server side component that makes requests to AWS Polly. In our case, we'll need to do everything using Javascript. Maybe this just means we'll need to use data URLs with the fetched MP3s in HTML audio tags.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.