Git Product home page Git Product logo

amazon-polly-developer-guide's Introduction

amazon-polly-developer-guide's People

Contributors

brentkellmer avatar cbigham avatar joshbean avatar jpeddicord avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

amazon-polly-developer-guide's Issues

[Errno 13] Permission denied: 'hello.mp3'

Hello,

Just getting started with AWS in general so not sure what's going on here.

I ran the Polly sample command in bash to get started.

aws polly synthesize-speech
--output-format mp3
--voice-id Joanna
--text 'Hello, my name is Joanna. I learned about the W3C on 10/3 of last year.'
hello.mp3

Returned this error: [Errno 13] Permission denied: 'hello.mp3'

I tried redirecting the file using > ~/hello.mp3 and that did not work.

I also thought I tried changing permissions on various files and directories using sudo chmod +x, but I may have missed something there.

Ubuntu 22.10

********Update

I reconfigured aws using sudo aws configure.

I then ran the Polly command aws using sudo and now it works fine.

Thanks,

Matthew

Android Studio cannot resolve 'AmazonPollyClientBuilder' or 'AWSClientBuilder'. everything else seems wroking

What i'm trying to do?

Get a specific piece of text outputted from amazon polly to mp3 format.
i tried following the amazon aws android example in the polly guide. that crashed my app with a null point exception

i tried following the java example but it shows me that AmazonPollyClient constructor is deprecated.

i then tried this:

https://docs.aws.amazon.com/sdk-for-java/v1/developer-guide/credentials.html

My Code :
`import com.amazonaws.auth.BasicAWSCredentials;
import com.amazonaws.regions.Regions;
import com.amazonaws.services.polly.AmazonPollyClient;
import com.amazonaws.services.polly.model.DescribeVoicesRequest;
import com.amazonaws.services.polly.model.DescribeVoicesResult;
import com.amazonaws.services.polly.model.OutputFormat;
import com.amazonaws.services.polly.model.SynthesizeSpeechPresignRequest;
import com.amazonaws.services.polly.model.Voice;

public class AmazonPolly {

public URL getRingtones(String name) {
    BasicAWSCredentials awsCred = new BasicAWSCredentials("XXXXXXXXXXX","XXXXXXXXXX");
    AmazonPollyClient client = AmazonPollyClientBuilder.standard().withCredentials(new AWSStaticCredentialsProvider(awsCred)).build();
    DescribeVoicesRequest describeVoicesRequest = new DescribeVoicesRequest();

    // voice options
    DescribeVoicesResult describeVoicesResult = client.describeVoices(describeVoicesRequest);
    List<Voice> voices = describeVoicesResult.getVoices();
    SynthesizeSpeechPresignRequest synthesizeSpeechPresignRequest =
            new SynthesizeSpeechPresignRequest()
                    // Set text
                    .withText("You have a call from " + name + "   ")
                    // Set voice
                    .withVoiceId(voices.get(0).getId()) // "Joanna"
                    // Set format
                    .withOutputFormat(OutputFormat.Mp3);

// Get the presigned URL for synthesized speech audio stream.
URL presignedSynthesizeSpeechUrl = client.getPresignedSynthesizeSpeechUrl(synthesizeSpeechPresignRequest);
return presignedSynthesizeSpeechUrl;

}

My dependencies in build.gradle: module: app >dependencies {
implementation fileTree(dir: 'libs', include: ['*.jar'])
implementation 'com.amazonaws:aws-android-sdk-polly:2.15.2'
implementation ('com.amazonaws:aws-android-sdk-mobile-client:2.6.23@aar') { transitive = true }
implementation group: 'com.amazonaws', name: 'aws-android-sdk-core', version: '2.15.2'
implementation ('com.amazonaws:aws-android-sdk-auth-core:2.15.+@aar') { transitive = true }
implementation "org.jetbrains.kotlin:kotlin-stdlib-jdk7:$kotlin_version"
implementation 'androidx.appcompat:appcompat:1.0.2'
implementation 'androidx.core:core-ktx:1.0.2'
implementation 'androidx.constraintlayout:constraintlayout:1.1.3'
testImplementation 'junit:junit:4.12'
androidTestImplementation 'androidx.test.ext:junit:1.1.0'
androidTestImplementation 'androidx.test.espresso:espresso-core:3.1.1'
}`

my dependencies in build.gradle( project level) >
dependencies { classpath 'com.android.tools.build:gradle:3.6.1' classpath 'com.amazonaws:aws-android-sdk-appsync-gradle-plugin:2.9.+' classpath "org.jetbrains.kotlin:kotlin-gradle-plugin:$kotlin_version" // NOTE: Do not place your application dependencies here; they belong // in the individual module build.gradle files }

Support for mixed languages

Can a text with both Chinese and English support voice conversion? Without specifying a language type。And you don't need to use an SSML tag to set whether it's in Chinese or English

About Phoneme/Viseme Tables

This is more of a question than an issue.

There are Phoneme/Viseme tables available at https://docs.aws.amazon.com/polly/latest/dg/ref-phoneme-tables-shell.html which are supported by Amazon Polly.

I require a text to speech speech system for my research. I want to select certain words which will be spoken with exclusively with certain visemes (like T, @, O, etc) in various languages and generate these sounds of words using amazon polly to further study it in speech signal domain.

However, Hindi, which is a very important language I wish to include does not have Phoneme/Viseme mapping info on the website. Only phonemes (viseme column is absent from the table) are available. I am willing to generate the table myself but what are the visemes which you support? For ex: some linguists say there are 13 visemes others say there are 16 (or more). Also some linguistics may map a phoneme to a different one than other.

How were these visemes shortlisted. Perhaps a citation to the original publication would be helpful.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.