awsdocs / amazon-polly-developer-guide Goto Github PK

The open source version of the Amazon Polly docs. You can submit feedback & requests for changes by submitting issues in this repo or by making proposed changes & submitting a pull request.

License: Other

amazon-polly-developer-guide's Introduction

NOTICE

This repository is archived, read-only, and no longer updated. For more information, read the announcement on the AWS News Blog.

You can find up-to-date AWS technical documentation on the AWS Documentation website, where you can also submit feedback and suggestions for improvement.

amazon-polly-developer-guide's People

Contributors

Stargazers

Watchers

amazon-polly-developer-guide's Issues

[Errno 13] Permission denied: 'hello.mp3'

Hello,

Just getting started with AWS in general so not sure what's going on here.

I ran the Polly sample command in bash to get started.

aws polly synthesize-speech
--output-format mp3
--voice-id Joanna
--text 'Hello, my name is Joanna. I learned about the W3C on 10/3 of last year.'
hello.mp3

Returned this error: [Errno 13] Permission denied: 'hello.mp3'

I tried redirecting the file using > ~/hello.mp3 and that did not work.

I also thought I tried changing permissions on various files and directories using sudo chmod +x, but I may have missed something there.

Ubuntu 22.10

********Update

I reconfigured aws using sudo aws configure.

I then ran the Polly command aws using sudo and now it works fine.

Thanks,

Matthew

examples-python.md adminuser not found

I've run the python script on python 3 and it says the adminuser set on line 68 is not found?

AWS for Wordpress plugin no longer available

The AWS for Wordpress plugin, referenced in the help doc at https://docs.aws.amazon.com/polly/latest/dg/WordPressPlugin-install.html, is no longer available via Wordpress. See https://wordpress.org/plugins/ It is also not available here: https://github.com/awslabs/aws-for-wordpress

Android Studio cannot resolve 'AmazonPollyClientBuilder' or 'AWSClientBuilder'. everything else seems wroking

What i'm trying to do?

Get a specific piece of text outputted from amazon polly to mp3 format.
i tried following the amazon aws android example in the polly guide. that crashed my app with a null point exception

i tried following the java example but it shows me that AmazonPollyClient constructor is deprecated.

i then tried this:

https://docs.aws.amazon.com/sdk-for-java/v1/developer-guide/credentials.html

My Code :
`import com.amazonaws.auth.BasicAWSCredentials;
import com.amazonaws.regions.Regions;
import com.amazonaws.services.polly.AmazonPollyClient;
import com.amazonaws.services.polly.model.DescribeVoicesRequest;
import com.amazonaws.services.polly.model.DescribeVoicesResult;
import com.amazonaws.services.polly.model.OutputFormat;
import com.amazonaws.services.polly.model.SynthesizeSpeechPresignRequest;
import com.amazonaws.services.polly.model.Voice;

public class AmazonPolly {

public URL getRingtones(String name) {
    BasicAWSCredentials awsCred = new BasicAWSCredentials("XXXXXXXXXXX","XXXXXXXXXX");
    AmazonPollyClient client = AmazonPollyClientBuilder.standard().withCredentials(new AWSStaticCredentialsProvider(awsCred)).build();
    DescribeVoicesRequest describeVoicesRequest = new DescribeVoicesRequest();

    // voice options
    DescribeVoicesResult describeVoicesResult = client.describeVoices(describeVoicesRequest);
    List<Voice> voices = describeVoicesResult.getVoices();
    SynthesizeSpeechPresignRequest synthesizeSpeechPresignRequest =
            new SynthesizeSpeechPresignRequest()
                    // Set text
                    .withText("You have a call from " + name + "   ")
                    // Set voice
                    .withVoiceId(voices.get(0).getId()) // "Joanna"
                    // Set format
                    .withOutputFormat(OutputFormat.Mp3);

// Get the presigned URL for synthesized speech audio stream.
URL presignedSynthesizeSpeechUrl = client.getPresignedSynthesizeSpeechUrl(synthesizeSpeechPresignRequest);
return presignedSynthesizeSpeechUrl;

My dependencies in build.gradle: module: app >dependencies {
implementation fileTree(dir: 'libs', include: ['*.jar'])
implementation 'com.amazonaws:aws-android-sdk-polly:2.15.2'
implementation ('com.amazonaws:aws-android-sdk-mobile-client:2.6.23@aar') { transitive = true }
implementation group: 'com.amazonaws', name: 'aws-android-sdk-core', version: '2.15.2'
implementation ('com.amazonaws:aws-android-sdk-auth-core:2.15.+@aar') { transitive = true }
implementation "org.jetbrains.kotlin:kotlin-stdlib-jdk7:$kotlin_version"
implementation 'androidx.appcompat:appcompat:1.0.2'
implementation 'androidx.core:core-ktx:1.0.2'
implementation 'androidx.constraintlayout:constraintlayout:1.1.3'
testImplementation 'junit:junit:4.12'
androidTestImplementation 'androidx.test.ext:junit:1.1.0'
androidTestImplementation 'androidx.test.espresso:espresso-core:3.1.1'
}`

my dependencies in build.gradle( project level) >
dependencies { classpath 'com.android.tools.build:gradle:3.6.1' classpath 'com.amazonaws:aws-android-sdk-appsync-gradle-plugin:2.9.+' classpath "org.jetbrains.kotlin:kotlin-gradle-plugin:$kotlin_version" // NOTE: Do not place your application dependencies here; they belong // in the individual module build.gradle files }

Code snippet rendered wrong

looks correct in source at doc_source/get-started-what-next.md

but at https://docs.aws.amazon.com/polly/latest/dg/get-started-what-next.html there are two extra comments on lines 30 and 35

About Phoneme/Viseme Tables

This is more of a question than an issue.

There are Phoneme/Viseme tables available at https://docs.aws.amazon.com/polly/latest/dg/ref-phoneme-tables-shell.html which are supported by Amazon Polly.

I require a text to speech speech system for my research. I want to select certain words which will be spoken with exclusively with certain visemes (like T, @, O, etc) in various languages and generate these sounds of words using amazon polly to further study it in speech signal domain.

However, Hindi, which is a very important language I wish to include does not have Phoneme/Viseme mapping info on the website. Only phonemes (viseme column is absent from the table) are available. I am willing to generate the table myself but what are the visemes which you support? For ex: some linguists say there are 13 visemes others say there are 16 (or more). Also some linguistics may map a phoneme to a different one than other.

How were these visemes shortlisted. Perhaps a citation to the original publication would be helpful.

Support for mixed languages

Can a text with both Chinese and English support voice conversion? Without specifying a language type。And you don't need to use an SSML tag to set whether it's in Chinese or English

awsdocs / amazon-polly-developer-guide Goto Github PK

amazon-polly-developer-guide's Introduction

NOTICE

amazon-polly-developer-guide's People

Contributors

Stargazers

Watchers

Forkers

amazon-polly-developer-guide's Issues

[Errno 13] Permission denied: 'hello.mp3'

examples-python.md adminuser not found

AWS for Wordpress plugin no longer available

Android Studio cannot resolve 'AmazonPollyClientBuilder' or 'AWSClientBuilder'. everything else seems wroking

Code snippet rendered wrong

About Phoneme/Viseme Tables

Support for mixed languages

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent