Amazon Polly as a provider for the text-to-speech feature. #734

iamdharmesh · 2024-03-04T05:32:50Z

Description of the Change

PR Adds Amazon Polly as a provider for the Text-to-speech feature.

Closes #728

@jeffpaul @dkotter Amazon Polly provides additional options such as Newscaster Speaking Style and SSML, which offers features like including breathing sounds and emphasizing specific words or phrases. I haven't implemented it in this PR, but we can consider integrating it in the future if there are specific client requirements around these features.

@dkotter, I haven't added E2E tests for this because I haven't figured out yet how to mock the API, given that we are using the AWS PHP SDK here. Please let me know if you have any ideas on this.

How to test the Change

Go to Tools > ClassifAI > Language Processing > Text to Speech.
Select the "Amazon Polly" option as the provider.
Add AWS credentials and save settings.
Create/Edit a post and ensure that the Text-to-Speech feature is working as expected.

Changelog Entry

Added - Amazon Polly as a provider for the text-to-speech feature.

Credits

Props @jeffpaul @iamdharmesh

Checklist:

I agree to follow this project's Code of Conduct.
I have updated the documentation accordingly.
I have added tests to cover my change.
All new and existing tests pass.

includes/Classifai/Providers/AWS/AmazonPolly.php

dkotter · 2024-03-14T17:27:50Z

@dkotter, I haven't added E2E tests for this because I haven't figured out yet how to mock the API, given that we are using the AWS PHP SDK here. Please let me know if you have any ideas on this.

I guess my first question would be do we need to use the SDK here? I know that can help simplify things but we haven't used any SDKs for the other Providers up to this point. I'm not opposed to it, just wondering if there was a specific reason.

But I can think of two approaches we can take to mock the requests:

Add a short-circuit filter right before we make the request to AWS, allowing us to return our own results. This is basically what WordPress does, my only concern is we'd basically be adding a filter for testing purposes only which I don't love
Because the main request goes through a custom REST endpoint, there is a filter there that fires before any callbacks are called: rest_pre_dispatch. We could use this and return a hardcoded result, similar to how we're currently using the pre_http_request filter. This wouldn't work for all scenarios (like triggering Text to Speech from the inline row action) but should work to test the main use case of publishing content

iamdharmesh · 2024-03-15T15:10:12Z

I guess my first question would be do we need to use the SDK here? I know that can help simplify things but we haven't used any SDKs for the other Providers up to this point. I'm not opposed to it, just wondering if there was a specific reason.

The main reason for using the SDK was to keep things simple, especially concerning signing and authenticating REST requests. I believe we don't have this complex authentication process with existing providers. I'm open to getting rid of the SDK here and writing a custom class for handling authentication and REST operations (similar to what we did for the OpenAPI). Please let me know if you think we should remove the SDK here.

I can think of two approaches we can take to mock the requests:

Approach #1 seems like the better choice to me as it allows us to cover all scenarios.

Thanks

dkotter · 2024-03-19T13:58:07Z

The main reason for using the SDK was to keep things simple, especially concerning signing and authenticating REST requests. I believe we don't have this complex authentication process with existing providers. I'm open to getting rid of the SDK here and writing a custom class for handling authentication and REST operations (similar to what we did for the OpenAPI). Please let me know if you think we should remove the SDK here.

I think we're fine to proceed with keeping the SDK here. It does increase the size of the final release zip due to all the code the SDK brings but that should be fine.

Approach #1 seems like the better choice to me as it allows us to cover all scenarios.

That works for me

includes/Classifai/Providers/AWS/AmazonPolly.php

iamdharmesh added 3 commits February 23, 2024 16:24

Added "aws-sdk-php" composer dependency.

aba4abc

Add AmazonPolly provider.

0e4ba4d

Merge branch 'develop' of github.com:10up/classifai into enhancement/728

6e56949

iamdharmesh self-assigned this Mar 4, 2024

github-actions bot added this to the 3.1.0 milestone Mar 4, 2024

iamdharmesh added 6 commits March 8, 2024 15:05

Add support for select voice engine.

149c2c8

Merge branch 'develop' of github.com:10up/classifai into enhancement/728

d18bb7e

Readme updates.

169d911

fix composer.lock file PHP version issue.

ac590bf

PHPCS fixes.

0324cdd

Sort voices.

27c557a

iamdharmesh changed the title ~~[WIP] Add Amazon Polly provider for Text to speech feature~~ Amazon Polly as a provider for the text-to-speech feature. Mar 8, 2024

iamdharmesh marked this pull request as ready for review March 8, 2024 12:13

iamdharmesh requested review from dkotter, jeffpaul and a team as code owners March 8, 2024 12:13

github-actions bot added the needs:code-review This requires code review. label Mar 8, 2024

Minor cleanup

e902bf6

dkotter reviewed Mar 14, 2024

View reviewed changes

includes/Classifai/Providers/AWS/AmazonPolly.php Show resolved Hide resolved

iamdharmesh added 3 commits March 20, 2024 20:28

Add E2E tests.

adfabb6

Fix azure text to speech E2E tests.

f5fa87c

Added error handing for the text to speech feature.

8aaa664

iamdharmesh requested a review from dkotter March 29, 2024 13:52

dkotter previously approved these changes Mar 29, 2024

View reviewed changes

includes/Classifai/Providers/AWS/AmazonPolly.php Outdated Show resolved Hide resolved

includes/Classifai/Providers/AWS/AmazonPolly.php Outdated Show resolved Hide resolved

Update hook docs.

e13824e

iamdharmesh dismissed dkotter’s stale review via e13824e April 1, 2024 05:42

iamdharmesh requested a review from dkotter April 1, 2024 05:43

dkotter approved these changes Apr 1, 2024

View reviewed changes

dkotter merged commit fe28ee7 into develop Apr 1, 2024
13 checks passed

dkotter deleted the enhancement/728 branch April 1, 2024 14:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Amazon Polly as a provider for the text-to-speech feature. #734

Amazon Polly as a provider for the text-to-speech feature. #734

iamdharmesh commented Mar 4, 2024 •

edited

Loading

dkotter commented Mar 14, 2024

iamdharmesh commented Mar 15, 2024

dkotter commented Mar 19, 2024

Amazon Polly as a provider for the text-to-speech feature. #734

Amazon Polly as a provider for the text-to-speech feature. #734

Conversation

iamdharmesh commented Mar 4, 2024 • edited Loading

Description of the Change

How to test the Change

Changelog Entry

Credits

Checklist:

dkotter commented Mar 14, 2024

iamdharmesh commented Mar 15, 2024

dkotter commented Mar 19, 2024

iamdharmesh commented Mar 4, 2024 •

edited

Loading