Automatic excerpt generation using OpenAI's ChatGPT API #405

dkotter · 2023-03-08T22:43:18Z

Description of the Change

This PR adds an integration with OpenAI as a new provider in the Language Processing service, specifically integrating with ChatGPT. The specific integration being added here is utilizing ChatGPT to provide a summary of a piece of content and then storing that summary in the excerpt field.

Setup

Setup with this provider only requires an API key. There's validation done on the settings page, anytime settings are saved, to verify if the API key is valid. This can give you one of three errors:

No API Key	Invalid API Key	Rate limit reached

If the API key is valid, you won't get an error message and you'll get a success message instead. You then have a few settings to change. The most important is turning on the Generate excerpt option. If a valid API key is added but this setting isn't on, no integration will happen. The Allowed roles setting allows you to control which user roles see the Generate excerpt button (and are allowed to access the REST API endpoint). The Excerpt length setting controls how many words the final excerpt will be. This defaults to the excerpt length that WordPress has set (that can be changed by the core excerpt_length filter). The Temperature value is one of the config options the API supports. There are other options there but I decided not to bring those over for now.

Excerpt integration

I debated a few different approaches on how to actually integrate with the excerpt generation. My priorities were the following:

Limit how often we hit the API since OpenAI charges by usage
Only try generating an excerpt once the content is (mostly) finalized. No sense in generating an excerpt on an in-progress post that doesn't have much content
Have the ability to see what the generated excerpt will be before publishing
Have the ability to add your own excerpt, modify the generated excerpt or remove the excerpt all together without those changes being overwritten

I initially was thinking of automatically generating an excerpt on save (draft, publish, ...) but this goes against point 1 and 2. I then considered automatically generating only on publish but that goes against 3 (and possibly 4). I eventually landed on the approach of adding a Generate excerpt button in the Excerpt panel that will send content to OpenAI when clicked and populate the excerpt with whatever value is returned.

This solves all the points above, as you are able to choose how often the API is hit and when the excerpt is generated (only when the button is pressed). And if you don't want to generate the excerpt, you don't have to. It does make the process more manual, as you have to click the button but I think that's a fine trade-off. I am open to other ideas on the best integration here though (I had considered adding something to the pre-publish panel but I don't think that's enough by itself. May be worth adding in addition to what else we have here).

| | |

API integration

When the Generate excerpt button is clicked, a request is made to a new REST endpoint (wp-json/classifai/v1/generate-excerpt/POST_ID). This endpoint verifies the current user has permission to edit the post, we are properly authenticated with OpenAI and the Generate excerpt setting is on.

Assuming all that passes, a new Tokenizer class has been added that will try and determine how many tokens the content has and how many tokens the final excerpt will be. The ChatGPT API has a limit of 4096 tokens per request and this includes both the data you send and the data that is sent back. Unfortunately tokens are equivalent to words or characters (roughly 4 characters is 1 token) but we do some basic calculations, erring on the side of being too aggressive, to ensure our request doesn't go over the limit.

A new APIRequest class has been added here as well (followed the approach in the Watson APIRequest class) to make it easier to integrate with the API, not only for this feature but for any other OpenAI features we may add in the future (the Tokenizer class should also be reusable for future integrations).

The request is sent and then the response is parsed and returned, whether we get a successful response including our excerpt or we get an error. If it's an error, that will be shown to the user. If success, we set the returned value as our excerpt.

Reviewer notes

I couldn't find a way to add a custom button to the core Excerpt panel so I removed that panel all together and replaced it with our own, copying most of the code from Gutenberg and adding in our custom handling
There's no WP-CLI integration in this PR. I think it makes sense to add that in a followup PR
I added new details about OpenAI into the readme files and also updated those in a few places to make it more clear that we have multiple Language Processing providers now
I added a new image to the readme as well and noticed the existing images were not optimized, so I ran those through an optimization step as well as reduced the dimensions on one image. I can revert that last change if we want that image to stay super large but currently displays weird in the readme
get_plugin_settings was updated to account for multiple providers instead of just always using the first provider. There are other places in the code that should be updated to account for this but I'm planning to tackle that in a followup PR

How to test the Change

A valid OpenAI API key is needed to fully test this feature. OpenAI does offer a free $5 credit for new users so if you haven't signed up before, you can sign up and get an API key (ping me in Slack if you want to use my API key for testing).

Log in to your OpenAI account and go to your API key section. Generate a new API key there and copy it
Go to ClassifAI > Language Processing > OpenAI and paste in your API key
Turn on the Generate excerpt setting. The other settings can be left default. Save changes and ensure no error message is shown
Create a new post, ensuring it has at least a few paragraphs of content
Open the Excerpt panel, ensure you can see the Generate excerpt button then click on that
Ensure an excerpt gets populated and no errors are shown
Can run through these same tests with no API key entered, an invalid API key entered and/or the Generate excerpt option is off, ensuring proper error messages are shown and functionality is removed

Changelog Entry

Added - Automatic excerpt generation using OpenAI's ChatGPT API

Credits

Props @dkotter, @jeffpaul, @zamanq

Checklist:

I agree to follow this project's Code of Conduct.
I have updated the documentation accordingly.
I have added tests to cover my change.
All new and existing tests pass.

…t. Trim content to be within the ChatGPT limits. Modify the settings helper function to work for multiple services. Add custom excerpt panel, replacing the core one

…on. Small style and text tweaks

…sable. Shorten test prompt. Fix display of last response

…king

dkotter · 2023-03-08T23:11:46Z

Note that E2E tests are failing here but they seem to have been failing for a bit (all recent PRs are failing as well). I'm going to look to see what needs fixed on those and tackle that separately from this PR

src/js/post-excerpt/panel.js

includes/Classifai/Providers/OpenAI/ChatGPT.php

jeffpaul · 2023-03-09T19:56:17Z

@fabiankaegy per Darin's comment of:

I couldn't find a way to add a custom button the core Excerpt panel so I removed that panel all together and replaced it with our own, copying most of the code from Gutenberg and adding in our custom handling

...are you aware of a way to add a button into that panel or is the approach here the best given the current state of the editor?

…ate to determine when this shows. This allows us to only show the panel if an excerpt was added prior to the panel showing.

…s. Don't load our custom JS if the current user role doesn't match

…lowed roles setting

jeffpaul · 2023-03-14T20:59:56Z

@iamdharmesh tagging you for code review here as this week you've got some OSS time, hoping to get this ready for release as expeditiously as we can (hoping to get 1-2 features released before Summit as feasible)

…ound our prompt and update the other filters we have

iamdharmesh

Thanks for adding this @dkotter. This looks amazing. 🎉

I just added 2-3 minor notes to discuss but otherwise, all looks great.

includes/Classifai/Helpers.php

includes/Classifai/Providers/OpenAI/APIRequest.php

includes/Classifai/Providers/OpenAI/ChatGPT.php

…void timeouts on slow requests.

…for handling excerpts. Minor code tweaks for consistency

…issing false

Excerpt Pre-publish Check

dkotter added 11 commits March 3, 2023 16:44

Add the beginnings of an OpenAI integration

a303429

Add additional settings

3ffce38

Add REST endpoint and initial round of code that runs on this endpoin…

5766ce6

…t. Trim content to be within the ChatGPT limits. Modify the settings helper function to work for multiple services. Add custom excerpt panel, replacing the core one

Store last response and use that in our debug data

f0777be

Don't override the excerpt panel if the excerpt setting isn't turned …

528ca50

…on. Small style and text tweaks

Move tokenizer code into it's own class and make it a little more reu…

fc49cc6

…sable. Shorten test prompt. Fix display of last response

Convert sentence length into text for a better prompt

0322f48

Add some filters around data, allowing easy modification if needed.

d38a11f

Test with a valid API key and make a few tweaks to get everything wor…

8561ece

…king

Modify prompt a bit to handle plurals correctly

1623bb1

Update readmes. Optimize images

4225571

dkotter self-assigned this Mar 8, 2023

dkotter requested review from jeffpaul and a team as code owners March 8, 2023 22:43

dkotter linked an issue Mar 8, 2023 that may be closed by this pull request

Auto-populate missing meta tags and descriptions #159

Closed

4 tasks

dkotter mentioned this pull request Mar 8, 2023

Refactor ClassifAI so that it's easier to add more Providers under a Service. #404

Closed

1 task

dkotter added 3 commits March 8, 2023 15:55

Fix eslint error

019f74c

Fix eslint error, maybe?

4bafa63

Fix eslint error, maybe?

e0f4a5e

jeffpaul added this to the 1.9.0 milestone Mar 9, 2023

ravinderk reviewed Mar 9, 2023

View reviewed changes

src/js/post-excerpt/panel.js Outdated Show resolved Hide resolved

ravinderk reviewed Mar 9, 2023

View reviewed changes

src/js/post-excerpt/panel.js Outdated Show resolved Hide resolved

ravinderk reviewed Mar 9, 2023

View reviewed changes

src/js/post-excerpt/panel.js Show resolved Hide resolved