Add AzureCognitiveServicesToolkit to call Azure Cognitive Services API #5012

whiskyboy · 2023-05-20T02:37:15Z

Add AzureCognitiveServicesToolkit to call Azure Cognitive Services API: achieve some multimodal capabilities

This PR adds a toolkit named AzureCognitiveServicesToolkit which bundles the following tools:

AzureCogsImageAnalysisTool: calls Azure Cognitive Services image analysis API to extract caption, objects, tags, and text from images.
AzureCogsFormRecognizerTool: calls Azure Cognitive Services form recognizer API to extract text, tables, and key-value pairs from documents.
AzureCogsSpeech2TextTool: calls Azure Cognitive Services speech to text API to transcribe speech to text.
AzureCogsText2SpeechTool: calls Azure Cognitive Services text to speech API to synthesize text to speech.

This toolkit can be used to process image, document, and audio inputs.

@hwchase17 and @vowelparrot Would be glad to hear your thoughts!

hwchase17

this looks really solid! some lint errors, but we can help fix those up if you dont get to it

thanks for this!! really excited about this

whiskyboy · 2023-05-21T06:52:35Z

Thanks @hwchase17 for reviewing it!

I just did the reformatting and linting, hope it could pass the checks now.

@hwchase17 @vowelparrot Could you help to approve triggering the workflows? Thanks!

langchain/tools/azure_cognitive_services/form_recognizer.py

dev2049

couple small comments but overall looks great!

langchain/tools/azure_cognitive_services/image_analysis.py

#5012) # Add AzureCognitiveServicesToolkit to call Azure Cognitive Services API: achieve some multimodal capabilities This PR adds a toolkit named AzureCognitiveServicesToolkit which bundles the following tools: - AzureCogsImageAnalysisTool: calls Azure Cognitive Services image analysis API to extract caption, objects, tags, and text from images. - AzureCogsFormRecognizerTool: calls Azure Cognitive Services form recognizer API to extract text, tables, and key-value pairs from documents. - AzureCogsSpeech2TextTool: calls Azure Cognitive Services speech to text API to transcribe speech to text. - AzureCogsText2SpeechTool: calls Azure Cognitive Services text to speech API to synthesize text to speech. This toolkit can be used to process image, document, and audio inputs. --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>

whiskyboy and others added 14 commits May 19, 2023 08:24

add AzureCogsImageAnalysisTool

79ba6ad

add AzureCogsFormRecognizerTool

b58930e

add AzureCogsSpeech2TextTool

288a2db

add AzureCogsText2SpeechTool

9ff7270

use key and region to construct speech config

b569be6

add AzureCognitiveServicesToolkit

806c73e

update __init__.py for AzureCognitiveServicesToolkit

1728cf2

add docs

86df679

check text2speech exceptions

29438cf

clear audio output

4776bb3

add ut

fed2e05

add dependencies

ade20d8

typo

f5e34ae

Merge branch 'master' into master

fa6df7f

hwchase17 reviewed May 21, 2023

View reviewed changes

update poetry.lock

6a06142

whiskyboy and others added 5 commits May 22, 2023 00:36

Merge branch 'master' into master

194bd70

update poetry.lock

c509ecd

reformat and lint

8cbcac0

Merge branch 'master' into master

2978dd3

update poetry.lock

1d87758

dev2049 added 03 enhancement Enhancement of existing functionality lgtm PR looks good. Use to confirm that a PR is ready for merging. labels May 22, 2023

dev2049 reviewed May 22, 2023

View reviewed changes

langchain/tools/azure_cognitive_services/form_recognizer.py Outdated Show resolved Hide resolved

dev2049 reviewed May 22, 2023

View reviewed changes

langchain/tools/azure_cognitive_services/image_analysis.py Outdated Show resolved Hide resolved

whiskyboy and others added 3 commits May 23, 2023 02:27

remove storing api key

4d08cdf

Merge branch 'master' into master

8349628

update poetry.lock

cbc640d

dev2049 reviewed May 23, 2023

View reviewed changes

langchain/tools/azure_cognitive_services/image_analysis.py Show resolved Hide resolved

whiskyboy and others added 4 commits May 23, 2023 03:42

add os check for AzureCogsImageAnalysisTool

d52272c

Merge branch 'master' into master

1769d85

update poetry.lock

8771d5f

nit

bb60ede

dev2049 merged commit d7f807b into langchain-ai:master May 23, 2023

danielchalef mentioned this pull request Jun 5, 2023

Zep Hybrid Search #5742

Merged

This was referenced Jun 25, 2023

Zep Authentication #6725

Closed

Zep Authentication #6728

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add AzureCognitiveServicesToolkit to call Azure Cognitive Services API #5012

Add AzureCognitiveServicesToolkit to call Azure Cognitive Services API #5012

whiskyboy commented May 20, 2023

hwchase17 left a comment

whiskyboy commented May 21, 2023 •

edited

Loading

dev2049 left a comment

Add AzureCognitiveServicesToolkit to call Azure Cognitive Services API #5012

Add AzureCognitiveServicesToolkit to call Azure Cognitive Services API #5012

Conversation

whiskyboy commented May 20, 2023

Add AzureCognitiveServicesToolkit to call Azure Cognitive Services API: achieve some multimodal capabilities

hwchase17 left a comment

Choose a reason for hiding this comment

whiskyboy commented May 21, 2023 • edited Loading

dev2049 left a comment

Choose a reason for hiding this comment

whiskyboy commented May 21, 2023 •

edited

Loading