Transformers Agents #23214

sgugger · 2023-05-08T17:53:20Z

Introducing Transformers Agents

This PR adds a new API called Transformers Agents. Agents allow you to use Transformers with zero code experience, directly talking to Transformers or Diffusers via natural language. It is based on Agents and Tools. The agent is an LLM prompted to generate code using the tools, which are simple functions performing a single task.

Tools can live in Transformers or on the Hub, this PR introduces both. You can read more about this in the added documentation but here is an example:

Define an agent using the starcoder model:

from transformers import HfAgent

agent = HfAgent("https://api-inference.huggingface.co/models/bigcode/starcoder")

Use the command run to execute a given problem:

agent.run("Draw me a picture of rivers and lakes")

Use the command chat to chat with the agent and execute instructions one after the other:

agent.chat("Draw me a picture of rivers and lakes")

agent.chat("Transform the picture so that there is a rock in there")

patrickvonplaten

Hope the review helps a bit, mostly nits / suggestions

docs/source/en/custom_tools.mdx

patrickvonplaten · 2023-05-09T19:13:15Z

docs/source/en/transformers_agents.mdx

+For demonstration purposes and so that this can be used with all setups, we have created remote executors for several 
+of the default tools the agent has access to. These are created using 
+[inference endpoints](https://huggingface.co/inference-endpoints). To see how to setup remote executors tools yourself,
+we recommend reading the custom tool guide [TODO LINK].


Here is still a TODO - not 100% sure what should go there

src/transformers/tools/image_question_answering.py

src/transformers/tools/image_segmentation.py

src/transformers/utils/__init__.py

src/transformers/utils/import_utils.py

…s into test_composition

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

…s into test_composition

amyeroberts

🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥

Super exciting PR!

🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥 🔥

I ignored the TODOs dotted around as I know it's still being iterated on. Just left a few nits here and there but overall a super impressive piece of work and one I can't way to start playing with!

src/transformers/tools/text_to_speech.py

src/transformers/tools/text_summarization.py

src/transformers/tools/python_interpreter.py

docs/source/en/custom_tools.mdx

src/transformers/tools/agents.py

src/transformers/tools/README.md

amyeroberts · 2023-05-09T21:03:45Z

src/transformers/tools/base.py

+TOOL_CONFIG_FILE = "tool_config.json"
+
+
+def get_repo_type(repo_id, repo_type=None, **hub_kwargs):


Bit of a sneaky name as it's downloading in the background :D

Yeah but get_repo_type_by_downloading_tool_config is a bit long ;-p

amyeroberts · 2023-05-09T21:09:11Z

src/transformers/tools/base.py

+                raise ValueError("This tool does not implement a default checkpoint, you need to pass one.")
+            model = self.default_checkpoint
+        if pre_processor is None:
+            pre_processor = model


Does this mean we do a double forward pass if there's no preprocessor?

Unless the user passed an instantiated model (which we could check in further development). Normally at this stage model is a checkpoint name.

src/transformers/tools/evaluate_agent.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Remove nestedness in tool config * Really do it * Use remote tools descriptions * Work * Clean up eval * Changes * Tools * Tools * tool * Fix everything * Use last result/assign for evaluation * Prompt * Remove hardcoded selection * Evaluation for chat agents * correct some spelling * Small fixes * Change summarization model (#23172) * Fix link displayed * Update description of the tool * Fixes in chat prompt * Custom tools, custom prompt * Tool clean up * save_pretrained and push_to_hub for tool * Fix init * Tests * Fix tests * Tool save/from_hub/push_to_hub and tool->load_tool * Clean push_to_hub and add app file * Custom inference API for endpoints too * Clean up * old remote tool and new remote tool * Make a requirements * return_code adds tool creation * Avoid redundancy between global variables * Remote tools can be loaded * Tests * Text summarization tests * Quality * Properly mark tests * Test the python interpreter * And the CI shall be green. * fix loading of additional tools * Work on RemoteTool and fix tests * General clean up * Guard imports * Fix tools * docs: Fix broken link in 'How to add a model...' (#23216) fix link * Get default endpoint from the Hub * Add guide * Simplify tool config * Docs * Some fixes * Docs * Docs * Docs * Fix code returned by agent * Try this * Match args with signature in remote tool * Should fix python interpreter for Python 3.8 * Fix push_to_hub for tools * Other fixes to push_to_hub * Add API doc page * Docs * Docs * Custom tools * Pin tensorflow-probability (#23220) * Pin tensorflow-probability * [all-test] * [all-test] Fix syntax for bash * PoC for some chaining API * Text to speech * J'ai pris des libertés * Rename * Basic python interpreter * Add agents * Quality * Add translation tool * temp * GenQA + LID + S2T * Quality + word missing in translation * Add open assistance, support f-strings in evaluate * captioning + s2t fixes * Style * Refactor descriptions and remove chain * Support errors and rename OpenAssistantAgent * Add setup * Deal with typos + example of inference API * Some rename + README * Fixes * Update prompt * Unwanted change * Make sure everyone has a default * One prompt to rule them all. * SD * Description * Clean up remote tools * More remote tools * Add option to return code and update doc * Image segmentation * ControlNet * Gradio demo * Diffusers protection * Lib protection * ControlNet description * Cleanup * Style * Remove accelerate and try to be reproducible * No randomness * Male Basic optional in token * Clean description * Better prompts * Fix args eval in interpreter * Add tool wrapper * Tool on the Hub * Style post-rebase * Big refactor of descriptions, batch generation and evaluation for agents * Make problems easier - interface to debug * More problems, add python primitives * Back to one prompt * Remove dict for translation * Be consistent * Add prompts * New version of the agent * Evaluate new agents * New endpoints agents * Make all tools a dict variable * Typo * Add problems * Add to big prompt * Harmonize * Add tools * New evaluation * Add more tools * Build prompt with tools descriptions * Tools on the Hub * Let's chat! * Cleanup * Temporary bs4 safeguard * Cache agents and clean up * Blank init * Fix evaluation for agents * New format for tools on the Hub * Add method to reset state * Remove nestedness in tool config * Really do it * Use remote tools descriptions * Work * Clean up eval * Changes * Tools * Tools * tool * Fix everything * Use last result/assign for evaluation * Prompt * Remove hardcoded selection * Evaluation for chat agents * correct some spelling * Small fixes * Change summarization model (#23172) * Fix link displayed * Update description of the tool * Fixes in chat prompt * Custom tools, custom prompt * Tool clean up * save_pretrained and push_to_hub for tool * Fix init * Tests * Fix tests * Tool save/from_hub/push_to_hub and tool->load_tool * Clean push_to_hub and add app file * Custom inference API for endpoints too * Clean up * old remote tool and new remote tool * Make a requirements * return_code adds tool creation * Avoid redundancy between global variables * Remote tools can be loaded * Tests * Text summarization tests * Quality * Properly mark tests * Test the python interpreter * And the CI shall be green. * Work on RemoteTool and fix tests * fix loading of additional tools * General clean up * Guard imports * Fix tools * Get default endpoint from the Hub * Simplify tool config * Add guide * Docs * Some fixes * Docs * Docs * Fix code returned by agent * Try this * Docs * Match args with signature in remote tool * Should fix python interpreter for Python 3.8 * Fix push_to_hub for tools * Other fixes to push_to_hub * Add API doc page * Fixes * Doc fixes * Docs * Fix audio * Custom tools * Audio fix * Improve custom tools docstring * Docstrings * Trigger CI * Mode docstrings * More docstrings * Improve custom tools * Fix for remote tools * Style * Fix repo consistency * Quality * Tip * Cleanup on doc * Cleanup toc * Add disclaimer for starcoder vs openai * Remove disclaimer * Small fixed in the prompts * 4.29 * Update src/transformers/tools/agents.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Complete documentation * Small fixes * Agent evaluation * Note about gradio-tools & LC * Clean up agents and prompt * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Note about gradio-tools & LC * Add copyrights and address review comments * Quality * Add all language codes * Add remote tool tests * Move custom prompts to other docs * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * TTS tests * Quality --------- Co-authored-by: Lysandre <hi@lyand.re> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com> Co-authored-by: Connor Henderson <connor.henderson@talkiatry.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre <lysandre@huggingface.co> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

ltm920716 · 2023-05-10T13:09:56Z

picture = agent.run("Draw me a picture of rivers and lakes")
==Explanation from the agent==
I will use the following tool: image_segmenter to generate a segmentation mask for the image.

==Code generated by the agent==
prompt = "rivers and lakes"
mask = image_segmenter(image, prompt)

==Result==
Evaluation of the code stopped at line 1 before the end because of the following error:
The variable image is not defined.

sgugger · 2023-05-10T13:27:15Z

Ah yes we did that example with openAI. Will fine-tune the prompt so that example works before the release, thanks for the pointer!

* Remove nestedness in tool config * Really do it * Use remote tools descriptions * Work * Clean up eval * Changes * Tools * Tools * tool * Fix everything * Use last result/assign for evaluation * Prompt * Remove hardcoded selection * Evaluation for chat agents * correct some spelling * Small fixes * Change summarization model (huggingface#23172) * Fix link displayed * Update description of the tool * Fixes in chat prompt * Custom tools, custom prompt * Tool clean up * save_pretrained and push_to_hub for tool * Fix init * Tests * Fix tests * Tool save/from_hub/push_to_hub and tool->load_tool * Clean push_to_hub and add app file * Custom inference API for endpoints too * Clean up * old remote tool and new remote tool * Make a requirements * return_code adds tool creation * Avoid redundancy between global variables * Remote tools can be loaded * Tests * Text summarization tests * Quality * Properly mark tests * Test the python interpreter * And the CI shall be green. * fix loading of additional tools * Work on RemoteTool and fix tests * General clean up * Guard imports * Fix tools * docs: Fix broken link in 'How to add a model...' (huggingface#23216) fix link * Get default endpoint from the Hub * Add guide * Simplify tool config * Docs * Some fixes * Docs * Docs * Docs * Fix code returned by agent * Try this * Match args with signature in remote tool * Should fix python interpreter for Python 3.8 * Fix push_to_hub for tools * Other fixes to push_to_hub * Add API doc page * Docs * Docs * Custom tools * Pin tensorflow-probability (huggingface#23220) * Pin tensorflow-probability * [all-test] * [all-test] Fix syntax for bash * PoC for some chaining API * Text to speech * J'ai pris des libertés * Rename * Basic python interpreter * Add agents * Quality * Add translation tool * temp * GenQA + LID + S2T * Quality + word missing in translation * Add open assistance, support f-strings in evaluate * captioning + s2t fixes * Style * Refactor descriptions and remove chain * Support errors and rename OpenAssistantAgent * Add setup * Deal with typos + example of inference API * Some rename + README * Fixes * Update prompt * Unwanted change * Make sure everyone has a default * One prompt to rule them all. * SD * Description * Clean up remote tools * More remote tools * Add option to return code and update doc * Image segmentation * ControlNet * Gradio demo * Diffusers protection * Lib protection * ControlNet description * Cleanup * Style * Remove accelerate and try to be reproducible * No randomness * Male Basic optional in token * Clean description * Better prompts * Fix args eval in interpreter * Add tool wrapper * Tool on the Hub * Style post-rebase * Big refactor of descriptions, batch generation and evaluation for agents * Make problems easier - interface to debug * More problems, add python primitives * Back to one prompt * Remove dict for translation * Be consistent * Add prompts * New version of the agent * Evaluate new agents * New endpoints agents * Make all tools a dict variable * Typo * Add problems * Add to big prompt * Harmonize * Add tools * New evaluation * Add more tools * Build prompt with tools descriptions * Tools on the Hub * Let's chat! * Cleanup * Temporary bs4 safeguard * Cache agents and clean up * Blank init * Fix evaluation for agents * New format for tools on the Hub * Add method to reset state * Remove nestedness in tool config * Really do it * Use remote tools descriptions * Work * Clean up eval * Changes * Tools * Tools * tool * Fix everything * Use last result/assign for evaluation * Prompt * Remove hardcoded selection * Evaluation for chat agents * correct some spelling * Small fixes * Change summarization model (huggingface#23172) * Fix link displayed * Update description of the tool * Fixes in chat prompt * Custom tools, custom prompt * Tool clean up * save_pretrained and push_to_hub for tool * Fix init * Tests * Fix tests * Tool save/from_hub/push_to_hub and tool->load_tool * Clean push_to_hub and add app file * Custom inference API for endpoints too * Clean up * old remote tool and new remote tool * Make a requirements * return_code adds tool creation * Avoid redundancy between global variables * Remote tools can be loaded * Tests * Text summarization tests * Quality * Properly mark tests * Test the python interpreter * And the CI shall be green. * Work on RemoteTool and fix tests * fix loading of additional tools * General clean up * Guard imports * Fix tools * Get default endpoint from the Hub * Simplify tool config * Add guide * Docs * Some fixes * Docs * Docs * Fix code returned by agent * Try this * Docs * Match args with signature in remote tool * Should fix python interpreter for Python 3.8 * Fix push_to_hub for tools * Other fixes to push_to_hub * Add API doc page * Fixes * Doc fixes * Docs * Fix audio * Custom tools * Audio fix * Improve custom tools docstring * Docstrings * Trigger CI * Mode docstrings * More docstrings * Improve custom tools * Fix for remote tools * Style * Fix repo consistency * Quality * Tip * Cleanup on doc * Cleanup toc * Add disclaimer for starcoder vs openai * Remove disclaimer * Small fixed in the prompts * 4.29 * Update src/transformers/tools/agents.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Complete documentation * Small fixes * Agent evaluation * Note about gradio-tools & LC * Clean up agents and prompt * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Note about gradio-tools & LC * Add copyrights and address review comments * Quality * Add all language codes * Add remote tool tests * Move custom prompts to other docs * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * TTS tests * Quality --------- Co-authored-by: Lysandre <hi@lyand.re> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com> Co-authored-by: Connor Henderson <connor.henderson@talkiatry.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre <lysandre@huggingface.co> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

sgugger and others added 30 commits May 8, 2023 10:29

Work

1a2f127

Clean up eval

ac508c8

Changes

1ee146b

Tools

08b97bd

Tools

e072f4f

tool

ef7d989

Fix everything

30a39d8

Use last result/assign for evaluation

2ef47fb

Prompt

5a212fc

Remove hardcoded selection

d558620

Evaluation for chat agents

a56a008

correct some spelling

0ddbb67

Small fixes

d3d0958

Change summarization model (#23172)

c548087

Fix link displayed

9ba0342

Update description of the tool

b89e1c1

Fixes in chat prompt

d621762

Custom tools, custom prompt

d17f559

Tool clean up

8b2a900

save_pretrained and push_to_hub for tool

78d178f

Fix init

df0b4e0

Tests

ff01bc3

Fix tests

36f2e55

Tool save/from_hub/push_to_hub and tool->load_tool

4670626

Clean push_to_hub and add app file

b4d0a2c

Custom inference API for endpoints too

4705ab9

Clean up

0032ad2

old remote tool and new remote tool

9fd81ed

Make a requirements

f60032d

return_code adds tool creation

44e71f6

Agent evaluation

f48f7b3

patrickvonplaten reviewed May 9, 2023

View reviewed changes

LysandreJik and others added 11 commits May 9, 2023 15:55

Note about gradio-tools & LC

6acc104

Clean up agents and prompt

f7eb64a

Merge branch 'test_composition' of github.com:huggingface/transformer…

51e6758

…s into test_composition

Apply suggestions from code review

34da8dd

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

Apply suggestions from code review

059d68c

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

Note about gradio-tools & LC

e25d386

Add copyrights and address review comments

700c39a

Merge branch 'test_composition' of github.com:huggingface/transformer…

a7d9b56

…s into test_composition

Quality

c53168d

Add all language codes

66d09cf

Add remote tool tests

f1f833e

amyeroberts approved these changes May 9, 2023

View reviewed changes

patrickvonplaten and others added 7 commits May 10, 2023 00:09

Move custom prompts to other docs

dbe81d8

Apply suggestions from code review

4c355a6

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

TTS tests

2ea8339

Quality

2d55901

Address review comments

c5439f1

Quality

ec9d168

Last fix

4038274

sgugger merged commit 3335724 into main May 10, 2023

sgugger deleted the test_composition branch May 10, 2023 00:37

sgugger changed the title ~~Test composition~~ Transformers Agents May 10, 2023

apbard mentioned this pull request May 11, 2023

Better check for packages availability #23163

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transformers Agents #23214

Transformers Agents #23214

sgugger commented May 8, 2023 •

edited

Loading

patrickvonplaten left a comment

patrickvonplaten May 9, 2023

amyeroberts left a comment

amyeroberts May 9, 2023

sgugger May 9, 2023

amyeroberts May 9, 2023

sgugger May 9, 2023

ltm920716 commented May 10, 2023

sgugger commented May 10, 2023

		TOOL_CONFIG_FILE = "tool_config.json"


		def get_repo_type(repo_id, repo_type=None, **hub_kwargs):

Transformers Agents #23214

Transformers Agents #23214

Conversation

sgugger commented May 8, 2023 • edited Loading

Introducing Transformers Agents

patrickvonplaten left a comment

Choose a reason for hiding this comment

patrickvonplaten May 9, 2023

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts May 9, 2023

Choose a reason for hiding this comment

sgugger May 9, 2023

Choose a reason for hiding this comment

amyeroberts May 9, 2023

Choose a reason for hiding this comment

sgugger May 9, 2023

Choose a reason for hiding this comment

ltm920716 commented May 10, 2023

sgugger commented May 10, 2023

sgugger commented May 8, 2023 •

edited

Loading