Draft: Support execution limits in `run_` functions #374

sydney-runkle · 2024-12-17T15:05:08Z

Fix #70
Should also fix #267

TODO:

docs, once we verify this is the API we want (decide on name for this dataclass)
tests, same blocker as above
Add execution_limit_settings to RunContext
Confirm correct behavior for streaming - how to deal with StreamedRunResult?

There's a part of me that's tempted to call this AgentSettings or something, though I think that's misleading bc it still only takes effect on a run call, not across multiple agent run calls...

cloudflare-workers-and-pages · 2024-12-17T15:05:20Z

Deploying pydantic-ai with Cloudflare Pages

Latest commit:	`bc08546`
Status:	✅ Deploy successful!
Preview URL:	https://3853196a.pydantic-ai.pages.dev
Branch Preview URL:	https://cost-and-msg-limits.pydantic-ai.pages.dev

View logs

sydney-runkle · 2024-12-17T16:48:32Z

API here is changing, this is not up to date.

sydney-runkle · 2024-12-17T19:23:15Z

pydantic_ai_slim/pydantic_ai/settings.py

+    _request_count: int = 0
+    _request_tokens_count: int = 0
+    _response_tokens_count: int = 0
+    _total_tokens_count: int = 0


Should these be public?

I would say yes if we want to also include this structure in RunContext...

I don't like the idea that the settings object also holds state, it feels to me like there should be a separate object for tracking state, and we can check the state against the settings. If I were a user I'd be inclined to reuse an instance of ExecutionLimitSettings which obviously will cause issues.

I would imagine we make a private type _UsageState or similar (which holds all the fields you are talking about here), and have one of ExecutionLimits and _UsageState have a method that accepts the other and raises an error if appropriate.

If we want to put the usage state on the runcontext we can make it public, but I feel like we can do that later/separately. I'll note that I could imagine Samuel disagreeing with all this, and I wouldn't find that unreasonable.

sydney-runkle · 2024-12-17T19:24:20Z

pydantic_ai_slim/pydantic_ai/agent.py

            model_settings = merge_model_settings(self.model_settings, model_settings)
+            execution_limit_settings = execution_limit_settings or ExecutionLimitSettings(request_limit=50)


Is this where we want to set the default?

dmontagu · 2024-12-17T20:58:45Z

pydantic_ai_slim/pydantic_ai/agent.py

@@ -191,6 +191,7 @@ async def run(
        model: models.Model | models.KnownModelName | None = None,
        deps: AgentDeps = None,
        model_settings: ModelSettings | None = None,
+        execution_limit_settings: ExecutionLimitSettings | None = None,


Suggested change

execution_limit_settings: ExecutionLimitSettings | None = None,

execution_limits: ExecutionLimits | None = None,

this would be my preference

dmontagu · 2024-12-17T20:59:41Z

pydantic_ai_slim/pydantic_ai/settings.py

+
+    def _check_limit(self, limit: int | None, count: int, limit_name: str) -> None:
+        if limit and limit < count:
+            raise UnexpectedModelBehavior(f'Exceeded {limit_name} limit of {limit} by {count - limit}')


I feel like this deserves its own exception, and probably one that doesn't inherit from UnexpectedModelBehavior (as this is more or less expected behavior)

dmontagu · 2024-12-17T21:06:30Z

pydantic_ai_slim/pydantic_ai/agent.py

@@ -254,6 +256,8 @@ async def run(

                messages.append(model_response)
                cost += request_cost
+                # TODO: is this the right location? Should we move this earlier in the logic?
+                execution_limit_settings.increment(request_cost)


I personally would prefer if we added a request_count field to the Cost type, and then just did execution_limit_settings.validate(cost) here (rather than incrementing both cost and the limits).

I'd also prefer we rename Cost to Usage or similar, since that's really what it's representing now, and would make it feel less weird to add the request_count field. But even if we don't rename it like that, I think it's reasonable to add request_count: int (or requests: int) as a field on the type currently known as Cost

API

39bd780

initial API idea

ccc9fd9

sydney-runkle changed the title ~~Draft: Support message_limit and token_limit params in run_ functions~~ Draft: Support execution limits in run_ functions Dec 17, 2024

sydney-runkle commented Dec 17, 2024

View reviewed changes

dmontagu reviewed Dec 17, 2024

View reviewed changes

Merge branch 'main' into cost-and-msg-limits

bc08546

dmontagu mentioned this pull request Dec 18, 2024

Add support for usage limits #409

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: Support execution limits in `run_` functions #374

Draft: Support execution limits in `run_` functions #374

sydney-runkle commented Dec 17, 2024 •

edited

Loading

cloudflare-workers-and-pages bot commented Dec 17, 2024 •

edited

Loading

sydney-runkle commented Dec 17, 2024

sydney-runkle Dec 17, 2024

sydney-runkle Dec 17, 2024

dmontagu Dec 17, 2024 •

edited

Loading

dmontagu Dec 17, 2024

sydney-runkle Dec 17, 2024

dmontagu Dec 17, 2024

dmontagu Dec 17, 2024

dmontagu Dec 17, 2024

		model_settings = merge_model_settings(self.model_settings, model_settings)
		execution_limit_settings = execution_limit_settings or ExecutionLimitSettings(request_limit=50)

	execution_limit_settings: ExecutionLimitSettings \| None = None,
	execution_limits: ExecutionLimits \| None = None,

Draft: Support execution limits in run_ functions #374

Are you sure you want to change the base?

Draft: Support execution limits in run_ functions #374

Conversation

sydney-runkle commented Dec 17, 2024 • edited Loading

cloudflare-workers-and-pages bot commented Dec 17, 2024 • edited Loading

Deploying pydantic-ai with Cloudflare Pages

sydney-runkle commented Dec 17, 2024

sydney-runkle Dec 17, 2024

Choose a reason for hiding this comment

sydney-runkle Dec 17, 2024

Choose a reason for hiding this comment

dmontagu Dec 17, 2024 • edited Loading

Choose a reason for hiding this comment

dmontagu Dec 17, 2024

Choose a reason for hiding this comment

sydney-runkle Dec 17, 2024

Choose a reason for hiding this comment

dmontagu Dec 17, 2024

Choose a reason for hiding this comment

dmontagu Dec 17, 2024

Choose a reason for hiding this comment

dmontagu Dec 17, 2024

Choose a reason for hiding this comment

Draft: Support execution limits in `run_` functions #374

Draft: Support execution limits in `run_` functions #374

sydney-runkle commented Dec 17, 2024 •

edited

Loading

cloudflare-workers-and-pages bot commented Dec 17, 2024 •

edited

Loading

dmontagu Dec 17, 2024 •

edited

Loading