bug: Adaptative batching with max_batch_size=1 crashes the API #4856

bruno-hays · 2024-07-11T15:47:25Z

Describe the bug

With this decorator on my function:

    @bentoml.api(batchable=True,
                 max_batch_size=1,
                 max_latency_ms=3600000)

I get this incomprehensible error:

  File "/home/gcpuser/sky_workdir/whisperapi/venv/lib/python3.11/site-packages/bentoml/_internal/utils/metrics.py", line 44, in exponential_buckets
    assert start < end
           ^^^^^^^^^^^
AssertionError

now use this header:

    @bentoml.api(batchable=True,
                 max_batch_size=2,
                 max_latency_ms=3600000)

And it works.

Amusingly, this happens when I run my API on debian and I am not able to reproduce this error on my mac M1

To reproduce

No response

Expected behavior

No response

Environment

bentoml==1.2.19
python==3.11.9

The text was updated successfully, but these errors were encountered:

bruno-hays added the bug Something isn't working label Jul 11, 2024

frostming linked a pull request Aug 2, 2024 that will close this issue

fix: Cannot define custom duration histogram buckets via @bentoml.service(metrics=...) #4895

Merged

5 tasks

frostming closed this as completed in #4895 Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: Adaptative batching with max_batch_size=1 crashes the API #4856

bug: Adaptative batching with max_batch_size=1 crashes the API #4856

bruno-hays commented Jul 11, 2024

bug: Adaptative batching with max_batch_size=1 crashes the API #4856

bug: Adaptative batching with max_batch_size=1 crashes the API #4856

Comments

bruno-hays commented Jul 11, 2024

Describe the bug

To reproduce

Expected behavior

Environment