You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I have a question regarding the termination condition for each batch.
longspec_benchmark.py terminates a batch if a single instance in the batch reaches desired amount or produces eos token.
But shouldn't the system wait for all other instances to finish?
Or is this a technique used commonly to estimate performance of speculatvie decoding?
The text was updated successfully, but these errors were encountered:
Hi, I have a question regarding the termination condition for each batch.
longspec_benchmark.py terminates a batch if a single instance in the batch reaches desired amount or produces eos token.
But shouldn't the system wait for all other instances to finish?
Or is this a technique used commonly to estimate performance of speculatvie decoding?
The text was updated successfully, but these errors were encountered: