You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1. I have searched related issues but cannot get the expected help.
2. The bug has not been fixed in the latest version.
3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.
Checklist
Describe the bug
您好:我遇到了这样的问题:我希望在第一个流式推理满足一定条件时提前终止(比如图中的tokens>500),紧接着进行第二个问题的流式推理,但是我发现尽管break退出了第一个循环,第一个问题的推理仍在继续(占用了第二个问题的计算资源)
请问是否提供了相应的接口来终止第一个问题的请求而不影响第二个问题的推理?
![Image](https://private-user-images.githubusercontent.com/125688164/408463746-c3578db0-a2e6-4819-8532-44c254830acb.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkxMjQyMDIsIm5iZiI6MTczOTEyMzkwMiwicGF0aCI6Ii8xMjU2ODgxNjQvNDA4NDYzNzQ2LWMzNTc4ZGIwLWEyZTYtNDgxOS04NTMyLTQ0YzI1NDgzMGFjYi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjA5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIwOVQxNzU4MjJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT03NzE1NzNkNWMwYTZhMDhiY2VmZGNlNTI2ZjdiNTU0N2Q3MTcwOWVmYTFkM2IxZTE5ZDg1NWNmMzRmMmYyNzhkJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.oFcxwkYLsJkHVrH_y-wbrx5pZzH0z8aHtsPN_liA8m0)
Reproduction
codes above
Environment
Error traceback
The text was updated successfully, but these errors were encountered: