-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix kv_cache_type issue #2219
Fix kv_cache_type issue #2219
Conversation
Fix kv_cache_type issue
@Barry-Delaney ^^ mind helping take a look? Seems to help resolve some issue here #1930 Thank you! |
Thanks for the fix. We'll merge you changes into internal code base. |
Closing this out as it's been merged. |
Hi @qingquansong , thanks a lot for the contribution! Your changes will be included in the next main branch update, and we'll mark you as co-author. Please also note that, the Python benchmark is not suggested to be used and will soon be deprecated. Please take a look at the on-going support to a benchmarking suite, as well as the C++ benchmark for the support to the latest features.
@hchings To clarify in case there are going to be confusion - the changes are merged in the internal repo, but not updated to the external GitHub repo yet. For future cases I would suggest to only close the PR after we pushed the main branch update that includes the changes. Please let me know if there are any questions, thanks! Thanks again for your support! |
Sound great. Thank you! Besides the C++ throughput API benchmarking, I'm currently also switching to using the hlapi and benchmarking it with this perf evaluator script, is the a suggested one to use? It contains both latency and throughput results which is quite nice to use and the only thing that I'm modifying to add is the concurrency part (which I'm planning to use the Poisson request). Not sure if you think that's a good feature to add here . |
@qingquansong Thanks a lot for your attention on those details! The |
Hello @qingquansong. I was in charge of the publishing for this week. |
Hello @qingquansong, we credited you in the last push to main .Thank you for the contribution again! |
Fix
kv_cache_type
issue related to #1930Details described: #1930 (comment)