Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TGI stability fixes #226

Merged
merged 2 commits into from
Sep 14, 2023
Merged

TGI stability fixes #226

merged 2 commits into from
Sep 14, 2023

Conversation

dacorvo
Copy link
Collaborator

@dacorvo dacorvo commented Sep 12, 2023

Two small fixes to improve Inf2 TGI stability.

The first one disables the optimum-neuron logs, because they are not displayed by the CLI and seem to lock the server at some point.

With that change, I verified that the TGI server does not collapse under a heavy load.

The second change is more a cleanup to avoid keeping references to the global input tensor in each assigned slot.

@dacorvo dacorvo marked this pull request as ready for review September 12, 2023 15:42
@dacorvo dacorvo merged commit 1c4afc8 into main Sep 14, 2023
@dacorvo dacorvo deleted the tgi_fixes branch September 14, 2023 07:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants