Skip to content
This repository has been archived by the owner on Oct 11, 2024. It is now read-only.

Rs/marlin downstream v0.3.2 #43

Merged

Conversation

robertgshaw2-redhat
Copy link
Collaborator

No description provided.

afeldman-nm and others added 30 commits February 22, 2024 17:29
…anch safe_expose_semi_structured_sparse_tensor
…size by running multiple parallel problems of size 64. (2) Refactor the workspace to be dynamic per layer
cleanup to undo autoformatting
Copy link
Member

@andy-neuma andy-neuma left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cool

tests/models/test_marlin.py Outdated Show resolved Hide resolved
tests/models/test_marlin.py Outdated Show resolved Hide resolved
vllm/config.py Outdated Show resolved Hide resolved
@robertgshaw2-redhat robertgshaw2-redhat merged commit 4b44479 into rs/bump-main-to-v0.3.2 Feb 22, 2024
@robertgshaw2-redhat robertgshaw2-redhat deleted the rs/marlin-downstream-v0.3.2 branch February 22, 2024 22:46
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants