habana_main rebase#81

Merged

kzawora-intel merged 537 commits intohabana_mainfrom private/kzawora/rebase_v3

Jul 2, 2024

+64,037-19,646

This pull request is big! We're only showing the most recent 250 commits

Commits on Jun 13, 2024

[Hardware][Intel] Optimize CPU backend and add more performance tips (vllm-project#4971 )

bigPYJ1151
and
jianan-gu
authored
[Docs] Add 4th meetup slides (vllm-project#5509 )
WoosukKwon
authored
[Misc] Add vLLM version getter to utils (vllm-project#5098 )
DarkLight1337
authored
[CI/Build] Simplify OpenAI server setup in tests (vllm-project#5100 )
DarkLight1337
authored
[Doc] Update LLaVA docs (vllm-project#5437 )

DarkLight1337
and
ywang96
authored
[Kernel] Factor out epilogues from cutlass kernels (vllm-project#5391 )

authored
[MISC] Remove FP8 warning (vllm-project#5472 )

comaniac
and
pcmoritz
authored
Seperate dev requirements into lint and test (vllm-project#5474 )
Yard1
authored
Revert "[Core] Remove unnecessary copies in flash attn backend" (vllm-project#5478 )
Yard1
authored
[misc] fix format.sh (vllm-project#5511 )
youkaichao
authored
[CI/Build] Disable test_fp8.py (vllm-project#5508 )
tlrmchlsmth
authored
[Kernel] Disable CUTLASS kernels for fp8 (vllm-project#5505 )
tlrmchlsmth
authored
Add cuda_device_count_stateless (vllm-project#5473 )
Yard1
authored
[Hardware][Intel] Support CPU inference with AVX2 ISA (vllm-project#5452 )
DamonFool
authored

Commits on Jun 14, 2024

Commits on Jun 15, 2024

Commits on Jun 17, 2024

Commits on Jun 18, 2024

Commits on Jun 19, 2024

Commits on Jun 20, 2024

Commits on Jun 21, 2024

Commits on Jun 23, 2024

[BugFix] [Kernel] Add Cutlass2x fallback kernels (vllm-project#5744 )

varun-sundar-rabindranath
and
Varun Sundar Rabindranath
authored
[Distributed] Add send and recv helpers (vllm-project#5719 )
andoorve
authored

Commits on Jun 24, 2024

Commits on Jun 25, 2024

Commits on Jun 26, 2024

Commits on Jun 27, 2024

Commits on Jun 28, 2024

Commits on Jun 29, 2024

Commits on Jun 30, 2024

Commits on Jul 1, 2024

Commits on Jul 2, 2024