Skip to content
@wenxcs-msft

wenxcs-msft

Popular repositories Loading

  1. flash-attention-rocm flash-attention-rocm Public archive

    Forked from ROCm/flash-attention

    Fast and memory-efficient exact attention

    C++ 1

  2. tvm.dx tvm.dx Public archive

    TVM with DirectX support

    Python 1

  3. DirectX-Headers DirectX-Headers Public archive

    Forked from microsoft/DirectX-Headers

    Official DirectX headers available under an open source license

    C

  4. DirectXShaderCompiler DirectXShaderCompiler Public archive

    Forked from microsoft/DirectXShaderCompiler

    This repo hosts the source for the DirectX Shader Compiler which is based on LLVM/Clang.

    C++

  5. dxpy dxpy Public archive

    DirectX Python Runtime

    C++

  6. deort deort Public archive

    Python

Repositories

Showing 10 of 12 repositories
  • vllm-xx Public Forked from xiaoxiawu-microsoft/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    wenxcs-msft/vllm-xx’s past year of commit activity
    Python 0 Apache-2.0 4,900 0 8 Updated Sep 26, 2024
  • vllm-pr Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    wenxcs-msft/vllm-pr’s past year of commit activity
    Python 0 Apache-2.0 4,900 0 0 Updated Jul 10, 2024
  • wenxcs-msft/foundation-model-stack’s past year of commit activity
    Python 0 Apache-2.0 58 0 0 Updated May 10, 2024
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    wenxcs-msft/vllm’s past year of commit activity
    Python 0 Apache-2.0 4,900 0 0 Updated May 8, 2024
  • WizardLM Public archive Forked from nlpxucan/WizardLM

    WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex Instructions

    wenxcs-msft/WizardLM’s past year of commit activity
    Python 0 769 0 0 Updated May 12, 2023
  • azure-openai-proxy Public archive Forked from diemus/azure-openai-proxy

    A proxy for Azure OpenAI API that can convert an OpenAI request into an Azure OpenAI request.

    wenxcs-msft/azure-openai-proxy’s past year of commit activity
    Go 0 MIT 69 0 0 Updated Apr 7, 2023
  • flash-attention-rocm Public archive Forked from ROCm/flash-attention

    Fast and memory-efficient exact attention

    wenxcs-msft/flash-attention-rocm’s past year of commit activity
    C++ 1 BSD-3-Clause 1,371 0 0 Updated Feb 25, 2023
  • tvm.dx Public archive

    TVM with DirectX support

    wenxcs-msft/tvm.dx’s past year of commit activity
    Python 0 Apache-2.0 1 0 1 Updated Dec 12, 2022
  • deort Public archive
    wenxcs-msft/deort’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Sep 24, 2022
  • dxpy Public archive

    DirectX Python Runtime

    wenxcs-msft/dxpy’s past year of commit activity
    C++ 0 MIT 0 0 0 Updated Apr 3, 2022

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…