Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Inference] Fix inference latency issue when weights/neff are separated #584

Merged
merged 36 commits into from
May 28, 2024

Commits on Apr 30, 2024

  1. Configuration menu
    Copy the full SHA
    5bc3516 View commit details
    Browse the repository at this point in the history

Commits on May 7, 2024

  1. Configuration menu
    Copy the full SHA
    469a58c View commit details
    Browse the repository at this point in the history
  2. add workaround

    JingyaHuang committed May 7, 2024
    Configuration menu
    Copy the full SHA
    1e12227 View commit details
    Browse the repository at this point in the history
  3. fix

    JingyaHuang committed May 7, 2024
    Configuration menu
    Copy the full SHA
    0c63ce8 View commit details
    Browse the repository at this point in the history
  4. fix style

    JingyaHuang committed May 7, 2024
    Configuration menu
    Copy the full SHA
    7cc8288 View commit details
    Browse the repository at this point in the history
  5. remove comments

    JingyaHuang committed May 7, 2024
    Configuration menu
    Copy the full SHA
    532b2a2 View commit details
    Browse the repository at this point in the history
  6. fix doc build

    JingyaHuang committed May 7, 2024
    Configuration menu
    Copy the full SHA
    96144d8 View commit details
    Browse the repository at this point in the history
  7. fix doc build

    JingyaHuang committed May 7, 2024
    Configuration menu
    Copy the full SHA
    7416320 View commit details
    Browse the repository at this point in the history
  8. fix doc build

    JingyaHuang committed May 7, 2024
    Configuration menu
    Copy the full SHA
    353858b View commit details
    Browse the repository at this point in the history

Commits on May 8, 2024

  1. Configuration menu
    Copy the full SHA
    604ba9a View commit details
    Browse the repository at this point in the history
  2. bump dev version

    JingyaHuang committed May 8, 2024
    Configuration menu
    Copy the full SHA
    04b2e14 View commit details
    Browse the repository at this point in the history
  3. lazy loading

    JingyaHuang committed May 8, 2024
    Configuration menu
    Copy the full SHA
    237e159 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    83acdfe View commit details
    Browse the repository at this point in the history
  5. fix?

    JingyaHuang committed May 8, 2024
    Configuration menu
    Copy the full SHA
    1a54150 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    45f2a4f View commit details
    Browse the repository at this point in the history

Commits on May 20, 2024

  1. Configuration menu
    Copy the full SHA
    d15d22d View commit details
    Browse the repository at this point in the history
  2. add docstring

    JingyaHuang committed May 20, 2024
    Configuration menu
    Copy the full SHA
    af35486 View commit details
    Browse the repository at this point in the history
  3. fix import

    JingyaHuang committed May 20, 2024
    Configuration menu
    Copy the full SHA
    b47967e View commit details
    Browse the repository at this point in the history
  4. fix tests

    JingyaHuang committed May 20, 2024
    Configuration menu
    Copy the full SHA
    052447e View commit details
    Browse the repository at this point in the history

Commits on May 21, 2024

  1. fix test

    JingyaHuang committed May 21, 2024
    Configuration menu
    Copy the full SHA
    2425138 View commit details
    Browse the repository at this point in the history
  2. fix test

    JingyaHuang committed May 21, 2024
    Configuration menu
    Copy the full SHA
    4f3377a View commit details
    Browse the repository at this point in the history
  3. fix for decoder as well

    JingyaHuang committed May 21, 2024
    Configuration menu
    Copy the full SHA
    1e924b0 View commit details
    Browse the repository at this point in the history

Commits on May 22, 2024

  1. try fix

    JingyaHuang committed May 22, 2024
    Configuration menu
    Copy the full SHA
    a9345d9 View commit details
    Browse the repository at this point in the history
  2. try fix

    JingyaHuang committed May 22, 2024
    Configuration menu
    Copy the full SHA
    00d1d5d View commit details
    Browse the repository at this point in the history
  3. try fix

    JingyaHuang committed May 22, 2024
    Configuration menu
    Copy the full SHA
    e211d41 View commit details
    Browse the repository at this point in the history
  4. try fix

    JingyaHuang committed May 22, 2024
    Configuration menu
    Copy the full SHA
    fda3303 View commit details
    Browse the repository at this point in the history
  5. try fix

    JingyaHuang committed May 22, 2024
    Configuration menu
    Copy the full SHA
    a17a3e8 View commit details
    Browse the repository at this point in the history
  6. try fix

    JingyaHuang committed May 22, 2024
    Configuration menu
    Copy the full SHA
    abf45ce View commit details
    Browse the repository at this point in the history

Commits on May 24, 2024

  1. Configuration menu
    Copy the full SHA
    c37c9d1 View commit details
    Browse the repository at this point in the history
  2. fix style

    JingyaHuang committed May 24, 2024
    Configuration menu
    Copy the full SHA
    00a48e8 View commit details
    Browse the repository at this point in the history
  3. fix typo

    JingyaHuang committed May 24, 2024
    Configuration menu
    Copy the full SHA
    8107d83 View commit details
    Browse the repository at this point in the history
  4. add back previous fix

    JingyaHuang committed May 24, 2024
    Configuration menu
    Copy the full SHA
    87c3902 View commit details
    Browse the repository at this point in the history

Commits on May 27, 2024

  1. Configuration menu
    Copy the full SHA
    0ad907a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    1a91d1a View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    30c6736 View commit details
    Browse the repository at this point in the history
  4. for sdxl as well

    JingyaHuang committed May 27, 2024
    Configuration menu
    Copy the full SHA
    068cf90 View commit details
    Browse the repository at this point in the history