moe

Star

Here are 134 public repositories matching this topic...

hiyouga / LLaMA-Factory

Star

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Updated Feb 4, 2025
Python

sgl-project / sglang

Star

SGLang is a fast serving framework for large language models and vision language models.

cuda inference pytorch transformer moe llama vlm llm llm-serving llava llama2 deepseek-llm deepseek llama3 llama3-1 deepseek-v3

Updated Feb 4, 2025
Python

An unofficial https://bgm.tv ui first app client for Android and iOS, built with React Native. 一个无广告、以爱好为驱动、不以盈利为目的、专门做 ACG 的类似豆瓣的追番记录，bgm.tv 第三方客户端。为移动端重新设计，内置大量加强的网页端难以实现的功能，且提供了相当的自定义选项。目前已适配 iOS / Android / WSA、mobile / 简单 pad、light / dark theme、移动端网页。

react android ios design react-native mobx ios-app moe bangumi android-app expo

Updated Jan 27, 2025
TypeScript

PKU-YuanGroup / MoE-LLaVA

Star

Mixture-of-Experts for Large Vision-Language Models

moe multi-modal mixture-of-experts large-vision-language-model

Updated Dec 3, 2024
Python

davidmrau / mixture-of-experts

Star

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

pytorch moe re-implementation mixture-of-experts sparsely-gated-mixture-of-experts

Updated Apr 19, 2024
Python

pjlab-sys4nlp / llama-moe

Star

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

moe llama mixture-of-experts llm continual-pre-training expert-partition

Updated Dec 6, 2024
Python

sail-sg / Adan

Star

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Updated Jul 2, 2024
Python

open-compass / MixtralKit

Star

A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI

moe mistral llm

Updated Dec 15, 2023
Python

microsoft / Tutel

Star

Tutel MoE: An Optimized Mixture-of-Experts Implementation

nlp pytorch transformer moe mixture-of-experts

Updated Jan 18, 2025
Python

ymcui / Chinese-Mixtral

Star

中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）

nlp moe 64k mixture-of-experts 32k large-language-models llm mixtral

Updated Apr 30, 2024
Python

mindspore-courses / step_into_llm

Star

MindSpore online courses: Step into LLM

Updated Jan 6, 2025
Jupyter Notebook

kokororin / pixiv.moe

Star

😘 A pinterest-style layout site, shows illusts on pixiv.net order by popularity.

react redux website typescript comic comics lovelive webapp moe pixiv illust illusts

Updated Mar 8, 2023
TypeScript

LISTEN-moe / android-app

Star

Official LISTEN.moe Android app

android kotlin music music-player anime jpop japan moe kpop android-auto

Updated Feb 2, 2025
Kotlin

inferflow / inferflow

Star

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

bloom falcon moe gemma mistral mixture-of-experts model-quantization multi-gpu-inference m2m100 llamacpp llm-inference internlm llama2 qwen baichuan2 mixtral phi-2 deepseek minicpm

Updated Mar 15, 2024
C++

libgdx / gdx-pay

Star

A libGDX cross-platform API for InApp purchasing.

android java ios libgdx moe robovm iap in-app-purchase multi-os-engine gdx-pay

Updated Jan 2, 2025
Java

IBM / ModuleFormer

Star

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.

lm moe

Updated Apr 10, 2024
Python

SkyworkAI / MoH

Star

MoH: Multi-Head Attention as Mixture-of-Head Attention

transformer moe attention vit dit mixture-of-experts llms

Updated Oct 29, 2024
Python

SkyworkAI / MoE-plus-plus

Star

[ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

moe mixture-of-experts large-language-models llms

Updated Oct 16, 2024
Python

shalldie / chuncai

Star

A lovely Page Wizard, is responsible for selling moe.

moe chuncai

Updated Jul 13, 2018
TypeScript

LISTEN-moe / desktop-app

Star

Official LISTEN.moe Desktop Client

music windows macos linux client app anime jpop desktop listen moe

Updated Nov 10, 2022
Vue

Improve this page

Add a description, image, and links to the moe topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the moe topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

moe

Here are 134 public repositories matching this topic...

hiyouga / LLaMA-Factory

sgl-project / sglang

czy0729 / Bangumi

PKU-YuanGroup / MoE-LLaVA

davidmrau / mixture-of-experts

pjlab-sys4nlp / llama-moe

sail-sg / Adan

open-compass / MixtralKit

microsoft / Tutel

ymcui / Chinese-Mixtral

mindspore-courses / step_into_llm

kokororin / pixiv.moe

LISTEN-moe / android-app

inferflow / inferflow

libgdx / gdx-pay

IBM / ModuleFormer

SkyworkAI / MoH

SkyworkAI / MoE-plus-plus

shalldie / chuncai

LISTEN-moe / desktop-app

Improve this page

Add this topic to your repo