efficient-model

Star

Here are 35 public repositories matching this topic...

mit-han-lab / temporal-shift-module

Star

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

acceleration low-latency video-understanding tsm nvidia-jetson-nano efficient-model temporal-modeling

Updated Jul 11, 2024
Python

mit-han-lab / once-for-all

Star

[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

acceleration nas automl edge-ai efficient-model tinyml

Updated Dec 14, 2023
Python

mit-han-lab / proxylessnas

Star

[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

acceleration automl specialization efficient-model on-device-ai hardware-aware

Updated Aug 30, 2024
C++

mit-han-lab / amc

Star

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

automl model-compression channel-pruning automl-for-compression efficient-model on-device-ai

Updated Nov 22, 2023
Python

mit-han-lab / haq

Star

[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision

quantization automl mixed-precision efficient-model

Updated Feb 26, 2021
Python

microsoft / nn-Meter

Star

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

python machine-learning deep-neural-networks deep-learning latency inference pytorch tensorflow-models edge-computing neural-architecture-search edge-ai efficient-model onnx-models

Updated Jul 30, 2024
Python

mit-han-lab / hardware-aware-transformers

Star

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

natural-language-processing machine-translation transformer specialization efficient-model hardware-aware

Updated Jul 14, 2024
Python

SqueezeAILab / KVQuant

Star

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

natural-language-processing compression text-generation transformer llama quantization mistral model-compression efficient-inference efficient-model large-language-models llm small-models localllm localllama

Updated Aug 13, 2024
Python

amirgholami / ZeroQ

Star

[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework

compression quantization quantized-neural-networks efficient-model efficient-neural-networks

Updated Dec 8, 2023
Python

kssteven418 / I-BERT

Star

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

natural-language-processing transformer quantization bert model-compression efficient-model efficient-neural-networks

Updated Jan 29, 2023
Python

mit-han-lab / amc-models

Star

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

automl model-compression efficient-model on-device-ai

Updated Feb 26, 2021
Python

youngwanLEE / VoV3D

Star

Efficient 3D Backbone Network for Temporal Modeling

video-understanding vovnet efficient-model temporal-modeling backbone-networks 3d-cnn-architecture vov3d

Updated Apr 20, 2021
Python

d-li14 / HBONet

Star

[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2

pytorch imagenet pretrained-models mobilenetv2 efficient-model iccv2019

Updated Apr 30, 2020
Python

kssteven418 / LTP

Star

[KDD'22] Learned Token Pruning for Transformers

natural-language-processing transformer pruning bert model-compression efficient-model efficient-neural-networks

Updated Feb 27, 2023
Python

szq0214 / S2-BNN

Star

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

binary-neural-networks contrastive-loss self-supervised-learning efficient-model contrastive-learning distillation-loss

Updated Aug 18, 2021
Python

SHI-Labs / Any-Precision-DNNs

Star

Any-Precision Deep Neural Networks (AAAI 2021)

on-demand efficient-model any-precision

Updated May 2, 2020
Python

xvyaward / owq

Star

Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models".

quantization efficient-model large-language-models llm

Updated Mar 7, 2024
Python

mit-han-lab / neurips-micronet

Star

[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion

natural-language-processing language-modeling pruning quantization knowledge-distillation efficient-model

Updated Feb 26, 2021
Jupyter Notebook

tiangexiang / BiX-NAS

Star

[MICCAI 2021] BiX-NAS: Searching Efficient Bi-directional Architecture for Medical Image Segmentation

segmentation semantic-segmentation miccai neural-architecture-search efficient-model miccai-2021

Updated Sep 12, 2022
Python

lironui / ABCNet

Star

The semantic segmentation of remote sensing images

real-time uav remote-sensing segmentation semantic-segmentation potsdam efficient-model uavid isprs vaihingen

Updated Sep 29, 2023
Python

Improve this page

Add a description, image, and links to the efficient-model topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the efficient-model topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

efficient-model

Here are 35 public repositories matching this topic...

mit-han-lab / temporal-shift-module

mit-han-lab / once-for-all

mit-han-lab / proxylessnas

mit-han-lab / amc

mit-han-lab / haq

microsoft / nn-Meter

mit-han-lab / hardware-aware-transformers

SqueezeAILab / KVQuant

amirgholami / ZeroQ

kssteven418 / I-BERT

mit-han-lab / amc-models

youngwanLEE / VoV3D

d-li14 / HBONet

kssteven418 / LTP

szq0214 / S2-BNN

SHI-Labs / Any-Precision-DNNs

xvyaward / owq

mit-han-lab / neurips-micronet

tiangexiang / BiX-NAS

lironui / ABCNet

Improve this page

Add this topic to your repo