Awesome EMDL

Embedded and mobile deep learning research notes.

Papers

Quantization

Pruning

Awesome-Pruning [Repo]
Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration [CVPR'19]
To prune, or not to prune: exploring the efficacy of pruning for model compression [ICLR'18]
Pruning Filters for Efficient ConvNets [ICLR'17]
Pruning Convolutional Neural Networks for Resource Efficient Inference [ICLR'17]
Soft Weight-Sharing for Neural Network Compression [ICLR'17]
Designing Energy-Efficient Convolutional Neural Networks using Energy-Aware Pruning [CVPR'17]
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression [ICCV'17]
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding [ICLR'16]
Dynamic Network Surgery for Efficient DNNs [NIPS'16]
Learning both Weights and Connections for Efficient Neural Networks [NIPS'15]

Approximation

High performance ultra-low-precision convolutions on mobile devices [NIPS'17]
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications [ICLR'16]
Efficient and Accurate Approximations of Nonlinear Convolutional Networks [CVPR'15]
Accelerating Very Deep Convolutional Networks for Classification and Detection (Extended version of above one)
Convolutional neural networks with low-rank regularization [arXiv'15]
Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation [NIPS'14]

Characterization

Libraries

Inference Framework

Alibaba - MNN - is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba.
Apple - CoreML - is integrate machine learning models into your app. BERT and GPT-2 on iPhone
Arm - ComputeLibrary - is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies. Intro
Arm - Arm NN - is the most performant machine learning (ML) inference engine for Android and Linux, accelerating ML on Arm Cortex-A CPUs and Arm Mali GPUs.
Baidu - Paddle Lite - is multi-platform high performance deep learning inference engine.
DeepLearningKit - is Open Source Deep Learning Framework for Apple's iOS, OS X and tvOS.
Edge Impulse - Interactive platform to generate models that can run in microcontrollers. They are also quite active on social netwoks talking about recent news on EdgeAI/TinyML.
Google - TensorFlow Lite - is an open source deep learning framework for on-device inference.
Intel - OpenVINO - Comprehensive toolkit to optimize your processes for faster inference.
JDAI Computer Vision - dabnn - is an accelerated binary neural networks inference framework for mobile platform.
Meta - PyTorch Mobile - is a new framework for helping mobile developers and machine learning engineers embed PyTorch ML models on-device.
Microsoft - DeepSpeed - is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Microsoft - ELL - allows you to design and deploy intelligent machine-learned models onto resource constrained platforms and small single-board computers, like Raspberry Pi, Arduino, and micro:bit.
Microsoft - ONNX RUntime - cross-platform, high performance ML inferencing and training accelerator.
Nvidia - TensorRT - is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
OAID - Tengine - is a lite, high performance, modular inference engine for embedded device
Qualcomm - Neural Processing SDK for AI - Libraries to developers run NN models on Snapdragon mobile platforms taking advantage of the CPU, GPU and/or DSP.
Tencent - ncnn - is a high-performance neural network inference framework optimized for the mobile platform.
uTensor - AI inference library based on mbed (an RTOS for ARM chipsets) and TensorFlow.
XiaoMi - Mace - is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
xmartlabs - Bender - Easily craft fast Neural Networks on iOS! Use TensorFlow models. Metal under the hood.

Optimization Tools

Neural Network Distiller - Python package for neural network compression research.
PocketFlow - An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

Research Demos

RSTensorFlow - GPU Accelerated TensorFlow for Commodity Android Devices.

Web

mil-tokyo/webdnn - Fastest DNN Execution Framework on Web Browser.

General

Edge / Tiny MLOps

Tiny-MLOps: a framework for orchestrating ML applications at the far edge of IoT systems [EAIS '22]
MLOps for TinyML: Challenges & Directions in Operationalizing TinyML at Scale [TinyML Talks '22]
TinyMLOps: Operational Challenges for Widespread Edge AI Adoption [arXiv '22]
A TinyMLaaS Ecosystem for Machine Learning in IoT: Overview and Research Challenges [VLSI-DAT '21]
SOLIS: The MLOps journey from data acquisition to actionable insights [arXiv '21]
Edge MLOps: An Automation Framework for AIoT Applications [IC2E '21]
SensiX++: Bringing MLOPs and Multi-tenant Model Serving to Sensory Edge Devices [arXiv '21, Nokia]

Vulkan

OpenCL

DeepMon

RenderScript

Mobile_ConvNet: RenderScript CNN for Android

Tutorials

General

NEON

NEON™ Programmer’s Guide

OpenCL

Courses

Tools

GPU

Driver

Related Repos

EfficientDNNs by @MingSun-Tse
Awesome ML Model Compression by @cedrickchee
Awesome Pruning by @he-y
Model Compression by @j-marple-dev
awesome-AutoML-and-Lightweight-Models by @guan-yuan
knowledge-distillation-papers by @lhyfst
Awesome-model-compression-and-acceleration by @memoiry
Embedded Neural Network by @ZhishengWang

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome EMDL

Papers

Survey

Model

System

Quantization

Pruning

Approximation

Characterization

Libraries

Inference Framework

Optimization Tools

Research Demos

Web

General

Edge / Tiny MLOps

Vulkan

OpenCL

RenderScript

Tutorials

General

NEON

OpenCL

Courses

Tools

GPU

Driver

Related Repos

About

Releases

Packages

Contributors 6

License

csarron/awesome-emdl

Folders and files

Latest commit

History

Repository files navigation

Awesome EMDL

Papers

Survey

Model

System

Quantization

Pruning

Approximation

Characterization

Libraries

Inference Framework

Optimization Tools

Research Demos

Web

General

Edge / Tiny MLOps

Vulkan

OpenCL

RenderScript

Tutorials

General

NEON

OpenCL

Courses

Tools

GPU

Driver

Related Repos

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Packages