README

Open source

Update Log.

Update. 2023 04 23 Add Arm support for HSplit operation (vulkan and cuda not support).

Update 2023 02 23 Add Hsplit layer for horizontal partitioning.

Intro

This repository provides a unified interface for specifying a CNN model with Open Neural Network Exchange (ONNX) support, the model partitioning, and the target edge devices. Inside the CNN inference library, we integrate hybrid OpenMP and MPI to support the exploitation of parallelism among and within the edge devices (i.e., exploiting multi-core execution).

With different Mapping Specifications, users can quickly and flexibly change the CNN model partitioning and mapping of partitions onto resources of edge devices. The hardware configurations in the automated code generation step can also be modified to adapt to user requirements targeting or other heterogeneous edge platforms.

Thanks to Nihui and atanmarko's NCNN providing a cross-platform inference engine library that supports GPU acceleration via, e.g., VULKAN & CUDA APIs. We extend NCNN with OPENMP + MPI to support Multi-node Inference and distribution of the most commonly used CNN network over multiple devices/nodes at the edge.

ncnn is a high-performance neural network inference computing framework optimized for mobile platforms. ncnn is deeply considerate about deployment and uses on mobile phones from the beginning of design. ncnn does not have third-party dependencies. it is cross-platform, and runs faster than all known open source frameworks on mobile phone CPU. Developers can easily deploy deep learning algorithm models to the mobile platform by using efficient ncnn implementation, creating intelligent APPs, and bringing artificial intelligence to your fingertips. ncnn is currently being used in many Tencent applications, such as QQ, Qzone, WeChat, Pitu, and so on.

How to build.

prerequisites

Installing MPI

MPI is simply a standard interface for others to follow in their implementation. Because of this, there are a wide variety of MPI implementations out there, such as OpenMPI, MPICH. MPI is used for multi-node communication due to its outstanding performance. Users are free to use any implementation they wish, but only the limitation for installing MPI is to ensure that the MPI version keeps consistent on every device.

Installing Dependencies & AutoDiCE

You can install AutoDiCE by following the instructions below. We also provide an installation script that automates these instructions.

# install cmake --version 3.20
# install Opencv & protobuf & vulkan (if needed)
sudo apt install build-essential git libprotobuf-dev protobuf-compiler libvulkan-dev vulkan-utils libopencv-dev
cd AutoDiCE && mkdir -p build && cd build
### Laptop with GPUs…
cmake -DNCNN_VULKAN=OFF -DNCNN_CUDA=ON -DLOG_LAYERS=ON -DNCNN_MPI=ON -DCMAKE_CUDA_ARCHITECTURES=75 -DNCNN_BUILD_BENCHMARK=OFF -DNCNN_BUILD_EXAMPLES=ON ..

###  NVIDIA Jetson NANO/TX2/NX series
cmake -DNCNN_VULKAN=ON -DNCNN_CUDA=OFF -DLOG_LAYERS=ON -DCMAKE_TOOLCHAIN_FILE=../toolchains/aarch64-linux-gnu.toolchain.cmake -DNCNN_OPENMP=OFF ..

###  CPU-only Machine
cmake -DNCNN_VULKAN=OFF -DNCNN_CUDA=OFF -DNCNN_MPI=ON -DNCNN_BUILD_BENCHMARK=OFF -DNCNN_BUILD_EXAMPLES=ON ..

How to use AutoDiCE

Please check our step-by-step Tutorial.

Citation

If you use these models in your research, please cite:

  @article{guo2022autodice,
    title={AutoDiCE: Fully Automated Distributed CNN Inference at the Edge},
    author={Guo, Xiaotian and Pimentel, Andy D and Stefanov, Todor},
    journal={arXiv preprint arXiv:2207.12113},
    year={2022}
  }
  Please include Nihui's NCNN if involves with ARM implementation of NCNN.
  @online{ncnn,
    author = {Tencent, Lihui},
    title = {NCNN},
    year = {2017},
    publisher = {GitHub},
    journal = {GitHub repository},
    url  = {https://github.com/Tencent/ncnn},
  }

License

BSD 3 Clause

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
benchmark		benchmark
cmake		cmake
docs		docs
examples		examples
glslang		glslang
images		images
python		python
src		src
tests		tests
toolchains		toolchains
tools		tools
.gitignore		.gitignore
20220801.png		20220801.png
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
Info.plist		Info.plist
LICENSE		LICENSE
LICENSE.txt		LICENSE.txt
README.md		README.md
build-android.cmd		build-android.cmd
build.sh		build.sh
cat.jpg		cat.jpg
codeformat.sh		codeformat.sh
dog.jpg		dog.jpg
install_dependencies.sh		install_dependencies.sh
package.sh		package.sh
pyproject.toml		pyproject.toml
setup.py		setup.py
tutorial.md		tutorial.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

README

Open source

Update Log.

Intro

How to build.

prerequisites

Installing MPI

Installing Dependencies & AutoDiCE

How to use AutoDiCE

Citation

License

About

Licenses found

Releases

Packages

Contributors 3

Languages

License

Licenses found

parrotsky/AutoDiCE

Folders and files

Latest commit

History

Repository files navigation

README

Open source

Update Log.

Intro

How to build.

prerequisites

Installing MPI

Installing Dependencies & AutoDiCE

How to use AutoDiCE

Citation

License

About

Topics

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages