Skip to content

Latest commit

 

History

History
351 lines (265 loc) · 15.1 KB

overview.md

File metadata and controls

351 lines (265 loc) · 15.1 KB

NPU software stack

 ──[User─space]──────────────────────────────────────────────────────────────────────────────
                                                                                             
     ┌──────────────────────────(optional)─┐   ┌────────────────────────────────────────┐    
     │ User Mode Driver tests              │   │ OpenVINO + NPU plugin                  │    
     │                                     │   │                                        │    
     │         intel-validation-npu        │   │            openvino_toolkit            │    
     │            (npu-umd-test)           │   │                                        │
     │                                     │   │                                        │    
     └──────────────────▲──────────────────┘   └────────────────────▲───────────────────┘    
                        ╚════════════════════╦══════════════════════╝                        
 ............................................║.............................................. 
     ┌───────────────────────────────────────▼──────────────────────────────────────────┐    
     │ oneAPI Level-Zero                 level-zero                                     │    
     │                                (libze_loader.so)                                 │    
     │                                                                                  │    
     └───────────────────────────────────────▲──────────────────────────────────────────┘    
                        ╔════════════════════╝                                               
     ┌──────────────────▼──────────────────┐   ┌────────────────────────────────────────┐    
     │ User Mode Driver                    │   │ NPU compiler                           │    
     │                                     │   │                                        │    
     │        intel-level-zero-npu         ◀═══▶     intel-driver-compiler-npu         │    
     │        (libze_intel_vpu.so)         │   │     (libnpu_driver_compiler.so)        │    
     │                                     │   │                                        │    
     └──────────────────▲──────────────────┘   └────────────────────────────────────────┘    
                        ╚════════════════════╗                                               
 ──[Kernel─space]────────────────────────────║───────────────────────────────────────────────
                         ┌───────────────────▼─────────────────────┐                         
                         │ Kernel Mode Driver                      │                         
                         │            intel-npu-driver             │                         
                         │             (intel_vpu.ko)              │                         
                         │                                         │                         
                         └───────────────────▲─────────────────────┘                         
 ──[Device]──────────────────────────────────║────────────────────────────────────────────── 
                         ┌───────────────────▼─────────────────────┐                         
                         │ Firmware                                │                         
                         │             intel-fw-npu                │                         
                         │                                         │                         
                         └─────────────────────────────────────────┘                         

Changelog

zeMutableCommandList extension implementation (from v1.6.0)

Implemented zeMutableCommandList extension from Level Zero API in v1.6.0 driver release. This feature allows to change arguments of inference recorded in zeCommandList by zeGraph->appendGraphExecute call. Using zeMutableCommandList user can easily exchange argument of Graph Execute without recording new command.

Ref.: https://spec.oneapi.io/level-zero/latest/core/EXT_Exp_MutableCommandList.html#mutable-command-list

Model driver caching (from v1.5.0)

Starting from v1.5.0 release the driver allows to cache compiled model in filesystem. Whenever user compiles a model (using zeGraphCreate), the driver saves the compiled model representation in the user directory ~/.cache/ze_intel_npu_cache/ or in the directory specified in ZE_INTEL_NPU_CACHE_DIR environment variable. The cache directory size is controlled by the driver. Whenever the cache directory exceeds 1GB the least used compiled models are removed to save the filesystem space.

Environment variable Description
ZE_INTEL_NPU_CACHE_DIR= The cache path. Set it to empty ("") to disable driver cache
ZE_INTEL_NPU_CACHE_SIZE= The cache directory size. Whenever the cached files exceeds the size, some cached files are removed using the least recently used strategy
Kernel module functional tests - npu-kmd-test (from v1.5.0)

Introduced npu-kmd-test in v1.5.0 release that can be used for intel_vpu module validation. npu-kmd-test is built together with the driver and can be found in <build-dir>/bin

Support for Arrow Lake and Lunar Lake CPUs (from v1.5.0)

In v1.5.0 release introduced support for Arrow Lake and Lunar Lake family processors.

Log mask for granular control over logging (from v1.5.0)

In v1.5.0 release there is new environment variable ZE_INTEL_NPU_LOGMASK that allows the user to enable specific log groups in driver. The ZE_INTEL_NPU_LOGLEVEL variable is used to enable logging. The available log level has been reduced to ERROR, WARNING and INFO. Whenever INFO is set, the ZE_INTEL_NPU_LOGMASK allows to target specific log group. The log group are listed in umd/vpu_driver/source/utilities/log.hpp

# Set log level to INFO
export ZE_INTEL_NPU_LOGLEVEL=INFO

# Set log mask to only print from DEVICE, DRIVER and CACHE groups
export ZE_INTEL_NPU_LOGMASK=$((1<<4|1<<3|1<<17))

Supported devices

The Intel® NPU driver supports following Intel® processor families

Platform Kernel support NPU driver release
Meteor Lake v6.5.0-35-generic - Ubuntu22.04 Stock kernel v1.1.0
Arrow Lake v6.5.0-45-generic - Ubuntu22.04 Stock kernel v1.5.0
Lunar Lake v6.8.0-40-generic - Ubuntu24.04 Stock kernel v1.5.0

Kernel module driver

The kernel module driver intel_vpu can be found in drivers/accel tree in the kernel sources.

Enable the intel_vpu module in the kernel config for compilation

In the Kernel Menu Configuration from Device Driver select Compute Acceleration Framework and set "modularize" for Intel NPU (Neural Processing Unit).

Finding the intel_vpu kernel module in the system
# check if the intel_vpu exists is in the system
modinfo intel_vpu

# check if the intel_vpu is loaded in the kernel
lsmod | grep intel_vpu

# if the previous command nothing produced, load the intel_vpu
sudo modprobe intel_vpu

# verify that the intel_vpu has been loaded successfully
sudo dmesg | tail -n 20

# check if accel device is available
ls /dev/accel/accel0

Building a standalone driver

Install the required dependencies in Ubuntu:

sudo apt update
sudo apt install -y build-essential git git-lfs cmake libudev-dev libboost-all-dev libssl-dev libudev-dev

Commands to build the driver:

cd linux-npu-driver
git submodule update --init --recursive

cmake -B build -S .
cmake --build build --parallel $(nproc)

# install the driver in the system
sudo cmake --install build --prefix /usr

# reload the intel_vpu module to load new firmware
sudo rmmod intel_vpu
sudo modprobe intel_vpu

Building a driver together with the compiler

The driver has a target in compiler/npu_compiler.cmake to build Compiler-in-Driver component from NPU plugin repository.

OpenVINO runtime is required by compiler. About the dependencies for building OpenVINO, please check the OpenVINO build document.

To build a compiler from the driver repository the ENABLE_NPU_COMPILER_BUILD flag has to be set:

cd linux-npu-driver
git submodule update --init --recursive

cmake -B build -S . -DENABLE_NPU_COMPILER_BUILD=ON
cmake --build build --parallel $(nproc)

# install the driver in the system
sudo cmake --install build --prefix /usr

# reload the intel_vpu module to load new firmware
sudo rmmod intel_vpu
sudo modprobe intel_vpu

The compiler binary libnpu_driver_compiler.so can be found in build/lib/.

Driver test application

The validation/umd-test directory contains npu-umd-test functional tests for driver. The binary npu-umd-test is located in the build folder, ex. build/bin/.

Command line to run functional tests (after driver installation):

npu-umd-test

It is expected that Umd.ConfigurationCheck test fails when --config option is not used. --config points to YAML configuration file that allows to control the inference test content. Those tests require compiler in system.

Config file requires to download any OpenVINO model. Command line to setup a basic.yaml:

# Prepare the add_abc model in path pointed by basic.yaml
mkdir -p models/add_abc
curl -o models/add_abc/add_abc.xml https://raw.githubusercontent.com/openvinotoolkit/openvino/master/src/core/tests/models/ir/add_abc.xml
touch models/add_abc/add_abc.bin

# Run tests with add_abc.xml
npu-umd-test --config=validation/umd-test/configs/basic.yaml

More information about config can be found in validation/umd-test/configs.

Troubleshooting

Non-root access to the NPU device

To access the NPU device, the user must be in the "render" or "video" group. A group depends on system configuration:

# check user groups
groups

# add user to the "render" group
sudo usermod -a -G render <user-name>

# log out and log in to apply the new group

The patch for systemd to set the "render" group for accel subsystem has been merged, but might not be available in your Linux distribution. See systemd change.

If setting the "render" group does not resolve the non-root access issue, this must be done by an administrator manually:

# check device permissions
ls -l /dev/accel/

# change group for accel device
sudo chown root:render /dev/accel/accel0

# allow to read/write from device for group
sudo chmod g+rw /dev/accel/accel0

# check final permissions
$ ls -lah /dev/accel/accel0
crw-rw---- 1 root render 261, 0 Jan 31 15:58 /dev/accel/accel0
Compilation problem due to lack of memory

The compilation may fail due to memory shortage. The recommendation is to use the Ninja generator instead of Unix Makefiles. If it does not help, please file a new issue.

# install Ninja
sudo apt update
sudo apt install -y ninja-build

# remove the old build and create a new one
rm build -rf
cmake -B build -S . -G Ninja
Enable driver log using an environment variable

Valid logging levels are ERROR, WARNING, INFO (and VERBOSE for driver older than v1.5.0 release)

Seting the logging level using the ZE_INTEL_NPU_LOGLEVEL environment variable:

export ZE_INTEL_NPU_LOGLEVEL=<logging_level>

Command to clear an exported value:

unset ZE_INTEL_NPU_LOGLEVEL

Setting ZE_INTEL_NPU_LOGMASK allows to print specific log groups in driver. The log group are listed in umd/vpu_driver/source/utilities/log.hpp

# Set log level to INFO to enable LOGMASK
export ZE_INTEL_NPU_LOGLEVEL=INFO

# Set log mask to only print from DEVICE, DRIVER and CACHE groups
export ZE_INTEL_NPU_LOGMASK=$((1<<4|1<<3|1<<17))
Firmware version mismatch

The user can use different kernel and firmware combination for NPU device. The user might receive the following error message:

ERROR! MAPPED_INFERENCE_VERSION is NOT compatible with the ELF Expected: 6.1.0 vs received: 7.0.0

It means that NPU compiler mismatches the NPU firmware. To fix this issue the user needs to upgrade the firmware. Firmware update should be done from driver repository using release tag that matches the NPU compiler:

cmake -B build -S .
cmake --install build/ --component fw-npu --prefix /