app/virtio-ha: add PF reset before DMA table clean up #105

Ch3n60x · 2024-07-01T09:17:50Z

When vfe-vhostd crashes, it could happen that some adminQ command is in-flight, for example, some adminQ command is sent just before the crash. Before this commit, DMA mapping of global container will be cleaned up upon vfe-vhostd quit. So for PFs, it could happen that adminQ command response comes after DMA mapping clean-up, resulting in IO_PAGE_FAULT in kernel.

This commit fixes this issue by doing a PF reset before DMA clean-up.

RM: 3957706

When vfe-vhostd crashes, it could happen that some adminQ command is in-flight, for example, some adminQ command is sent just before the crash. Before this commit, DMA mapping of global container will be cleaned up upon vfe-vhostd quit. So for PFs, it could happen that adminQ command response comes after DMA mapping clean-up, resulting in IO_PAGE_FAULT in kernel. This commit fixes this issue by doing a PF reset before DMA clean-up. RM: 3957706 Signed-off-by: Chenbo Xia <chenbox@nvidia.com>

When doing hot-upgrade, we need to know that vfe-vhostd and vfe-vhostd-ha init finish or not. This commit adds the related log and corresponding HA IPC message so that vfe-vhostd could notify vfe-vhostd-ha that init finishes. Signed-off-by: Chenbo Xia <chenbox@nvidia.com>

Before this commit, we use HPA for checking if old and new memory region is the same or not. This is for a corner case that when vhostd restart, qemu also restart, then qemu could send different memory region with same info (QEMU_VA, GPA, SIZE). Previously we use HPA to handle this case, but the side effect is we need to use MAP_POPULATE flag for mmap call, which results in more time used in mmap. In real environment, the time could be several seconds when mmap hundreds of GB memory. This commit removes the usage of HPA and MAP_POPULATE flag, but use QEMU process id to handle the corner case. Signed-off-by: Chenbo Xia <chenbox@nvidia.com>

Ch3n60x added 3 commits July 1, 2024 09:16

Ch3n60x force-pushed the fix_pf_pagefault branch from 8b8d939 to a198a00 Compare July 4, 2024 04:22

kailiangz1 merged commit 814e8b0 into Mellanox:main Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

app/virtio-ha: add PF reset before DMA table clean up #105

app/virtio-ha: add PF reset before DMA table clean up #105

Ch3n60x commented Jul 1, 2024

app/virtio-ha: add PF reset before DMA table clean up #105

app/virtio-ha: add PF reset before DMA table clean up #105

Conversation

Ch3n60x commented Jul 1, 2024