Frequent discovery DATABASE_ERROR
during WiFi brownouts and roaming
#3544
Labels
triage
Issue pending classification
DATABASE_ERROR
during WiFi brownouts and roaming
#3544
Is there an already existing issue for this?
Expected behavior
The context:
Expected behaviour:
Current behavior
For most part, the behaviour is as we expect (described over). However, occasionally, after the WiFi hosts rejoin network, the discovery service is seen to throw error messages like these:
Once we see these errors on the discovery service, we notice that discovery is no more reliable. Certain publishers and subscribers may not match anymore and data flow may not ever recover.
Steps to reproduce
As this is an occasional behaviour, this is quite hard to reproduce. The way to reproduce this using multiple virtual machines (VMs) on a host
DATABASE_ERROR
reported on the console running discovery serverFast DDS version/commit
Happens in master. But certainly on release 2.10.1
Platform/Architecture
Ubuntu Focal 20.04 amd64, Ubuntu Focal 20.04 arm64
Transport layer
UDPv4
Additional context
We seem to have solved this by delayed reconciliation of readers and writers reported to have
no associated participants
. Essentially push such readers and writers into a list, and upon discovery of new participants, we try to run the association again. This seems to resolve the errors. I will add a pull request demonstrating the solution later.XML configuration file
No response
Relevant log output
No response
Network traffic capture
No response
The text was updated successfully, but these errors were encountered: