If the eureka client frequently sends register or renew requests, nodes in the eureka server cluster may be inconsistent #1509

tkf0707 · 2023-07-25T02:41:44Z

Scene description: An abnormal eureka client sends dozens of renew requests within 1s. As a result, the number of nodes in the eureka server cluster is inconsistent, and node information cannot be synchronized between the two servers.

Analyze：When nodes are synchronized in the eureka server cluster, a batchTaskDispatcher is created to package and send tasks in batches, and tasks are queued in the AcceptorExecutor.
If the renewal frequency of an application instance(eureka.instance.lease-renewal-interval-in-seconds) is smaller than the synchronization execution frequency(MAX_BATCHING_DELAY_MS=500), and the Task of the application instance is at the head of the pendingTasks to be processed, the registry server fails to determine whether the difference between the task creation time and the current time is smaller than the synchronization execution time. As a result, the registry server does not put the renewal task in the batchWorkQueue to be executed.

AcceptorExecutor.java:      

        void assignBatchWork() {
            if (hasEnoughTasksForNextBatch()) {
                if (batchWorkRequests.tryAcquire(1)) {
                    long now = System.currentTimeMillis();
                    int len = Math.min(maxBatchingSize, processingOrder.size());
                    List<TaskHolder<ID, T>> holders = new ArrayList<>(len);
                    while (holders.size() < len && !processingOrder.isEmpty()) {
                        ID id = processingOrder.poll();
                        TaskHolder<ID, T> holder = pendingTasks.remove(id);
                        if (holder.getExpiryTime() > now) {
                            holders.add(holder);
                        } else {
                            expiredTasks++;
                        }
                    }
                    if (holders.isEmpty()) {
                        batchWorkRequests.release();
                    } else {
                        batchSizeMetric.record(holders.size(), TimeUnit.MILLISECONDS);
                        batchWorkQueue.add(holders);
                    }
                }
            }
        }

        private boolean hasEnoughTasksForNextBatch() {
            if (processingOrder.isEmpty()) {
                return false;
            }
            if (pendingTasks.size() >= maxBufferSize) {
                return true;
            }

            TaskHolder<ID, T> nextHolder = pendingTasks.get(processingOrder.peek());
            long delay = System.currentTimeMillis() - nextHolder.getSubmitTimestamp();
            return delay >= maxBatchingDelay;
        }

How should the server solve this problem to protect the synchronization between the server clusters？

The text was updated successfully, but these errors were encountered:

error0702 · 2023-07-25T02:42:45Z

这是来自QQ邮箱的假期自动回复邮件。你好，我最近正在休假中，无法亲自回复你的邮件。我将在假期结束后，尽快给你回复。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

If the eureka client frequently sends register or renew requests, nodes in the eureka server cluster may be inconsistent #1509

If the eureka client frequently sends register or renew requests, nodes in the eureka server cluster may be inconsistent #1509

tkf0707 commented Jul 25, 2023

error0702 commented Jul 25, 2023 via email

If the eureka client frequently sends register or renew requests, nodes in the eureka server cluster may be inconsistent #1509

If the eureka client frequently sends register or renew requests, nodes in the eureka server cluster may be inconsistent #1509

Comments

tkf0707 commented Jul 25, 2023

error0702 commented Jul 25, 2023 via email