Network Performance Degradation #121

zhouxiaobo500 · 2018-02-08T07:17:14Z

I'm running on Arch Linux, using two versions of zen kernels: 4.14.5-1 and 4.15.1-2.
After the rolling update, the new kernels drops its network performance up to A-HALF, compared with SCP transmission. (eg. 100M/s vs. 50M+/s)
Testing environment keeps the system configuration remains the same excepts the versions of linux-zen and linux-zen-headers are different.

I don't know if the problem comes from upstream or not.

heftig · 2018-02-08T07:41:00Z

Does sysctl net.ipv4.tcp_congestion_control=cubic help?

zhouxiaobo500 · 2018-02-08T07:46:04Z

Not tried it yet. But I have tried to load the latest stock kernel (4.15.1-2), and there's no issue on network performance. @heftig

zhouxiaobo500 · 2018-02-08T08:21:16Z

After reloading the kernels and rebooting 4+ times, it seems as normal. Weird bug..
Anyway, tks your help!

zhouxiaobo500 · 2018-02-08T10:29:12Z

The bug appears again..
Maybe it really comes from upstream..

The screenshot is captured from the client, while the host is using linux-zen 4.14.5-1.

[ Upstream commit 80bf6ce ] When we get into activate_mm(), lockdep complains that we're doing something strange: WARNING: possible circular locking dependency detected 5.1.0-10252-gb00152307319-dirty #121 Not tainted ------------------------------------------------------ inside.sh/366 is trying to acquire lock: (____ptrval____) (&(&p->alloc_lock)->rlock){+.+.}, at: flush_old_exec+0x703/0x8d7 but task is already holding lock: (____ptrval____) (&mm->mmap_sem){++++}, at: flush_old_exec+0x6c5/0x8d7 which lock already depends on the new lock. the existing dependency chain (in reverse order) is: -> #1 (&mm->mmap_sem){++++}: [...] __lock_acquire+0x12ab/0x139f lock_acquire+0x155/0x18e down_write+0x3f/0x98 flush_old_exec+0x748/0x8d7 load_elf_binary+0x2ca/0xddb [...] -> #0 (&(&p->alloc_lock)->rlock){+.+.}: [...] __lock_acquire+0x12ab/0x139f lock_acquire+0x155/0x18e _raw_spin_lock+0x30/0x83 flush_old_exec+0x703/0x8d7 load_elf_binary+0x2ca/0xddb [...] other info that might help us debug this: Possible unsafe locking scenario: CPU0 CPU1 ---- ---- lock(&mm->mmap_sem); lock(&(&p->alloc_lock)->rlock); lock(&mm->mmap_sem); lock(&(&p->alloc_lock)->rlock); *** DEADLOCK *** 2 locks held by inside.sh/366: #0: (____ptrval____) (&sig->cred_guard_mutex){+.+.}, at: __do_execve_file+0x12d/0x869 #1: (____ptrval____) (&mm->mmap_sem){++++}, at: flush_old_exec+0x6c5/0x8d7 stack backtrace: CPU: 0 PID: 366 Comm: inside.sh Not tainted 5.1.0-10252-gb00152307319-dirty #121 Stack: [...] Call Trace: [<600420de>] show_stack+0x13b/0x155 [<6048906b>] dump_stack+0x2a/0x2c [<6009ae64>] print_circular_bug+0x332/0x343 [<6009c5c6>] check_prev_add+0x669/0xdad [<600a06b4>] __lock_acquire+0x12ab/0x139f [<6009f3d0>] lock_acquire+0x155/0x18e [<604a07e0>] _raw_spin_lock+0x30/0x83 [<60151e6a>] flush_old_exec+0x703/0x8d7 [<601a8eb8>] load_elf_binary+0x2ca/0xddb [...] I think it's because in exec_mmap() we have down_read(&old_mm->mmap_sem); ... task_lock(tsk); ... activate_mm(active_mm, mm); (which does down_write(&mm->mmap_sem)) I'm not really sure why lockdep throws in the whole knowledge about the task lock, but it seems that old_mm and mm shouldn't ever be the same (and it doesn't deadlock) so tell lockdep that they're different. Signed-off-by: Johannes Berg <johannes.berg@intel.com> Signed-off-by: Richard Weinberger <richard@nod.at> Signed-off-by: Sasha Levin <sashal@kernel.org>

Passing a sufficient amount of imix entries leads to invalid access to the pkt_dev->imix_entries array because of the incorrect boundary check. UBSAN: array-index-out-of-bounds in net/core/pktgen.c:874:24 index 20 is out of range for type 'imix_pkt [20]' CPU: 2 PID: 1210 Comm: bash Not tainted 6.10.0-rc1 #121 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996) Call Trace: <TASK> dump_stack_lvl lib/dump_stack.c:117 __ubsan_handle_out_of_bounds lib/ubsan.c:429 get_imix_entries net/core/pktgen.c:874 pktgen_if_write net/core/pktgen.c:1063 pde_write fs/proc/inode.c:334 proc_reg_write fs/proc/inode.c:346 vfs_write fs/read_write.c:593 ksys_write fs/read_write.c:644 do_syscall_64 arch/x86/entry/common.c:83 entry_SYSCALL_64_after_hwframe arch/x86/entry/entry_64.S:130 Found by Linux Verification Center (linuxtesting.org) with SVACE. Fixes: 52a62f8 ("pktgen: Parse internet mix (imix) input") Signed-off-by: Artem Chernyshev <artem.chernyshev@red-soft.ru> [ fp: allow to fill the array completely; minor changelog cleanup ] Signed-off-by: Fedor Pchelkin <pchelkin@ispras.ru> Signed-off-by: David S. Miller <davem@davemloft.net>

zhouxiaobo500 closed this as completed Feb 8, 2018

zhouxiaobo500 reopened this Feb 8, 2018

zhouxiaobo500 closed this as completed Feb 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Network Performance Degradation #121

Network Performance Degradation #121

zhouxiaobo500 commented Feb 8, 2018

heftig commented Feb 8, 2018

zhouxiaobo500 commented Feb 8, 2018 •

edited

Loading

zhouxiaobo500 commented Feb 8, 2018

zhouxiaobo500 commented Feb 8, 2018

Network Performance Degradation #121

Network Performance Degradation #121

Comments

zhouxiaobo500 commented Feb 8, 2018

heftig commented Feb 8, 2018

zhouxiaobo500 commented Feb 8, 2018 • edited Loading

zhouxiaobo500 commented Feb 8, 2018

zhouxiaobo500 commented Feb 8, 2018

zhouxiaobo500 commented Feb 8, 2018 •

edited

Loading