WSL2-Linux-Kernel

История

Vitaly Kuznetsov e47679c06a x86/hyperv: Properly deal with empty cpumasks in hyperv_flush_tlb_multi() [ Upstream commit `51500b71d5` ] KASAN detected the following issue: BUG: KASAN: slab-out-of-bounds in hyperv_flush_tlb_multi+0xf88/0x1060 Read of size 4 at addr ffff8880011ccbc0 by task kcompactd0/33 CPU: 1 PID: 33 Comm: kcompactd0 Not tainted 5.14.0-39.el9.x86_64+debug #1 Hardware name: Microsoft Corporation Virtual Machine/Virtual Machine, BIOS Hyper-V UEFI Release v4.0 12/17/2019 Call Trace: dump_stack_lvl+0x57/0x7d print_address_description.constprop.0+0x1f/0x140 ? hyperv_flush_tlb_multi+0xf88/0x1060 __kasan_report.cold+0x7f/0x11e ? hyperv_flush_tlb_multi+0xf88/0x1060 kasan_report+0x38/0x50 hyperv_flush_tlb_multi+0xf88/0x1060 flush_tlb_mm_range+0x1b1/0x200 ptep_clear_flush+0x10e/0x150 ... Allocated by task 0: kasan_save_stack+0x1b/0x40 __kasan_kmalloc+0x7c/0x90 hv_common_init+0xae/0x115 hyperv_init+0x97/0x501 apic_intr_mode_init+0xb3/0x1e0 x86_late_time_init+0x92/0xa2 start_kernel+0x338/0x3eb secondary_startup_64_no_verify+0xc2/0xcb The buggy address belongs to the object at ffff8880011cc800 which belongs to the cache kmalloc-1k of size 1024 The buggy address is located 960 bytes inside of 1024-byte region [ffff8880011cc800, ffff8880011ccc00) 'hyperv_flush_tlb_multi+0xf88/0x1060' points to hv_cpu_number_to_vp_number() and '960 bytes' means we're trying to get VP_INDEX for CPU#240. 'nr_cpus' here is exactly 240 so we're trying to access past hv_vp_index's last element. This can (and will) happen when 'cpus' mask is empty and cpumask_last() will return '>=nr_cpus'. Commit `ad0a6bad44` ("x86/hyperv: check cpu mask after interrupt has been disabled") tried to deal with empty cpumask situation but apparently didn't fully fix the issue. 'cpus' cpumask which is passed to hyperv_flush_tlb_multi() is 'mm_cpumask(mm)' (which is '&mm->cpu_bitmap'). This mask changes every time the particular mm is scheduled/unscheduled on some CPU (see switch_mm_irqs_off()), disabling IRQs on the CPU which is performing remote TLB flush has zero influence on whether the particular process can get scheduled/unscheduled on _other_ CPUs so e.g. in the case where the mm was scheduled on one other CPU and got unscheduled during hyperv_flush_tlb_multi()'s execution will lead to cpumask becoming empty. It doesn't seem that there's a good way to protect 'mm_cpumask(mm)' from changing during hyperv_flush_tlb_multi()'s execution. It would be possible to copy it in the very beginning of the function but this is a waste. It seems we can deal with changing cpumask just fine. When 'cpus' cpumask changes during hyperv_flush_tlb_multi()'s execution, there are two possible issues: - 'Under-flushing': we will not flush TLB on a CPU which got added to the mask while hyperv_flush_tlb_multi() was already running. This is not a problem as this is equal to mm getting scheduled on that CPU right after TLB flush. - 'Over-flushing': we may flush TLB on a CPU which is already cleared from the mask. First, extra TLB flush preserves correctness. Second, Hyper-V's TLB flush hypercall takes 'mm->pgd' argument so Hyper-V may avoid the flush if CR3 doesn't match. Fix the immediate issue with cpumask_last()/hv_cpu_number_to_vp_number() and remove the pointless cpumask_empty() check from the beginning of the function as it really doesn't protect anything. Also, avoid the hypercall altogether when 'flush->processor_mask' ends up being empty. Fixes: `ad0a6bad44` ("x86/hyperv: check cpu mask after interrupt has been disabled") Signed-off-by: Vitaly Kuznetsov <vkuznets@redhat.com> Reviewed-by: Michael Kelley <mikelley@microsoft.com> Link: https://lore.kernel.org/r/20220106094611.1404218-1-vkuznets@redhat.com Signed-off-by: Wei Liu <wei.liu@kernel.org> Signed-off-by: Sasha Levin <sashal@kernel.org>		2022-03-08 19:12:36 +01:00
..
alpha	alpha: enable GENERIC_PCI_IOMAP unconditionally	2021-09-19 10:37:00 -07:00
arc	signal: Replace force_sigsegv(SIGSEGV) with force_fatal_sig(SIGSEGV)	2021-11-25 09:49:06 +01:00
arm	ARM: OMAP2+: adjust the location of put_device() call in omapdss_init_of	2022-02-23 12:03:17 +01:00
arm64	arm64: Mark start_backtrace() notrace and NOKPROBE_SYMBOL	2022-03-08 19:12:32 +01:00
csky	perf: Protect perf_guest_cbs with RCU	2022-01-20 09:13:14 +01:00
h8300	Merge branch 'akpm' (patches from Andrew)	2021-09-08 12:55:35 -07:00
hexagon	hexagon: clean up timer-regs.h	2021-11-25 09:48:42 +01:00
ia64	PCI/sysfs: Find shadow ROM before static attribute initialization	2022-02-01 17:27:05 +01:00
m68k	signal: Replace force_fatal_sig with force_exit_sig when in doubt	2021-11-25 09:49:07 +01:00
microblaze	Microblaze patches for 5.15-rc1	2021-09-08 16:02:13 -07:00
mips	MIPS: fix local_{add,sub}_return on MIPS64	2022-03-08 19:12:33 +01:00
nds32	perf: Protect perf_guest_cbs with RCU	2022-01-20 09:13:14 +01:00
nios2	nios2: Make NIOS2_DTB_SOURCE_BOOL depend on !COMPILE_TEST	2021-10-27 09:29:07 -05:00
openrisc	openrisc: Add clone3 ABI wrapper	2022-01-27 11:04:10 +01:00
parisc	parisc/unaligned: Fix ldw() and stw() unalignment handlers	2022-03-02 11:47:49 +01:00
powerpc	powerpc/lib/sstep: fix 'ptesync' build error	2022-02-23 12:03:14 +01:00
riscv	riscv: fix oops caused by irqsoff latency tracer	2022-03-02 11:48:08 +01:00
s390	KVM: s390: Ensure kvm_arch_no_poll() is read once when blocking vCPU	2022-03-08 19:12:34 +01:00
sh	Documentation, arch: Remove leftovers from CIFS_WEAK_PW_HASH	2022-01-27 11:05:21 +01:00
sparc	signal: Replace force_fatal_sig with force_exit_sig when in doubt	2021-11-25 09:49:07 +01:00
um	um: gitignore: Add kernel/capflags.c	2022-01-27 11:05:34 +01:00
x86	x86/hyperv: Properly deal with empty cpumasks in hyperv_flush_tlb_multi()	2022-03-08 19:12:36 +01:00
xtensa	xtensa: xtfpga: Try software restart before simulating CPU reset	2021-10-05 12:19:05 -07:00
.gitignore	…
Kconfig	arch/cc: Introduce a function to check for confidential computing features	2021-11-18 19:17:21 +01:00