WSL2-Linux-Kernel

История

Michael Ellerman 6627b96851 powerpc/mm: Fix boot crash with FLATMEM [ Upstream commit `daa9ada209` ] Erhard reported that his G5 was crashing with v6.6-rc kernels: mpic: Setting up HT PICs workarounds for U3/U4 BUG: Unable to handle kernel data access at 0xfeffbb62ffec65fe Faulting instruction address: 0xc00000000005dc40 Oops: Kernel access of bad area, sig: 11 [#1] BE PAGE_SIZE=4K MMU=Hash SMP NR_CPUS=2 PowerMac Modules linked in: CPU: 0 PID: 0 Comm: swapper/0 Tainted: G T 6.6.0-rc3-PMacGS #1 Hardware name: PowerMac11,2 PPC970MP 0x440101 PowerMac NIP: c00000000005dc40 LR: c000000000066660 CTR: c000000000007730 REGS: c0000000022bf510 TRAP: 0380 Tainted: G T (6.6.0-rc3-PMacGS) MSR: 9000000000001032 <SF,HV,ME,IR,DR,RI> CR: 44004242 XER: 00000000 IRQMASK: 3 GPR00: 0000000000000000 c0000000022bf7b0 c0000000010c0b00 00000000000001ac GPR04: 0000000003c80000 0000000000000300 c0000000f20001ae 0000000000000300 GPR08: 0000000000000006 feffbb62ffec65ff 0000000000000001 0000000000000000 GPR12: 9000000000001032 c000000002362000 c000000000f76b80 000000000349ecd8 GPR16: 0000000002367ba8 0000000002367f08 0000000000000006 0000000000000000 GPR20: 00000000000001ac c000000000f6f920 c0000000022cd985 000000000000000c GPR24: 0000000000000300 00000003b0a3691d c0003e008030000e 0000000000000000 GPR28: c00000000000000c c0000000f20001ee feffbb62ffec65fe 00000000000001ac NIP hash_page_do_lazy_icache+0x50/0x100 LR __hash_page_4K+0x420/0x590 Call Trace: hash_page_mm+0x364/0x6f0 do_hash_fault+0x114/0x2b0 data_access_common_virt+0x198/0x1f0 --- interrupt: 300 at mpic_init+0x4bc/0x10c4 NIP: c000000002020a5c LR: c000000002020a04 CTR: 0000000000000000 REGS: c0000000022bf9f0 TRAP: 0300 Tainted: G T (6.6.0-rc3-PMacGS) MSR: 9000000000001032 <SF,HV,ME,IR,DR,RI> CR: 24004248 XER: 00000000 DAR: c0003e008030000e DSISR: 40000000 IRQMASK: 1 ... NIP mpic_init+0x4bc/0x10c4 LR mpic_init+0x464/0x10c4 --- interrupt: 300 pmac_setup_one_mpic+0x258/0x2dc pmac_pic_init+0x28c/0x3d8 init_IRQ+0x90/0x140 start_kernel+0x57c/0x78c start_here_common+0x1c/0x20 A bisect pointed to the breakage beginning with commit `9fee28baa6` ("powerpc: implement the new page table range API"). Analysis of the oops pointed to a struct page with a corrupted compound_head being loaded via page_folio() -> _compound_head() in hash_page_do_lazy_icache(). The access by the mpic code is to an MMIO address, so the expectation is that the struct page for that address would be initialised by init_unavailable_range(), as pointed out by Aneesh. Instrumentation showed that was not the case, which eventually lead to the realisation that pfn_valid() was returning false for that address, causing the struct page to not be initialised. Because the system is using FLATMEM, the version of pfn_valid() in memory_model.h is used: static inline int pfn_valid(unsigned long pfn) { ... return pfn >= pfn_offset && (pfn - pfn_offset) < max_mapnr; } Which relies on max_mapnr being initialised. Early in boot max_mapnr is zero meaning no PFNs are valid. max_mapnr is initialised in mem_init() called via: start_kernel() mm_core_init() # init/main.c:928 mem_init() But that is too late for the usage in init_unavailable_range() called via: start_kernel() setup_arch() # init/main.c:893 paging_init() free_area_init() init_unavailable_range() Although max_mapnr is currently set in mem_init(), the value is actually already available much earlier, as soon as mem_topology_setup() has completed, which is also before paging_init() is called. So move the initialisation there, which causes paging_init() to correctly initialise the struct page and fixes the bug. This bug seems to have been lurking for years, but went unnoticed because the pre-folio code was inspecting the uninitialised page->flags but not dereferencing it. Thanks to Erhard and Aneesh for help debugging. Reported-by: Erhard Furtner <erhard_f@mailbox.org> Closes: https://lore.kernel.org/all/20230929132750.3cd98452@yea/ Signed-off-by: Michael Ellerman <mpe@ellerman.id.au> Link: https://msgid.link/20231023112500.1550208-1-mpe@ellerman.id.au Signed-off-by: Sasha Levin <sashal@kernel.org>		2023-11-08 17:26:48 +01:00
..
book3s32	powerpc/32s: Do kuep_lock() and kuep_unlock() in assembly	2023-10-25 11:58:59 +02:00
book3s64	powerpc: Don't include lppaca.h in paca.h	2023-09-19 12:22:42 +02:00
kasan	powerpc/kasan: Disable KCOV in KASAN code	2023-08-26 14:23:26 +02:00
nohash	powerpc/32: Call mmu_mark_initmem_nx() regardless of data block mapping.	2022-08-17 14:24:12 +02:00
ptdump	powerpc/ptdump: Fix display of RW pages on FSL_BOOK3E	2022-08-17 14:22:58 +02:00
Makefile	powerpc/ptdump: Convert powerpc to GENERIC_PTDUMP	2021-08-25 13:35:48 +10:00
cacheflush.c	powerpc/mem: Use kmap_local_page() in flushing functions	2021-04-14 23:04:19 +10:00
copro_fault.c	mm: clean up the last pieces of page fault accountings	2020-08-12 10:58:04 -07:00
dma-noncoherent.c	dma-mapping: merge <linux/dma-noncoherent.h> into <linux/dma-map-ops.h>	2020-10-06 07:07:06 +02:00
drmem.c	pseries/drmem: update LMBs after LPM	2021-08-10 23:14:55 +10:00
fault.c	powerpc/64s: Don't use DSISR for SLB faults	2022-04-08 14:23:38 +02:00
hugetlbpage.c	hugetlb: pass vma into huge_pte_alloc() and huge_pmd_share()	2021-05-05 11:27:20 -07:00
init-common.c	powerpc: Inline setup_kup()	2020-12-15 13:13:49 +11:00
init_32.c	powerpc: Enable KFENCE for PPC32	2021-03-24 14:09:30 +11:00
init_64.c	powerpc/mm/altmap: Fix altmap boundary check	2023-08-11 15:13:59 +02:00
ioremap.c	mm/vmalloc: remove unmap_kernel_range	2021-04-30 11:20:40 -07:00
ioremap_32.c	powerpc/mm: Leave a gap between early allocated IO areas	2021-06-25 00:07:10 +10:00
ioremap_64.c	powerpc/mm: Leave a gap between early allocated IO areas	2021-06-25 00:07:10 +10:00
maccess.c	powerpc: Don't use 'struct ppc_inst' to reference instruction location	2021-06-17 00:09:00 +10:00
mem.c	powerpc/mm: Fix boot crash with FLATMEM	2023-11-08 17:26:48 +01:00
mmap.c	…
mmu_context.c	powerpc/mm: Switch obsolete dssall to .long	2022-06-14 18:36:27 +02:00
mmu_decl.h	powerpc/ptdump: Convert powerpc to GENERIC_PTDUMP	2021-08-25 13:35:48 +10:00
numa.c	powerpc/papr_scm: Update the NUMA distance table for the target node	2023-04-20 12:13:56 +02:00
pageattr.c	powerpc/set_memory: Avoid spinlock recursion in change_page_attr()	2022-04-13 20:59:05 +02:00
pgtable-frag.c	powerpc/mm/radix: Fix PTE/PMD fragment count for early page table mappings	2020-07-20 22:57:56 +10:00
pgtable.c	powerpc/fixmap: Fix VM debug warning on unmap	2022-02-16 12:56:12 +01:00
pgtable_32.c	powerpc/32: Call mmu_mark_initmem_nx() regardless of data block mapping.	2022-08-17 14:24:12 +02:00
pgtable_64.c	powerpc/64s/radix: Fix huge vmap false positive	2022-01-27 11:05:12 +01:00
slice.c	powerpc: Replace _ALIGN_UP() by ALIGN()	2020-05-11 23:15:15 +10:00