Re: Fwd: kernel bug when performing heavy IO operations

From: Matthew Wilcox
Date: Wed Sep 27 2023 - 03:20:07 EST


On Wed, Sep 27, 2023 at 01:36:52PM +0800, dianlujitao@xxxxxxxxx wrote:
> Hello, I got some logs with 6.5.4 kernel from the official linux package of
> Arch, no zen patches this time. Full dmesg is uploaded to
> https://fars.ee/F1yM and below is a small snippet for your convenience, from
> which PG_offline is no longer set:
>
> [177850.039441] BUG: Bad page map in process ld.lld pte:8000000edacc4025
> pmd:147f96067
> [177850.039454] page:000000007415dd6c refcount:22 mapcount:-237
> mapping:00000000b0c37ca6 index:0x1075 pfn:0xedacc4

It still looks like memory corruption to me. If you go back to an older
kernel (say 5.10 or 5.15) does the problem go away? It's not really
dispositive either way, since a newer kernel might drive the hardware
closer to the edge, but it might give some clue.