Re: [RFC PATCH v2 00/19] PKS write protected page tables

From: Boris Lukashev
Date: Fri Mar 15 2024 - 23:14:29 EST


IIRC shoot-downs are one of the reasons for using per-cpu PGDs, which
can in-turn enable/underpin other hardening functions... presuming the
churn of recent years has softened attitudes toward such core MM
changes.
https://forum.osdev.org/viewtopic.php?f=15&t=29661

-Boris


On Mon, Aug 30, 2021 at 8:02 PM Rick Edgecombe
<rick.p.edgecombe@xxxxxxxxx> wrote:
>
> Hi,
>
> This is a second RFC for the PKS write protected tables concept. I'm sharing to
> show the progress to interested people. I'd also appreciate any comments,
> especially on the direct map page table protection solution (patch 17).
>
> Since v1[1], the improvements are:
> - Fully handle direct map page tables, and handle hotplug/unplug path.
> - Create a debug time checker that scans page tables and verifies
> their protection.
> - Fix odds-and-ends kernel page tables that showed up with debug
> checker. At this point all of the typical normal page tables should be
> protected.
> - Fix toggling of writablility for odds-and-ends page table modifications found
> that don't use the normal helpers.
> - Create atomic context grouped page allocator, after finding some page table
> allocations that are passing GFP_ATOMIC.
> - Create "soft" mode that warns and disables protection on violation instead
> of oopsing.
> - Boot parameters for disabling pks tables
> - Change PageTable set clear to ctor/dtor (peterz)
> - Remove VM_BUG_ON_PAGE in alloc_table() (Shakeel Butt)
> - PeterZ/Vlastimil had suggested to also build a non-PKS mode for use in
> debugging. I skipped it for now because the series was too big.
> - Rebased to latest PKS core v7 [2]
>
> Also, Mike Rapoport has been experimenting[3] with this usage to work on how to
> share caches of permissioned/broken pages between use cases. This RFCv2 still
> uses the "grouped pages" concept, where each usage would maintain its own
> cache, but should be able to integrate with a central solution if something is
> developed.
>
> Next I was planning to look into characterizing/tuning the performance, although
> what page allocation scheme is ultimately used will probably impact that.
>
> This applies on top of the PKS core v7 series[2] and this patch[4]. Testing is
> still pretty light.
>
> This RFC has been acked by Dave Hansen.
>
> [1] https://lore.kernel.org/lkml/20210505003032.489164-1-rick.p.edgecombe@xxxxxxxxx/
> [2] https://lore.kernel.org/lkml/20210804043231.2655537-1-ira.weiny@intelcom/
> [3] https://lore.kernel.org/lkml/20210823132513.15836-1-rppt@xxxxxxxxxx/
> [4] https://lore.kernel.org/lkml/20210818221026.10794-1-rick.p.edgecombe@xxxxxxxxx/
>
> Rick Edgecombe (19):
> list: Support getting most recent element in list_lru
> list: Support list head not in object for list_lru
> x86/mm/cpa: Add grouped page allocations
> mm: Explicitly zero page table lock ptr
> x86, mm: Use cache of page tables
> x86/mm/cpa: Add perm callbacks to grouped pages
> x86/cpufeatures: Add feature for pks tables
> x86/mm/cpa: Add get_grouped_page_atomic()
> x86/mm: Support GFP_ATOMIC in alloc_table_node()
> x86/mm: Use alloc_table() for fill_pte(), etc
> mm/sparsemem: Use alloc_table() for table allocations
> x86/mm: Use free_table in unmap path
> mm/debug_vm_page_table: Use setters instead of WRITE_ONCE
> x86/efi: Toggle table protections when copying
> x86/mm/cpa: Add set_memory_pks()
> x86/mm: Protect page tables with PKS
> x86/mm/cpa: PKS protect direct map page tables
> x86/mm: Add PKS table soft mode
> x86/mm: Add PKS table debug checking
>
> .../admin-guide/kernel-parameters.txt | 4 +
> arch/x86/boot/compressed/ident_map_64.c | 5 +
> arch/x86/include/asm/cpufeatures.h | 2 +-
> arch/x86/include/asm/pgalloc.h | 6 +-
> arch/x86/include/asm/pgtable.h | 31 +-
> arch/x86/include/asm/pgtable_64.h | 33 +-
> arch/x86/include/asm/pkeys_common.h | 1 -
> arch/x86/include/asm/set_memory.h | 24 +
> arch/x86/mm/init.c | 90 +++
> arch/x86/mm/init_64.c | 29 +-
> arch/x86/mm/pat/set_memory.c | 527 +++++++++++++++++-
> arch/x86/mm/pgtable.c | 183 +++++-
> arch/x86/mm/pkeys.c | 4 +
> arch/x86/platform/efi/efi_64.c | 8 +
> include/asm-generic/pgalloc.h | 46 +-
> include/linux/list_lru.h | 26 +
> include/linux/mm.h | 16 +-
> include/linux/pkeys.h | 1 +
> mm/Kconfig | 23 +
> mm/debug_vm_pgtable.c | 36 +-
> mm/list_lru.c | 38 +-
> mm/memory.c | 1 +
> mm/sparse-vmemmap.c | 22 +-
> mm/swap.c | 6 +
> mm/swap_state.c | 5 +
> .../arch/x86/include/asm/disabled-features.h | 8 +-
> 26 files changed, 1123 insertions(+), 52 deletions(-)
>
> --
> 2.17.1
>


--
Boris Lukashev
Systems Architect
Semper Victus