[RFC PATCH v0.1 0/9] UMCG early preview/RFC patchset

From: Peter Oskolkov
Date: Thu May 20 2021 - 14:36:23 EST


As indicated earlier in the FUTEX_SWAP patchset:

https://lore.kernel.org/lkml/20200722234538.166697-1-posk@xxxxxxx/

"Google Fibers" is a userspace scheduling framework
used widely and successfully at Google to improve in-process workload
isolation and response latencies. We are working on open-sourcing
this framework, and UMCG (User-Managed Concurrency Groups) kernel
patches are intended as the foundation of this.

This patchset is "early preview/RFC" - an earlier version of the
"core UMCG API" was discussed offlist, and I was asked to post
"what I have" to LKML before I consider the work ready for a full
review.

Notes:
- the first six patches cover "core UMCG API" and are more "ready"
than the last three, in the sense that I expect to see few, if any,
material changes to them, unless the whole approach is NACKed;
- the last three patches cover "server/worker UMCG API" and need
more work and testing:
- while I'm not aware of any specific issues with them, I have not
implemented and/or tested, yet, many important use cases, such as:
- tracing
- signals/interrupts
- explicit preemption of workers other than cooperative wait/swap
- comments/documentation is missing in many important places, or
maybe even wrong/outdated.

As such, please pay more attention to the high-level intended behavior
and design than to things like patch organization, contents of commit
messages, comments or indentation, especially in the last three patches.

Unless the feedback here points to a different approach, my next step
is to add timeout handling to sys_umcg_wait/sys_umcg_swap, as this
will open up a lot of Google-internal tests that cover most of
use/corner cases other than explicit preemption of workers (Google
Fibers use cooperative scheduling features only). Then I'll
work on issues uncovered by those tests. Then I'll address preemption
and tracing.


This work is loosely based on Google-internal SwitchTo and SwitchTo
Groups kernel patches developed by Paul Turner and Ben Segall.

Peter Oskolkov (9):
sched/umcg: add UMCG syscall stubs and CONFIG_UMCG
sched/umcg: add uapi/linux/umcg.h and sched/umcg.c
sched: add WF_CURRENT_CPU and externise ttwu
sched/umcg: implement core UMCG API
lib/umcg: implement UMCG core API for userspace
selftests/umcg: add UMCG core API selftest
sched/umcg: add UMCG server/worker API (early RFC)
lib/umcg: add UMCG server/worker API (early RFC)
selftests/umcg: add UMCG server/worker API selftest

arch/x86/entry/syscalls/syscall_64.tbl | 11 +
include/linux/mm_types.h | 5 +
include/linux/sched.h | 7 +-
include/linux/syscalls.h | 14 +
include/uapi/asm-generic/unistd.h | 25 +-
include/uapi/linux/umcg.h | 70 ++
init/Kconfig | 10 +
kernel/fork.c | 11 +
kernel/sched/Makefile | 1 +
kernel/sched/core.c | 17 +-
kernel/sched/fair.c | 4 +
kernel/sched/sched.h | 15 +-
kernel/sched/umcg.c | 1114 +++++++++++++++++
kernel/sched/umcg.h | 96 ++
kernel/sys_ni.c | 13 +
mm/init-mm.c | 4 +
tools/lib/umcg/.gitignore | 4 +
tools/lib/umcg/Makefile | 11 +
tools/lib/umcg/libumcg.c | 572 +++++++++
tools/lib/umcg/libumcg.h | 262 ++++
tools/testing/selftests/umcg/.gitignore | 3 +
tools/testing/selftests/umcg/Makefile | 15 +
tools/testing/selftests/umcg/umcg_core_test.c | 347 +++++
tools/testing/selftests/umcg/umcg_test.c | 475 +++++++
24 files changed, 3096 insertions(+), 10 deletions(-)
create mode 100644 include/uapi/linux/umcg.h
create mode 100644 kernel/sched/umcg.c
create mode 100644 kernel/sched/umcg.h
create mode 100644 tools/lib/umcg/.gitignore
create mode 100644 tools/lib/umcg/Makefile
create mode 100644 tools/lib/umcg/libumcg.c
create mode 100644 tools/lib/umcg/libumcg.h
create mode 100644 tools/testing/selftests/umcg/.gitignore
create mode 100644 tools/testing/selftests/umcg/Makefile
create mode 100644 tools/testing/selftests/umcg/umcg_core_test.c
create mode 100644 tools/testing/selftests/umcg/umcg_test.c

--
2.31.1.818.g46aad6cb9e-goog