Re: [PATCH v8 11/12] user_events: Validate user payloads for size and null termination

From: Beau Belgrave
Date: Thu Jan 06 2022 - 20:02:42 EST


On Fri, Jan 07, 2022 at 08:32:52AM +0900, Masami Hiramatsu wrote:
> Hi Beau,
>
> On Mon, 3 Jan 2022 10:53:08 -0800
> Beau Belgrave <beaub@xxxxxxxxxxxxxxxxxxx> wrote:
>
> [...]
> > > > typedef void (*user_event_func_t) (struct user_event *user, struct iov_iter *i,
> > > > - void *tpdata);
> > > > + void *tpdata, bool *faulted);
> > >
> > > Why don't you just return "int" value? ;-)
> > >
> >
> > There can be more than one callback attached per-probe, and in all cases
> > where a return value is needed is for a faulted (or would have faulted)
> > case. This allows less branches when data is being traced/logged as the
> > return value does not need to be checked (nor should it short circuit
> > other probes that are attached).
>
> Would you mean overwriting the 'faulted' ? If so, you can do something like
>
> faulted = 0;
> for_each_user_event_func(user_event_func) {
> faulted |= user_event_func();
> }
> if (faulted)
> ...
>

Yeah, could OR it in, I don't see a big difference though to be honest
:)

> But I think if one user_event_func() fails to access the user data,
> other funcs also fail. In this case, it is faster to skip others than
> repeating faults.

eBPF will not fault when perf/ftrace could, at the very least we want to
ensure that callbacks get a chance to see data even if it faulted
elsewhere. This ensures that we are not blind to the fact it is happening
at least when eBPF is being used. We cannot guarantee probe/callback order.

This has been a problem in the past for us, we've seen data disappear later
to find out it was possibly due to a page fault occurring.

In later versions I would like to add internal tracepoints for these
conditions (and some others) so we can further track when they occur.

Thanks,
-Beau