Re: [PATCH 00/13] [RFC] Rust support

From: Olliver Schinagl
Date: Thu Jul 28 2022 - 16:43:34 EST


Hey Gary,

On 28-07-2022 12:21, Gary Guo wrote:
Hi Olliver,

On Wed, 27 Jul 2022 10:05:31 +0200
Olliver Schinagl <oliver+list@xxxxxxxxxxx> wrote:

Consitency is absolutly important! Zero argument there. My argument
is, the consistency should be within the kernel tree, not 'but the
rest of the world is using style X/Y/Z, lets be consistent with that.
In an utopia, maybe, but the real world doesn't work that way, sadly.
So in an attempt to standardize (rustfmt) they just "invented" a new
standard. Which btw is common, we see this happening every so often,
right?

Difference languages have different characteristics and I don't think
it's necessarily good (and often isn't) to blindly apply coding style
of one language onto another. So I don't see rustfmt as "inventing yet
another standard" really, because there aren't many conflicting coding
style standards in Rust world; almost everyone just settled on using
rustfmt, mostly using the default, maybe with a few small
project-specific configuration tweaks.
I was mostly arguing about a) lets look at this and b) having said configuration tweaks, rather then blindly (pun not really intended) going with rust's defaults :)


A small example for C and Rust differences:

Rust requires braces around branches of if expression, and C doesn't.
So in kernel coding style you often have:

if (condition) do_something();

Or

if (condition)
do_something();

But in Rust it will be:

if condition {
do_something();
}
So kernel style kind of says 'no braces around single statements'; but if your rust compiler doesn't allow this; well then there's nothing to do. You could even argue to update the kernel C style on this to make it consistent again. BUT, this inconsistency makes it cognative 'hard'. If this if a C or a rust function? for example during a review. During authoring, when writing both C and rust code (due to nececity, not constant context switching) you cognitivly constantly have to go back/foward. While I'm sure there's people here that can do this all day without problem, some of of find this harder then needs to be. Hence the request to _try_ to keep consistency within the kernel tree.


That's just an example of one control flow constructions. There are
differences between Rust match and C switch, etc. Rust's official
coding style takes properties of Rust into consideration, so in many
regards it's a more suitable coding style for Rust code in kernel, then
applying kernel's C coding standard directly on kernel's Rust code.

Your earlier email in the thread also mentions about indentation, and I
have a few things to point out as well.

First, Rust code typically requires more levels of indentation than C
code. For example, many functions might be methods and they are inside
an impl block, which creates one extra level of indentation.
Statements inside match arms' block are two levels more indented than
the match statement itself, as opposed to C's switch (as kernel coding
style doesn't indent the case labels). As a result, 8 spaces for 1 level
can be a bit excessive for Rust code, and thus the 4 space indentation
used in rustfmt default.

Secondly, I don't think the argument about tabs being customisable
holds; in kernel coding style tabs are strictly 8 characters. For line
Sure, this rule implies that for alignment, tabs should be set to 8 so things align nicely. However, nobody forces people to set their editor to 8 character width. Not doing so, doesn't break anything. At worst, you may commit something that is poorly aligned (but we _should_ be using tabs to indent, spaces to align anyway :p, tab == indent has meaning).

With non-tab indentation, this is no longer really possible, or at least, editors haven't solved that problem yet, as it tends to still break (due to the mixing of indentation and alignment using a single character). Maybe once we have AI and ML in our editors though :p

continuation it's not uncommon to use a series of tabs followed by a
few spaces, e.g.

int function_name(int first_argument,
< tab >< tab >..int second_argument)

changing tab into 4 spaces will break the layout. (and I'll not go into
well-known reasons about non-4-space-tab messing up code in terminal
etc).

Copy/pasting is known to cause bugs. There's actually research from
NASA on that. Code-reuse (libraries/functions) are not bad. But
(worst kind of example) copy paste from stack-overflow, or
copy/pasting stuff without actually looking at the content and
forgetting to rename something, causes bugs. Why is this relevant?
The whole 'lets be consistent with the rust codebase of the wrold'
argument. E.g. if everybody uses the same style (which is idealistic
and great) then copy/pasting becomes consistent. Where I say, try to
be careful when copy/pasting code.

When we vendor in code as a whole (e.g. like we currently do for
alloc crate), it is proper code reuse. With different coding style the
vendored code either diverges from upstream (which makes upstreaming
much more difficult) or diverge from rest of kernel's Rust code base.
Very fair point of course. Though really, we should fix the upstream rust preferred format, but there it was already stated, that 'too bad, sorry' which from a developer point of view is fine, your project, your choice. From a disabilities point of view, sucks of course.


But if that is the case, why not try to follow the kernels existing
code-style as close as possible with the rust-fmt configuration? I
know code-style has been discussed a few times over the decades; but
not many changes have been done, surely, if there's some codestyle
changes that are best for the kernel, they would have been 'advised'?
'4 space indents are better then 8-size tabs, on new code, try to use
them' for example :p

You do realize that you are creating a new coding style by doing this,
right? It feels like creating problems rather than solving problems.

My personal feeling is that it's easier for me to adapt to different
coding style when switching between languages, but it's rather awkward
for me when trying to use different coding styles with the same
language. I find myself no problem switching between 2 spaces when
coding JavaScript to 4 spaces when coding Rust to 8 spaces(tab) when
coding C, but it's rather painful to switch between C projects with
different coding styles. I certainly don't want to switch between Rust
projects with vastly different coding styles.
And I'm happy for you that you can easily take in 2 and 4 spaces. For me, it is extremly hard to read. So it's not a 'personal preference' thing. But I suggest to read the earlier posted links, where others at length explain it as well, how it is like to feel excluded becaues its just hard to read.


But why? Why should we not be consistent with the kernels' code-base
(while yes, that is not rust, but C, but we can follow the same
style?)

Difference languages have different characteristics, and one size
doesn't fit them all :)
I'm not even arguing this at all :)

I think the biggest issues i'm speaking of really are the braces and the spaces really, where the braces can be argued for/against, it's cognitive harder, but can be dealth with (and we can expect inconsitencies; but the sapces vs tabs thing, personal configuration vs forced with is the point I was trying to raise.

As said before, 'every building is different, some offer wheelchair ramps, others do' kind of point, not like 'this building is red, and that one is blue, and not every color fits all :p


Sadly, I've seen so much vendor code (yeah, I know) which doesn't
even have consistency in their own files ...

That's very true. So when all other Rust code currently follow
(roughly) the same coding style and this situation doesn't currently
exist, let's not make it worse...

Best,
Gary