Re: RFC: Exporting NOCMTIME to userspace

From: Aleksandr Koltsoff
Date: Mon Jun 21 2010 - 05:12:17 EST


Andi Kleen wrote:
> Aleksandr Koltsoff <aleksandr.koltsoff@xxxxxxx> writes:
>> On the other hand, if someone can suggest a way to avoid timestamp
>> updates/causing inode writes, I'm all ears and eyes. (using the
>> block-layer directly or writing a custom fs is not really an elegant
>> solution, IMO).
>
> I think what would be better would be to have flush intervals
> that specify that m/c time are only flushed with longer
> intervals (similar to the deferred atime that's now in there)
>
> This would still cause the inode to be written if it gets flushed from
> memory on low memory and occasionally depending on the interval, but
> most of the writes would be gone. All still with the same semantics.

While this might solve the performance aspect of the problem, it will
only migitate the reliability aspect with NANDs, since the inodes will
eventually be flushed anyway (thus causing irreversible wear).

Also, a tunable like you're suggesting would affect all files on a
single filesystem, instead of just a subset of files. Having an option
to "disable" m/ctime updates per file would be optimal in our case,
since we're talking about several years of runtime with the use case.
There are other issues with rrdtool which make this hard, but those are
all solvable without kernel modifications.

That said, the migitative solution would be better than none :-).

Regards,

ak.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/