Re: [PATCH 1/2] fs: Do not dispatch FITRIM through separate super_operation

From: Greg Freemyer
Date: Fri Nov 19 2010 - 11:21:29 EST


On Fri, Nov 19, 2010 at 3:55 AM, Christoph Hellwig <hch@xxxxxxxxxxxxx> wrote:
> On Thu, Nov 18, 2010 at 05:16:36PM -0800, Greg Freemyer wrote:
>> I agree with Mark.  When you say "make coalescing work" it sounds like
>> major overkill.
>>
>> FITRIM should be able to lock a group of non-contiguous free ranges,
>> send them down to the block layer as a single pre-coalesced set, and
>> the block layer just needs to pass it on in a synchronous way.  Then
>> when that group of ranges is discarded, FITRIM releases the locks.
>
> Given that you know the Linux I/O stack and hardware so well may I
> volunteer you to implement it?  Including remapping the multiple ranges
> in device mapper, and dealing with the incompatible range formats for
> TRIM vs UNMAP.  If your implementation is clean enough I'm pretty
> sure no one will object to merge it.
>
>> Every TRIM causes a cache flush anyway on the SSD, so the synchronous
>> aspect of the process should not be a problem.  (Maybe SCSI with thin
>> provisioning would see a performance hit?)
>
> What's your evidence?

The kernel team has been coding around some Utopian SSD TRIM
implementation for at least 2 years with the basic assumption that
SSDs can handle thousands of trims per second. Just mix em in with
the rest of the i/o. No problem. Intel swore to us its the right
thing to do.

I'm still waiting to see the first benchmark report from anywhere
(SSD, Thin Provisioned SCSI) that the online approach used by mount -o
discard is a win performance wise. Linux has a history of designing
for reality, but for some reason when it comes to SSDs reality seems
not to be a big concern.

Greg
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/