Re: [Scst-devel] ISCSI-SCST performance (with also IET and STGT data)

From: Vladislav Bolkhovitin
Date: Tue Mar 31 2009 - 13:37:57 EST


Bart Van Assche, on 03/30/2009 10:53 PM wrote:
On Mon, Mar 30, 2009 at 8:33 PM, Vladislav Bolkhovitin <vst@xxxxxxxx> wrote:
Bart Van Assche, on 03/30/2009 10:06 PM wrote:
These are indeed interesting results. There are some aspects of the
test setup I do not understand however:
* All tests have been run with buffered I/O instead of direct I/O
(iflag=direct / oflag=direct). My experience is that the results of
tests with direct I/O are easier to reproduce (less variation between
runs). So I have been wondering why the tests have been run with
buffered I/O instead ?
Real applications use buffered I/O, hence it should be used in tests. It
evaluates all the storage stack on both initiator and target as a whole.
The results are very reproducible, variation is about 10%.

Most applications do indeed use buffered I/O. Database software
however often uses direct I/O. It might be interesting to publish
performance results for both buffered I/O and direct I/O.

Yes, sure

A quote from
the paper "Asynchronous I/O Support in Linux 2.5" by Bhattacharya e.a.
(Linux Symposium, Ottawa, 2003):

Direct I/O (raw and O_DIRECT) transfers data between a user buffer and
a device without copying the data through the kernel’s buffer cache.
This mechanism can boost performance if the data is unlikely to be
used again in the short term (during a disk backup, for example), or
for applications such as large database management systems that
perform their own caching.

Please don't misread phrase "unlikely to be used again in the short term". If you have read-ahead, all your cached data is *likely* to be used "again" in the near future after they were read from storage, although only once in the first read by application. The same is true for write-back caching, where data written to the cache once for each command. Both read-ahead and write back are very important for good performance and O_DIRECT throws them away. All the modern HDDs have a memory buffer (cache) at least 2MB big on the cheapest ones. This cache is essential for performance, although how can it make any difference if the host computer has, say, 1000 times more memory?

Thus, to work effectively with O_DIRECT an application has to be very smart to workaround the lack of read-ahead and write back.

I personally consider O_DIRECT (as well as BLOCKIO) as nothing more than a workaround for possible flaws in the storage subsystem. If O_DIRECT works better, then in 99+% cases there is something in the storage subsystem, which should be fixed to perform better.

To be complete, there is one case where O_DIRECT and BLOCKIO have an advantage: both of them transfer data zero-copy. So they are good if your memory is too slow comparing to storage (InfiniBand case, for instance) and additional data copy hurts performance noticeably.

Bart.

------------------------------------------------------------------------------
_______________________________________________
Scst-devel mailing list
Scst-devel@xxxxxxxxxxxxxxxxxxxxx
https://lists.sourceforge.net/lists/listinfo/scst-devel


--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/