Re: [PATCH v3 5/6] virtio-scsi: fix various bad behavior on aborted requests

From: Venkatesh Srinivas
Date: Wed Jun 04 2014 - 13:29:28 EST


On 6/4/14, Paolo Bonzini <pbonzini@xxxxxxxxxx> wrote:
> Even though the virtio-scsi spec guarantees that all requests related
> to the TMF will have been completed by the time the TMF itself completes,
> the request queue's callback might not have run yet. This causes requests
> to be completed more than once, and as a result triggers a variety of
> BUGs or oopses.
>
> Cc: stable@xxxxxxxxxxxxxxx
> Signed-off-by: Paolo Bonzini <pbonzini@xxxxxxxxxx>
> ---
> drivers/scsi/virtio_scsi.c | 22 ++++++++++++++++++++++
> 1 file changed, 22 insertions(+)
>
> diff --git a/drivers/scsi/virtio_scsi.c b/drivers/scsi/virtio_scsi.c
> index d66c4ee2c774..fda9fb358888 100644
> --- a/drivers/scsi/virtio_scsi.c
> +++ b/drivers/scsi/virtio_scsi.c
> @@ -235,6 +235,16 @@ static void virtscsi_req_done(struct virtqueue *vq)
> virtscsi_vq_done(vscsi, req_vq, virtscsi_complete_cmd);
> };
>
> +static void virtscsi_poll_requests(struct virtio_scsi *vscsi)
> +{
> + int i, num_vqs;
> +
> + num_vqs = vscsi->num_queues;
> + for (i = 0; i < num_vqs; i++)
> + virtscsi_vq_done(vscsi, &vscsi->req_vqs[i],
> + virtscsi_complete_cmd);
> +}
> +
> static void virtscsi_complete_free(struct virtio_scsi *vscsi, void *buf)
> {
> struct virtio_scsi_cmd *cmd = buf;
> @@ -579,6 +589,18 @@ static int virtscsi_tmf(struct virtio_scsi *vscsi,
> struct virtio_scsi_cmd *cmd)
> cmd->resp.tmf.response == VIRTIO_SCSI_S_FUNCTION_SUCCEEDED)
> ret = SUCCESS;
>
> + /*
> + * The spec guarantees that all requests related to the TMF have
> + * been completed, but the callback might not have run yet if
> + * we're using independent interrupts (e.g. MSI). Poll the
> + * virtqueues once.
> + *
> + * In the abort case, sc->scsi_done will do nothing, because
> + * the block layer must have detected a timeout and as a result
> + * REQ_ATOM_COMPLETE has been set.
> + */
> + virtscsi_poll_requests(vscsi);

Do you really want to poll the request VQs for completions if the TMF
was rejected?

TMF ABORT may return FUNCTION REJECTED if the command to abort
completed before the device saw the TMF ABORT message, for example. In
such cases, this would
unnecessarily lengthen the EH path.

> +
> out:
> mempool_free(cmd, virtscsi_cmd_pool);
> return ret;
> --
> 1.8.3.1

Thanks for looking into this,
-- vs;
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/