Re: [PATCH v3 12/25] bus: mhi: ep: Add support for ring management

From: Manivannan Sadhasivam
Date: Fri Feb 18 2022 - 10:23:39 EST


On Fri, Feb 18, 2022 at 01:37:04PM +0530, Manivannan Sadhasivam wrote:
> On Tue, Feb 15, 2022 at 02:03:13PM -0600, Alex Elder wrote:
> > On 2/12/22 12:21 PM, Manivannan Sadhasivam wrote:
> > > Add support for managing the MHI ring. The MHI ring is a circular queue
> > > of data structures used to pass the information between host and the
> > > endpoint.
> > >
> > > MHI support 3 types of rings:
> > >
> > > 1. Transfer ring
> > > 2. Event ring
> > > 3. Command ring
> > >
> > > All rings reside inside the host memory and the MHI EP device maps it to
> > > the device memory using blocks like PCIe iATU. The mapping is handled in
> > > the MHI EP controller driver itself.
> > >
> > > Signed-off-by: Manivannan Sadhasivam <manivannan.sadhasivam@xxxxxxxxxx>
> >
> > Great explanation. One more thing to add, is that the command
> > and transfer rings are directed from the host to the MHI EP device,
> > while the event rings are directed from the EP device toward the
> > host.
> >
>
> That's correct, will add.
>
> > I notice that you've improved a few things I had notes about,
> > and I don't recall suggesting them. I'm very happy about that.
> >
> > I have a few more comments here, some worth thinking about
> > at least.
> >
> > -Alex
> >
> > > ---
> > > drivers/bus/mhi/ep/Makefile | 2 +-
> > > drivers/bus/mhi/ep/internal.h | 33 +++++
> > > drivers/bus/mhi/ep/main.c | 59 +++++++-
> > > drivers/bus/mhi/ep/ring.c | 267 ++++++++++++++++++++++++++++++++++
> > > include/linux/mhi_ep.h | 11 ++
> > > 5 files changed, 370 insertions(+), 2 deletions(-)
> > > create mode 100644 drivers/bus/mhi/ep/ring.c
> > >
> > > diff --git a/drivers/bus/mhi/ep/Makefile b/drivers/bus/mhi/ep/Makefile
> > > index a1555ae287ad..7ba0e04801eb 100644
> > > --- a/drivers/bus/mhi/ep/Makefile
> > > +++ b/drivers/bus/mhi/ep/Makefile
> > > @@ -1,2 +1,2 @@
> > > obj-$(CONFIG_MHI_BUS_EP) += mhi_ep.o
> > > -mhi_ep-y := main.o mmio.o
> > > +mhi_ep-y := main.o mmio.o ring.o
> > > diff --git a/drivers/bus/mhi/ep/internal.h b/drivers/bus/mhi/ep/internal.h
> > > index 2c756a90774c..48d6e9667d55 100644
> > > --- a/drivers/bus/mhi/ep/internal.h
> > > +++ b/drivers/bus/mhi/ep/internal.h
> > > @@ -112,6 +112,18 @@ enum mhi_ep_execenv {
> > > MHI_EP_UNRESERVED
> > > };
> > > +/* Transfer Ring Element macros */
> > > +#define MHI_EP_TRE_PTR(ptr) (ptr)
> > > +#define MHI_EP_TRE_DWORD0(len) (len & MHI_MAX_MTU)
> >
> > The above looks funny. This assumes MHI_MAX_MTU is
> > a mask value (likely one less than a power-of-2).
> > That doesn't seem obvious to me; use modulo if you
> > must, but better, just ensure len is in range rather
> > than silently truncating it if it's not.
> >
> > > +#define MHI_EP_TRE_DWORD1(bei, ieot, ieob, chain) ((2 << 16) | (bei << 10) \
> > > + | (ieot << 9) | (ieob << 8) | chain)
> >
> > You should probably use FIELD_PREP() to compute the value
> > here, since you're using FIELD_GET() to extract the field
> > values below.
> >
> > > +#define MHI_EP_TRE_GET_PTR(tre) ((tre)->ptr)
> > > +#define MHI_EP_TRE_GET_LEN(tre) ((tre)->dword[0] & 0xffff)
> > > +#define MHI_EP_TRE_GET_CHAIN(tre) FIELD_GET(BIT(0), (tre)->dword[1])
> >
> > #define TRE_FLAG_CHAIN BIT(0)
> >
> > Then just call
> > bei = FIELD_GET(TRE_FLAG_CHAIN, tre->dword[1]);
> >
> > But I haven't looked at the code where this is used yet.
> >
> > > +#define MHI_EP_TRE_GET_IEOB(tre) FIELD_GET(BIT(8), (tre)->dword[1])
> > > +#define MHI_EP_TRE_GET_IEOT(tre) FIELD_GET(BIT(9), (tre)->dword[1])
> > > +#define MHI_EP_TRE_GET_BEI(tre) FIELD_GET(BIT(10), (tre)->dword[1])
> > > +
> >
> > These macros should be shared/shareable between the host and endpoint.
> > They operate on external interfaces and so should be byte swapped
> > (where used) when updating actual memory. Unlike the patches from
> > Paul Davey early in this series, this does *not* byte swap the
> > values in the right hand side of these definitions, which is good.
> >
> > I'm pretty sure I mentioned this before... I don't really like these
> > "DWORD" macros that simply write compute register values to write
> > out to the TREs. A TRE is a structure, not a set of registers. And
> > a whole TRE can be written or read in a single ARM instruction in
> > some cases--but most likely you need to define it as a structure
> > for that to happen.
> >
> > struct mhi_tre {
> > __le64 addr;
> > __le16 len_opcode
> > __le16 reserved;
> > __le32 flags;
> > };
>
> Changing the TRE structure requires changes to both host and endpoint
> stack. So I'll tackle this as an improvement later.
>
> Added to TODO list.

Just did a comparision w/ IPA code and I convinced myself that this conversion
should happen now itself. So please ignore my above comment.

Thanks,
Mani

>
> >
> > Which reminds me, this shared memory area should probably be mapped
> > using memremap() rather than ioremap(). I haven't checked whether
> > it is...
> >
> > > enum mhi_ep_ring_type {
> > > RING_TYPE_CMD = 0,
> > > RING_TYPE_ER,
> > > @@ -131,6 +143,11 @@ union mhi_ep_ring_ctx {
> > > struct mhi_generic_ctx generic;
> > > };
> > > +struct mhi_ep_ring_item {
> > > + struct list_head node;
> > > + struct mhi_ep_ring *ring;
> > > +};
> > > +
> > > struct mhi_ep_ring {
> > > struct mhi_ep_cntrl *mhi_cntrl;
> > > int (*ring_cb)(struct mhi_ep_ring *ring, struct mhi_ep_ring_element *el);
> > > @@ -143,6 +160,9 @@ struct mhi_ep_ring {
> > > u32 db_offset_h;
> > > u32 db_offset_l;
> > > u32 ch_id;
> > > + u32 er_index;
> > > + u32 irq_vector;
> > > + bool started;
> > > };
> > > struct mhi_ep_cmd {
> > > @@ -168,6 +188,19 @@ struct mhi_ep_chan {
> > > bool skip_td;
> > > };
> > > +/* MHI Ring related functions */
> > > +void mhi_ep_ring_init(struct mhi_ep_ring *ring, enum mhi_ep_ring_type type, u32 id);
> > > +void mhi_ep_ring_reset(struct mhi_ep_cntrl *mhi_cntrl, struct mhi_ep_ring *ring);
> > > +int mhi_ep_ring_start(struct mhi_ep_cntrl *mhi_cntrl, struct mhi_ep_ring *ring,
> > > + union mhi_ep_ring_ctx *ctx);
> > > +size_t mhi_ep_ring_addr2offset(struct mhi_ep_ring *ring, u64 ptr);
> > > +int mhi_ep_process_ring(struct mhi_ep_ring *ring);
> > > +int mhi_ep_ring_add_element(struct mhi_ep_ring *ring, struct mhi_ep_ring_element *element);
> > > +void mhi_ep_ring_inc_index(struct mhi_ep_ring *ring);
> > > +int mhi_ep_process_cmd_ring(struct mhi_ep_ring *ring, struct mhi_ep_ring_element *el);
> > > +int mhi_ep_process_tre_ring(struct mhi_ep_ring *ring, struct mhi_ep_ring_element *el);
> > > +int mhi_ep_update_wr_offset(struct mhi_ep_ring *ring);
> > > +
> > > /* MMIO related functions */
> > > u32 mhi_ep_mmio_read(struct mhi_ep_cntrl *mhi_cntrl, u32 offset);
> > > void mhi_ep_mmio_write(struct mhi_ep_cntrl *mhi_cntrl, u32 offset, u32 val);
> > > diff --git a/drivers/bus/mhi/ep/main.c b/drivers/bus/mhi/ep/main.c
> > > index 950b5bcabe18..2c8045766292 100644
> > > --- a/drivers/bus/mhi/ep/main.c
> > > +++ b/drivers/bus/mhi/ep/main.c
> > > @@ -18,6 +18,48 @@
> > > static DEFINE_IDA(mhi_ep_cntrl_ida);
> >
> > The following function handles command or channel interrupt work.
> >
>
> Both
>
> > > +static void mhi_ep_ring_worker(struct work_struct *work)
> > > +{
> > > + struct mhi_ep_cntrl *mhi_cntrl = container_of(work,
> > > + struct mhi_ep_cntrl, ring_work);
> > > + struct device *dev = &mhi_cntrl->mhi_dev->dev;
> > > + struct mhi_ep_ring_item *itr, *tmp;
> > > + struct mhi_ep_ring *ring;
> > > + struct mhi_ep_chan *chan;
> > > + unsigned long flags;
> > > + LIST_HEAD(head);
> > > + int ret;
> > > +
> > > + /* Process the command ring first */
> > > + ret = mhi_ep_process_ring(&mhi_cntrl->mhi_cmd->ring);
> > > + if (ret) {
> >
> > At the moment I'm not sure where this work gets scheduled.
> > But what if there is no command to process? It looks
> > like you go update the cached pointer no matter what
> > to see if there's anything new. But it seems like you
> > ought to be able to do this when interrupted for a
> > command rather than all the time.
> >
>
> No, ring cache is not getting updated all the time. If you look into
> process_ring(), first the write pointer is read from MMIO and there is a
> check to see if there are elements in the ring or not. Only if that
> check passes, the ring cache will get updated.
>
> Since the same work item is used for both cmd and transfer rings, this
> check is necessary. The other option would be to use different work items
> for command and transfer rings. This is something I want to try once
> this initial version gets merged.
>
> Thanks,
> Mani