Re: [PATCH 01/10] PCI: designware-ep: Add multiple PFs support for DWC

From: Andrew Murray
Date: Fri Aug 16 2019 - 08:35:32 EST


On Fri, Aug 16, 2019 at 11:00:01AM +0000, Xiaowei Bao wrote:
>
>
> > -----Original Message-----
> > From: Andrew Murray <andrew.murray@xxxxxxx>
> > Sent: 2019å8æ16æ 17:45
> > To: Xiaowei Bao <xiaowei.bao@xxxxxxx>
> > Cc: jingoohan1@xxxxxxxxx; gustavo.pimentel@xxxxxxxxxxxx;
> > mark.rutland@xxxxxxx; shawnguo@xxxxxxxxxx; Leo Li
> > <leoyang.li@xxxxxxx>; kishon@xxxxxx; lorenzo.pieralisi@xxxxxxx;
> > arnd@xxxxxxxx; gregkh@xxxxxxxxxxxxxxxxxxx; M.h. Lian
> > <minghuan.lian@xxxxxxx>; Roy Zang <roy.zang@xxxxxxx>;
> > linux-pci@xxxxxxxxxxxxxxx; devicetree@xxxxxxxxxxxxxxx;
> > linux-kernel@xxxxxxxxxxxxxxx; linux-arm-kernel@xxxxxxxxxxxxxxxxxxx;
> > linuxppc-dev@xxxxxxxxxxxxxxxx; Z.q. Hou <zhiqiang.hou@xxxxxxx>
> > Subject: Re: [PATCH 01/10] PCI: designware-ep: Add multiple PFs support for
> > DWC
> >
> > On Fri, Aug 16, 2019 at 02:55:41AM +0000, Xiaowei Bao wrote:
> > >
> > >
> > > > -----Original Message-----
> > > > From: Andrew Murray <andrew.murray@xxxxxxx>
> > > > Sent: 2019å8æ15æ 19:32
> > > > To: Xiaowei Bao <xiaowei.bao@xxxxxxx>
> > > > Cc: jingoohan1@xxxxxxxxx; gustavo.pimentel@xxxxxxxxxxxx;
> > > > bhelgaas@xxxxxxxxxx; robh+dt@xxxxxxxxxx; mark.rutland@xxxxxxx;
> > > > shawnguo@xxxxxxxxxx; Leo Li <leoyang.li@xxxxxxx>; kishon@xxxxxx;
> > > > lorenzo.pieralisi@xxxxxxx; arnd@xxxxxxxx;
> > > > gregkh@xxxxxxxxxxxxxxxxxxx; M.h. Lian <minghuan.lian@xxxxxxx>;
> > > > Mingkai Hu <mingkai.hu@xxxxxxx>; Roy Zang <roy.zang@xxxxxxx>;
> > > > linux-pci@xxxxxxxxxxxxxxx; devicetree@xxxxxxxxxxxxxxx;
> > > > linux-kernel@xxxxxxxxxxxxxxx; linux-arm-kernel@xxxxxxxxxxxxxxxxxxx;
> > > > linuxppc-dev@xxxxxxxxxxxxxxxx
> > > > Subject: Re: [PATCH 01/10] PCI: designware-ep: Add multiple PFs
> > > > support for DWC
> > > >
> > > > On Thu, Aug 15, 2019 at 04:37:07PM +0800, Xiaowei Bao wrote:
> > > > > Add multiple PFs support for DWC, different PF have different
> > > > > config space, we use pf-offset property which get from the DTS to
> > > > > access the different pF config space.
> > > >
> > > > Thanks for the patch. I haven't seen a cover letter for this series,
> > > > is there one missing?
> > > Maybe I miss, I will add you to review next time, thanks a lot for your
> > comments.
> > > >
> > > >
> > > > >
> > > > > Signed-off-by: Xiaowei Bao <xiaowei.bao@xxxxxxx>
> > > > > ---
> > > > > drivers/pci/controller/dwc/pcie-designware-ep.c | 97
> > > > +++++++++++++---------
> > > > > drivers/pci/controller/dwc/pcie-designware.c | 105
> > > > ++++++++++++++++++++++--
> > > > > drivers/pci/controller/dwc/pcie-designware.h | 10 ++-
> > > > > include/linux/pci-epc.h | 1 +
> > > > > 4 files changed, 164 insertions(+), 49 deletions(-)
> > > > >
> > > > > diff --git a/drivers/pci/controller/dwc/pcie-designware-ep.c
> > > > > b/drivers/pci/controller/dwc/pcie-designware-ep.c
> > > > > index 2bf5a35..75e2955 100644
> > > > > --- a/drivers/pci/controller/dwc/pcie-designware-ep.c
> > > > > +++ b/drivers/pci/controller/dwc/pcie-designware-ep.c
> > > > > @@ -19,12 +19,14 @@ void dw_pcie_ep_linkup(struct dw_pcie_ep
> > *ep)
> > > > > pci_epc_linkup(epc);
> > > > > }
> > > > >
> > > > > -static void __dw_pcie_ep_reset_bar(struct dw_pcie *pci, enum
> > > > > pci_barno
> > > > bar,
> > > > > - int flags)
> > > > > +static void __dw_pcie_ep_reset_bar(struct dw_pcie *pci, u8 func_no,
> > > > > + enum pci_barno bar, int flags)
> > > > > {
> > > > > u32 reg;
> > > > > + struct pci_epc *epc = pci->ep.epc;
> > > > > + u32 pf_base = func_no * epc->pf_offset;
> > > > >
> > > > > - reg = PCI_BASE_ADDRESS_0 + (4 * bar);
> > > > > + reg = pf_base + PCI_BASE_ADDRESS_0 + (4 * bar);
> > > >
> > > > I think I'd rather see this arithmetic (and the one for determining
> > > > pf_base) inside a macro or inline header function. This would make
> > > > this code more readable and reduce the chances of an error by avoiding
> > duplication of code.
> > > >
> > > > For example look at cdns_pcie_ep_fn_writeb and
> > > > ROCKCHIP_PCIE_EP_FUNC_BASE for examples of other EP drivers that do
> > > > this.
> > > Agree, this looks fine, thanks a lot for your comments, I will use
> > > this way to access the registers in next version patch.
> > > >
> > > >
> > > > > dw_pcie_dbi_ro_wr_en(pci);
> > > > > dw_pcie_writel_dbi2(pci, reg, 0x0);
> > > > > dw_pcie_writel_dbi(pci, reg, 0x0); @@ -37,7 +39,12 @@ static
> > > > > void __dw_pcie_ep_reset_bar(struct dw_pcie *pci, enum pci_barno
> > > > > bar,
> > > > >
> > > > > void dw_pcie_ep_reset_bar(struct dw_pcie *pci, enum pci_barno bar)
> > {
> > > > > - __dw_pcie_ep_reset_bar(pci, bar, 0);
> > > > > + u8 func_no, funcs;
> > > > > +
> > > > > + funcs = pci->ep.epc->max_functions;
> > > > > +
> > > > > + for (func_no = 0; func_no < funcs; func_no++)
> > > > > + __dw_pcie_ep_reset_bar(pci, func_no, bar, 0);
> > > > > }
> > > > >
> > > > > static u8 __dw_pcie_ep_find_next_cap(struct dw_pcie *pci, u8
> > > > > cap_ptr, @@ -78,28 +85,29 @@ static int
> > > > > dw_pcie_ep_write_header(struct pci_epc *epc, u8 func_no, {
> > > > > struct dw_pcie_ep *ep = epc_get_drvdata(epc);
> > > > > struct dw_pcie *pci = to_dw_pcie_from_ep(ep);
> > > > > + u32 pf_base = func_no * epc->pf_offset;
> > > > >
> > > > > dw_pcie_dbi_ro_wr_en(pci);
> > > > > - dw_pcie_writew_dbi(pci, PCI_VENDOR_ID, hdr->vendorid);
> > > > > - dw_pcie_writew_dbi(pci, PCI_DEVICE_ID, hdr->deviceid);
> > > > > - dw_pcie_writeb_dbi(pci, PCI_REVISION_ID, hdr->revid);
> > > > > - dw_pcie_writeb_dbi(pci, PCI_CLASS_PROG, hdr->progif_code);
> > > > > - dw_pcie_writew_dbi(pci, PCI_CLASS_DEVICE,
> > > > > + dw_pcie_writew_dbi(pci, pf_base + PCI_VENDOR_ID,
> > hdr->vendorid);
> > > > > + dw_pcie_writew_dbi(pci, pf_base + PCI_DEVICE_ID, hdr->deviceid);
> > > > > + dw_pcie_writeb_dbi(pci, pf_base + PCI_REVISION_ID, hdr->revid);
> > > > > + dw_pcie_writeb_dbi(pci, pf_base + PCI_CLASS_PROG,
> > hdr->progif_code);
> > > > > + dw_pcie_writew_dbi(pci, pf_base + PCI_CLASS_DEVICE,
> > > > > hdr->subclass_code | hdr->baseclass_code << 8);
> > > > > - dw_pcie_writeb_dbi(pci, PCI_CACHE_LINE_SIZE,
> > > > > + dw_pcie_writeb_dbi(pci, pf_base + PCI_CACHE_LINE_SIZE,
> > > > > hdr->cache_line_size);
> > > > > - dw_pcie_writew_dbi(pci, PCI_SUBSYSTEM_VENDOR_ID,
> > > > > + dw_pcie_writew_dbi(pci, pf_base + PCI_SUBSYSTEM_VENDOR_ID,
> > > > > hdr->subsys_vendor_id);
> > > > > - dw_pcie_writew_dbi(pci, PCI_SUBSYSTEM_ID, hdr->subsys_id);
> > > > > - dw_pcie_writeb_dbi(pci, PCI_INTERRUPT_PIN,
> > > > > + dw_pcie_writew_dbi(pci, pf_base + PCI_SUBSYSTEM_ID,
> > > > hdr->subsys_id);
> > > > > + dw_pcie_writeb_dbi(pci, pf_base + PCI_INTERRUPT_PIN,
> > > > > hdr->interrupt_pin);
> > > > > dw_pcie_dbi_ro_wr_dis(pci);
> > > > >
> > > > > return 0;
> > > > > }
> > > > >
> > > > > -static int dw_pcie_ep_inbound_atu(struct dw_pcie_ep *ep, enum
> > > > pci_barno bar,
> > > > > - dma_addr_t cpu_addr,
> > > > > +static int dw_pcie_ep_inbound_atu(struct dw_pcie_ep *ep, u8
> > func_no,
> > > > > + enum pci_barno bar, dma_addr_t cpu_addr,
> > > > > enum dw_pcie_as_type as_type) {
> > > > > int ret;
> > > > > @@ -112,7 +120,7 @@ static int dw_pcie_ep_inbound_atu(struct
> > > > dw_pcie_ep *ep, enum pci_barno bar,
> > > > > return -EINVAL;
> > > > > }
> > > > >
> > > > > - ret = dw_pcie_prog_inbound_atu(pci, free_win, bar, cpu_addr,
> > > > > + ret = dw_pcie_prog_inbound_atu(pci, func_no, free_win, bar,
> > > > > +cpu_addr,
> > > > > as_type);
> > > > > if (ret < 0) {
> > > > > dev_err(pci->dev, "Failed to program IB window\n"); @@
> > -125,7
> > > > > +133,8 @@ static int dw_pcie_ep_inbound_atu(struct dw_pcie_ep *ep,
> > > > enum pci_barno bar,
> > > > > return 0;
> > > > > }
> > > > >
> > > > > -static int dw_pcie_ep_outbound_atu(struct dw_pcie_ep *ep,
> > > > > phys_addr_t phys_addr,
> > > > > +static int dw_pcie_ep_outbound_atu(struct dw_pcie_ep *ep, u8
> > func_no,
> > > > > + phys_addr_t phys_addr,
> > > > > u64 pci_addr, size_t size) {
> > > > > u32 free_win;
> > > > > @@ -137,8 +146,8 @@ static int dw_pcie_ep_outbound_atu(struct
> > > > dw_pcie_ep *ep, phys_addr_t phys_addr,
> > > > > return -EINVAL;
> > > > > }
> > > > >
> > > > > - dw_pcie_prog_outbound_atu(pci, free_win, PCIE_ATU_TYPE_MEM,
> > > > > - phys_addr, pci_addr, size);
> > > > > + dw_pcie_prog_ep_outbound_atu(pci, func_no, free_win,
> > > > PCIE_ATU_TYPE_MEM,
> > > > > + phys_addr, pci_addr, size);
> > > > >
> > > > > set_bit(free_win, ep->ob_window_map);
> > > > > ep->outbound_addr[free_win] = phys_addr; @@ -154,7 +163,7
> > @@
> > > > static
> > > > > void dw_pcie_ep_clear_bar(struct pci_epc *epc, u8 func_no,
> > > > > enum pci_barno bar = epf_bar->barno;
> > > > > u32 atu_index = ep->bar_to_atu[bar];
> > > > >
> > > > > - __dw_pcie_ep_reset_bar(pci, bar, epf_bar->flags);
> > > > > + __dw_pcie_ep_reset_bar(pci, func_no, bar, epf_bar->flags);
> > > > >
> > > > > dw_pcie_disable_atu(pci, atu_index,
> > DW_PCIE_REGION_INBOUND);
> > > > > clear_bit(atu_index, ep->ib_window_map); @@ -170,14 +179,16
> > @@
> > > > > static int dw_pcie_ep_set_bar(struct pci_epc *epc, u8 func_no,
> > > > > size_t size = epf_bar->size;
> > > > > int flags = epf_bar->flags;
> > > > > enum dw_pcie_as_type as_type;
> > > > > - u32 reg = PCI_BASE_ADDRESS_0 + (4 * bar);
> > > > > + u32 pf_base = func_no * epc->pf_offset;
> > > > > + u32 reg = PCI_BASE_ADDRESS_0 + (4 * bar) + pf_base;
> > > > >
> > > > > if (!(flags & PCI_BASE_ADDRESS_SPACE))
> > > > > as_type = DW_PCIE_AS_MEM;
> > > > > else
> > > > > as_type = DW_PCIE_AS_IO;
> > > > >
> > > > > - ret = dw_pcie_ep_inbound_atu(ep, bar, epf_bar->phys_addr,
> > as_type);
> > > > > + ret = dw_pcie_ep_inbound_atu(ep, func_no, bar,
> > > > > + epf_bar->phys_addr, as_type);
> > > > > if (ret)
> > > > > return ret;
> > > > >
> > > > > @@ -235,7 +246,7 @@ static int dw_pcie_ep_map_addr(struct pci_epc
> > > > *epc, u8 func_no,
> > > > > struct dw_pcie_ep *ep = epc_get_drvdata(epc);
> > > > > struct dw_pcie *pci = to_dw_pcie_from_ep(ep);
> > > > >
> > > > > - ret = dw_pcie_ep_outbound_atu(ep, addr, pci_addr, size);
> > > > > + ret = dw_pcie_ep_outbound_atu(ep, func_no, addr, pci_addr,
> > > > > +size);
> > > > > if (ret) {
> > > > > dev_err(pci->dev, "Failed to enable address\n");
> > > > > return ret;
> > > > > @@ -248,12 +259,13 @@ static int dw_pcie_ep_get_msi(struct pci_epc
> > > > > *epc, u8 func_no) {
> > > > > struct dw_pcie_ep *ep = epc_get_drvdata(epc);
> > > > > struct dw_pcie *pci = to_dw_pcie_from_ep(ep);
> > > > > + u32 pf_base = func_no * epc->pf_offset;
> > > > > u32 val, reg;
> > > > >
> > > > > if (!ep->msi_cap)
> > > > > return -EINVAL;
> > > > >
> > > > > - reg = ep->msi_cap + PCI_MSI_FLAGS;
> > > > > + reg = ep->msi_cap + pf_base + PCI_MSI_FLAGS;
> > > > > val = dw_pcie_readw_dbi(pci, reg);
> > > > > if (!(val & PCI_MSI_FLAGS_ENABLE))
> > > > > return -EINVAL;
> > > > > @@ -267,12 +279,13 @@ static int dw_pcie_ep_set_msi(struct pci_epc
> > > > > *epc, u8 func_no, u8 interrupts) {
> > > > > struct dw_pcie_ep *ep = epc_get_drvdata(epc);
> > > > > struct dw_pcie *pci = to_dw_pcie_from_ep(ep);
> > > > > + u32 pf_base = func_no * epc->pf_offset;
> > > > > u32 val, reg;
> > > > >
> > > > > if (!ep->msi_cap)
> > > > > return -EINVAL;
> > > > >
> > > > > - reg = ep->msi_cap + PCI_MSI_FLAGS;
> > > > > + reg = ep->msi_cap + pf_base + PCI_MSI_FLAGS;
> > > > > val = dw_pcie_readw_dbi(pci, reg);
> > > > > val &= ~PCI_MSI_FLAGS_QMASK;
> > > > > val |= (interrupts << 1) & PCI_MSI_FLAGS_QMASK; @@ -287,12
> > > > > +300,13 @@ static int dw_pcie_ep_get_msix(struct pci_epc *epc, u8
> > func_no) {
> > > > > struct dw_pcie_ep *ep = epc_get_drvdata(epc);
> > > > > struct dw_pcie *pci = to_dw_pcie_from_ep(ep);
> > > > > + u32 pf_base = func_no * epc->pf_offset;
> > > > > u32 val, reg;
> > > > >
> > > > > if (!ep->msix_cap)
> > > > > return -EINVAL;
> > > > >
> > > > > - reg = ep->msix_cap + PCI_MSIX_FLAGS;
> > > > > + reg = ep->msix_cap + pf_base + PCI_MSIX_FLAGS;
> > > > > val = dw_pcie_readw_dbi(pci, reg);
> > > > > if (!(val & PCI_MSIX_FLAGS_ENABLE))
> > > > > return -EINVAL;
> > > > > @@ -306,12 +320,13 @@ static int dw_pcie_ep_set_msix(struct
> > > > > pci_epc *epc, u8 func_no, u16 interrupts) {
> > > > > struct dw_pcie_ep *ep = epc_get_drvdata(epc);
> > > > > struct dw_pcie *pci = to_dw_pcie_from_ep(ep);
> > > > > + u32 pf_base = func_no * epc->pf_offset;
> > > > > u32 val, reg;
> > > > >
> > > > > if (!ep->msix_cap)
> > > > > return -EINVAL;
> > > > >
> > > > > - reg = ep->msix_cap + PCI_MSIX_FLAGS;
> > > > > + reg = ep->msix_cap + pf_base + PCI_MSIX_FLAGS;
> > > > > val = dw_pcie_readw_dbi(pci, reg);
> > > > > val &= ~PCI_MSIX_FLAGS_QSIZE;
> > > > > val |= interrupts;
> > > > > @@ -400,6 +415,7 @@ int dw_pcie_ep_raise_msi_irq(struct
> > dw_pcie_ep
> > > > *ep, u8 func_no,
> > > > > unsigned int aligned_offset;
> > > > > u16 msg_ctrl, msg_data;
> > > > > u32 msg_addr_lower, msg_addr_upper, reg;
> > > > > + u32 pf_base = func_no * epc->pf_offset;
> > > > > u64 msg_addr;
> > > > > bool has_upper;
> > > > > int ret;
> > > > > @@ -408,19 +424,19 @@ int dw_pcie_ep_raise_msi_irq(struct
> > > > dw_pcie_ep *ep, u8 func_no,
> > > > > return -EINVAL;
> > > > >
> > > > > /* Raise MSI per the PCI Local Bus Specification Revision 3.0, 6.8.1.
> > */
> > > > > - reg = ep->msi_cap + PCI_MSI_FLAGS;
> > > > > + reg = ep->msi_cap + pf_base + PCI_MSI_FLAGS;
> > > > > msg_ctrl = dw_pcie_readw_dbi(pci, reg);
> > > > > has_upper = !!(msg_ctrl & PCI_MSI_FLAGS_64BIT);
> > > > > - reg = ep->msi_cap + PCI_MSI_ADDRESS_LO;
> > > > > + reg = ep->msi_cap + pf_base + PCI_MSI_ADDRESS_LO;
> > > > > msg_addr_lower = dw_pcie_readl_dbi(pci, reg);
> > > > > if (has_upper) {
> > > > > - reg = ep->msi_cap + PCI_MSI_ADDRESS_HI;
> > > > > + reg = ep->msi_cap + pf_base + PCI_MSI_ADDRESS_HI;
> > > > > msg_addr_upper = dw_pcie_readl_dbi(pci, reg);
> > > > > - reg = ep->msi_cap + PCI_MSI_DATA_64;
> > > > > + reg = ep->msi_cap + pf_base + PCI_MSI_DATA_64;
> > > > > msg_data = dw_pcie_readw_dbi(pci, reg);
> > > > > } else {
> > > > > msg_addr_upper = 0;
> > > > > - reg = ep->msi_cap + PCI_MSI_DATA_32;
> > > > > + reg = ep->msi_cap + pf_base + PCI_MSI_DATA_32;
> > > > > msg_data = dw_pcie_readw_dbi(pci, reg);
> > > > > }
> > > > > aligned_offset = msg_addr_lower & (epc->mem->page_size - 1);
> > @@
> > > > > -439,7 +455,7 @@ int dw_pcie_ep_raise_msi_irq(struct dw_pcie_ep
> > > > > *ep,
> > > > > u8 func_no, }
> > > > >
> > > > > int dw_pcie_ep_raise_msix_irq(struct dw_pcie_ep *ep, u8 func_no,
> > > > > - u16 interrupt_num)
> > > > > + u16 interrupt_num)
> > > > > {
> > > > > struct dw_pcie *pci = to_dw_pcie_from_ep(ep);
> > > > > struct pci_epc *epc = ep->epc;
> > > > > @@ -447,16 +463,17 @@ int dw_pcie_ep_raise_msix_irq(struct
> > > > dw_pcie_ep *ep, u8 func_no,
> > > > > u32 bar_addr_upper, bar_addr_lower;
> > > > > u32 msg_addr_upper, msg_addr_lower;
> > > > > u32 reg, msg_data, vec_ctrl;
> > > > > + u32 pf_base = func_no * epc->pf_offset;
> > > > > u64 tbl_addr, msg_addr, reg_u64;
> > > > > void __iomem *msix_tbl;
> > > > > int ret;
> > > > >
> > > > > - reg = ep->msix_cap + PCI_MSIX_TABLE;
> > > > > + reg = ep->msix_cap + pf_base + PCI_MSIX_TABLE;
> > > > > tbl_offset = dw_pcie_readl_dbi(pci, reg);
> > > > > bir = (tbl_offset & PCI_MSIX_TABLE_BIR);
> > > > > tbl_offset &= PCI_MSIX_TABLE_OFFSET;
> > > > >
> > > > > - reg = PCI_BASE_ADDRESS_0 + (4 * bir);
> > > > > + reg = PCI_BASE_ADDRESS_0 + pf_base + (4 * bir);
> > > > > bar_addr_upper = 0;
> > > > > bar_addr_lower = dw_pcie_readl_dbi(pci, reg);
> > > > > reg_u64 = (bar_addr_lower &
> > PCI_BASE_ADDRESS_MEM_TYPE_MASK);
> > > > @@
> > > > > -592,13 +609,17 @@ int dw_pcie_ep_init(struct dw_pcie_ep *ep)
> > > > > ep->epc = epc;
> > > > > epc_set_drvdata(epc, ep);
> > > > >
> > > > > - if (ep->ops->ep_init)
> > > > > - ep->ops->ep_init(ep);
> > > > > -
> > > > > ret = of_property_read_u8(np, "max-functions",
> > &epc->max_functions);
> > > > > if (ret < 0)
> > > > > epc->max_functions = 1;
> > > > >
> > > > > + ret = of_property_read_u32(np, "pf-offset", &epc->pf_offset);
> > > > > + if (ret < 0)
> > > > > + epc->pf_offset = 0;
> > > >
> > > > Bad things will likely happen if max_functions > 1 and pf-offset isn't set.
> > > > I think the driver should bail in this situation. It would be very
> > > > easy for someone to misconfigure this.
> > > Yes, you are right, but if the max-functions have defined in DTS,
> > > require the pf-offset must define in DTS, I am not sure the correct
> > > value of pf-offsetfor other platforms, so I think the max-functions and
> > pf-offset should not have the dependence.
> >
> > Yes you're correct. I hadn't really thought about this beyond layerscape. It's
> > also possible that other hardware could support multiple PFs without relying
> > on an offset and perhaps employ some other mechanism to access different
> > functions. So whilst this property can be optional for the majority of dwc
> > controllers - it must be set and cannot be zero for layerscape.
> >
> > Perhaps inside ls_pcie_ep_init, you can set max_functions to 1 if pf_offset is
> > 0 and print a WARN to explain why? (Or ls_pcie_ep_init returns failure and
> > dw_pcie_ep_init checks it and bails).
> >
> > The assumption is being made here that future dw controllers may also use
> > pf_offset (is this likely?) - otherwise why is this in pcie-designware-ep.c and
> > not pci-layerscape-ep.c and why is this value not just hard-coded for lp?
>
> Thanks a lot for your detail comments, this give me a lot of help.
> Yes, I agree your point, and I will seriously consider a best way to fix this potential issue.
> Based on your experience, how do other platforms implement the multiple functions?
> The DWC core difference the different PF by signal "client0_tlp_func_num[(PF_WD-1):0]"

I don't know, though looking at the kernel drivers suggests that the
existing EP controllers have a large address space which contains multiple
PFs. They are accessed via macros (ROCKCHIP_PCIE_EP_FUNC_BASE(fn),
CDNS_PCIE_EP_FUNC_BASE(fn)). It would be possible, but probably not desirable
to have a smaller address space (window) and a register that selects which
function the window refers to. This is why I'm slight nervous of assuming
that a pf-offset will cover all future dw drivers - I may be wrong.

> >
> >
> > > even though I didn't define pf-offset when I defined max-functions,
> > > the pf-offset is 0, the DWC ep driver can continue run the progress of
> > > INIT but not return, of course, thus the PF1 will not work, I don't know which
> > way is better.
> Hi Andrew,
> > > >
> > > >
> > > > > +
> > > > > + if (ep->ops->ep_init)
> > > > > + ep->ops->ep_init(ep);
> > > > > +
> > > > > ret = __pci_epc_mem_init(epc, ep->phys_base, ep->addr_size,
> > > > > ep->page_size);
> > > > > if (ret < 0) {
> > > > > diff --git a/drivers/pci/controller/dwc/pcie-designware.c
> > > > > b/drivers/pci/controller/dwc/pcie-designware.c
> > > > > index 7d25102..c99cee4 100644
> > > > > --- a/drivers/pci/controller/dwc/pcie-designware.c
> > > > > +++ b/drivers/pci/controller/dwc/pcie-designware.c
> > > > > @@ -158,6 +158,43 @@ static void dw_pcie_writel_ob_unroll(struct
> > > > dw_pcie *pci, u32 index, u32 reg,
> > > > > dw_pcie_writel_atu(pci, offset + reg, val); }
> > > > >
> > > > > +static void dw_pcie_prog_ep_outbound_atu_unroll(struct dw_pcie
> > > > > +*pci, u8
> > > > func_no,
> > > > > + int index, int type,
> > > > > + u64 cpu_addr, u64 pci_addr,
> > > > > + u32 size)
> > > > > +{
> > > > > + u32 retries, val;
> > > > > +
> > > > > + dw_pcie_writel_ob_unroll(pci, index,
> > PCIE_ATU_UNR_LOWER_BASE,
> > > > > + lower_32_bits(cpu_addr));
> > > > > + dw_pcie_writel_ob_unroll(pci, index,
> > PCIE_ATU_UNR_UPPER_BASE,
> > > > > + upper_32_bits(cpu_addr));
> > > > > + dw_pcie_writel_ob_unroll(pci, index, PCIE_ATU_UNR_LIMIT,
> > > > > + lower_32_bits(cpu_addr + size - 1));
> > > > > + dw_pcie_writel_ob_unroll(pci, index,
> > PCIE_ATU_UNR_LOWER_TARGET,
> > > > > + lower_32_bits(pci_addr));
> > > > > + dw_pcie_writel_ob_unroll(pci, index,
> > PCIE_ATU_UNR_UPPER_TARGET,
> > > > > + upper_32_bits(pci_addr));
> > > > > + dw_pcie_writel_ob_unroll(pci, index,
> > PCIE_ATU_UNR_REGION_CTRL1,
> > > > > + type | PCIE_ATU_FUNC_NUM(func_no));
> > > >
> > > > With the exception of this line, the rest of this function is
> > > > identical to dw_pcie_prog_outbound_atu_unroll.
> > > Yes, I can integrate the same code, but I think we'd better use the
> > > different outbound window set function between RC and EP, because the RC
> > don't need the func_num parameter.
> >
> >
> >
> > > >
> > > > > + dw_pcie_writel_ob_unroll(pci, index,
> > PCIE_ATU_UNR_REGION_CTRL2,
> > > > > + PCIE_ATU_ENABLE);
> > > > > +
> > > > > + /*
> > > > > + * Make sure ATU enable takes effect before any subsequent config
> > > > > + * and I/O accesses.
> > > > > + */
> > > > > + for (retries = 0; retries < LINK_WAIT_MAX_IATU_RETRIES; retries++)
> > {
> > > > > + val = dw_pcie_readl_ob_unroll(pci, index,
> > > > > + PCIE_ATU_UNR_REGION_CTRL2);
> > > > > + if (val & PCIE_ATU_ENABLE)
> > > > > + return;
> > > > > +
> > > > > + mdelay(LINK_WAIT_IATU);
> > > > > + }
> > > > > + dev_err(pci->dev, "Outbound iATU is not being enabled\n"); }
> > > > > +
> > > > > static void dw_pcie_prog_outbound_atu_unroll(struct dw_pcie *pci,
> > > > > int
> > > > index,
> > > > > int type, u64 cpu_addr,
> > > > > u64 pci_addr, u32 size) @@ -194,6
> > +231,51 @@ static
> > > > > void
> > > > dw_pcie_prog_outbound_atu_unroll(struct dw_pcie *pci, int index,
> > > > > dev_err(pci->dev, "Outbound iATU is not being enabled\n"); }
> > > > >
> > > > > +void dw_pcie_prog_ep_outbound_atu(struct dw_pcie *pci, u8
> > > > > +func_no, int
> > > > index,
> > > > > + int type, u64 cpu_addr, u64 pci_addr,
> > > > > + u32 size)
> > > > > +{
> > > > > + u32 retries, val;
> > > > > +
> > > > > + if (pci->ops->cpu_addr_fixup)
> > > > > + cpu_addr = pci->ops->cpu_addr_fixup(pci, cpu_addr);
> > > > > +
> > > > > + if (pci->iatu_unroll_enabled) {
> > > > > + dw_pcie_prog_ep_outbound_atu_unroll(pci, func_no, index,
> > type,
> > > > > + cpu_addr, pci_addr, size);
> > > > > + return;
> > > > > + }
> > > > > +
> > > > > + dw_pcie_writel_dbi(pci, PCIE_ATU_VIEWPORT,
> > > > > + PCIE_ATU_REGION_OUTBOUND | index);
> > > > > + dw_pcie_writel_dbi(pci, PCIE_ATU_LOWER_BASE,
> > > > > + lower_32_bits(cpu_addr));
> > > > > + dw_pcie_writel_dbi(pci, PCIE_ATU_UPPER_BASE,
> > > > > + upper_32_bits(cpu_addr));
> > > > > + dw_pcie_writel_dbi(pci, PCIE_ATU_LIMIT,
> > > > > + lower_32_bits(cpu_addr + size - 1));
> > > > > + dw_pcie_writel_dbi(pci, PCIE_ATU_LOWER_TARGET,
> > > > > + lower_32_bits(pci_addr));
> > > > > + dw_pcie_writel_dbi(pci, PCIE_ATU_UPPER_TARGET,
> > > > > + upper_32_bits(pci_addr));
> > > > > + dw_pcie_writel_dbi(pci, PCIE_ATU_CR1, type |
> > > > > + PCIE_ATU_FUNC_NUM(func_no));
> > > >
> > > > The same here, this is identical to dw_pcie_prog_outbound_atu with
> > > > the exception of this line.
> > > >
> > > > Is there a way you can avoid all of this duplicated code?
> > > As above, I can integrate the same code, but I keep to think the
> > > different outbound Window set function should be used between RC and EP.
> >
> > Or, is it possible to keep and use the existing functions, but use them
> > differently, e.g:
> >
> >
> > @@ -137,8 +146,8 @@ static int dw_pcie_ep_outbound_atu(struct
> > dw_pcie_ep *ep, phys_addr_t phys_addr,
> > return -EINVAL;
> > }
> >
> > - dw_pcie_prog_outbound_atu(pci, free_win, PCIE_ATU_TYPE_MEM,
> > - phys_addr, pci_addr, size);
> > + dw_pcie_prog_outbound_atu(pci, free_win,
> > PCIE_ATU_TYPE_MEM_FUNC(func_no),
> > + phys_addr, pci_addr, size);
> >
> > set_bit(free_win, ep->ob_window_map);
> > ep->outbound_addr[free_win] = phys_addr;
> >
> >
> > Supported with:
> >
> > #define PCIE_ATU_TYPE_MEM 0x0
> > #define PCIE_ATU_TYPE_MEM_FUNC(func_no) (PCIE_ATU_TYPE_MEM |
> > PCIE_ATU_FUNC_NUM(func_no))
> >
> >
> > This is just a suggestion, but I'm keen to avoid code duplication.
> Thanks, I have think of a way as follow:
>
> This is a good way, but I think PCIE_ATU_TYPE_MEM_FUNC(func_no) will give
> Someone confused meaning, because PCIE_ATU_TYPE_MEM indicate the type of TLP,
> and the location in the bit[0:3] of register CR1, but the PCIE_ATU_FUNC_NUM is bit[20:24],
> I have another way:
> @@ -137,8 +146,8 @@ static int dw_pcie_ep_outbound_atu(struct
> dw_pcie_ep *ep, phys_addr_t phys_addr,
> return -EINVAL;
> }
>
> dw_pcie_prog_outbound_atu(pci, free_win, PCIE_ATU_TYPE_MEM,
> phys_addr, pci_addr, size);
> + val = dw_pcie_readl_dbi(pci, PCIE_ATU_CR1);
> + dw_pcie_writel_dbi(pci, PCIE_ATU_CR1, val | PCIE_ATU_FUNC_NUM(func_no));
> or
> +void dw_pcie_prog_ep_outbound_atu(struct dw_pcie *pci, u8 func_no, int index,
> + int type, u64 cpu_addr, u64 pci_addr,
> + u32 size)
> +{
> + dw_pcie_prog_outbound_atu(pci, index, type, cpu_addr, pci_addr, size);
> + val = dw_pcie_readl_dbi(pci, PCIE_ATU_CR1);
> + dw_pcie_writel_dbi(pci, PCIE_ATU_CR1, val | PCIE_ATU_FUNC_NUM(func_no));
> +}
>
> Which do you think is better of these three ways?

Building upon your idea, how about:


@@ -137,8 +146,8 @@ static int dw_pcie_ep_outbound_atu(struct
dw_pcie_ep *ep, phys_addr_t phys_addr,
return -EINVAL;
}

- dw_pcie_prog_outbound_atu(pci, free_win, PCIE_ATU_TYPE_MEM,
- phys_addr, pci_addr, size);
+ dw_pcie_prog_ep_outbound_atu(pci, func_no, free_win, PCIE_ATU_TYPE_MEM
+ phys_addr, pci_addr, size);

set_bit(free_win, ep->ob_window_map);
ep->outbound_addr[free_win] = phys_addr;


+void dw_pcie_prog_ep_outbound_atu(struct dw_pcie *pci, u8 func_no, int index,
+ int type, u64 cpu_addr, u64 pci_addr,
+ u32 size)
+{
+ __dw_pcie_prog_outbound_atu(pci, func_no, index, type, cpu_addr, pci_addr, size);
+}
+
+void dw_pcie_prog_outbound_atu(struct dw_pcie *pci, u8 func_no, int index,
+ int type, u64 cpu_addr, u64 pci_addr,
+ u32 size)
+{
+ __dw_pcie_prog_outbound_atu(pci, 0, index, type, cpu_addr, pci_addr, size);
+}

In other words dw_pcie_prog_outbound_atu is updated (and renamed) to always
take a func_no and for host controllers this is always set to zero. Or you
could follow the approach taken in the cadence drivers for their implementation
of cdns_pcie_set_outbound_region - this always takes a func_no and is used
by host controller and endpoint drivers (except they don't have the helper
wrapper functions above thus exposing fn=0 to host controllers).

> >
> > > >
> > > > Thanks,
> > > >
> > > > Andrew Murray
> > > >
> > > > > + dw_pcie_writel_dbi(pci, PCIE_ATU_CR2, PCIE_ATU_ENABLE);
> > > > > +
> > > > > + /*
> > > > > + * Make sure ATU enable takes effect before any subsequent config
> > > > > + * and I/O accesses.
> > > > > + */
> > > > > + for (retries = 0; retries < LINK_WAIT_MAX_IATU_RETRIES; retries++)
> > {
> > > > > + val = dw_pcie_readl_dbi(pci, PCIE_ATU_CR2);
> > > > > + if (val & PCIE_ATU_ENABLE)
> > > > > + return;
> > > > > +
> > > > > + mdelay(LINK_WAIT_IATU);
> > > > > + }
> > > > > + dev_err(pci->dev, "Outbound iATU is not being enabled\n"); }
> > > > > +
> > > > > void dw_pcie_prog_outbound_atu(struct dw_pcie *pci, int index, int
> > type,
> > > > > u64 cpu_addr, u64 pci_addr, u32 size) { @@
> > -252,8
> > > > +334,8
> > > > > @@ static void dw_pcie_writel_ib_unroll(struct dw_pcie *pci, u32
> > > > > index,
> > > > u32 reg,
> > > > > dw_pcie_writel_atu(pci, offset + reg, val); }
> > > > >
> > > > > -static int dw_pcie_prog_inbound_atu_unroll(struct dw_pcie *pci, int
> > index,
> > > > > - int bar, u64 cpu_addr,
> > > > > +static int dw_pcie_prog_inbound_atu_unroll(struct dw_pcie *pci,
> > > > > +u8
> > > > func_no,
> > > > > + int index, int bar, u64 cpu_addr,
> > > > > enum dw_pcie_as_type as_type) {
> > > > > int type;
> > > > > @@ -275,8 +357,10 @@ static int
> > > > > dw_pcie_prog_inbound_atu_unroll(struct
> > > > dw_pcie *pci, int index,
> > > > > return -EINVAL;
> > > > > }
> > > > >
> > > > > - dw_pcie_writel_ib_unroll(pci, index,
> > PCIE_ATU_UNR_REGION_CTRL1,
> > > > type);
> > > > > + dw_pcie_writel_ib_unroll(pci, index,
> > PCIE_ATU_UNR_REGION_CTRL1,
> > > > type |
> > > > > + PCIE_ATU_FUNC_NUM(func_no));
> > > > > dw_pcie_writel_ib_unroll(pci, index,
> > PCIE_ATU_UNR_REGION_CTRL2,
> > > > > + PCIE_ATU_FUNC_NUM_MATCH_EN |
> > > > > PCIE_ATU_ENABLE |
> > > > > PCIE_ATU_BAR_MODE_ENABLE | (bar << 8));
> > > > >
> > > > > @@ -297,14 +381,15 @@ static int
> > > > dw_pcie_prog_inbound_atu_unroll(struct dw_pcie *pci, int index,
> > > > > return -EBUSY;
> > > > > }
> > > > >
> > > > > -int dw_pcie_prog_inbound_atu(struct dw_pcie *pci, int index, int bar,
> > > > > - u64 cpu_addr, enum dw_pcie_as_type as_type)
> > > > > +int dw_pcie_prog_inbound_atu(struct dw_pcie *pci, u8 func_no, int
> > index,
> > > > > + int bar, u64 cpu_addr,
> > > > > + enum dw_pcie_as_type as_type)
> > > > > {
> > > > > int type;
> > > > > u32 retries, val;
> > > > >
> > > > > if (pci->iatu_unroll_enabled)
> > > > > - return dw_pcie_prog_inbound_atu_unroll(pci, index, bar,
> > > > > + return dw_pcie_prog_inbound_atu_unroll(pci, func_no, index,
> > > > > +bar,
> > > > > cpu_addr, as_type);
> > > > >
> > > > > dw_pcie_writel_dbi(pci, PCIE_ATU_VIEWPORT,
> > > > PCIE_ATU_REGION_INBOUND |
> > > > > @@ -323,9 +408,11 @@ int dw_pcie_prog_inbound_atu(struct dw_pcie
> > > > *pci, int index, int bar,
> > > > > return -EINVAL;
> > > > > }
> > > > >
> > > > > - dw_pcie_writel_dbi(pci, PCIE_ATU_CR1, type);
> > > > > - dw_pcie_writel_dbi(pci, PCIE_ATU_CR2, PCIE_ATU_ENABLE
> > > > > - | PCIE_ATU_BAR_MODE_ENABLE | (bar << 8));
> > > > > + dw_pcie_writel_dbi(pci, PCIE_ATU_CR1, type |
> > > > > + PCIE_ATU_FUNC_NUM(func_no));
> > > > > + dw_pcie_writel_dbi(pci, PCIE_ATU_CR2, PCIE_ATU_ENABLE |
> > > > > + PCIE_ATU_FUNC_NUM_MATCH_EN |
> > > > > + PCIE_ATU_BAR_MODE_ENABLE | (bar << 8));
> > > > >
> > > > > /*
> > > > > * Make sure ATU enable takes effect before any subsequent
> > > > > config diff --git a/drivers/pci/controller/dwc/pcie-designware.h
> > > > > b/drivers/pci/controller/dwc/pcie-designware.h
> > > > > index ffed084..2b291e8 100644
> > > > > --- a/drivers/pci/controller/dwc/pcie-designware.h
> > > > > +++ b/drivers/pci/controller/dwc/pcie-designware.h
> > > > > @@ -71,9 +71,11 @@
> > > > > #define PCIE_ATU_TYPE_IO 0x2
> > > > > #define PCIE_ATU_TYPE_CFG0 0x4
> > > > > #define PCIE_ATU_TYPE_CFG1 0x5
> > > > > +#define PCIE_ATU_FUNC_NUM(pf) (pf << 20)
> > > > > #define PCIE_ATU_CR2 0x908
> > > > > #define PCIE_ATU_ENABLE BIT(31)
> > > > > #define PCIE_ATU_BAR_MODE_ENABLE BIT(30)
> > > > > +#define PCIE_ATU_FUNC_NUM_MATCH_EN BIT(19)
> > > > > #define PCIE_ATU_LOWER_BASE 0x90C
> > > > > #define PCIE_ATU_UPPER_BASE 0x910
> > > > > #define PCIE_ATU_LIMIT 0x914
> > > > > @@ -265,8 +267,12 @@ int dw_pcie_wait_for_link(struct dw_pcie
> > > > > *pci); void dw_pcie_prog_outbound_atu(struct dw_pcie *pci, int index,
> > > > > int type, u64 cpu_addr, u64 pci_addr,
> > > > > u32 size);
> > > > > -int dw_pcie_prog_inbound_atu(struct dw_pcie *pci, int index, int bar,
> > > > > - u64 cpu_addr, enum dw_pcie_as_type as_type);
> > > > > +void dw_pcie_prog_ep_outbound_atu(struct dw_pcie *pci, u8
> > > > > +func_no, int
> > > > index,
> > > > > + int type, u64 cpu_addr, u64 pci_addr,
> > > > > + u32 size);
> > > > > +int dw_pcie_prog_inbound_atu(struct dw_pcie *pci, u8 func_no, int
> > index,
> > > > > + int bar, u64 cpu_addr,
> > > > > + enum dw_pcie_as_type as_type);
> > > > > void dw_pcie_disable_atu(struct dw_pcie *pci, int index,
> > > > > enum dw_pcie_region_type type); void
> > dw_pcie_setup(struct
> > > > > dw_pcie *pci); diff --git a/include/linux/pci-epc.h
> > > > > b/include/linux/pci-epc.h index f641bad..fc2feee 100644
> > > > > --- a/include/linux/pci-epc.h
> > > > > +++ b/include/linux/pci-epc.h
> > > > > @@ -96,6 +96,7 @@ struct pci_epc {
> > > > > const struct pci_epc_ops *ops;
> > > > > struct pci_epc_mem *mem;
> > > > > u8 max_functions;
> > > > > + u32 pf_offset;
> >
> > Also pf_offset is an implementation detail needed only by the driver to
> > calculate where the PF is - it doesn't seem right that we share this with the EP
> > controller framework (whereas max_functions is used as a bounds check for
> > func_no in the framework calls).
> >
> > I'd suggest that pf_offset is moved to a dwc structure, perhaps dw_pcie_ep?
> I add the variable to this struct is consider that all PF is belong to a PCIe controller,
> and the pci_epc indicate a PCIe controller, so I add this variable to this struct, what
> do you think about this? I am not sure whether I should add this variable to dw_pcie_ep.

The EPC framework won't use the pf_offset and doesn't need it. It abstracts
the complexity of writing to the config address space (and similar) through
the pci_epc_ops. I'd suggest that the EPC framework (and pci_epc struct) only
needs to contain what *it* needs. Especially given that not all ep drivers
have a pf_offset or similar.

I understand the logic that pci_epc represents a EP controller, but I think
you should consider that it actually represents a *generic* EP controller in
the context of a framework which solely serves the purpose of connecting
controllers with functions. Whereas the dw_pcie_ep represents a specific type
of controller (DW) - as the pf_offset is (so far) relating to only DW
controllers (and as confirmed by the DT mapping) then it makes sense to not
move pf_offset from the specialised specific controller to the generic
controller. (Or at least this is how I rationalise it, though the EPC
framework is something quite unfamiliar to me).

Thanks,

Andrew Murray

> >
> > Thanks,
> >
> > Andrew Murray
> >
> > > > > struct config_group *group;
> > > > > /* spinlock to protect against concurrent access of EP controller */
> > > > > spinlock_t lock;
> > > > > --
> > > > > 2.9.5
> > > > >
> > > > >
> > > > > _______________________________________________
> > > > > linux-arm-kernel mailing list
> > > > > linux-arm-kernel@xxxxxxxxxxxxxxxxxxx
> > > > > http://lists
> > > > > .infradead.org%2Fmailman%2Flistinfo%2Flinux-arm-kernel&amp;data=0
> > 2
> > > > > %
> > > > 7C0
> > > > >
> > > >
> > 1%7Cxiaowei.bao%40nxp.com%7C0e39168f6f144db6840308d721742040%7
> > > > C686ea1d
> > > > >
> > > >
> > 3bc2b4c6fa92cd99c5c301635%7C0%7C1%7C637014654998524452&amp;sd
> > > > ata=bP7eh
> > > > > cjlGXCMVFE2b4f12Q6fGV7lQ%2F5i9qIi9FoPlbI%3D&amp;reserved=0