Re: [PATCH v3 1/1] net: fec: add xdp and page pool statistics

From: Alexander Lobakin
Date: Wed Nov 16 2022 - 09:34:31 EST


From: Shenwei Wang <shenwei.wang@xxxxxxx>
Date: Mon, 14 Nov 2022 21:17:48 +0000

> > -----Original Message-----
> > From: Alexander Lobakin <alexandr.lobakin@xxxxxxxxx>
> > Sent: Monday, November 14, 2022 9:23 AM
> > To: Shenwei Wang <shenwei.wang@xxxxxxx>
> > Cc: Alexander Lobakin <alexandr.lobakin@xxxxxxxxx>; Andrew Lunn
> > <andrew@xxxxxxx>; David S. Miller <davem@xxxxxxxxxxxxx>; Eric Dumazet
> > <edumazet@xxxxxxxxxx>; Jakub Kicinski <kuba@xxxxxxxxxx>; Paolo Abeni
> > <pabeni@xxxxxxxxxx>; Alexei Starovoitov <ast@xxxxxxxxxx>; Daniel Borkmann
> > <daniel@xxxxxxxxxxxxx>; Jesper Dangaard Brouer <hawk@xxxxxxxxxx>; John
> > Fastabend <john.fastabend@xxxxxxxxx>; Wei Fang <wei.fang@xxxxxxx>;
> > netdev@xxxxxxxxxxxxxxx; linux-kernel@xxxxxxxxxxxxxxx; imx@xxxxxxxxxxxxxxx;
> > kernel test robot <lkp@xxxxxxxxx>
> > Subject: [EXT] Re: [PATCH v3 1/1] net: fec: add xdp and page pool statistics

[...]

> Did some testing with the atomic64_t counter, with the following codes to update
> the u64 counter in the end of every NAPI poll cycle.
>
> @@ -1764,7 +1768,13 @@ fec_enet_rx_queue(struct net_device *ndev, int budget, u16 queue_id)
>
> if (xdp_result & FEC_ENET_XDP_REDIR)
> xdp_do_flush_map();
> +#if 1
> + if (xdp_prog) {
> + int i;
> + for(i = 0; i < XDP_STATS_TOTAL; i++)
> + atomic64_add(xdp_stats[i], &rxq->stats[i]);
> + }
> +#endif
> return pkt_received;
> }
>
> With the codes above, the testing result is below:
> root@imx8qxpc0mek:~/bpf# ./xdpsock -i eth0
> sock0@eth0:0 rxdrop xdp-drv
> pps pkts 1.00
> rx 349399 1035008
> tx 0 0
>
> sock0@eth0:0 rxdrop xdp-drv
> pps pkts 1.00
> rx 349407 1384640
> tx 0 0
>
> Without the atomic_add codes above, the testing result is below:
> root@imx8qxpc0mek:~/bpf# ./xdpsock -i eth0
> sock0@eth0:0 rxdrop xdp-drv
> pps pkts 1.00
> rx 350109 1989130
> tx 0 0
>
> sock0@eth0:0 rxdrop xdp-drv
> pps pkts 1.00
> rx 350425 2339786
> tx 0 0
>
> And regarding the u32 counter solution, the testing result is below:
> root@imx8qxpc0mek:~/bpf# ./xdpsock -i eth0
> sock0@eth0:0 rxdrop xdp-drv
> pps pkts 1.00
> rx 361347 2637796
> tx 0 0
>
> There are about 10K pkts/s difference here. Do we really want the u64 counters?

Where did those atomic64_t come from? u64_stats_t use either plain
u64 for 32-bit platforms or local64_t for 64-bit ones. Take a look
at [0] for the example of how x86_64 does this, it is far from
atomic64_t.

>
> Regards,
> Shenwei
>
> >>
> >> Thanks,
> >> Shenwei
> >>
> >>>
> >>> Andrew
> >
> > Thanks,
> > Olek

[0] https://elixir.bootlin.com/linux/v6.1-rc5/source/arch/x86/include/asm/local.h#L31

Thanks,
Olek