Re: [PATCH v3] net: bonding: Add support for IPV6 ns/na

From: 孙守鑫
Date: Mon Dec 20 2021 - 06:13:11 EST

Next message: Pratyush Yadav: "Re: [PATCH v5 2/2] mtd: spi-nor: macronix: Add support for mx66lm1g45g"
Previous message: Mel Gorman: "Re: [PATCH 2/2] sched/fair: Adjust the allowed NUMA imbalance when SD_NUMA spans multiple LLCs"
In reply to: Jay Vosburgh: "Re: [PATCH v3] net: bonding: Add support for IPV6 ns/na"
Next in thread: Eric Dumazet: "Re: [PATCH v3] net: bonding: Add support for IPV6 ns/na"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

在 2021/12/18 7:09, Jay Vosburgh 写道:

For clarity, please add "to balance-alb mode" to the Subject.

Thanks your comment, I'll adjust it and send out V4 soon.

Sun Shouxin <sunshouxin@xxxxxxxxxxxxxxx> wrote:

Since ipv6 neighbor solicitation and advertisement messages
isn't handled gracefully in bonding6 driver, we can see packet
drop due to inconsistency bewteen mac address in the option
message and source MAC .

Another examples is ipv6 neighbor solicitation and advertisement
messages from VM via tap attached to host brighe, the src mac
mighe be changed through balance-alb mode, but it is not synced
with Link-layer address in the option message.

The patch implements bond6's tx handle for ipv6 neighbor
solicitation and advertisement messages.

Border-Leaf
/ \
/ \
Tunnel1 Tunnel2
/ \
/ \
Leaf-1--Tunnel3--Leaf-2
\ /
\ /
\ /
\ /
NIC1 NIC2
\ /
server

We can see in our lab the Border-Leaf receives occasionally
a NA packet which is assigned to NIC1 mac in ND/NS option
message, but actaully send out via NIC2 mac due to tx-alb,
as a result, it will cause inconsistency between MAC table
and ND Table in Border-Leaf, i.e, NIC1 = Tunnel2 in ND table
and NIC1 = Tunnel1 in mac table.

And then, Border-Leaf starts to forward packet destinated
to the Server, it will only check the ND table entry in some
switch to encapsulate the destination MAC of the message as
NIC1 MAC, and then send it out from Tunnel2 by ND table.
Then, Leaf-2 receives the packet, it notices the destination
MAC of message is NIC1 MAC and should forword it to Tunne1
by Tunnel3.

However, this traffic forward will be failure due to split
horizon of VxLAN tunnels.

I believe I understand what problem you're trying to solve here,
but the solution seems to be incomplete, as (from our prior discussion)
a rebalance event for balance-alb will apparently induce the same
problem. Granted, those do not occur frequently (only when interfaces
are added to the bond, or an interface link state changes), but have you
tested what happens if NIC1 or NIC2 (or in a situation with more than
two interfaces) undergoes a link state change?

The code in the bond_xmit_alb_slave_get should act for ns/na in the rebalance.
what's more, with NIC1/NIC2 link state change, I don't observe abnormal scene.

Suggested-by: Hu Yadi <huyd12@xxxxxxxxxxxxxxx>
Reviewed-by: Jay Vosburgh<jay.vosburgh@xxxxxxxxxxxxx>

I did not include this signoff tag in my prior message. Please
do not include such tags unless explicitly provided by the relevant
person. Discussion on the mailing list is not equivalent to providing
the tag; please review Documentation/process/submitting-patches.rst.