ArubaCX Cannot Count When Dealing with VXLAN

Thursday, June 12, 2025 07:08 +0200

ArubaCX Cannot Count When Dealing with VXLAN

This blog post describes yet another bizarre example of how reliable digital twins are, but don’t worry; they all work great in PowerPoint.

After “fixing” the integration tests to deal with ArubaCX’s notion of VXLAN VNI having 16 bits, the bridging test worked, but the IRB tests kept failing.

In the IRB test, the lab has two layer-3 switches. Each of them should be able to bridge within a VLAN/VXLAN segment and route across the segments.

For example, H1 should be able to ping H2 (bridging), as well as H3 and H4 (routing). The way netlab sets up the lab, all hosts use S1 (ArubaCX) as the default gateway.

This is the lab topology I was using (it has been adjusted in the meantime to deal with ArubaCX):

module: [ vlan, vxlan, ospf ]

groups:
  _auto_create: True
  hosts:
    members: [ h1, h2, h3, h4 ]
    device: linux
    provider: clab

nodes:
  s1:
    device: arubacx
  s2:
    device: frr
    provider: clab

vlans:
  red:
    vni: 5000
    ospf.passive: True
    links: [ s1-h1, s2-h2 ]
  blue:
    vni: 5001
    ospf.passive: True
    links: [ s1-h3, s2-h4 ]

links:
- interfaces: [ s1, s2 ]
  mtu: 1600

tools:
  edgeshark:

The test results were disheartening:

The lab worked with two ArubaCX virtual machines
It failed when S2 was replaced with an FRR container
It worked (again) when S2 was an Arista vEOS virtual machine, but not when it was an Arista cEOS container.
Everything started working when I rebooted my netlab server.

Fortunately, if the previous troubleshooting exercise taught me anything, it was to do a packet capture before wasting time on anything else. netlab includes support for the excellent Edgeshark tool, so I was able to perform packet capture on my Mac OS laptop from my Ubuntu server, which was ~50 km away (yay, Tailscale!).

Here’s the VXLAN-encapsulated ARP request (captured on the link between S1 and S2) sent from H1 before it tries to ping H2:

And here’s the VXLAN-encapsulated ARP request for H3 sent by S1 (ArubaCX) when it tries to route the packet from H1 to H3:

Can you spot the difference? Even though the two packets have the same size (Ethernet frames are 92 bytes long), the IP packet length and the UDP payload length don’t match. ArubaCX claims the packet contains four bytes more than it does.

Long story short: VXLAN packets generated by ArubaCX routing process have invalid length. The likely culprit is the VLAN tag attached to the packet before it enters the (software) VXLAN encapsulation process. The VLAN tag is removed (as it should be), but the packet length is not adjusted.

Now for the fun part: why did the test sometimes work? It’s evident that most VXLAN implementations don’t verify the IP or UDP packet length (a bad idea), or the test could never work with devices from other vendors. But why did Arista vEOS accept the packet when it never reached Arista cEOS?

Welcome to the wonderful world of Linux bridges that love to masquerade as firewalls and sometimes decide it’s their job to filter invalid IP traffic. Fortunately, we can do a packet capture on interfaces connected to a Linux bridge to verify who the culprit is, and here’s what was going on in my server:

netlab uses libvirt UDP tunnels to create point-to-point inter-VM links. ArubaCX VM and Arista vEOS VM are thus connected with a UDP tunnel, and the Linux bridge is not involved.
netlab has to use a Linux bridge to connect ArubaCX VM and Arista cEOS (or FRR) container, and the Linux bridge dropped the packets with an invalid IP length.
It seems that the FRR VM (Debian Bookworm) checks the IP packet length, while the FRR container does not.

Finally, why did things start to work after I rebooted the server? The firewall-on-a-bridge is an add-on module that is not loaded at boot time, so the test works after a server reboot. However, something (and I wasn’t able to figure out what) triggers the loading of that kernel module, and from that point onwards, the ArubaCX VM can no longer send VXLAN-encapsulated ARP requests to adjacent containers.

Maybe we should rename the Linux bridge to a Heisenberg bridge? If it works, you don’t know why, and if you think you know how it’s configured, it works in unpredictable ways.

netlab

UDP Checksum: It SHOULD be transmitted as zero. When a packet is received with a UDP checksum of zero, it MUST be accepted for decapsulation. Optionally, if the encapsulating end point includes a non-zero UDP checksum, it MUST be correctly calculated across the entire packet including the IP header, UDP header, VXLAN header, and encapsulated MAC frame. When a decapsulating end point receives a packet with a non-zero checksum, it MAY choose to verify the checksum value. If it chooses to perform such verification, and the verification fails, the packet MUST be dropped. If the decapsulating destination chooses not to perform the verification, or performs it successfully, the packet MUST be accepted for decapsulation.

3 comments:

Edmund R 12 June 2025 05:51

(disclaimer - I work for Arista) One thing to note is the RFC7348 says

so perhaps if the packet is allowed by the "firewall" any vxlan implementation has to process it if there is no udp checksum (which looks to be the case in your packet capture).

Ivan 12 June 2025 05:54

Thanks for pointing that out. I have no idea whether the UDP checksum is valid (but you can download the pcaps and check), but we have a bigger mismatch between the Ethernet frame length and the IP/UDP packet/payload length (which is too large for the Ethernet frame in which it's sitting).

Dave Williams 15 June 2025 02:09

I’m curious if we know if any of the issues that you are finding would exist on actual Aruba hardware. Are you finding discrepancies between the software implementation of ArubaCX and what would be executed in hardware on the switch?

I’ve never been impressed with the cx VM.

Matthew S 15 June 2025 10:02

The CX sim/VM is useful, but not at parity with the real switch. No impact to customer/production environments.

Recent posts in the same categories

netlab

3 comments: