Leaf-and-Spine Fabrics Between Theory and Reality

Tuesday, March 14, 2023 07:01 UTC

Leaf-and-Spine Fabrics Between Theory and Reality

I’m always envious of how easy networking challenges seem when you’re solving them in PowerPoint, for example, when an innovation specialist explains how scalability works in leaf-and-spine fabrics in a LinkedIn comment:

One of the main benefits of a CLOS folded spine topology is the scale out spine where you can scale out the number of spine nodes increasing your leaf-spine n-way ECMP as well as minimizing the blast radius with the more spine nodes the more redundancy and resiliency.

Isn’t that wonderful? If you need more bandwidth, sprinkle the magic spine powder on your fabric, add water, and voila! Problem solved. Also, it looks like adding spine switches reduces the blast radius. Who would have known?

In reality:

It doesn’t matter whether you have two or sixteen spines – the blast radius is the same. It is true that you’re pretty low on redundancy if you have just two spines and one of them exploded, so make that three.
Adding a spine switch often results in the rewiring of the physical fabric; the only exception would be going from three to four spines when you’re using leaf switches with four uplinks (and similarly for switches with eight uplinks). Next step: an exciting configuration exercise unless you’ve decided to use unnumbered leaf-to-spine links when deploying the fabric.
The number of uplink ports on the leaf switches limits the maximum number of spine switches. Most leaf switches used to have four uplinks. These days, a lot of switches come with six or eight uplinks, making it easier to build fabrics with more spines and thus lower oversubscription ratio. The maximum fabric size is still limited by the number of ports on the spine switches though.
Obviously you could also buy switches with high-speed ports (example: 100GE) and use some of those as four lower-speed ports (example: 25GE) with breakout cables. That makes your design totally flexible regarding the number of uplinks and the oversubscription ratio, but the breakout cables could get messy, although not as much as the next option¹.
You could build much larger fabrics if you split leaf switch uplinks into individual lanes (100GE ports into four 25GE lanes), but you don’t want to know how messy the cabling gets with the octopus cables or complex behind-the-scenes wiring between patch panels.

Brad Hedlund explained that idea in the Leaf-and-Spine Fabric Architectures webinar.

Another dose of reality: most of the above doesn’t matter. It’s easy to get a spine switch with 32 100GE or 400GE ports; some vendors are shipping spine switches with 64 ports. Sixty-four leaf switches connected to those ports give you over 3000 server-facing ports – probably good enough for 95% of the data centers out there.

Considering all that, what should we do with generic opinions like the one above? Charity Majors answered this thorny question in a recent tweet²:

I can opine all I want on your architecture or ours, but if I’m not carrying a pager for you, you should probably just smile politely and move along. People with skin in the game are the people you should listen to.

And also:

The antipattern I see in so many places with devs and architects is the same fucking problem they have with devs and ops. “No time to be on call, too busy writing important software” ~turns into~ “No time to write code, too busy telling other people how to write code.”

FWIW, you should read the whole thread (assuming Twitter still works when you’re reading this) and the resulting blog post, and continue with Martin Fowler ’s take on Who Needs an Architect.

Next: What Happened to Leaf Switches with Four Uplinks? Continue

Revision History

2023-03-14: Sander Steffann pointed out that there are more switches with six or even eight uplinks than I expected. Also added the ’local breakout cables’ option.
2023-03-15: Another dose of reality: Erik Auerswald pointed out that many switches using Trident3 or Trident4 ASICs have eight uplinks. More details in a follow-up blog post.
2023-03-16: The number of uplinks a switch has doesn’t matter (apart from the oversubscription ratio). The maximum fabric size is still limited by the number of ports on the spine switches.

And I don’t want to know how messy it gets when you decide you need extra uplink ports and have to rewire the whole fabric because some of the ports that used breakout cables became uplinks. ↩︎
Maybe I should do the same with LinkedIn comments, but sometimes they’re just too juicy to pass. ↩︎

Revision History

Recent posts in the same categories

fabric

design

3 comments: