Building network automation solutions

9 module online course

Start now!

iSCSI: Moore’s Law won

A while ago I was criticizing the network-blindness of the storage industry that decided to run 25-year old protocol (SCSI) over the most resource-intensive transport protocol (TCP) instead of fixing their stuff or choosing a more lightweight transport mechanism. My argument (although theoretically valid) became moot a few months ago: Intel and Microsoft have demonstrated an iSCSI solution that can saturate a 10GE link and perform more than 1 million I/O operations per second. Another clear victory for the Moore’s Law.

You’ll find introduction to SCSI, Fiber Channel, iSCSI and server virtualization in the Next Generation IP Services webinar.

Why is this important? iSCSI was always considered an inferior (but admittedly cheaper) solution when compared with Fiber Channel (FC). This benchmark clearly shows that iSCSI performance no longer lags behind FC. iSCSI is thus a viable alternative for network designers that want to build converged LAN/SAN networks.

How did they do it? Intel has built a large number of TCP acceleration techniques in its Gigabit Ethernet chipsets (here’s a quick summary). For example, the Intel 82599 10GE controller handles IP, TCP and UDP checksums, transmit-side TCP segmentation and receive-side coalescing. Going even further, recent Intel CPUs based on Nehalem and Westmere microarchitectures have dedicated instructions that can compute CRC checksums extremely efficiently.

You might think that you don’t need CRC32 checksum in iSCSI as TCP already checksums its payload and the transmission errors are discovered by layer-2 checksums. Not true, go through this presentation from Mark Bakke (IBM Research).

Why did Intel create such a chipset? Definitely not just to solve the iSCSI performance problems; iSCSI is just one of the high-bandwidth TCP applications, other such applications include web hosting of static content or Netflix-style video streaming. Without TCP offload, an Intel-based server tops out at Fast Ethernet or Gigabit Ethernet speeds (with CPU running at 100% utilization). Offload functions in the Gigabit Ethernet controllers help you get the 10GE speeds at reasonable CPU loads.

Last but not least, you might wonder why I’m interested in this topic. Chipsets and computer architectures were my first (geek) love before I’ve discovered networking. I’ve even designed and wire-wrapped my own Z80-based motherboard and run an emulation of CP/M environment and Turbo Pascal on it.

Add comment