Home / Solutions / Approaches Compared

Approaches Compared

A number of solutions to the server communications bottleneck have been proposed in recent years. Table 1 compares the most well-known alternative approaches on the critical attributes required by IT architects and managers (click on column headings for a discussion of the approach).

Table 1: Comparison on Critical Attributes

 

ETHERFABRIC APPROACH

The EtherFabric architecture is the direct result of thinking outside the box when compared to previous attempts at interconnect acceleration. The technology is based on the fundamentals of how the Ethernet-based TCP/IP implementations work but then takes a different approach to system architecture that produces significant performance breakthroughs. Increased bandwidth and significantly lower latency are both accomplished while preserving Ethernet standards compliance. The EtherFabric approach is illustrated in Figure 5.

Figure 5 – EtherFabric architecture

The fundamental breakthrough of this innovative architecture involves implementing the TCP/IP stack and driver as part of the user space instead of the kernel space. Thus, we can bypass the operating system and its kernel, which provides needed latency benefits but in a manner that is completely compatible. This allows the hardware acceleration to be both highly efficient and of modest cost, enabling easy LoM (LAN-on-Motherboard) deployment. The application API is unchanged from the market-standard Sockets MPI (and any other) APIs; thus all applications are unaffected, except for much improved performance! The wire protocol is still Ethernet, which leverages the $billions of installed equipment in computer networks, data centers, universities and corporations worldwide. And the transport protocol is still TCP/IP, bringing all the reliability benefits developed over the past years of TCP/IP evolution.

In EtherFabric, Level 5 Networks has created a high-performance interconnect that:

  • Is compatible with existing Ethernet servers, switches, copper and both long-reach and short-reach optical physical interfaces, and cables. This contrasts with Infiniband (and Quadrics and Myrinet), which each require special physical interfaces, cables, and switches
  • Is binary-software-compatible with the vast installed base of TCP/IP applications and systems software. This is in stark contrast to iWARP and Infiniband, both of which use many stacked new protocols, such as RDMA. DDP, and MPA to implement a reliable transport method different from TCP/IP.
  • Improves the performance of every server into which it is installed by delivering more CPU cycles to applications, and requiring far less for network processing.
  • Delivers sub-10 usec latency to every connection, in contrast to conventional Ethernet, which is frequently 50 usec or more.
  • Uniquely offers the capability of Single-Sided Acceleration, whereby connection performance is improved even if EtherFabric is installed on only one side of a link.

Back to top

The following sections describe previous interconnect approaches, including key fundamental architectural principles, as well as the strengths and weaknesses of each.

Conventional Ethernet

The following figure depicts the architecture of the standard Ethernet solution in use today. The major issues that have caused problems for many years are the inefficient use of the host CPU—in some cases 90% of the available processing power can be lost when handling TCP due to context switching and interrupt overhead, cache thrashing and inefficient data handling. These issues also lead to limited throughput, extremely long latencies for applications and have become increasingly problematic as the rest of the equipment in the data center has scaled by orders of magnitude in performance.

 

Figure 1 – Conventional Ethernet architecture

Back to top

 

TCP Offload

TCP Offload (see Figure 2) was the first alternative to Ethernet's shortcomings that was explored by the industry. Unfortunately it has fallen short as a solution.

 

Figure 2 – TCP offload architecture

The premise of TCP Offload is that a coprocessor (a TOE, or TCP Offload Engine) can execute TCP processing faster than the host CPU. However, while this approach solves the CPU loading issue for large data transfers, it does not solve the other critical issue of latency. Additionally, TCP Offload suffers the classic coprocessor problem of not being able to scale up in performance and down in cost through time as quickly as the system processor.

Back to top

InfiniBand Architecture

InfiniBand Architecture (see Figure 3), the next invention to address the issue, was a clean piece of paper approach. It addressed all the performance problems of interconnects. It used an architecture called RDMA (Remote Direct Memory Access) which bypassed the operating system kernel in delivering data to applications.

The major drawback of Infiniband is that several new protocols and a new API (Application Programming Interface) were required underneath applications, along with new control plane software. Attempts have since been made to provide shims to map existing APIs to the RDMA protocols, such as SDP (Sockets Direct Protocol), but such efforts often prove to be semantically incompatible with existing applications.Additionally, the performance loss of protocol conversion eliminates the benefit of the new approach. Thus, while Infiniband solved the technical interconnect issues, it was incompatible with existing software infrastructure. This has limited its adoption to arenas where performance gains were so compelling that the software reworking required was considered acceptable. These arenas are limited to large-scale scientific high-performance compute clusters. In other segments, Infiniband has not been accepted and likely will never be now that there is a much easier solution in EtherFabric.

 

Figure 3 – InfiniBand architecture

Back to top

 

iWARP

Recognizing the undesirability of requiring all new cabling and switching, iWARP is an attempt to use RDMA with TCP-Offloaded (see Figure 4). The RDMA implementation was borrowed from Infiniband, which gave iWARP the same benefits and drawbacks as the combination of TOE and Infiniband. Namely, latency should be better than TOE-only, but suffers from the same Infiniband issues of a new wire protocol, new incompatible application API and new incompatible control plane software also apply. Additionally, the cost issues seen with TOE approaches are also a factor, and are exacerbated by adding the hardware RDMA engines of Infiniband.

Figure 4 – iWARP architecture

Back to top

   
Copyright 2002-2005 Level 5 Networks, Inc. All right reserved.
Level 5 Networks, Inc., EtherFabric, and the Level 5 Networks, Inc. and EtherFabric identities are trademarks of Level 5 Networks, Inc.