Understanding the Micro-Behaviors of Hardware Offloaded Network Stacks with Lumina

  • Zhuolong Yu
  • , Bowen Su
  • , Wei Bai
  • , Shachar Raindel
  • , Vladimir Braverman
  • , Xin Jin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

Hardware offloaded network stacks are widely adopted in modern datacenters to meet the demand for high throughput, ultra-low latency and low CPU overhead. To fully leverage their exceptional performance, users need to have a deep understanding of their behaviors. Despite many efforts on testing software network stacks, hardware network stacks impose unique challenges to testing tools due to their kernel bypass nature and high performance.In this paper, we present Lumina, a tool to test the correctness and performance of hardware network stacks. Lumina leverages network programmability to emulate various network scenarios at line rate. With user-friendly interfaces, Lumina enables developers to inject deterministic events, thus facilitating the development of precise and reproducible tests. Given the limited resource and flexibility of programmable network devices, we mirror all the packets to dedicated servers and dump them for offline analysis. We leverage Lumina to test four RDMA NICs from NVIDIA and Intel, and identify bugs that can significantly degrade performance or mislead network operations. Lumina also enables us to capture unexpected micro-behaviors which are missing or not clearly described in public documents and specifications. Vendors have confirmed the critical bugs we discovered and will include bug fixes in future releases.

Original languageEnglish
Title of host publicationSIGCOMM 2023 - Proceedings of the ACM SIGCOMM 2023 Conference
PublisherAssociation for Computing Machinery, Inc
Pages1074-1087
Number of pages14
ISBN (Electronic)9798400702365
DOIs
StatePublished - 1 Sep 2023
Externally publishedYes
Event2023 ACM SIGCOMM Conference, ACM SIGCOMM 2023 - New York, United States
Duration: 10 Sep 202314 Sep 2023

Publication series

NameSIGCOMM 2023 - Proceedings of the ACM SIGCOMM 2023 Conference

Conference

Conference2023 ACM SIGCOMM Conference, ACM SIGCOMM 2023
Country/TerritoryUnited States
CityNew York
Period10/09/2314/09/23

Keywords

  • RDMA
  • event injection
  • hardware offloaded network stack
  • network testing
  • programmable networking

ASJC Scopus subject areas

  • Hardware and Architecture
  • Software
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Understanding the Micro-Behaviors of Hardware Offloaded Network Stacks with Lumina'. Together they form a unique fingerprint.

Cite this