A High-Performance and Scalable NVMe Controller Featuring Hardware Acceleration

Autor: Yunhui Qiu, Lingli Wang, Wenbo Yin
Rok vydání: 2022
Předmět:
Zdroj: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems. 41:1344-1357
ISSN: 1937-4151
0278-0070
Popis: Nonvolatile memory express (NVMe) is a highperformance and scalable PCIe-based interface for the host software communicating with NVMs including NAND Flash and the storage class memories (SCMs). NVMe solid-state drives (SSDs) have been deployed in cloud platforms and data-centers for a variety of I/O intensive applications due to their performance benefits compared with SATA/SAS SSDs. Considering the design flexibility, firmware-based NVMe controllers are typically used in Flash-based NVMe SSDs but may occupy a significant portion of processor resources and power consumption to achieve high performance. Moreover, the firmware component can be a critical performance bottleneck for SCMs that are an order of magnitude faster than Flash. To address these challenges, hardwareaccelerated NVMe controllers have emerged in both industry and academia. The commercial hardware controllers are confidential, whereas current academic studies still spare much room for architecture innovations. In this paper, we propose an open-source ultralow-latency and high-throughput NVMe controller with a highly parallel, pipelined, and scalable architecture that accommodates one admin controller and multiple fully hardwareautomated I/O controllers. We perform extensive empirical performance evaluations concerning the NVMe I/O size, queue depth, queue number, read-to-write ratio, and access pattern. The maximum read/write bandwidth can achieve 7.0GB/s, accounting for 89% of the PCIe bandwidth. The 4KB-sized read/write throughput can attain 1.7 million I/O operations per second (MIOPS), whereas the average latency is merely 2.4μs/3.2μs. Compared with state-of-the-art NVMe controllers in academia, the 4KB-sized read/write bandwidth of our controller reaches 2.2×/2.3× as high and the latency is 5.1×/4.9× lower.
Databáze: OpenAIRE