Loading…
Thursday, July 11 • 5:35pm - 5:55pm
Unification of Temporary Storage in the NodeKernel Architecture

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Efficiently exchanging temporary data between tasks is critical to the end-to-end performance of many data processing frameworks and applications. Unfortunately, the diverse nature of temporary data creates storage demands that often fall between the sweet spots of traditional storage platforms, such as file systems or key-value stores.

We present NodeKernel, a novel distributed storage architecture that offers a convenient new point in the design space by fusing file system and key-value semantics in a common storage kernel while leveraging modern networking and storage hardware to achieve high performance and cost-efficiency. NodeKernel provides hierarchical naming, high scalability, and close to bare-metal performance for a wide range of data sizes and access patterns that are characteristic of temporary data. We show that storing temporary data in Crail, our concrete implementation of the NodeKernel architecture which uses RDMA networking with tiered DRAM/NVMe-Flash storage, improves NoSQL workload performance by up to 4.8× and Spark application performance by up to 3.4×. Furthermore, by storing data across NVMe Flash and DRAM storage tiers, Crail reduces storage cost by up to 8× compared to DRAM-only storage systems.

Speakers
PS

Patrick Stuedi

IBM Research
AT

Animesh Trivedi

Vrije Universiteit
JP

Jonas Pfefferle

IBM Research
AK

Ana Klimovic

Stanford University
AS

Adrian Schuepbach

IBM Research
BM

Bernard Metzler

IBM Research


Thursday July 11, 2019 5:35pm - 5:55pm PDT
USENIX ATC Track II: Grand Ballroom VII–IX