Application Persistence, Performance, and Deployment in a UNIX-Compatible Single Level Store Operating System

No Thumbnail Available

Date

2025-04-01

Advisor

Mashtizadeh, Ali José

Journal Title

Journal ISSN

Volume Title

Publisher

University of Waterloo

Abstract

This thesis presents the Aurora single level store, an OS design that uses continuous checkpointing for application persistence and deployment. Aurora provides submillisecond application checkpoint and restore operations to efficiently turn applications into on-disk images and back. Fast checkpointing/restore as an OS service also serves as a foundation for further research into open problems like efficient persistence APIs for memory-mapped data and serverless computing. Aurora’s single level store-based persistence has recently become practical because of advances in hardware and file system technology. Modern SSD storage devices have low latency at 10µs, allowing us to persist application checkpoints to the disk with minimal latency overhead. Modern CPUs also have IO throughput that rivals that of their memory bandwidth, making it possible to continuously checkpoint and forward in-memory application state to the disk. This thesis describes three systems that demonstrate the efficiency and flexibility of the single level store paradigm. We first present Aurora (SOSP 2021), an OS design capable of continuous application checkpointing at a fast enough granularity to provide transparent persistence. We follow up with MemSnap (ASPLOS 2024), an OS single level store API and associated virtual memory mechanism. MemSnap persists application data, e.g., database data, more efficiently than the file API. Finally, we present Metropolis, a serverless invoker that uses the single level store paradigm to create serverless function instances at submillisecond latency.

Description

Keywords

Operating Systems, Single Level Store, Persistence, Storage, Checkpoint-Restore, Serverless

LC Subject Headings

Citation