OSDI '21 - Polyjuice: High-Performance Transactions via Learned Concurrency Control

Показать описание

Polyjuice: High-Performance Transactions via Learned Concurrency Control

Jiachen Wang, Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University; Shanghai AI Laboratory; Engineering Research Center for Domain-specific Operating Systems, Ministry of Education, China; Ding Ding, Department of Computer Science, New York University; Huan Wang, Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University; Shanghai AI Laboratory; Engineering Research Center for Domain-specific Operating Systems, Ministry of Education, China; Conrad Christensen, Department of Computer Science, New York University; Zhaoguo Wang and Haibo Chen, Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University; Shanghai AI Laboratory; Engineering Research Center for Domain-specific Operating Systems, Ministry of Education, China; Jinyang Li, Department of Computer Science, New York University

Concurrency control algorithms are key determinants of the performance of in-memory databases. Existing algorithms are designed to work well for certain workloads. For example, optimistic concurrency control (OCC) is better than two-phase-locking (2PL) under low contention, while the converse is true under high contention.

To adapt to different workloads, prior works mix or switch between a few known algorithms using manual insights or simple heuristics. We propose a learning-based framework that instead explicitly optimizes concurrency control via offline training to maximize performance. Instead of choosing among a small number of known algorithms, our approach searches in a "policy space" of fine-grained actions, resulting in novel algorithms that can outperform existing algorithms by specializing to a given workload.

We build Polyjuice based on our learning framework and evaluate it against several existing algorithms. Under different configurations of TPC-C and TPC-E, Polyjuice can achieve throughput numbers higher than the best of existing algorithms by 15% to 56%.

Рекомендации по теме

OSDI '21 - Polyjuice: High-Performance Transactions via Learned Concurrency Control

OSDI '21 - Polyjuice: High-Performance Transactions via Learned Concurrency Control

OSDI '21 - Marius: Learning Massive Graph Embeddings on a Single Machine

OSDI '21 - GoJournal: a verified, concurrent, crash-safe journaling system

OSDI '21 - Modernizing File System through In-Storage Indexing

Paper #73. Polyjuice: High-Performance Transactions via Learned Concurrency Control

OSDI '21 - Rearchitecting Linux Storage Stack for µs Latency and High Throughput

OSDI '21 - NrOS: Effective Replication and Sharing in an Operating System

OSDI '20 - AIFM: High-Performance, Application-Integrated Far Memory

OSDI '20 - PANIC: A High-Performance Programmable NIC for Multi-tenant Networks

USENIX ATC '21/OSDI '21 Joint Keynote Address-It's Time for Operating Systems to Redi...

OSDI '20 - Performance-Optimal Read-Only Transactions

USENIX ATC '21/OSDI '21 Joint Keynote Address - Distributed Trust: Is “Blockchain” the ans...

OSDI '21 - Privacy Budget Scheduling

OSDI '20 - Microsecond Consensus for Microsecond Applications

ShadowVM: Accelerating Data Plane for Data Analytics with Bare Metal CPUs and GPUs

Reading 'Global Capacity Management with Flux' from OSDI 2023

USENIX ATC '21 - Accelerating Encrypted Deduplication via SGX

USENIX ATC '22/OSDI '22 Joint Keynote Address - Surprise-Inspired Networking

SIGMOD2021-5min-EfficientlyAnsweringDurabilityPredictionQueries

OSDI '20 - The CacheLib Caching Engine: Design and Experiences at Scale

OSDI '20 - Fault-tolerant and transactional stateful serverless workflows

SOSP 2021 (Long Video): Caracal: Contention Management with Deterministic Concurrency Control

OSDI '20 - A large scale analysis of hundreds of in-memory cache clusters at Twitter

[SIGMOD 2021] Instance-Optimized Data Layouts for Cloud Analytics Workloads