pith. machine review for the scientific record. sign in

arxiv: 1205.6699 · v1 · submitted 2012-05-30 · 💻 cs.DB

Recognition: unknown

Minuet: A Scalable Distributed Multiversion B-Tree

Authors on Pith no claims yet
classification 💻 cs.DB
keywords minuetanalyticsb-treedatamain-memoryqueriesdistributedhundreds
0
0 comments X
read the original abstract

Data management systems have traditionally been designed to support either long-running analytics queries or short-lived transactions, but an increasing number of applications need both. For example, online games, socio-mobile apps, and e-commerce sites need to not only maintain operational state, but also analyze that data quickly to make predictions and recommendations that improve user experience. In this paper, we present Minuet, a distributed, main-memory B-tree that supports both transactions and copy-on-write snapshots for in-situ analytics. Minuet uses main-memory storage to enable low-latency transactional operations as well as analytics queries without compromising transaction performance. In addition to supporting read-only analytics queries on snapshots, Minuet supports writable clones, so that users can create branching versions of the data. This feature can be quite useful, e.g. to support complex "what-if" analysis or to facilitate wide-area replication. Our experiments show that Minuet outperforms a commercial main-memory database in many ways. It scales to hundreds of cores and TBs of memory, and can process hundreds of thousands of B-tree operations per second while executing long-running scans.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. BranchBench: Aligning Database Branching with Agentic Demands

    cs.DB 2026-04 unverdicted novelty 5.0

    BranchBench shows that existing branchable DBMSes face severe trade-offs between branching speed and read/write performance, with no system supporting representative agentic workloads at scale.