[CS] Jun Hyuk Chang MS PresentationJun 9, 2025

via cs cs at mailman.cs.uchicago.edu
Fri May 30 09:12:06 CDT 2025


This is an announcement of Jun Hyuk Chang's MS Presentation
===============================================
Candidate: Jun Hyuk Chang

Date: Monday, June 09, 2025

Time:  2:30 pm CST

Location: JCL 298

Title: Multi-Version Hash Table

Abstract: Hash tables are essential and frequently used data structures in database systems, particularly for hash joins and aggregation operations. Typically, a hash table is built once and discarded after the query execution. However, many database workloads repeatedly execute similar or identical joins, suggesting potential for significant performance improvements if hash tables could be reused across multiple queries. In this research, we propose the Multi-Version Hash Table (MVHT), a novel data structure designed to leverage multi-version concurrency control (MVCC) semantics, enabling the reuse and sharing of hash tables across queries executed at different snapshot timestamps. MVHT extends traditional ephemeral hash tables by maintaining versioned records organized either by linear version chains, dual chains (recent vs. historical), or timestamp-based partitioning. This design allows concurrent access from multiple queries, including long-lived analytical queries and incremental view maintenance (IVM) workloads. We investigate various design choices for MVHT, including different structural organizations, repair strategies for version chains, and garbage collection (GC) policies. Our evaluation considers diverse workloads to analyze under which scenarios MVHT significantly improves query performance and memory utilization compared to traditional approaches. This research leaves several key questions regarding optimal MVHT structures, repair timing (eager versus lazy), and the most effective GC strategies open, highlighting promising directions for future exploration.

Advisors: Aaron Elmore

Committee Members: Aaron Elmore, Raul Castro Fernandez, Michael Franklin


More information about the cs mailing list