DobLIX: A Dual-Objective Learned Index for Log-Structured Merge Trees

  • 2025-09-02 00:35:44
  • Alireza Heidari, Amirhossein Ahmadi, Wei Zhang
  • 0

Abstract

In this paper, we introduce DobLIX, a dual-objective learned indexspecifically designed for Log-Structured Merge(LSM) tree-based key-valuestores. Although traditional learned indexes focus exclusively on optimizingindex lookups, they often overlook the impact of data access from storage,resulting in performance bottlenecks. DobLIX addresses this by incorporating asecond objective, data access optimization, into the learned index trainingprocess. This dual-objective approach ensures that both index lookup efficiencyand data access costs are minimized, leading to significant improvements inread performance while maintaining write efficiency in real-world LSM-treesystems. Additionally, DobLIX features a reinforcement learning agent thatdynamically tunes the system parameters, allowing it to adapt to varyingworkloads in real-time. Experimental results using real-world datasetsdemonstrate that DobLIX reduces indexing overhead and improves throughput by1.19 to 2.21 times compared to state-of-the-art methods within RocksDB, awidely used LSM-tree-based storage engine.

 

Quick Read (beta)

loading the full paper ...