One post tagged with "optimization"

OrioleDB fastpath search

July 18, 2025 · 4 min read

Creator of OrioleDB

When you optimize the CPU time of a transactional database management system, it comes down to one question: how fast can you read a page without breaking consistency? In this post, we explore how OrioleDB avoids locks, trims memory copies, and — starting with beta12 — even bypasses both copying and tuple deforming altogether for fixed-length types during intra-page search. This means that not only are memory copies skipped, but the overhead of reconstructing tuples is also eliminated. The result: an even faster read path, with no manual tuning required.

Why page copies matter … and why they hurt

Every time a PostgreSQL backend descends an OrioleDB B-tree, it needs a consistent view of the target page. Instead of locking the whole page, OrioleDB keeps a 32-bit state word in the page header.

The low bits represent a change count that increments with every data modification; the high bits hold lightweight flags for "exclusive" and "read-blocked" states. A reader copies the necessary bytes, then re-reads the state, and retries if the counter has changed without using locks, yet achieving perfect consistency.

The following pseudo-code illustrates how to copy a consistent page image using a state variable for synchronization.

def read_page(page, image):
  while true:
    state = read_state(page)
    if state_is_blocked(state):
      continue

    copy_data(image, page)

    newState = read_state(page)
    if state_is_blocked(state) or get_change_count(newState) != get_change_count(state):
      continue

    return

Why page copies matter … and why they hurt​

Why page copies matter … and why they hurt