BTree

This is the common data structure when DB store index.

Limitation of BTree

BTree structure diagram

Require more space since it has key (PK) : value (RowID)
👉🏻 more nodes == more page == more IO required => ⚠️ Make slower!
Range based queries are slow because of random access
There’s a lot of disk I/O jump even though PKs are sorted.

This limitation is the reason the B+Tree appears

B+Tree structure diagram

Actually, each node has multiple key-values. you’d remember the page size is 8KB or 16KB that is different according to DBMS.

Each root, internal nodes have only keys.
👉🏻 more elements => lower pages => lower I/O => Faster!
Each leaf node has key-value
value is a pointer to row (Row ID)
Leaf nodes are linked Once a leaf node is found, there is no need to check from the tree root
👉🏻 This makes range-based query faster.

Range based query in B+Tree

This post is licensed under CC BY 4.0 by the author.