LMDB

Viewing Revision #1 from 2018-04-10 14:35 View Current

LMBD is a light-weight, high-speed embedded database for key-value data and it uses B+ trees to manage data. LMDB is inspired from Berkeley DB, essentially LMDB is made light-weight by removing and rewriting most of the BDB code. Few features in Berkeley DB like the 2-level locking(deadlocks and overhead avoided) and the additional 2-level caching (DB cache and backend cache in addition to file system cache) have been discarded in LMDB. LMDB is fully transactional and it complies with the ACID properties. The key features of LMDB is that it uses a single-level store, i.e, it uses memory map to store the database, which means that the OS is responsible for managing the pages (like caching frequently uses pages), this also means that the database can be larger than the memory and can grow upto the size of the virtual memory. Given that this embedded database system relinquishes most of the control of what to keep in memory and what to flush to disk, larger than memory databases may not perform well for certain workloads. MVCC concurrency model is enforced using a single writer(serialized) and the copy-on-write semantics to handle writes to the database. Readers don't block writers and writers don't block readers, and since there is single writer to the database there won't be any deadlocks. LMDB encourages read-only memory map, although write maps are supported in fully in-memory databases. LMDB maintains only two versions of data, i.e. once committed all the previous verions are discarded. A free list (another B+ tree apart from the one for the user pages) is maintained to track and reuse freed pages instead of going for new pages everytime, hence keeping the overall size usage optimized.[01][02][03][04]

Developer: Howard Chu
Start Year: 2011 [09]
Project Type: Open Source
Supported Languages: C++
Derived From: Berkeley DB
Operating Systems: AIX, Android, BSD, iOS, Linux, macOS, Solaris, Windows

Developer: Howard Chu
Start Year: 2011 [09]
Project Type: Open Source
Supported Languages: C++
Derived From: Berkeley DB
Operating Systems: AIX, Android, BSD, iOS, Linux, macOS, Solaris, Windows

Derivative Systems

Siaqodb

FineDB LI libmdbx

View All (4)

Embeddings

XTDB

UnisonDB

Quadrable

View All (17)

LMDB

Viewing Revision #1 from 2018-04-10 14:35 View Current

History[05][04]

LMDB is currently used in OpenLDAP (Berkeley DB was previously used since 1999). OpenLDAP is an open-source implementation of the Lightweight Directory Access Protocol (LDAP). As mentioned in the description, LMDB was mainly inspired from Berkeley DB, and discards many heavy modules from BDB. The object code size is around 40 KB (which is very light), can comfortably fit into modern CPU caches. The code for LMDB is mainly developed and maintained by the Symas Corporation.

Checkpoints

In the event of a crash, the database starts of from where it was left, the OS takes care of writing data to disk and the database here doesn't need to take any snapshots. The on-disk representation is similar to the in-memory representation, there is no provision for compressing the data, due the memory map constraints.

Concurrency Control[03]

Multi-version Concurrency Control (MVCC)

Locking overhead avoided by using MVCC, readers don't block at all and writers don't block readers. Deleted versions are reclaimed by the free space management module of LMDB (essentially stored into a B+ tree for later use).

Data Model

Key/Value

This embedded database is a key-value in the backend, which is stored in the memory-map. The keys are indexed in a B+ tree. LMDB provides transactional guarantees on top of this key-value store. It is not a relational database.

Indexes[06][03]

B+Tree

LMDB uses a modified design of Append-only B+ Tree and it uses 2 B+ trees : one for maintaining the regular user data pages and one for maintaining the free pages obtained after deletes. LMDB is optimized for short read transactions, long lived read transactions will keep older pages longer in the table and hence blocks write operations. If the workload has too many writes along with long lived read transactions, the performance would be very low.

Isolation Levels[03]

Serializable

LMDB provides Serializable isolation with MVCC, this is possible because of the single-writer semantics. Only a single write transaction can can be alive at a single point of time, hence no races among multiple writers modifying the database.

Joins

Not Supported

Logging[07]

No logging procedures are implemented here, using copy-on-write semantics (with shadow paging) provides durability without any need for logging. Shadow paging allows new writes to a different location and not directly replace the existing pages, hence avoids data-corruption. Also the shadow page reference update is atomic, hence avoids need for logging.

Query Compilation

Not Supported

Query Execution

Tuple-at-a-Time Model

There is no query planning or query execution options as this is an embedded database, since we operate at individual key level, the closest we can classify it is under tuple-at-a-time. The user can program custom querying models on top this embeddded database, which can support other query execution options.

Query Interface[08]

Custom API

LMDB has no SQL layer but applications can directly access the database using API calls provided by LMDB. API support is not just in C but many wrappers for other languages have been developed by open-source contributors. All key-value store operations can be performed using these API calls.

Storage Architecture

In-Memory

LMDB uses mmap, hence it reliquishes most of the caching control to the OS. Memory map allows zero-copies for read/write and no additional buffers for the transaction control. Supports larger-than memory databases, it is bounded by the size of the virtual memory since they use a memory map.

Storage Model

Custom

They use a memory-map to store the database with copy-on-write semantics, hence no specific storage model but the semantics are left to the operating system. The on-disk representation is similar to the memory representation of the database.

Stored Procedures

Not Supported

System Architecture

Shared-Memory

LMDB uses shared-memory model i.e. it handles the memory as a single address space and all the threads access this in parallel. It uses copy-on-write semantics.

Views

Not Supported

Derivative Systems

Siaqodb

FineDB LI libmdbx

View All (4)

Embeddings

XTDB

UnisonDB

Quadrable

View All (17)

Citations

9 sources

LMDB | symas symas.com Accessed: 2026-06-14
lmdb — lmdb 1.4.0 documentation readthedocs.io Modified: 2022-12-06 Accessed: 2026-06-14
LMDB: Lightning Memory-Mapped Database Manager (LMDB) lmdb.tech Modified: 2026-06-04 Accessed: 2026-06-14
The Lightning Memory-Mapped Database (Howard Chu) - Carnegie Mellon Database Group cmu.edu Modified: 2026-06-14 Accessed: 2026-06-14
OpenLDAP, Main Page openldap.org Modified: 2026-03-09 Accessed: 2026-06-14
http://www.bzero.se/ldapd/btree.html bzero.se Dead — Check Archive Accessed: 2026-06-14
https://www.symas.com/products/lightning-memory-mapped-database symas.com Dead — Check Archive Accessed: 2026-06-14
https://www.symas.com/products/lightning-memory-mapped-database/wrappers symas.com Dead — Check Archive Accessed: 2026-06-14
Lightning Memory-Mapped Database - Wikipedia wikipedia.org Modified: 2026-04-14 Accessed: 2026-06-14

Revision #1 Last Updated: 2018-04-10 10:35