Derby

Viewing Revision #57 from 2018-12-12 00:07 View Current

Derby is a lightweight embedded relational database implemented completely in Java. It is an embedded database for any Java applications.[05][06]

Logo Versions

Website: https://db.apache.org/derby/[01]
Source Code: https://github.com/apache/derby[02] Accessed: Jun 4, 2026 Last Commit: Aug 18, 2019
Tech Docs: https://db.apache.org/derby/manuals/index.html[03]
Developer: Cloudscape Inc.
Country of Origin: US
Start Year: 1997 [07]
Former Names: JBMS, Cloudscape, Java DB
Acquired By: Cloudscape Inc.[07]
Project Type: Open Source
Written in: Java
Supported Languages: Java
Operating System: All OS with Java VM
License: Apache v2
Wikipedia: https://en.wikipedia.org/wiki/Apache_Derby[04]

Logo Versions

Website: https://db.apache.org/derby/[01]
Source Code: https://github.com/apache/derby[02] Accessed: Jun 4, 2026 Last Commit: Aug 18, 2019
Tech Docs: https://db.apache.org/derby/manuals/index.html[03]
Developer: Cloudscape Inc.
Country of Origin: US
Start Year: 1997 [07]
Former Names: JBMS, Cloudscape, Java DB
Acquired By: Cloudscape Inc.[07]
Project Type: Open Source
Written in: Java
Supported Languages: Java
Operating System: All OS with Java VM
License: Apache v2
Wikipedia: https://en.wikipedia.org/wiki/Apache_Derby[04]

Derivative Systems

Splice Machine

Derby

Viewing Revision #57 from 2018-12-12 00:07 View Current

Derby is a lightweight embedded relational database implemented completely in Java. It is an embedded database for any Java applications.[05][06]

History[07][08][09]

In 1997, Cloudscape Inc., a start-up in Oakland, California, developed a database engine called JBMS, which was later renamed as Cloudscape. From 1999 to 2001, Cloudscape was acquired first by Informix Software, and by IBM, and its name was changed to IBM Cloudscape. In 2004, IBM contributed the code to the Apache Software Foundation as Derby. The Apache DB project, supported by Apache Software Foundation, aims at creating and maintaining open-sourced, high-quality databases. In 2005, Derby exited the incubator and became a Apache DB subproject.

Checkpoints[10]

Non-Blocking Fuzzy

Derby supports fuzzy checkpointing, with slight variances from the ARIES implementation. Instead of storing active transaction table and dirty page table in checkpoint, it instead stores a few timestamps, which include the checkpoint start time and the earliest start time of ongoing transaction when the checkpoint starts. For example, if transaction T1, T2 and T3 are not finished when the checkpoint starts, then the earliest start time of the three will be recorded in the checkpoint.

When it comes to recovery, the system will first find the nearest checkpoint. Using the earliest start time of ongoing transactions, it will iterate through the log and find all the active transactions and dirty pages at the checkpoint, and redo or undo accordingly.

Compression[11]

Derby does not support data compression, but it supports the function "syscs_util.syscs_compress_table" which is used to claim unused space after there is deletion of large amount of data.

Concurrency Control[12][13]

Two-Phase Locking (Deadlock Detection)

There are two scopes of locking (table-level and row-level), three types of locks (exclusive, shared and update) and four different types of transaction isolation levels. The locking strategies for different combination of scopes, lock types and isolation levels are different.

In general, although not explicitly stated, Derby implements strict two-phase locking. Exclusive locks will be held until a transaction aborts or commits; shared lock, instead, will be released after the reading of the rows finish (except for specific isolation levels) Derby also supports deadlock detection. When a deadlock is detected, the transaction that holds the least number of locks will be aborted.

Data Model[09][14][01]

Relational

Derby is a relational database that supports SQL syntax.

Foreign Keys[15]

Supported

Foreign key is implemented as one of the CONSTRAINT clauses. There are two levels of CONSTRAINTS, column level and table level. Foreign key constraint in a column level enforces that the values in the column must corresponds to the values in the referenced column marked as primary key or unique key. Table level constraint works similarly, but it is for multiple columns.

Insert, update or delete instructions will return an error if the foreign key constraint is violated. The constraint check can be at statement execution or commit depending on the constraint mode.

Indexes[16]

B+Tree

Derby implements standard B+ Tree algorithms with a few features:

It only uses exclusive latches on pages regardless of reading or modification of the page;
Node split is always left to right;
The system holds at most 2 latches simultaneously. During insertion, if there is no space for node splitting, all latches will be released, and Derby will do a split pass from top to bottom. After the split pass, Derby will redo the Insertion operation again.

Isolation Levels[12][17]

Read Uncommitted Read Committed Serializable Repeatable Read

Derby supports four level of isolation: serializable, repeatable read, read committed and read uncommitted for both table-level and row-level locking. The isolation levels can be set by either JDBC methods or SQL statement. One thing to highlight is that for both repeatable read and serializable, the entire table will be locked by either shared or exclusive lock depending on the statement, and the lock will only be released at the end of the transaction. Therefore, there is no phantom read under repeatable read.

Joins[18]

Nested Loop Join Hash Join

Derby provides two types of join strategies -- nested loop and hash join. Nested loop join is more preferable in most cases. Hash join is preferred when inner table values are unique and outer table have many qualifying rows. Also, when the system estimates that the amount of memory required for hash join exceeds the amount available, nested loop will be used.

Logging[19][16]

Physical Logging

Derby implements Write Ahead Logging (WAL) similar to the ARIES design. One of the differences is that instead of saving Log Sequence Number (LSN) in the page data, it saves the page version number in both the page data and the log record, and compare them during recovery.

Derby implements page-level physical logging. For queries that involves more than one pages, the operation will first be converted to loggable actions for each page involved. Then the loggable actions will be used to generate physical logging on that page.

Query Compilation[20][21]

Code Generation JIT Compilation

Derby parses the prepared statement using Javacc and generates the Java binary code directly. JIT complier is supported, so that after several executions, JIT compiler will compile it to native code for performance improvement.

Using ij, Derby can also run ad-hoc statements. The exact compilation process is unclear.

Query Execution[22]

Materialized Model

Subqueries can only be materialized if they not correlate with outer queries, and return one row. For subqueries that cannot be flattened (DISTINCT), optimization can be made on subqueries such as using Hash Join.

Query Interface[23][06]

SQL

Derby support core subset of SQL-92, and some features of SQL-99.

Storage Architecture[24][25][13]

Hybrid

Derby mainly support on-disk database. It also provides in-memory database for testing and developing applications. By following backup procedures, in-memory database can be stored and be used as either an in-memory database or normal on-disk database at a later time.

Storage Model[26]

N-ary Storage Model (Row/Record)

Derby implements row-based storage model. Rows corresponds to records in data pages.

Storage Organization[27]

Heaps

Derby stores data and index in containers, which has a one-to-one mapping with files. Within each container, there will be three types of pages -- header page, data page and allocation page. Data pages hold data in row order.

Stored Procedures[28]

Supported

Derby support Java stored procedures.

Views[29]

Virtual Views

Derivative Systems

Splice Machine

Citations

29 sources

Apache Derby apache.org Modified: 2025-11-04 Accessed: 2026-07-16
GitHub - apache/derby: Mirror of Apache Derby · GitHub github.com Accessed: 2026-06-04
Apache Derby: Documentation apache.org Modified: 2023-11-09 Accessed: 2026-06-05
Apache Derby - Wikipedia wikipedia.org Modified: 2026-01-05 Accessed: 2026-06-04
https://wiki.apache.org/db-derby/ apache.org Dead — Check Archive Accessed: 2026-05-23
Apache Derby apache.org Modified: 2025-11-04 Accessed: 2026-06-07
Apache Derby Project Charter apache.org Modified: 2023-11-09 Accessed: 2026-06-07
Welcome to the Apache DB Project! apache.org Modified: 2025-11-05 Accessed: 2026-06-07
Proposal for Derby: an Apache Database Sub-Project apache.org Modified: 2023-11-09 Accessed: 2026-06-07
Derby Logging and Recovery apache.org Modified: 2023-11-09 Accessed: 2026-06-07
SYSCS_UTIL.SYSCS_COMPRESS_TABLE system procedure apache.org Modified: 2017-10-14 Accessed: 2026-06-07
Types and Scope of Locks in Derby Systems apache.org Modified: 2013-01-24 Accessed: 2026-06-07
Derby Developer's Guide apache.org Modified: 2017-10-14 Accessed: 2026-06-07
https://builds.apache.org/job/Derby-docs/lastSuccessfulBuild/artifact/trunk/out/ref/refderby.pdf apache.org Dead — Check Archive Accessed: 2026-05-24
CONSTRAINT clause apache.org Modified: 2017-10-14 Accessed: 2026-06-07
org.apache.derby.impl.store.access.btree apache.org Modified: 2023-11-09 Accessed: 2026-06-07
Isolation levels and concurrency apache.org Modified: 2017-10-14 Accessed: 2026-06-07
Join strategies apache.org Modified: 2015-10-10 Accessed: 2026-06-07
Derby Logging and Recovery apache.org Modified: 2023-11-09 Accessed: 2026-06-07
https://db.apache.org/derby/binaries/ApacheDerbyInternals_1_1.pdf apache.org Modified: 2005-01-04 Accessed: 2026-06-07
Derby Engine Architecture Overview apache.org Modified: 2023-11-09 Accessed: 2026-06-07
Tuning Derby apache.org Modified: 2018-05-03 Accessed: 2026-06-07
Derby Reference Manual apache.org Modified: 2013-01-24 Accessed: 2026-06-07
https://builds.apache.org/job/Derby-docs/lastSuccessfulBuild/artifact/trunk/out/getstart/getstartderby.pdf apache.org Dead — Check Archive Accessed: 2026-05-23
https://builds.apache.org/job/Derby-docs/lastSuccessfulBuild/artifact/trunk/out/devguide/derbydev.pdf apache.org Dead — Check Archive Accessed: 2026-05-23
Derby On Disk Page Format apache.org Modified: 2023-11-09 Accessed: 2026-06-07
Derby On Disk Page Format apache.org Modified: 2023-11-09 Accessed: 2026-06-07
CREATE PROCEDURE statement apache.org Modified: 2017-10-14 Accessed: 2026-06-07
CREATE VIEW statement apache.org Modified: 2013-01-24 Accessed: 2026-06-07

Revision #57 Last Updated: 2018-12-11 19:07