DeepDB

DeepDB (also sometimes called DeepSQL near the end of the project) was a proprietary MySQL storage engine designed for OLAP and OLTP workloads. Intended to scale MySQL to large scale data operations, it utilizes adaptive data structures and machine learning algorithms to optimize transactional workloads at big data scale. Different from classic B-Tree and LSM-Tree based storage engines, DeepDB is built on top of a new tree structure, called CASSI (Continuous Adaptive Sequential Summarization of Info), which dynamically configures the database during runtime to adapt to new hardware deployments. CASSI keeps running the three steps of analysis, adaption and optimization for high efficiency. Therefore, this storage engine allows enterprises to utilize MySQL without manual configuration under new hardware settings.

History

Deep Information Sciences was founded in 2010 based on research conducted at the University of New Hampshire. After the company went under in 2017, the source code of the DeepDB engine was released as open-source as part of a new Deep Software Foundation holding. A large portion of the source code of the system was a custom C++ implementation of the Java Development Kit software and not related to the DBMS itself.

Storage Organization

Copy-on-Write / Shadow Paging

A shadow copy is maintained for recovery on system crush. Disk snapshots are maintained to roll forward and back in data history.

System Architecture

Shared-Everything

DeepDB storage engine is designed to replace MySQL's native storage engines, including InnoDB and MyISAM. It transforms MySQL into adaptive, self-tuning and highly performant database with full ACID compliance and brings additional machine-learning metrics to MySQL. The system is architected for complex environments and supports HTAP(Hybrid Transactional Processing).

Storage Architecture

Disk-oriented

DeepSQL turns MySQL into a cloud-ready, perpetually adapting database.

Storage Model

N-ary Storage Model (Row/Record)

DeepDB uses N-ary storage model.

DeepDB Logo
Source Code

https://github.com/DeepFound/deep_engine

Developer

Deep Information Sciences, Inc.

Country of Origin

US

Start Year

2010

End Year

2017

Former Name

DeepSQL

Project Type

Commercial, Open Source

Written in

C++

Derived From

MySQL

Licenses

AGPL v3, Proprietary