Elliptics

Viewing Revision #26 from 2018-12-20 01:17 View Current

Elliptics network is a fault-tolerant distributed key/value (no-relational) database system. By representing the key into a 512-bits id via sha512 algorithm, the default key generation policy, it implements hash table object storage. The key hashing algorithm can be overridden but in practice, this is not used.[05][01]

Logo Versions

Website: http://reverbrain.com/elliptics[01]
Source Code: https://github.com/reverbrain/elliptics[02] Accessed: Jun 4, 2026 Last Commit: Nov 13, 2019
Tech Docs: http://doc.reverbrain.com/elliptics:elliptics[03]
Developer: Reverbrain
Country of Origin: RU
Start Year: 2009
Project Types: Commercial, Open Source
Written in: C++, Python
Supported Languages: C, C++, Python
Operating System: Linux
License: LGPL v3
Wikipedia: https://en.wikipedia.org/wiki/Elliptics[04]

Database Entry

Elliptics

Viewing Revision #26 from 2018-12-20 01:17 View Current

History[04][06]

Elliptics was initially created in 2007 as part of POHMELFS v1. POHMELFS is the abbreviation of the Parallel Optimized Host Message Exchange Layered File System, which is a cache-compatible distributed file system developed by Russian Linux-hacker Evgeny Polyakov. It could be viewed as a protocol to share files between file systems on computers via LAN. In 2009 Elliptics seperated from POHMELFS and became a consistent distributed storage system later. As of 2014, the Elliptics was used in Yandex Map, Disk, Music, Photos and some infrastructure.

Concurrency Control[07]

Deterministic Concurrency Control

Elliptics supports basic lock operation on the key-value pair. It uses such key ID to grant the lock, which makes the operation in Elliptics is atomic in the single group. However, there are no locks between the groups for synchronous.

There is no deadlock detection in system level. It is the user's responsibility to prevent deadlock when developing client program.

And Elliptics uses eventual consistency model to maintain data replicas, which means the data in a group may not maintain the same at any time, but they will eventually be synced with others sometimes in the future.

Data Model[08]

Key/Value

Elliptics own storage supports key-value data model, but it can support SQL database for example by writing an own backend, so that supports relational data storage.

Indexes[09][10][11]

Red-Black Tree

The index structure that exposed to the client is named secondary indexes. Its implementation using STL std::map<T> template in C++.

Isolation Levels[12][13][08][14]

Read Committed

Isolations are used in the Elliptics core and used to spawn slaves/workers in a controlled environment. Each worker is a separate process, which can be interpreted as a transaction. Resources accessed by workers can be limited to process level or Group level, which leads to two types of isolation in Elliptics: Process and CGroup. Process isolation cannot be configured while CGroup level isolation can take system configuration as the argument.

For the details of isolation, readers and writers perform differently. Due to the locking mechanism is controlled manually by the clients, if no lock is got during reads, writers can sneak in and readers will return old and new contents accordingly. Because of the design that Elliptics updates replicas in parallel without holding the single 'replica' log, when reading from multiple replicas, it is possible that one reader will receive old content while another one will see the new one already. When it comes to writers, the writers will receive the completion status for every replica in any cases. If the transactions are atomic among the physically distributed replicas, the clients can implement a central entry point which will hold the lock and updated it only after all completion statuses have been received. By this way, transaction rollback can be implemented.

Joins[15]

Not Supported

Actually, from the API documents, we can find Elliptics doesn't provide join methods to be called from the client. Due to it is not table-oriented and only guaranteed atomicity level is per object in a single replica. This decision was made for performance and scalability reasons.

However, in the backends internal, merging and joins is available and will vary from backend to backend, which is controlled by Query Execution core. For example, when using a relational database as the backend, Join will be supported, while when using levelDB as the backend, Join will not be supported.

Logging[16][17][18]

Physical Logging

Elliptics uses replication to ensure data availability form the beginning of its design. To use replication features, a group of servers are logically bound together by admin and these servers will contain the same data, i.e. each server keeps the replication to the other node.

For recovery logging, Elliptics uses route table to record the set of nodes and the key(ID)-value pairs' ID ranges of each node. A route table at least contains the following fields: timestamp, group id, hex strings to the group number, and address.

Elliptics supports 3 type of recovery: hash ring recovery a.k.a. merge, deep hash ring recovery a.k.a. deep_merge, replica recovery a.k.a. dc. Merge mainly supports data recovery in the same group by moving from one node to another by route tables. Replica recovery is designed to ensure the durability between groups. Elliptics can also perform manual recovery.

Query Execution

Elliptics uses Cocaine to execute the code. It allows the Elliptics creating a pool of cgroups-bound processes to execute externally loaded code. It can access external elements.

Query Interface[19][20][21]

Custom API Command-line / Shell

The API is designed to support C, C++ and Python.

For current version Elliptics document, only the link to Python API works correctly. The links to C API and C++ API documents are broken. We can retrieve the archived version from archive.org. The version archive.org provided is on date May 11, 2016. Here is the link to C API and C++ API.

Python APIs are designed to config Elliptics client, including Logger, Config, Node, Session, etc. C++ APIs are designed to configure the client side Elliptics as well, but with less APIs than Python API library. Node is the main controlling structure of Elliptics.

C APIs could be used to config both Client and Server, with the functionalities of creation, configuration, server-side processing, cache and backend. For the client side, everything is built on the asynchronous API model, while we can do both synchronous and asynchronous calls in server side.

Storage Architecture[22][23][08]

Hybrid

Storage architecture is named as Backends in Elliptics. Elliptics has three low-level backends: filesystem (where written objects are stored as files), Eblob (fast append-only storage) and Smack (small compressible objects stored in sorted tables).

Moreover, Elliptics implemented both the generic storage protocol and its own specific protocol. Therefore, data stored in other services can be routed to Elliptics. For example, Elliptics can connect to MySQL servers and trigger some special commands to read/write data into Elliptics.

System Architecture[24]

Shared-Disk

Elliptics contains three layers: Frontends layers, Elliptics core, and Backends. The role of frontend is to connect with multiple clients to the core to perform the query execution. The Backends are the data storage layers which provide different storage systems.

Views

Not Supported

Citations

24 sources

http://reverbrain.com/elliptics reverbrain.com Dead — Check Archive Accessed: 2026-05-25
GitHub - reverbrain/elliptics: Distributed storage for medium and large objects with streaming support · GitHub github.com Accessed: 2026-06-04
http://doc.reverbrain.com/elliptics:elliptics reverbrain.com Dead — Check Archive Accessed: 2026-06-05
Elliptics - Wikipedia wikipedia.org Modified: 2025-10-22 Accessed: 2026-06-04
http://doc.reverbrain.com/elliptics:api-python#ellipticsid reverbrain.com Dead — Check Archive Accessed: 2026-05-24
POHMELFS — Википедия wikipedia.org Modified: 2025-11-15 Accessed: 2026-06-07
http://doc.reverbrain.com/elliptics:atomic-operations reverbrain.com Dead — Check Archive Accessed: 2026-05-22
http://doc.reverbrain.com/elliptics:what reverbrain.com Dead — Check Archive Accessed: 2026-05-22
https://github.com/reverbrain/elliptics/blob/master/bindings/cpp/session_indexes.cpp github.com Dead — Check Archive Accessed: 2026-05-22
http://doc.reverbrain.com/elliptics:secondary-indexes reverbrain.com Dead — Check Archive Accessed: 2026-05-22
std::map - cppreference.com cppreference.com Modified: 2026-05-19 Accessed: 2026-06-07
GitHub - cocaine/cocaine-core: An open platform to build your own PaaS clouds. · GitHub github.com Accessed: 2026-05-24
http://doc.reverbrain.com/elliptics:api-cc reverbrain.com Dead — Check Archive Accessed: 2026-05-29
Profile · cocaine/cocaine-core Wiki · GitHub github.com Accessed: 2026-05-22
http://doc.reverbrain.com/elliptics:api-python#ellipticssession reverbrain.com Dead — Check Archive Accessed: 2026-05-24
http://doc.reverbrain.com/elliptics:replication reverbrain.com Dead — Check Archive Accessed: 2026-05-22
http://doc.reverbrain.com/blueprints:elliptics:recovery reverbrain.com Dead — Check Archive Accessed: 2026-05-24
http://doc.reverbrain.com/elliptics:route reverbrain.com Dead — Check Archive Accessed: 2026-05-24
http://doc.reverbrain.com:80/elliptics:api-cm reverbrain.com Dead — Check Archive Accessed: 2026-05-24
http://doc.reverbrain.com:80/elliptics:api-cc reverbrain.com Dead — Check Archive Accessed: 2026-05-24
http://doc.reverbrain.com/elliptics:api-python reverbrain.com Dead — Check Archive Accessed: 2026-05-22
http://doc.reverbrain.com/elliptics:serverside reverbrain.com Dead — Check Archive Accessed: 2026-05-22
Architecture · cocaine/cocaine-core Wiki · GitHub github.com Accessed: 2026-05-22
http://doc.reverbrain.com/elliptics:architecture-scheme reverbrain.com Dead — Check Archive Accessed: 2026-05-22

Revision #26 Last Updated: 2018-12-19 20:17