MammothDB

Viewing Revision #7 from 2024-10-31 21:24 View Current

MammothDB is a distributed, relational OLAP DBMS that runs on the Apache Hadoop Distributed File System. It is primarily used for business analytics and data warehousing. After its company's acquisition by MariaDB, a company that provides a DBMS product that supports analytical and transactional queries, MammothDB got killed.[03][04]

Logo Versions

Website: https://mariadb.com[01]
Developer: MariaDB Corporation
Country of Origin: BG
Start Year: 2012 [04]
End Year: 2018 [04]
Acquired By: 2018MariaDB Corporation [11]
Project Type: Commercial
Supported Languages: PL/SQL
License: Proprietary
Twitter: @mammothdb[02]

Logo Versions

Website: https://mariadb.com[01]
Developer: MariaDB Corporation
Country of Origin: BG
Start Year: 2012 [04]
End Year: 2018 [04]
Acquired By: 2018MariaDB Corporation [11]
Project Type: Commercial
Supported Languages: PL/SQL
License: Proprietary
Twitter: @mammothdb[02]

MammothDB

Viewing Revision #7 from 2024-10-31 21:24 View Current

Acquired Company OLAP

History[04][05]

MammothDB was founded in 2012 in Sofia, Bulgaria by Steve Keil, Alex Aldev and Angel Mitev. The mission of MammothDB was to "democratize" enterprise-level data warehousing and analytics tools by making them cheaper and more accessible. The company got acquired by MariaDB on March 27, 2018, resulting in the end of MammothDB.

Checkpoints[06]

Not Supported

There is no custom checkpointing for MammothDB but users can use the fault tolerant HA block storage option for the Hadoop storage nodes for safety.

Compression[01]

For columnar storage, MammothDB's Hadoop storage can use any SQL-compliant storage engine so the compression capabilities will depend on which one a client chooses.

Concurrency Control[06][07]

Not Supported

MammothDB does not have a concurrency control protocol since it has a single worker thread per core on each of its nodes, so no two tasks ever share the same worker thread. The lack of a concurrency control protocol is primarily because it is meant for low-contention OLAP workloads and not high-contention OLTP workloads. Due to MySQL's limitation on the number of concurrent client sessions, MammothDB can serve up to 5000 concurrent client sessions.

Data Model[08][07]

Relational

MammothDB is a relational database. Part of the reason why is to allow clients to easily expand from their initial analytics storage engines (like Microsoft Excel).

Indexes[09]

Not Supported

MammothDB does not support indexes since they result in an additional storage and computation overhead.

Joins[06]

MammothDB does support joins.

Logging[07]

Not Supported

Logging is not supported by MammothDB. However if a node fails, copies of data are stored on multiple nodes so the query could continue to run.

Query Compilation[06][07]

Not Supported

Query Interface[07]

SQL

As of now, to LOAD DATA a SQL query or a data-loading API can be used. For now only SQL queries are supported.

Storage Architecture[07]

Disk-oriented

For each Hadoop storage node, when the data is distributed, it is stored primarily on disk.

Storage Model[10][06]

Decomposition Storage Model (Columnar)

MammothDB stores the data on each of its Hadoop storage nodes using a columnar data-store.

Storage Organization[06]

Heaps

MammothDB stores data on each Hadoop node using block storage, with no particular order in which the entries within the blocks or the blocks are stored.

Stored Procedures[06]

Not Supported

MammothDB does not support stored procedures as of September 2015. The purpose is to allow for generic-language queries in the future.

System Architecture[01]

Shared-Nothing

Data is loaded into tables and the tables are distributed across Hadoop storage nodes. Tables small enough to fit on each node (dimension tables) are replicated onto every node. Tables that are too large to fit on a node (fact tables) are partitioned by rows and each subset of rows is stored on a separate node, with every node getting around the same number of rows.

Views[01]

Virtual Views

Each query when re-written is separated into three components - one is the query that is run on each node's local data, another that is the map-reduce specification for the results from the nodes and the last one that is the final query run on MySQL. The third component will access a virtual view (stored on the pluggable storage engine) which will be specified by the first and second components and be filled after the queries on each of the nodes are aggregated by the map-reduce.

Citations

11 sources

MariaDB Enterprise Open Source Database | MariaDB mariadb.com Accessed: 2026-06-19
https://twitter.com/mammothdb twitter.com
https://mapr.com/partners/partner/mammothdb/ mapr.com Dead — Check Archive Accessed: 2026-05-26
https://www.bloomberg.com/research/stocks/private/snapshot.asp?privcapid=301634953 bloomberg.com Accessed: 2026-05-26
http://www.closingcircle.com/mammothdb-a-big-data-storage-and-analytics-company-secures-e1-6m-from-3ts-and-empower/ closingcircle.com Dead — Check Archive Accessed: 2026-05-26
B-PoC/Documentation/MammothDB Checklist.pdf at master · AMilkov/B-PoC · GitHub github.com Accessed: 2026-05-27
MariaDB Enterprise Open Source Database | MariaDB mammothdb.com Dead — Check Archive Accessed: 2026-06-07
TechTarget - Global Network of Information Technology Websites and Contributors b-eye-network.com Dead — Check Archive Modified: 2026-05-26 Accessed: 2026-06-07
Handbook of Research on Big Data Storage and Visualization Techniques - Google Books google.com Accessed: 2026-06-07
https://cdn2.hubspot.net/hub/310074/file-1448479989-pdf/BD50_Feedzai.pdf hubspot.net Modified: 2014-08-07 Accessed: 2026-06-07
MariaDB acquires big analytics company MammothDB | TechCrunch techcrunch.com Accessed: 2026-06-02

Revision #7 Last Updated: 2024-10-31 17:24