Vertica

Viewing Revision #14 from 2018-12-12 19:42 View Current

Vertica is a distributed analytics DBMS. It can be deployed on various platforms like AWS,GCP,Azure...[03]

Website: http://www.vertica.com[01]
Tech Docs: https://www.vertica.com/docs/9.1.x/HTML/index.htm[02]
Developer: Hewlett-Packard Company
Country of Origin: US
Start Year: 2005 [03]
Acquired By: Hewlett-Packard Company [03]
Project Type: Commercial
Written in: C++
Supported Languages: C++, Java, Perl, Python, R
Derived From: PostgreSQL
Inspired By: C-Store
Operating System: Linux
License: PostgreSQL License
Wikipedia: https://en.wikipedia.org/wiki/Vertica[03]

Vertica is designed to a achieve a high performance on OLAP compared with DBMSs for large workloads. High availability and good scalability can be achieved as well on commodity hardware. Also, it provides good integration with Hadoop, Spark, Kafka, which makes users select where they would like to analyze their data freely.

Database Entry

Vertica

Viewing Revision #14 from 2018-12-12 19:42 View Current

Vertica is a distributed analytics DBMS. It can be deployed on various platforms like AWS,GCP,Azure... Vertica is designed to a achieve a high performance on OLAP compared with DBMSs for large workloads. High availability and good scalability can be achieved as well on commodity hardware. Also, it provides good integration with Hadoop, Spark, Kafka, which makes users select where they would like to analyze their data freely.[03]

History[03]

Vetica was founded by Michael Stonebraker and Andrew Palmer in 2005. It is derived from C-Store, which is a prototype developed by MIT, Brown, and few other universities in 2005. It was acquired by Hewlett Packard in 2011 and joined Micro Focus in 2017 due to the merger between Micro Focus and HP.

Checkpoints[04][05]

Blocking Fuzzy

In Vertica, each node maintains checkpoints and transaction logs separately. The synchronization duration can be tuned by users as well. For a single-node failure, it can be recovered from other nodes. If the entire cluster fails, it can be recovered up to the earliest checkpoints when all nodes are good. New transaction log cannot be appended when a new checkpoint begins.

Compression[06]

Delta Encoding Run-Length Encoding

Both Run-Length Encoding and Delta encoding are used in Vertica. RLE encoding is only used when the number of repetitions is large. Delta encoding works for INTEGER/DATE/TIME/TIMESTAMP/INTERVAL type, where the variations from the smallest value are stored instead of the real values to save more space.

Concurrency Control[07][08]

Multi-version Concurrency Control (MVCC)

Vertica supports MVCC to achieve data consistency. Both current and previous status are stored and visible to transactions. Transaction isolations can be achieved since no conflict between the read and write operations exist. A shared-nothing MPP (Massively Parallel Processing) architecture is used in Vertica, which can avoid the overheads caused by locks.

Data Model[09]

Column Family / Wide-Column

Columnar store is used in Vertica to improve the performance of sequential access by sacrificing the performance of single access. Compared with row-oriented databases which has to scan the whole table, only few needed columns are retrieved based on given queries in Vertica, which can improve throughput by reducing disk I/O costs.

Foreign Keys[10]

Supported

Vertica allows users to use foreign key constraints. Foreign keys should be defined when tables are created or "ALTER TABLE" is used.

Indexes[11]

Not Supported

Indexes are not support in Vertica. Projections are used to improve query performance in Vertica.

Isolation Levels[12]

Read Uncommitted Read Committed Serializable Repeatable Read

Read Committed and Serializable are used in Vertica. Read Committed is the default isolation level. Read Uncommitted and Repeatable Read are treated automatically as Read Committed and Serializable respectively in vertica.

Joins[13]

Hash Join Sort-Merge Join

Both merge join and hash join are supported in Vertica. Merge join is faster in general and requires less memory, but data is required to be sorted before. Hash join requires more memory, but it is faster if the inner table can fit in the memory.

Logging[14]

Logical Logging

There are two kinds of logs at each node in Vertica cluster. The dblog is used to keep track of the status only when the database boots. The other type called vertica.log is used to record the status of the node and the status of the whole cluster when Vertica runs.

Query Compilation

Not Supported

Query Execution[15]

Materialized Model

Projections in Vertica have been used for query execution. Query optimizer is responsible for designing and selecting the suitable projections based on the given query plan. Various projections have different influence on query performance in terms of memory, CPU utilization, I/O, Network bandwidth ...

Query Interface[16]

Custom API SQL

Vertica supports query via SQL and its custom API. Vertica has good integrations with Hadoop, Spark, Kafka, and users could make queries via their interfaces. Moreover, Vertica also supports C++, Java, Python, R SDK.

Storage Architecture[17][18]

Hybrid

Hybrid data store are chosen in Vertica. Write Optimized Store(WOS) is about storing data in memory, which does not support compression and indexing. WOS is mainly designed for OLAP. Read Optimized Store(ROS) is about storing data on disk, where data is sorted and indexed, and compressed.

Storage Model[17]

Decomposition Storage Model (Columnar)

Data is stored in Vertica in columnar format to improve query performance, since a lot of disk I/O can be avoided.

Storage Organization[19]

Different from other DBMS, containers are used in Vertica to store data,WOS (Write Optimized Storage) and ROS (Read Optimized Storage) are two existing types. Tuple Mover is used to move data between WOS and ROS.

Stored Procedures[20]

Not Supported

Stored Procedures are not support in Vertica. External Procedures such as R,C++ can be used in Vertica.

System Architecture[09][21]

Shared-Nothing

Shared-nothing architecture is used in Vertica, where all nodes don't share anything in terms of memory and disk storage. Shared-nothing architecture are easier to scale, since there is no race or contention caused by locks. Moreover, Massively MPP(Massive Parallel Processing) architecture is used in Vertica, which can improve query performance such as increasing the throughput of joins when multiple machines are involved.

Views[05]

Materialized Views

The projections in Vertica are similar to materialized view in other DBMSs. Various projections can be created on the same table so that some optimizations such as sorting data can be done for some specific queries in advance.

Citations

21 sources

http://www.vertica.com vertica.com Spam — Check Archive Accessed: 2026-06-05
Vertica® Documentation vertica.com Modified: 2024-06-22 Accessed: 2026-06-07
Vertica - Wikipedia wikipedia.org Modified: 2026-03-18 Accessed: 2026-06-04
Troubleshooting Tips for the Vertica Catalog vertica.com Dead — Check Archive Modified: 2024-06-22 Accessed: 2026-06-07
https://blogs.opentext.com opentext.com Dead — Check Archive Accessed: 2026-06-05
Encoding Types vertica.com Modified: 2024-06-22 Accessed: 2026-06-07
https://db-engines.com/en/system/MemSQL;Vertica db-engines.com Accessed: 2026-05-24
https://www.vertica.com/blog/concurrency-workload-management vertica.com Modified: 2024-03-13 Accessed: 2026-06-05
Advanced Analytics for Data Warehouse & Data Lakehouse vertica.com Dead — Check Archive Modified: 2019-06-17 Accessed: 2026-05-24
Foreign Key Constraints vertica.com Modified: 2024-06-22 Accessed: 2026-06-07
Can I create an index on a table in Vertica? - Stack Overflow stackoverflow.com Dead — Check Archive Modified: 2026-02-01 Accessed: 2026-06-07
Transactions vertica.com Modified: 2024-06-22 Accessed: 2026-06-07
Hash Joins Versus Merge Joins vertica.com Modified: 2024-06-22 Accessed: 2026-06-07
Tips and Tricks on Working With vertica.log – DB Jungle dbjungle.com Dead — Check Archive Accessed: 2026-06-07
Redesigning Projections for Query Optimization vertica.com Modified: 2024-06-22 Accessed: 2026-06-07
Hadoop Interfaces vertica.com Modified: 2024-06-22 Accessed: 2026-06-07
Vertica Cluster Architecture vertica.com Modified: 2024-06-22 Accessed: 2026-06-07
Understanding ROS and WOS: A Hybrid Data Storage Model - OpenText Blogs opentext.com Modified: 2024-09-06 Accessed: 2026-06-05
http://www.aodba.com/understanding-vertica-storage-mechanism/ aodba.com Dead — Check Archive Accessed: 2026-05-24
sql - How to create external procedures in Vertica - Stack Overflow stackoverflow.com Dead — Check Archive Modified: 2025-10-21 Accessed: 2026-06-07
http://www.aodba.com/vertica-architecture-type/ aodba.com Dead — Check Archive Accessed: 2026-05-24

Revision #14 Last Updated: 2018-12-12 14:42