Snowflake

Viewing Revision #8 from 2018-11-19 14:10 View Current

Snowflake is a cloud-based database and is currently offered as a pay-as-you-go service in the Amazon cloud. It is developed by Snowflake Computing.

Logo Versions

Website: https://www.snowflake.net[01]
Tech Docs: https://docs.snowflake.com/en/index[02]
Developer: Snowflake Inc.
Country of Origin: US
Start Year: 2013
Project Type: Commercial
Written in: C++, Java
Derived From: FoundationDB
Operating System: Hosted
License: Proprietary
Wikipedia: https://en.wikipedia.org/wiki/Snowflake_Computing[03]

Snowflake adopts a shared-nothing architecture. It uses Amazon S3 for its underlying data storage. It performs query execution within in elastic clusters of virtual machines, called virtual warehouse. The Cloud Service layer stores the collection of services that manage computation clusters, queries, transactions, and all the metadata like database catalogs, access control information and ect. in a key-value store (FoundationDB).

Database Entry

Snowflake

Viewing Revision #8 from 2018-11-19 14:10 View Current

Snowflake is a cloud-based database and is currently offered as a pay-as-you-go service in the Amazon cloud. It is developed by Snowflake Computing. Snowflake adopts a shared-nothing architecture. It uses Amazon S3 for its underlying data storage. It performs query execution within in elastic clusters of virtual machines, called virtual warehouse. The Cloud Service layer stores the collection of services that manage computation clusters, queries, transactions, and all the metadata like database catalogs, access control information and ect. in a key-value store (FoundationDB).

History

Implementation of Snowflake began in late 2012 and has been generally available since June 2015.

Concurrency Control[04]

Multi-version Concurrency Control (MVCC)

Snowflake supports MVCC. As Snowflake's underlying data storage is done by Amazon S3, each write operation instead of performing writes in place, it creates a new entire file including the changes. The stale version of data is replaced by the newly created file, but is not deleted immediately. Snowflake allows users to define how long the stale version will be kept in S3, which is up to 90 days. Based on MVCC, Snowflake also supports time travel query.

Data Model[04]

Relational Document / XML

Snowflake is relational as it supports ANSI SQL and ACID transactions. It offers built-in functions and SQL extensions for traversing, flattening, and nesting of semi-structured data, with support for popular formats such as JSON and Avro. When storing semi-structured data, Snowflake can perform automatic type inference to find the most common types and store them using the same compressed columnar format as native relational data. Thus it can accelerate query execution on them.

Foreign Keys

Supported

Snowflake supports defining and maintaining constraints, but does not enforce them, except for NOT NULL constraints, which are always enforced including foreign key constraint.

Indexes[04]

Not Supported

Snowflake does not support index, as maintaining index is expensive due to its architecture. Snowflake uses min-max based pruning, and other techniques to accelerate data access.

Isolation Levels[05][06][07]

Snapshot Isolation

According to their paper and talk, Snowflake supports Snapshot Isolation. However, according to their documentation, it is said that Read Committed is the only Isolation level that is supported.

Joins[04]

Hash Join

Query Compilation

Not Supported

Query Execution[04]

Vectorized Model

Snowflake processes data in pipelined fashion, in batches of a few thousand rows in columnar format. It also uses a push instead of pull model as the relational operators push the intermediate results to their downstream operators.

Query Interface[08]

SQL

Storage Architecture[04]

Disk-oriented

Snowflake's data storage is done via Amazon S3 service. Upon query execution, the responsible work nodes uses HTTP -based interface to read/write data. The worker node also uses its local disk as a cache.

Storage Model[04]

Hybrid

Snowflake horizontally partitions data into large immutable files which are equivalent to blocks or pages in a traditional database system. Within each file, the values of each attribute or column are grouped together and heav- ily compressed, a well-known scheme called PAX or hybrid columnar. Each table file has a header which, among other metadata, contains the offsets of each column within the file.

Stored Procedures

Not Supported

System Architecture[04]

Shared-Disk

It uses Amazon S3 for its underlying data storage. It performs query execution within in elastic clusters of virtual machines, called virtual warehouse. Upon query execution, virtual warehouse use HTTP-based interface to read/write data from S3. The Cloud Service layer stores the collection of services that manage computation clusters, queries, transactions, and all the metadata like database catalogs and access control information, in FoundationDB.

Views

Virtual Views

Citations

8 sources

https://www.snowflake.net snowflake.net Dead — Check Archive Accessed: 2026-06-05
Welcome to Snowflake Documentation | Snowflake Documentation snowflake.com Accessed: 2026-06-05
Snowflake Inc. - Wikipedia wikipedia.org Modified: 2026-05-28 Accessed: 2026-06-04
http://info.snowflake.net/rs/252-RFO-227/images/Snowflake_SIGMOD.pdf snowflake.net Dead — Check Archive Accessed: 2026-05-21
https://www.semanticscholar.org/paper/The-Myria-Big-Data-Management-and-Analytics-System-Wang-Baker/712f78db141a106bd21c24667bc967fd0576aa1f?p2df= semanticscholar.org Accessed: 2026-05-21
CMU Advanced Database Systems - 25 Ashish Motivala [Snowflake] (Spring 2018) - YouTube youtube.com Accessed: 2026-06-07
Transactions | Snowflake Documentation snowflake.com Accessed: 2026-06-02
SQL command reference | Snowflake Documentation snowflake.com Accessed: 2026-06-02

Revision #8 Last Updated: 2018-11-19 09:10