Drill

Drill is a database system designed for Big Data exploration. It is an open-source, distributed SQL query system based on Google's Dremel query system, and it features a columnar execution engine. Drill is the only distributed SQL engine in the world that does not require schemas. It was designed from the ground up, and it supports many NoSQL databases and file systems, with the ability for a single query to join data from multiple datastores.

History

In 2010, Google published a paper titled "Dremel: Interactive Analysis of Web-Scale Datasets" that described a scalable database system designed for "interactive analysis of nested data". The Dremel system is available today under Google's BigQuery system. Development of Apache Drill began in 2012, with the goal of replicating the capabilities of Dremel. Initial goals of the system included support for multiple storage systems, file formats, query languages, and data sources, as well as the ability to scale over 10,000 servers and process petabytes of data in seconds.

Query Interface

SQL

Checkpoints

Not Supported

Drill adopts optimistic query execution, which assumes that failures occur rarely during queries. Therefore, it does not take checkpoints. With its pipelined query execution model, single queries are simply reran when they fail.

Data Model

Object-Oriented

Drill features a JSON self-describing data model that supports language independence and loosely defined, weak data typing.

System Architecture

Shared-Disk

Concurrency Control

Optimistic Concurrency Control (OCC)

Drill supports Optimistic Concurrency Control. It plans queries in fragments, assuming that all of the fragments can be completed in parallel. Larger fragments are broken into smaller fragments, which are run in clusters until the whole fragment is complete.

Drill Logo
Website

https://drill.apache.org

Source Code

https://github.com/apache/drill

Tech Docs

https://drill.apache.org/docs/

Developer

Apache Foundation

Country of Origin

US

Start Year

2012

Project Type

Open Source

Written in

Java

Supported languages

SQL

Inspired By

BigQuery

Compatible With

HBase, MongoDB

Operating Systems

Linux, OS X, Windows

Licenses

Apache v2

Wikipedia

https://en.wikipedia.org/wiki/Apache_Drill