BlinkDB is an extension on Spark SQL which supports Approximate Query Processing (AQP). It builds a couple of offline stratifies samples on which the query is executed.
- Website
- http://blinkdb.org[01]
- Source Code
- https://github.com/sameeragarwal/blinkdb[02]
- Country of Origin
- US
- Start Year
- 2012
- End Year
- 2014
- Project Types
- Academic, Open Source
- Derived From
- Spark SQL
- Operating System
- All OS with Java VM
- License
- Apache v2
BlinkDB is an extension on Spark SQL which supports Approximate Query Processing (AQP). It builds a couple of offline stratifies samples on which the query is executed.
Concurrency Control
From Sparkl SQL:
Spark SQL does not support concurrency control because it is designed for read-only OLAP. It provides an interface to save DataFrames to data sources like parquet, or persist tables. It has different save modes for the way handling existing data if present. But these save modes do not utilize any locking and are not atomic.
Citations
2 sources
Revision #3
Last Updated: