Zookeeper

ZooKeeper is a distributed, open source coordination service from Apache for distributed applications. Distributed applications can build upon it to implement higher level services for synchronization, groups and naming, and configuration maintenance. It is often used as a fault-tolerant storage for meta-data in large-scale distributed systems. ZooKeeper is used by companies including Yelp, RackSpace, Yahoo!, Reddit, Facebook, and Twitter.[05][04]

Logo Versions

Website: https://zookeeper.apache.org[01]
Source Code: https://github.com/apache/zookeeper[02] Accessed: Jul 22, 2026 Last Commit: Jul 17, 2026
Tech Docs: https://zookeeper.apache.org/doc/current/[03]
Developer: Apache Software Foundation
Governance: Apache Software Foundation
Country of Origin: US
Start Year: 2008 [19]
Project Type: Open Source
Written in: Java
Operating System: All OS with Java VM
License: Apache v2
Wikipedia: https://en.wikipedia.org/wiki/Apache_ZooKeeper[04]

Database Entry

Zookeeper

History[04]

ZooKeeper was originally developed at Yahoo! to streamline the processes running on big data clusters. It was developed to fix the bugs that occurred while deploying distributed big-data applications. It started out as a sub-project of Hadoop, but became a standalone Apache Foundation project in 2008.

Checkpoints[06]

Fuzzy

Zookeeper stores fuzzy snapshots of the data tree in a data directory.

Concurrency Control[07][08]

Not Supported

Zookeeper guarantees sequential consistency and atomicity, and does not allow concurrent writes.

Data Model[09][10]

Hierarchical

The namespace in ZooKeeper is similar to that of a standard file system. A name is a sequence of path elements separated by a slash, and every node in ZooKeeper's namespace is identified by a path. Each node can have data associated with it as well as children. The term znode refers to ZooKeeper data nodes.

Znodes include version numbers for data changes; every time a znode's data changes the version number increases. The data stored in each znode is read and written atomically.

Isolation Levels[11][12]

Not Supported

Logging[13]

Currently, Zookeeper uses Apache log4j 1.2 as an abstraction layer for logging with Apache log4j 1.2. Future plans include allowing the end user to choose the logging implementation.

Query Interface[14]

Command-line / Shell

ZooKeeper Command Line Interface interacts with the ZooKeeper ensemble to perform simple, file-like operations for debugging purposes.

Storage Architecture[15][16]

In-Memory

ZooKeeper servers each keep their state machine in memory, and write every mutation to a durable WAL (Write Ahead Log) on storage media. If a server crshes it can replay the WAL to recover its previous state.

Stored Procedures[17]

Not Supported

System Architecture[18]

Shared-Nothing

ZooKeeper follows a client-server model where clients are the nodes (machines) that make use of the service, and servers are the nodes that provide the service. The client library is responsible for the interaction between clients and ZooKeeper servers.

ZooKeeper servers run in two modes: standalone and quorum. In standalone mode there is a single server, and the ZooKeeper state is not replicated. In quorum mode, a group of ZooKeeper servers--each of which maintains an in-memory database containing the entire data tree of state as well as a transaction log and snapshots stored persistently--replicates its state and serves client requests. This group of servers is called an ensemble. As long as a majority of the ensemble is up the service will be available. .

Each client imports the client library. Every ZooKeeper client is connected to one ZooKeeper server, which can handle multiple client connections at the same time. The client periodically sends pings to the server it is connected to. The server responds with an acknowledgement of the ping to indicate that it is alive as well. If the client does not receive an acknowledgment within the specified time, the client connects to another server in the ensemble, and the client session is transferred over to the new server.

Embeddings

ZippyDB

Venice

SpaceTime

View All (5)

Citations

19 sources

Apache ZooKeeper apache.org Modified: 2026-07-15 Accessed: 2026-07-19
GitHub - apache/zookeeper: Apache ZooKeeper · GitHub github.com Accessed: 2026-06-05
ZooKeeper: Because Coordinating Distributed Systems is a Zoo apache.org Modified: 2026-03-06 Accessed: 2026-06-05
Apache ZooKeeper - Wikipedia wikipedia.org Modified: 2026-03-31 Accessed: 2026-06-04
ZooKeeper: Because Coordinating Distributed Systems is a Zoo apache.org Modified: 2026-03-06 Accessed: 2026-06-07
ZooKeeper Administrator's Guide apache.org Modified: 2026-02-12 Accessed: 2026-06-07
https://medium.com/@ben2460/about-apache-zookeeper-distributed-lock-1a990315e05c medium.com Dead — Check Archive Accessed: 2026-05-31
ZooKeeper in 15 Minutes dzone.com Modified: 2020-01-16 Accessed: 2026-06-07
ZooKeeper: Because Coordinating Distributed Systems is a Zoo apache.org Modified: 2026-03-06 Accessed: 2026-06-07
https://data-flair.training/blogs/zookeeper-architecture data-flair.training Dead — Check Archive Accessed: 2026-05-30
https://pdos.csail.mit.edu/6.824/papers/zookeeper-faq.txt mit.edu Modified: 2026-01-27 Accessed: 2026-06-07
https://medium.com/abrai/demystifying-consistency-and-isolation-for-a-distributed-systems-engineer-64a064c52f6e medium.com Dead — Check Archive Accessed: 2026-05-30
ZooKeeper: Because Coordinating Distributed Systems is a Zoo apache.org Modified: 2026-03-06 Accessed: 2026-06-07
http://www.corejavaguru.com/bigdata/zookeeper/cli corejavaguru.com Dead — Check Archive Accessed: 2026-05-30
https://www.usenix.org/legacy/events/atc10/tech/full_papers/Hunt.pdf usenix.org Dead — Check Archive Modified: 2010-05-12 Accessed: 2026-06-07
Running ZooKeeper, A Distributed System Coordinator | Kubernetes kubernetes.io Accessed: 2026-06-07
Extensible Distributed Coordination – the morning paper acolyer.org Accessed: 2026-06-07
http://www.corejavaguru.com/bigdata/zookeeper/architecture corejavaguru.com Dead — Check Archive Accessed: 2026-05-29
https://medium.com/@markobonaci/the-history-of-hadoop-68984a11704 medium.com Dead — Check Archive Accessed: 2026-05-27

Revision #10 Last Updated: 2026-06-02 11:29