Solr

Viewing Revision #7 from 2018-05-11 18:19 View Current

Solr is an open source NoSQL enterprise search platform built on Apache Lucene. Supporting distributed search and index replication, Solr is highly reliable, scalable and fault tolerant. It runs as a standalone full-text search server with a REST-like API and provides features including hit highlighting, faceted search, near real-time indexing, dynamic clustering, database integration, and geospatial search. Currently, Solr powers some of the highest-traffic websites and applications in the world.

Logo Versions

Website: https://lucene.apache.org/solr[01]
Source Code: https://github.com/apache/lucene-solr[02] Accessed: Jun 4, 2026 Last Commit: Nov 24, 2021
Tech Docs: https://solr.apache.org/7_3_0/index.html[03]
Developer: Apache Software Foundation
Country of Origin: US
Start Year: 2004
Project Type: Open Source
Written in: Java
Supported Languages: C#, C++, Clojure, Go, Java, JavaScript, Lua, Perl, PHP, Python, R, Ruby, Rust, Scala
Operating Systems: Linux, macOS, Windows
License: Apache v2

Database Entry

Solr

Viewing Revision #7 from 2018-05-11 18:19 View Current

History

In 2004, CNET Networks, an American media website, started Solr to support search capability and later donated to Apache Software Foundation as a open-source project in 2006. In 2007, graduated as a top-level project (TLP), Solr grew steadily with more features and supported several popular websites. Finally in 2010, Solr was merged with Lucene as a sub project and changed the version number to 3.1 after Solr 1.4 to match that of Lucene.

Checkpoints[04][05][06]

Consistent

For standalone mode, Solr provides support for checkpoints through replication handler and will back up the system from the latest index commit point. Checkpoints can be triggered manually or users can set customized configurations to back up automatically after each commit or startup. For SolrCloud mode, Solr utilizes the Collection API which will back up the indexes and configurations to a shared filesystem. The checkpoints will be taken across multiple shards. When restoring, a new collection with same number of shards will be created and will preserve all the shard structure like routing information.

Concurrency Control[07]

Optimistic Concurrency Control (OCC)

Solr uses Optimistic Concurrency Control to ensure that documents can not be concurrently modified by multiple client applications. All documents will be assigned a version field. When updating, clients are guaranteed to read the latest version and resubmit the document after local modification. When a version conflict is encountered, the transaction should be redone.

Data Model[08][09]

Document / XML

Solr stores data as documents which consist of different fields. Each field contains a piece of more specific information about the document and can have different data types. Using the index of documents, Solr can provide efficient search.

Foreign Keys[10][11]

Solr supports foreign keys to indicate relationships between documents as "join", which is different from that of relational database. Join in Solr will not truly join the documents and only generate information from one document, which is more similar to nested query. There is no support for foreign key constraints in Solr. Programmer needs to specify the from and to fields to explicitly join two documents with the join QueryParser.

Indexes[12][13][14]

Solr adopts the inverted index which lists the terms of all documents, each with a list of documents it appears in and the number of occurrence. The inverted index allows faster query processing. Solr also supports DocValue which is a column-oriented field mapping documents to values to support features like sorting and faceting.

Isolation Levels[15][16][17]

Read Committed

In Solr, updates is not visible to searchers until it has been committed to the index. The default action for commit is a "hard commit" which writes all the affected index segments to disk. Solr also supports "soft commit" which only commits the changes to the Lucene data structures to realize Near Real Time (NRT) search. In NRT, documents are available for search soon after being indexed without the need to wait for various background tasks to finish.

Joins[18][19][20][21]

Hash Join Sort-Merge Join

Solr supports joins (that of typical relational database systems) through Streaming Expressions which is designed to perform parallel computing tasks in SolrCloud. In Solr, Streaming Decorator expressions provide two types of joins by wrapping two streams: hash join / outer hash join and left outer join / inner join which is implemented using the idea of Sort-Merge Join.

Logging[16][22][23]

Physical Logging

On each update, Solr writes the entire documents as transaction logs (tlog) to disk in order to achieve reliability and consistency. When doing soft commits, Solr only changes the Lucene data structures and write the tlogs. When doing hard commits, Solr will truncate the tlogs and create a new one since all the previously affected index segments have been written to the disk. When doing the recovery, Solr follows the tlogs to redo updates that have not been fsync’d (hard committed).

Query Compilation

Not Supported

Query Execution[24]

Tuple-at-a-Time Model

Solr process queries by examining each document a time.

Query Interface[25][26]

Custom API

In Solr, the REST interfaces provide easy integrating with many languages. Customers can initiate HTTP requests to send command to Solr and receive responses through Client APIs. These custom APIs handle the work of sending requests and parsing responses.

Storage Architecture[27][28]

Disk-oriented

Solr stores its index files on disk. Uncommitted updates will be kept in memory as a small buffer.

Storage Model[29]

Decomposition Storage Model (Columnar)

Solr stores data in inverted index which stores the documents containing the same term contiguously in a block of data. This is an efficient storage model for OLAP workloads since search can be processed quickly by scanning over some terms.

Stored Procedures

Not Supported

System Architecture[30]

Shared-Nothing

Solr can split an index across multiple shards to distribute data and forms the shared-nothing system architecture. Each shard is a partition of the collection, containing a subset of the documents. Each document will be stored in exactly one shard. SolrCloud also supports automatic distribution for both documents and queries, and ZooKeeper provides load balancing.

Views

Not Supported

Compatible Systems

GraphDB

Derivative Systems

Chronix

Embeddings

CL CloudSearch

Citations

30 sources

Welcome to Apache Solr - Apache Solr apache.org Modified: 2026-05-20 Accessed: 2026-06-05
GitHub - apache/lucene-solr: Apache Lucene and Solr open-source search software · GitHub github.com Accessed: 2026-06-05
Apache Solr 7.3.0 Documentation apache.org Modified: 2018-04-04 Accessed: 2026-06-05
Making and Restoring Backups | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
Index Replication | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
Collections API | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
Updating Parts of Documents | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
http://www.solrtutorial.com/basic-solr-concepts.html solrtutorial.com Dead — Check Archive Accessed: 2026-06-02
Overview of Documents, Fields, and Schema Design | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
https://wiki.apache.org/solr/Join apache.org Dead — Check Archive Accessed: 2026-05-21
Other Parsers | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
org.apache.lucene.codecs.lucene70 (Lucene 7.3.0 API) apache.org Modified: 2018-04-04 Accessed: 2026-06-07
Solr Glossary | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
DocValues | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
Near Real Time Searching | Apache Solr Reference Guide 7.2 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
https://lucidworks.com/2013/08/23/understanding-transaction-logs-softcommit-and-commit-in-sorlcloud lucidworks.com Dead — Check Archive Accessed: 2026-05-21
UpdateHandlers in SolrConfig | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
Stream Decorator Reference | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
Stream Decorator Reference | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
Stream Decorator Reference | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
Stream Decorator Reference | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
UpdateHandlers in SolrConfig | Apache Solr Reference Guide 6.6 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
Read and Write Side Fault Tolerance | Apache Solr Reference Guide 6.6 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
org.apache.lucene.search (Lucene 4.6.0 API) apache.org Modified: 2013-12-02 Accessed: 2026-06-07
https://wiki.apache.org/solr/IntegratingSolr apache.org Dead — Check Archive Accessed: 2026-05-21
Introduction to Client APIs | Apache Solr Reference Guide 7.3 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
DataDir and DirectoryFactory in SolrConfig | Apache Solr Reference Guide 6.6 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
https://wiki.apache.org/solr/SolrPerformanceProblems apache.org Dead — Check Archive Accessed: 2026-05-21
Solr Glossary | Apache Solr Reference Guide 6.6 apache.org Modified: 2025-04-22 Accessed: 2026-06-07
Shards and Indexing Data in SolrCloud | Apache Solr Reference Guide 6.6 apache.org Modified: 2025-04-22 Accessed: 2026-06-07

Revision #7 Last Updated: 2018-05-11 14:19