Apache Lucene Search Library
lucene.apache.org
3
Leaving SiteNav
External Link Disclaimer
You are about to visit lucene.apache.org. This website is not operated by us. We are not responsible for its content or privacy practices.
About this website
Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java, originally created by Doug Cutting in 1999 and now a top-level Apache Software Foundation project with over 2,500 stars as of 2026. Lucene is the foundation for Elasticsearch, Apache Solr, and countless enterprise search applications worldwide. Key features include: powerful indexing (inverted index data structure with term dictionaries, frequency tables, position information, and payload support, enabling sub-millisecond query latency on billions of documents), analyzer framework (pluggable text analysis pipeline with tokenizers, token filters, and character filters for multilingual support including Chinese, Japanese, Arabic, and European languages via ICU4J), query language (rich query DSL supporting boolean, phrase, wildcard, fuzzy, proximity, range, boost, and span queries with Lucene's classic query parser and flexible query parser), scoring and relevance (TF-IDF and BM25 similarity models with customizable scoring algorithms, function queries, and learning-to-rank plugin), field types (text, string, integer, long, float, double, binary, numeric, and spatial fields with doc values for sorting and faceting), near-real-time search (searching newly indexed documents within milliseconds of indexing via NRT reader reopen), highlighter (highlighting query terms in result snippets with configurable fragment sizes), spell checking and suggest (Levenshtein distance-based spell checking and auto-complete suggestions via the suggest module), faceted search (multi-dimensional categorization and counting via the facet module), spatial search (geospatial queries with point, polygon, and distance filtering via the spatial-extras module), Lucene segment architecture (immutable segments with background merging for write efficiency), and high availability (replication via SolrCloud or Elasticsearch cluster).
Statistics
3
Views
0
Clicks
0
Like
0
Dislike