Partitioned Elias–Fano indexes

Partitioned Elias–Fano (PEF) indexes are a compressed data structure designed for efficiently representing sorted integer sequences, notably inverted indexes in information retrieval. Introduced by Giuseppe Ottaviano and Rossano Venturini in 2014, PEF indexes enhance classic Elias–Fano encoding by dividing sequences into partitions or chunks to leverage local clustering, thus achieving superior compression without sacrificing query speed.

Background

In search engines and information retrieval systems, an inverted index maps terms to "posting lists"—sorted sequences of document IDs where a specific term appears. Efficiently compressing these monotonically increasing sequences is critical for reducing memory consumption and improving query response times.

Elias–Fano encoding, proposed independently by Peter Elias and Robert Fano in the 1970s, provides a "quasi-succinct" method for this compression, using close to the theoretical minimum number of bits required. It operates by splitting each integer into high and low bits. The low bits are stored explicitly in a bit vector, while the high bits are encoded in unary. A major advantage of Elias-Fano over other delta-encoding schemes (like variable-byte or Golomb coding) is that it supports constant-time random access and fast skipping without needing to decompress the entire list. This makes it especially suitable for intersection and merging operations during boolean AND queries.

Partitioned Elias–Fano technique

While classic Elias-Fano encoding provides excellent query performance, it fails to exploit the local clustering often found in real-world inverted lists (i.e., long subsequences of documents with close IDs). The partitioned Elias–Fano method extends the standard encoding by dividing integer sequences into smaller sublists or chunks. Each chunk is compressed individually, which allows the encoding to better adapt to the local statistics of the data.

This two-level data structure consists of:

Top-level sequence: Encodes chunk descriptors (metadata and endpoints) using standard Elias–Fano encoding. This acts as a routing layer to allow quick navigation between chunks.
Chunk-level sequences: Compresses each chunk individually, choosing the most optimal encoding strategy based on the local structure of the data in that specific chunk.

Optimal Partitioning

Chunk partitioning can be implemented with fixed-length blocks or variable-length blocks. Variable-length partitioning provides significantly superior compression by adaptively setting chunk boundaries based on the data distribution. Because finding the absolute optimal partitioning configuration can be computationally expensive, Ottaviano and Venturini introduced a linear-time optimization algorithm. It reduces the partition problem to finding the shortest path on a specific directed acyclic graph (DAG), allowing the system to efficiently identify the minimum-space partition up to an arbitrarily small approximation factor.

Performance

In their 2014 study, Ottaviano and Venturini reported that partitioned Elias–Fano indexes significantly improved compression over standard Elias–Fano indexes, achieving up to double the compression efficiency on highly clustered datasets. Their benchmarks demonstrated that PEF maintained competitive query performance, particularly for intersection and skip-heavy queries common in web search applications. When compared to other state-of-the-art compressed encodings at the time—such as gamma-delta-Golomb coding and frame-of-reference encoding (PForDelta)—partitioned Elias–Fano exhibited the best trade-off between compression size and query speed.

Applications and adoption

Partitioned Elias–Fano indexes and their variations have become highly influential in modern search engine architecture, being cited by over 170 academic papers in the field of information retrieval. Academic implementations include the PISA toolkit (Performant Indexes and Search for Academia), an open-source search engine designed for high-performance information retrieval research.

Commercially, standard and partitioned Elias-Fano encoding concepts influence systems like Apache Lucene (which underpins popular enterprise search platforms like Elasticsearch and Solr) and Facebook’s Graph Search system (Unicorn), which utilized Elias–Fano encoding for rapid graph queries at scale.

Further developments in this space include clustered Elias–Fano indexes, which improve upon PEF by exploiting redundancy across multiple sequences, and dynamic Elias-Fano representations that allow for append-only operations while maintaining optimal space boundaries.

External links

🪦 Wikipedia History

10 monthsage

7editors

14edits

Archive Provenance

Created: June 22, 2025

Wikipedia title: Partitioned Elias–Fano indexes

Original author: Tomlovesfar

Original author ID: 47319163

Last editor: Citation bot

Last editor ID: 7903804

Last edit: April 20, 2026

Deleted: May 5, 2026

Article size: 7.5 KB

Technical Metadata

Wikipedia page ID: 80265783

Last revision ID: 1350201409

SHA-1 hash: 69l03v1zhdfur6ey0qszlot15qwsf0o

Metadata captured: April 20, 2026 8:17 PM

Metadata updated: April 22, 2026 6:10 PM

📊 Wikipedia Stats

Views before deletion: 1,746

Wikipedia pages linked here: 1

Wikidata: Q135225822 (Partitioned Elias–Fano indexes)

All Access: 873 views · 11 months · June 2025 to April 2026

Desktop: 731 views · 11 months · June 2025 to April 2026

Mobile App: 9 views · 11 months · June 2025 to April 2026

Mobile Web: 133 views · 11 months · June 2025 to April 2026

View monthly pageviews (44)

April 2026 · All Access · 25 views

April 2026 · Desktop · 22 views

April 2026 · Mobile Web · 3 views

April 2026 · Mobile App · 0 views

March 2026 · All Access · 99 views

March 2026 · Desktop · 95 views

March 2026 · Mobile Web · 4 views

March 2026 · Mobile App · 0 views

February 2026 · All Access · 105 views

February 2026 · Desktop · 96 views

February 2026 · Mobile Web · 9 views

February 2026 · Mobile App · 0 views

January 2026 · All Access · 88 views

January 2026 · Desktop · 75 views

January 2026 · Mobile Web · 13 views

January 2026 · Mobile App · 0 views

December 2025 · All Access · 53 views

December 2025 · Desktop · 48 views

December 2025 · Mobile Web · 5 views

December 2025 · Mobile App · 0 views

November 2025 · All Access · 46 views

November 2025 · Desktop · 40 views

November 2025 · Mobile Web · 6 views

November 2025 · Mobile App · 0 views

October 2025 · All Access · 75 views

October 2025 · Desktop · 64 views

October 2025 · Mobile Web · 10 views

October 2025 · Mobile App · 1 views

September 2025 · All Access · 41 views

September 2025 · Desktop · 29 views

September 2025 · Mobile Web · 11 views

September 2025 · Mobile App · 1 views

August 2025 · All Access · 109 views

August 2025 · Desktop · 71 views

August 2025 · Mobile Web · 37 views

August 2025 · Mobile App · 1 views

July 2025 · All Access · 109 views

July 2025 · Desktop · 81 views

July 2025 · Mobile Web · 22 views

July 2025 · Mobile App · 6 views

June 2025 · All Access · 123 views

June 2025 · Desktop · 110 views

June 2025 · Mobile Web · 13 views

June 2025 · Mobile App · 0 views

Subject Tags

All orphaned articlesData structuresInformation retrievalOrphaned articles from June 2025Speedy deletion candidates with talk pages

Maintenance Categories

View maintenance categories (3)

Articles for deletionCandidates for speedy deletionCandidates for speedy deletion as unreviewed LLM-generated content

Top Contributors

View full contributor metadata (7)

Citation bot · 4 edits · user ID 7903804 · first Jun 22, 2025 · last Apr 20, 2026

Tomlovesfar · 4 edits · user ID 47319163 · first Jun 22, 2025 · last Apr 20, 2026

OzmoOzmo · 2 edits · user ID 51872824 · first Apr 20, 2026 · last Apr 20, 2026

Antured · 1 edit · user ID 41454490 · first Jun 22, 2025 · last Jun 22, 2025

Cinder painter · 1 edit · user ID 47713826 · first Jun 22, 2025 · last Jun 22, 2025

Folkezoft · 1 edit · user ID 48035214 · first Jun 22, 2025 · last Jun 22, 2025

Jumpytoo · 1 edit · user ID 9949312 · first Apr 20, 2026 · last Apr 20, 2026

Also Known As

Partitioned Elias-Fano indexes

View redirect record details

Partitioned Elias-Fano indexes · page ID 80266641 · Article

Why Deleted

Speedy

by Spartaz

Articles for deletion/Partitioned Elias–Fano indexes (XFDcloser)

Sources

https://doi.org/10.1145%2F2600428.2609615

https://doi.org/10.14778%2F2536222.2536239

https://doi.org/10.4230%2FLIPIcs.CPM.2017.30

Additional preserved links are available in the archive details below.

Archive Inventory

View stored source record counts

Revision rows stored: 14

Outgoing links stored: 11

External links stored: 15

Templates stored: 82

Talk exports stored: 1

AfD exports stored: 1

Raw API payloads stored: 13

Image records stored: 0

View full source metadata

Outgoing Wikipedia links (11)

Apache LuceneApache SolrData structureDirected acyclic graphDoi (identifier)ElasticsearchFacebookInverted indexISBN (identifier)Peter EliasRobert Fano

Backlinks (1)

Partitioned Elias-Fano indexes

External links (15)

https://doi.org/10.1145%2F2600428.2609615

https://doi.org/10.14778%2F2536222.2536239

https://doi.org/10.4230%2FLIPIcs.CPM.2017.30

github.com/...

https://github.com/pisa-engine/pisa

https://lucene.apache.org/core/

lucene.apache.org/...

scholar.google.com/...

Templates (82)

AmboxArticle for deletion/datedArticle for deletion/switchCat handlerCategory handlerCite conferenceCite journalCite webDb-g15Db-llmDb-metaDraft otherEncodefirstEnumFind sources mainspaceHang on/notice3If emptyIn5Indent 5Main otherMboxModule:ArgumentsModule:Category handlerModule:Category handler/blacklistModule:Category handler/configModule:Category handler/dataModule:Category handler/sharedModule:Check for unknown parametersModule:Citation/CS1Module:Citation/CS1/COinSModule:Citation/CS1/ConfigurationModule:Citation/CS1/Date validationModule:Citation/CS1/IdentifiersModule:Citation/CS1/styles.cssModule:Citation/CS1/UtilitiesModule:Citation/CS1/WhitelistModule:Disambiguation/templatesModule:Find sourcesModule:Find sources/configModule:Find sources/linksModule:Find sources/templates/Find sources mainspaceModule:If emptyModule:In5Module:Message boxModule:Message box/ambox.cssModule:Message box/configurationModule:MultiReplaceModule:Namespace detectModule:Namespace detect/configModule:Namespace detect/dataModule:PagetypeModule:Pagetype/configModule:Pagetype/disambiguationModule:Pagetype/rfdModule:Pagetype/setindexModule:Pagetype/softredirectModule:Separated entriesModule:TableToolsModule:TextModule:Text/dataModule:Time agoModule:ToolbarModule:UnsubstModule:Wikitext ParsingModule:YesnoMonthyearMonthyear-1Namespace detectNOINDEXOrphanPAGENAMEUPagetypeReflistReflist/styles.cssREVISIONUSER2Separated entriesTalk otherTerminate sentenceTime agoToolbarYesnoYn

📊 Wikidata

Subject Facts

QID: Q135225822

Label: Partitioned Elias–Fano indexes

Wikidata Archive Data

Captured: April 20, 2026 8:49 PM

View raw claims JSON

Stored properties: 0

[]

Partitioned Elias–Fano indexes

Background

Partitioned Elias–Fano technique

Optimal Partitioning

Performance

Applications and adoption

External links

See Also

GLIMPSE

UNICE global brain project

Harvester42

Dictionary of Algorithms and Data Structures

Bucket (computing)