Compression of inverted indexes for fast query evaluation

F Scholer, HE Williams, J Yiannis, J Zobel

SIGIR Forum (ACM Special Interest Group on Information Retrieval) | Published : 2002


Compression reduces both the size of indexes and the time needed to evaluate queries. In this paper, we revisit the compression of inverted lists of document postings that store the position and frequency of indexed terms, considering two approaches to improving retrieval efficiency: better implementation and better choice of integer compression schemes. First, we propose several simple optimisations to well-known integer compression schemes, and show experimentally that these lead to significant reductions in time. Second, we explore the impact of choice of compression scheme on retrieval efficiency. In experiments on large collections of data, we show two surprising results: use of simple ..

