Journal article

In situ generation of compressed inverted files

A Moffat, TAH Bell

Journal of the American Society for Information Science | Published : 1995

Abstract

An inverted index stores, for each term that appears in a collection of documents, a list of document numbers containing that term. Such an index is indispensable when Boolean or informal ranked queries are to be answered. Construction of the index is, however, a nontrivial task. Simple methods using in‐memory data structures cannot be used for large collections because they require too much random access storage, and traditional disk‐based methods require large amounts of temporary file space. This paper describes a new indexing algorithm designed to create large compressed inverted indexes in situ. It makes use of simple compression codes for the positive integers and an in‐place external ..

View full abstract

University of Melbourne Researchers