Conference Proceedings

A cost model for long-term compressed data retention

K Liao, A Moffat, M Petri, A Wirth

Proceedings of the Tenth ACM International Conference on Web Search and Data Mining | Association for Computing Machinery (ACM) | Published : 2017


© 2017 ACM. Vast amounts of data are collected and stored every day, as part of corporate knowledge bases and as a response to legislative compliance requirements. To reduce the cost of retaining such data, compression tools are often applied. But simply seeking the best compression ratio is not necessarily the most economical choice, and other factors also come in to play, including compression and decompression throughput, the main memory required to support a given level of on-going access to the stored data, and the types of storage available. Here we develop a model for the total retention cost (TRC) of a data archiving regime, and by applying the charging rates associated with a cloud ..

View full abstract