Journal article

The compression-error trade-off for large gridded data sets

JD Silver, CS Zender

Geoscientific Model Development | COPERNICUS GESELLSCHAFT MBH | Published : 2017

Abstract

The netCDF-4 format is widely used for large gridded scientific data sets and includes several compression methods: lossy linear scaling and the non-lossy deflate and shuffle algorithms. Many multidimensional geoscientific data sets exhibit considerable variation over one or several spatial dimensions (e.g., vertically) with less variation in the remaining dimensions (e.g., horizontally). On such data sets, linear scaling with a single pair of scale and offset parameters often entails considerable loss of precision. We introduce an alternative compression method called "layer-packing" that simultaneously exploits lossy linear scaling and lossless compression. Layer-packing stores arrays (ins..

View full abstract

University of Melbourne Researchers

Grants

Awarded by NASA


Awarded by DOE


Funding Acknowledgements

The work of Jeremy D. Silver was funded by the University of Melbourne's McKenzie Postdoctoral Fellowship program. The work of Charles S. Zender was funded by NASA ACCESS NNX12AF48A and NNX14AH55A and by DOE ACME DE-SC0012998. We thank Peter J. Rayner (University of Melbourne) for useful discussions. Three anonymous reviewers provided constructive comments and suggestions on the manuscript.