Journal article
The compression-error trade-off for large gridded data sets
JD Silver, CS Zender
Geoscientific Model Development | COPERNICUS GESELLSCHAFT MBH | Published : 2017
Abstract
The netCDF-4 format is widely used for large gridded scientific data sets and includes several compression methods: lossy linear scaling and the non-lossy deflate and shuffle algorithms. Many multidimensional geoscientific data sets exhibit considerable variation over one or several spatial dimensions (e.g., vertically) with less variation in the remaining dimensions (e.g., horizontally). On such data sets, linear scaling with a single pair of scale and offset parameters often entails considerable loss of precision. We introduce an alternative compression method called "layer-packing" that simultaneously exploits lossy linear scaling and lossless compression. Layer-packing stores arrays (ins..
View full abstractRelated Projects (1)
Grants
Awarded by NASA
Awarded by DOE
Funding Acknowledgements
The work of Jeremy D. Silver was funded by the University of Melbourne's McKenzie Postdoctoral Fellowship program. The work of Charles S. Zender was funded by NASA ACCESS NNX12AF48A and NNX14AH55A and by DOE ACME DE-SC0012998. We thank Peter J. Rayner (University of Melbourne) for useful discussions. Three anonymous reviewers provided constructive comments and suggestions on the manuscript.