Journal article
Data compression and histograms
B Yu, TP Speed
Probability Theory and Related Fields | SPRINGER VERLAG | Published : 1992
DOI: 10.1007/BF01194921
Abstract
In this paper, the relationship between code length and the selection of the number of bins for a histogram density is considered for a sequence of iid observations on [0,1]. First, we use a shortest code length criterion to select the number of bins for a histogram. A uniform almost sure asymptotic expansion for the code length is given and it is used to prove the asymptotic optimality of the selection rule. In addition, the selection rule is consistent if the true density is uniform [0,1]. Secondly, we deal with the problem: what is the "best" achievable average code length with underlying density function f? Minimax lower bounds are derived for the average code length over certain smooth ..
View full abstract