Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2016
A histogram estimate of the Radon-Nikodym derivative of a probability measure with respect to a dominating measure is developed for binary sequences in {0, 1} N. A necessary and sufficient condition for the consistency of the estimate in the mean-square sense is given. It is noted that the product topology on {0, 1} N and the corresponding dominating product measure pose considerable restrictions on the rate of sampling required for the requisite convergence.
Statistics & Probability Letters, 2013
A histogram estimate of the Radon-Nikodym derivative of a probability measure with respect to a dominating measure is developed for binary sequences in {0, 1} N. A necessary and sufficient condition for the consistency of the estimate in the mean-square sense is given. It is noted that the product topology on {0, 1} N and the corresponding dominating product measure pose considerable restrictions on the rate of sampling required for the requisite convergence.
arXiv: Probability, 2015
We calculate the so-called Rademacher's Grand Lebesgue Space norm for a centered (shifted) indicator (Bernoulli's, binary) random variable. This norm is optimal for the centered and bounded random variables (r.v.). Using this result we derive a very simple bilateral sharp exponential tail estimates for sums of these variables, not necessary to be identical distributed, under non-standard norming, and give some examples to show the exactness of our estimates.
1994
The LIand L2-errors of the histogram estimate of a density f from a sample X l r X 2 ,. .. , X , using a cubic partition are shown to be asymptotically normal without any unnecessary conditions imposed on the density f. The asymptotic variances are shown to depend on f only through the corresponding norm off. From this follows the asymptotic null distribution of a goodness-of-fit test based on the total variation distance, introduced by Gyorfi and van der Meulen (1991). This note uses the idea of partial inversion for obtaining characteristic functions of conditional distributions, which goes back at least to Bartlett (1938). RESUME On s'interesse aux erreurs. suivant les normes L , et L2, inherentes a I'estimateur de type histogramme pour une fonction de densite f. ce dernier &ant obtenu i partir d'un Cchantillon X I , X 2 ,. .. ,X,, en utilisant une partition cubique. I1 est dtmontre que ces erreurs poss&dent une distribution asymptotique gaussienne, sans avoir a imposer des conditions superflues sur la fonction de densite f. Les variances asymptotiques ne dependent de f que par l'intermkdiaire de sa norme correspondante. Ceci nous permet d'obtenir la distribution asymptotique, sous I'hypothese nulle, d'une statistique de validitt de I'ajustement fondee sur la variation totale, introduite par Gyorfi et van der Meulen (1991). Cet article fait appel la notion d'inversion partielle, remontant au moins a Bartlett (1 938), afin d'obtenir les fonctions caracttristiques de distributions conditionnelles.
Mathematics and Statistics, 2019
On the probabilistic space (Ω, F, P) we consider a given two-component stationary (in the narrow sense) sequence { } 1 ≥ i i i X ξ , where { } 1 i i ξ ≥ (: i ξ Ω → Ξ) is the controlling sequence and the members : i X R Ω → of the sequence { } 1 i i X ≥ are the observations of some random variable X which are used in the construction of kernel estimates of Rosenblatt-Parzen type for an unknown density () f x of the variable X. The cases of conditional independence and chain dependence of these observations are considered. The upper bounds are established for mathematical expectations of the square of deviation of the obtained estimates from () f x .
Journal of Multivariate Analysis, 1981
Communicated by M. Rosenblatt The almost sure convergence of the kernel-type density estimate is proved for a strictly stationary ergodic sample. Let PI,,-co < n < co} be a sequence of identically distributed random vectors taking values in .@ and having the common density f. Consider the density estimate introduced by Wolverton and Wagner [4]: where The function h is called kernel. Let 3, be the conditional probability measure of qn given q"=,' 4 {q,, i < n-I}, i.e., for each Bore1 set A let
Journal of Mathematical Analysis and Applications, 2005
We introduce a generalization of the notion of sequence of finite variation, using asymptotic density of sets of positive integers. Some approximation results about approaching any sequence by sequences of finite statistical variation are given. 2005 Elsevier Inc. All rights reserved.
Image Analysis & Stereology, 2014
Many real phenomena may be modelled as random closed sets in ℝd, of different Hausdorff dimensions. The problem of the estimation of pointwise mean densities of absolutely continuous, and spatially inhomogeneous, random sets with Hausdorff dimension n < d, has been the subject of extended mathematical analysis by the authors. In particular, two different kinds of estimators have been recently proposed, the first one is based on the notion of Minkowski content, the second one is a kernel-type estimator generalizing the well-known kernel density estimator for random variables. The specific aim of the present paper is to validate the theoretical results on statistical properties of those estimators by numerical experiments. We provide a set of simulations which illustrates their valuable properties via typical examples of lower dimensional random sets.
Applied Mathematical Sciences, 2014
A comparison theorem for two weighted series is proved. As a consequence, a new result concerning the weighted densities is given.
Annals of the Institute of Statistical Mathematics, 1989
In this paper we investigate the limiting behaviour of the measures of information due to Csisz~ir, R6nyi and Fisher. Conditions for convergence of measures of information and for convergence of Radon-Nikodym derivatives are obtained. Our results extend the results of
ESAIM: Probability and Statistics, 2011
Our aim is to estimate the joint distribution of a finite sequence of independent categorical variables. We consider the collection of partitions into dyadic intervals and the associated histograms, and we select from the data the best histogram by minimizing a penalized least-squares criterion. The choice of the collection of partitions is inspired from approximation results due to DeVore and Yu. Our estimator satisfies a nonasymptotic oracle-type inequality and adaptivity properties in the minimax sense. Moreover, its computational complexity is only linear in the length of the sequence. We also use that estimator during the preliminary stage of a hybrid procedure for detecting multiple change-points in the joint distribution of the sequence. That second procedure still satisfies adaptivity properties and can be implemented efficiently. We provide a simulation study and apply the hybrid procedure to the segmentation of a DNA sequence.
2006
We consider quantization of signals in probabilistic framework. In practice, signals (or random processes) are observed at sampling points. We study probabilistic models for run-length encoding (RLE) method. This method is characterized by the compression efficiency coefficient (or quantization rate) and is widely used, for example, in digital signal and image compression. Some properties of RLE quantization rate are investigated. Statistical inference for mean RLE quantization rate is considered. In particular, the asymptotical normality of mean RLE quantization rate estimates is studied. Numerical experiments demonstrating the rate of convergence in the obtained asymptotical results are presented.
Annals of The Institute of Statistical Mathematics, 1995
A b s t r a c t . Let {X~,n _> 1} be a strictly stationary sequence of associated random variables defined on a probability space (f~, B, P) with probability density function f(x) and failure rate function r(x) for X1. Let f,~(x) be a kerneltype estimator of f(x) based on X1,..., X~. Properties of f,~(x) are studied. Pointwise strong consistency and strong uniform consistency are established under a certain set of conditions. An estimator rn(x) of r(x) based on fn(X) and P,~(x), the empirical survival function, is proposed. The estimator rn(x) is shown to be pointwise strongly consistent as well as uniformly strongly consistent over some sets.
2018
Recalling recent results on the characterization of threshold-based sampling as quasi-isometric mapping, mathematical implications on the metric and topological structure of the space of event sequences are derived. In this context, the space of event sequences is extended to a normed space equipped with Hermann Weyl's discrepancy measure. Sequences of finite discrepancy norm are characterized by a Jordan decomposition property. Its dual norm turns out to be the norm of total variation. As a by-product a measure for the lack of monotonicity of sequences is obtained. A further result refers to an inequality between the discrepancy norm and total variation which resembles Heisenberg's uncertainty relation.
IEEE Transactions on Information Theory, 1992
The problem of the nonparametric estimation of a probability distribution is considered from three viewpoints: the consistency in total variation, the consistency in information divergence, and consistency in reversed order information divergence. These types of consistencies are relatively strong criteria of convergence, and a probability distribution cannot he consistently estimated in either type of convergence without any restrictions on the class of probability distributions allowed. Histogram-based estimators of distribution are presented which, under certain conditions, converge in total variation, in information divergence, and in reversed order information divergence to the unknown probability distribution. Some a priori information about the true probability distribution is assumed in each case. As the concept of consistency in information divergence is stronger than that of convergence in total variation, additional assumptions are imposed in the cases of informational divergences. Index Tens-Consistent distribution estimation, total variation, information divergence, reversed order information divergence, histogram-based estimate. E CONSIDER the problem of estimating an un-W known probability distribution p, defined on an arbitrary measurable space ( X , 231, based on independent, identically distributed (i.i.d.) observations XI ,.*e, X,, from p. Here, X could be the real line [w or the Euclidean space Rd, d 2 1, in which case % is the collection of Bore1 sets. As generic notation for the distribution estimate of a set A we use = P . , * ( A X l , X * , . . . > X , , ) > (1 .1> where p.,* is a measurable function of its arguments. In this paper, we examine density estimation problems and related distribution estimation problems that are mo-Manuscript received March 6, 1990. A. R.
Applied Algebra and Number Theory
Large families of binary sequences of the same length are considered and a new measure, the cross-correlation measure of order k is introduced to study the connection between the sequences belonging to the family. It is shown that this new measure is related to certain other important properties of families of binary sequences. Then the size of the cross-correlation measure is studied. Finally, the cross-correlation measures of two important families of pseudorandom binary sequences are estimated.
Statistics & Probability Letters, 1990
Rate of convergence for density estimators based on Haar series are derived under very mild condition: the unknown density has to be of bounded variation. These estimators are histograms on dyadic intervals.
arXiv: Statistics Theory, 2020
In this paper we develop rate--optimal estimation procedures in the problem of estimating the $L_p$--norm, $p\in (0, \infty)$ of a probability density from independent observations. The density is assumed to be defined on $R^d$, $d\geq 1$ and to belong to a ball in the anisotropic Nikolskii space. We adopt the minimax approach and construct rate--optimal estimators in the case of integer $p\geq 2$. We demonstrate that, depending on parameters of Nikolskii's class and the norm index $p$, the risk asymptotics ranges from inconsistency to $\sqrt{n}$--estimation. The results in this paper complement the minimax lower bounds derived in the companion paper \cite{gl20}.
1999
In this paper, we present some general results determining minimax bounds on statistical risk for density estimation based on certain information-theoretic considerations. These bounds depend only on metric entropy conditions and are used to identify the minimax rates of convergence. This work was supported by NSF Grants ECS-9410760 and DMS-9505168. AMS 1991 subject classi cations. Primary 62G07; secondary 62B10, 62C20, 94A29. Key words and phrases. minimax risk, density estimation, metric entropy, Kullback-Leibler distance. of -packing sets is called the packing -entropy or Kolmogorov capacity of S with distance function d and is denoted M d ( ).
Applied Mathematical Sciences, 2013
In this paper, the estimation of a multivariate probability density f of mixing sequences, using wavelet method is considered. We investigate the rate of the L 2-almost sure convergence of wavelet estimators. Optimal rate, up to a logarithm, of convergence of estimators when f belongs to the Sobolev space H s 2 (R d) with s > 0 is established.
Journal of Multivariate Analysis, 2014
Many real phenomena may be modelled as random closed sets in R d , of different Hausdorff dimensions. Of particular interest are cases in which their Hausdorff dimension, say n, is strictly less than d, such as fiber processes, boundaries of germ-grain models, and n-facets of random tessellations. A crucial problem is the estimation of pointwise mean densities of absolutely continuous, and spatially inhomogeneous random sets, as defined by the authors in a series of recent papers. While the case n = 0 (random vectors, point processes, etc.) has been, and still is, the subject of extensive literature, in this paper we face the general case of any n < d; pointwise density estimators which extend the notion of kernel density estimators for random vectors are analyzed, together with a previously proposed estimator based on the notion of Minkowski content. In a series of papers, the authors have established the mathematical framework for obtaining suitable approximations of such mean densities. Here we study the unbiasedness and consistency properties, and identify optimal bandwidths for all proposed estimators, under sufficient regularity conditions. We show how some known results in literature follow as particular cases. A series of examples throughout the paper, both non-stationary, and stationary, are provided to illustrate various relevant situations.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.