Adaptive Noisy Clustering

The problem of adaptive noisy clustering is investigated. Given a set of noisy observations $Z_i=X_i+\epsilon_i$, $i=1, \ldots, n$, the goal is to design clusters associated with the law of $X_i$'s, with unknown density $f$ with respect to the Lebesgue measure. Since we observe a corrupted sample, a direct approach as the popular $k$-means is not suitable in this case. In this paper, we propose a noisy $k$-means minimization, which is based on the $k$-means loss function and a deconvolution estimator of the density $f$. In particular, this approach suffers from the dependance on a bandwidth involved in the deconvolution kernel. Fast rates of convergence for the excess risk are proposed for a particular choice of the bandwidth, which depends on the smoothness of the density $f$. Then, we turn out into the main issue of the paper: the data-driven choice of the bandwidth. We state an adaptive upper bound for a new selection rule, called ERC (Empirical Risk Comparison). This selection rule is based on the Lepski's principle, where empirical risks associated with different bandwidths are compared. Finally, we illustrate that this adaptive rule can be used in many statistical problems of $M$-estimation where the empirical risk depends on a nuisance parameter.

Data and Resources

Additional Info

Field Value
Source https://hal.science/hal-00831672
Author Chichignoud, Michael, Loustau, Sébastien
Maintainer CCSD
Last Updated May 10, 2026, 19:43 (UTC)
Created May 10, 2026, 19:43 (UTC)
Identifier hal-00831672
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor Laboratoire d'Analyse, Topologie, Probabilités (LATP) ; Aix Marseille Université (AMU)-École Centrale de Marseille (ECM)-Centre National de la Recherche Scientifique (CNRS)
creator Chichignoud, Michael
date 2013-06-10T00:00:00
harvest_object_id d1e60a5e-d67f-4408-9943-9d01435d25ba
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2024-05-03T00:00:00
set_spec type:UNDEFINED