Nearest neighbor classification in infinite dimension

Let $X$ be a random element in a metric space $(\calF,d)$, and let $Y$ be a random variable with value $0$ or $1$. $Y$ is called the class, or the label, of $X$. Assume $n$ i.i.d. copies $(X_i,Y_i)_1\leqi\leqn$. The problem of classification is to predict the label of a new random element $X$. The $k$-nearest neighbor classifier consists in the simple following rule : look at the $k$ nearest neighbors of $X$ and choose $0$ or $1$ for its label according to the majority vote. If $(\calF,d)=(R^d,||.||)$, Stone has proved in 1977 the universal consistency of this classifier : its probability of error converges to the Bayes error, whatever the distribution of $(X,Y)$. We show in this paper that this result is no more valid in general metric spaces. However, if $(\calF,d)$ is separable and if a regularity condition is assumed, then the $k$-nearest neighbor classifier is weakly consistent.

Data and Resources

Additional Info

Field Value
Source https://inria.hal.science/inria-00070470
Author Cérou, Frédéric, Guyader, Arnaud
Maintainer CCSD
Last Updated May 16, 2026, 01:34 (UTC)
Created May 16, 2026, 01:34 (UTC)
Identifier Report N°: RR-5536
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor Applications of interacting particle systems to statistics (ASPI) ; Université de Rennes (UR)-Centre Inria de l'Université de Rennes ; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)
creator Cérou, Frédéric
date 2005-05-16T00:00:00
harvest_object_id 0d2bb7b6-85ea-4845-a9f1-6fcd9b327a53
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2025-11-17T00:00:00
set_spec type:REPORT