Analysis of a Random Forests Model

Random forests are a scheme proposed by Leo Breiman in the 2000's for building a predictor ensemble with a set of decision trees that grow in randomly selected subspaces of data. Despite growing interest and practical use, there has been little exploration of the statistical properties of random forests, and little is known about the mathematical forces driving the algorithm. In this paper, we offer an in-depth analysis of a random forests model suggested by Breiman in \cite{Bre04}, which is very close to the original algorithm. We show in particular that the procedure is consistent and adapts to sparsity, in the sense that its rate of convergence depends only on the number of strong features and not on how many noise variables are present.

Data and Resources

Additional Info

Field Value
Source https://hal.science/hal-00476545
Author Biau, Gérard
Maintainer CCSD
Last Updated May 23, 2026, 13:29 (UTC)
Created May 23, 2026, 13:29 (UTC)
Identifier hal-00476545
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor Laboratoire de Statistique Théorique et Appliquée (LSTA) ; Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)
creator Biau, Gérard
date 2010-04-26T00:00:00
harvest_object_id d94fec4a-a552-4d4e-97bd-4dbdfb91d051
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2025-11-20T00:00:00
relation info:eu-repo/semantics/altIdentifier/arxiv/1005.0208
set_spec type:UNDEFINED