Estimation and model selection for model-based clustering with the conditional classification likelihood

The Integrated Completed Likelihood (ICL) criterion has been proposed by Biernacki et al. (2000) in the model-based clustering framework to select a relevant number of classes and has been used by statisticians in various application areas. A theoretical study of this criterion is proposed. A contrast related to the clustering objective is introduced: the conditional classification likelihood. This yields an estimator and a model selection criteria class. The properties of these new procedures are studied and ICL is proved to be an approximation of one of these criteria. We oppose these results to the current leading point of view about ICL, that it would not be consistent. Moreover these results give insights into the class notion underlying ICL and feed a reflection on the class notion in clustering. General results on penalized minimum contrast criteria and on mixture models are derived, which are interesting in their own right.

Data and Resources

Additional Info

Field Value
Source ISSN: 1935-7524
Author Baudry, Jean-Patrick
Maintainer CCSD
Last Updated May 16, 2026, 12:33 (UTC)
Created May 16, 2026, 12:33 (UTC)
Identifier hal-00699578
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor Laboratoire de Statistique Théorique et Appliquée (LSTA) ; Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)
creator Baudry, Jean-Patrick
date 2015-05-16T00:00:00
harvest_object_id a6b4b9d9-e1f4-4b69-9b21-b9783bbacd6e
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2025-08-12T00:00:00
relation info:eu-repo/semantics/altIdentifier/doi/10.1214/15-EJS1026
set_spec type:ART