Impact of the distance choice on clustering gene expression data using graph decompositions

The study of gene interactions is an important research area in biology and grouping genes with similar expression profiles to clusters is a first step towards a better understanding of their functional relationships. In Kaba et al. 2007, a new clustering approach was presented, using gene interaction graphs to model this data, and decomposing the graphs by means of clique minimal separators. A clique separator is a clique whose removal increases the number of connected components of the graph; the decomposition is obtained by repeatedly copying a clique separator into the components it defines, until only subgraphs with no clique separators are left: these subgraphs will be our clusters. The advantage of our approach is that this decomposition can be computed efficiently, is unique, and yields overlapping clusters. For that, the similarity between each pair of genes is estimated by a distance function, then a family of gene interaction graphs is constructed by choosing several thresholds, where an edge is added between two genes if their distance is below the threshold. Hereby, both the choice of the distance function and of the threshold influences the construction of the gene interaction graphs. In Kaba et al. 2007, several criteria are developed to select thresholds in an appropriate way. Here we discuss the impact of the choice of the distance function; our results suggest that this choice does not effect the final decomposition of the gene interaction graphs into clusters.

Data and Resources

Additional Info

Field Value
Source https://hal.science/hal-00679279
Author Favre, Marie C.F., Pogorelcnik, Romain, Wagler, Annegret K., Berry, Anne
Maintainer CCSD
Last Updated May 30, 2026, 12:58 (UTC)
Created May 30, 2026, 12:58 (UTC)
Identifier hal-00679279
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor Laboratoire d'Informatique, de Modélisation et d'optimisation des Systèmes (LIMOS) ; Université Blaise Pascal - Clermont-Ferrand 2 (UBP)-Université d'Auvergne - Clermont-Ferrand I (UdA)-SIGMA Clermont (SIGMA Clermont)-Ecole Nationale Supérieure des Mines de St Etienne (ENSM ST-ETIENNE)-Centre National de la Recherche Scientifique (CNRS)
creator Favre, Marie C.F.
date 2012-03-15T00:00:00
harvest_object_id 1d9ef3d1-fef5-417b-8ff1-b952e4813362
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2023-04-18T00:00:00
set_spec type:REPORT