OUTLIERS EMPHASIS ON CLUSTER ANALYSIS The use of squared Euclidean distance and fuzzy clustering to detect outliers in a dataset

Outlier is the term that indicates in statistics an anomalous observation, aberrant, clearly distant from others collected observations. The outliers are the subject to animated discussions in various contexts with regard to be or not to be considered in the average evaluations. Outliers can become a precious source of information, on condition that be able to accurately identify the presence in the reference datasets. The need to identify the presence of clustered outliers in a dataset not previously treated could argue for a fuzzy clustering, emphasized by using the quadratic Euclidean distance as similarity measure. For interesting and useful results, it should be inclined a possibilistic clustering approach, where the term "possibilistic" means, always in mathematical rigor, a component of interpretation of values that point out anomalous cases. The crisp method does not allow it, the fuzzy method introduce it, the possibilistic one use it. This is a very simple paper with divulgative purposes, addressed especially to students, but not only.

Data and Resources

Additional Info

Field Value
Source https://hal.science/hal-00962248
Author Rosso, Gianluca
Maintainer CCSD
Last Updated May 5, 2026, 23:07 (UTC)
Created May 5, 2026, 23:07 (UTC)
Identifier hal-00962248
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor SIS Società Italiana di Statistica (SIS) ; SIS
creator Rosso, Gianluca
date 2014-03-20T00:00:00
harvest_object_id f53bb1be-0f61-4c96-8e9d-81986f387af9
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2014-03-24T00:00:00
set_spec type:UNDEFINED