Simultaneous data pre-processing and SVM classification model selection based on a parallel genetic algorithm applied to spectroscopic data of olive oils

Classification is an important task in chemometrics. For several years now, support vector machines (SVMs) have proven to be powerful for infrared spectral data classification. However such methods require optimisation of parameters in order to control the risk of overfitting and the complexity of the boundary. Furthermore, it is established that the prediction ability of classification models can be improved using pre-processing in order to remove unwanted variance in the spectra. In this paper we propose a new methodology based on genetic algorithm (GA) for the simultaneous optimisation of SVM parameters and pre-processing (GENOPT-SVM). The method has been tested for the discrimination of the geographical origin of Italian olive oil (Ligurian and non-Ligurian) on the basis of near infrared (NIR) or mid infrared (FTIR) spectra. Different classification models (PLS-DA, SVM with mean centre data, GENOPT-SVM) have been tested and statistically compared using McNemar's statistical test. For the two datasets, SVM with optimised pre-processing give models with higher accuracy than the one obtained with PLS-DA on pre-processed data. In the case of the NIR dataset, most of this accuracy improvement (86.3% compared with 82.8% for PLS-DA) occurred using only a single pre-processing step. For the FTIR dataset, three optimised pre-processing steps are required to obtain SVM model with significant accuracy improvement (82.2%) compared to the one obtained with PLS-DA (78.6%). Furthermore, this study demonstrates that even SVM models have to be developed on the basis of well-corrected spectral data in order to obtain higher classification rates.

Data and Resources

Additional Info

Field Value
Source ISSN: 0308-8146
Author Devos, O., Downey, G., Duponchel, L.
Maintainer CCSD
Last Updated May 7, 2026, 03:42 (UTC)
Created May 7, 2026, 03:42 (UTC)
Identifier hal-00940875
Language en
contributor Laboratoire Avancé de Spectroscopie pour les Intéractions la Réactivité et l'Environnement - UMR 8516 (LASIRE) ; Institut de Chimie - CNRS Chimie (INC-CNRS)-Université de Lille-Centre National de la Recherche Scientifique (CNRS)
creator Devos, O.
date 2014-05-07T00:00:00
harvest_object_id bb999cb9-d3a8-48df-8cbe-673f126d017a
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2026-02-26T00:00:00
relation info:eu-repo/semantics/altIdentifier/doi/10.1016/j.foodchem.2013.10.020
set_spec type:ART