Automatic speech quality evaluation and diagnostic from hybrid indicators

With increasing development of new technologies (RTC, RNIS, GSM, VoIP), tele-communication services are becoming more and more diversified. To this end, telecommunication operators need to supervise in real-time the speech quality of the services they offer. Speech quality is usually evaluated from subjective experiments.. Nevertheless, such experiments are time consuming and do not allow any supervisory control. So, accurate objective models are useful to estimate the speech quality.This thesis proposes a non-intrusive model for diagnosing and evaluating speech quality using information available at the measurement point: the DESQHI model (Diagnostic and Evaluation of Speech Quality using Hybrid Indicators). It differs from existing models in terms in two main characteristics. The first one concerns the structure of the model. It is shown that speech quality can be represented as a multidimensional phenomenon incorporating three perceptual dimensions related to noisiness, speech codec and continuity. This multidimensional structure allows for a diagnostic of speech quality based on identifying the principal features affecting speech qual-ity. The second characteristic concerns the nature of indicators (signal-based and parametric) used to represent the three perceptual dimensions. Signal-based indicators use numeric information to represent the characteristics of the signal, for example, the loudness of the speech signal. Parametric indicators are obtained from the network statistics, for example, the percentage of packet loss, which gives information about the level of the discontinuity in the speech signal. This work proposes hybrid indicators (using both signal-based and parametric metrics). It is shown that they are better speech quality predictors than existing models, either parametric only (e.g. ITU-T Recommendation G.107, also known as the E-model) or signal-based only (e.g. ITU-T Recommendation P.563 model).

Data and Resources

Additional Info

Field Value
Source https://theses.hal.science/tel-00679705
Author Leman, Adrien
Maintainer CCSD
Last Updated May 24, 2026, 11:39 (UTC)
Created May 24, 2026, 11:39 (UTC)
Identifier NNT: 2011ISAL0053
Language fr
Rights https://about.hal.science/hal-authorisation-v1/
contributor Laboratoire Vibrations Acoustique (LVA) ; Institut National des Sciences Appliquées de Lyon (INSA Lyon) ; Université de Lyon-Institut National des Sciences Appliquées (INSA)-Université de Lyon-Institut National des Sciences Appliquées (INSA)
creator Leman, Adrien
date 2011-06-07T00:00:00
harvest_object_id e87ba828-902c-4d8e-9cfb-543982e0ad9d
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2026-03-30T00:00:00
set_spec type:THESE