Automatic speech quality evaluation and diagnostic from hybrid indicators

With increasing development of new technologies (RTC, RNIS, GSM, VoIP), tele-communication services are becoming more and more diversified. To this end, telecommunication operators need to supervise in real-time the speech quality of the services they offer. Speech quality is usually evaluated from subjective experiments.. Nevertheless, such experiments are time consuming and do not allow any supervisory control. So, accurate objective models are useful to estimate the speech quality.This thesis proposes a non-intrusive model for diagnosing and evaluating speech quality using information available at the measurement point: the DESQHI model (Diagnostic and Evaluation of Speech Quality using Hybrid Indicators). It differs from existing models in terms in two main characteristics. The first one concerns the structure of the model. It is shown that speech quality can be represented as a multidimensional phenomenon incorporating three perceptual dimensions related to noisiness, speech codec and continuity. This multidimensional structure allows for a diagnostic of speech quality based on identifying the principal features affecting speech qual-ity. The second characteristic concerns the nature of indicators (signal-based and parametric) used to represent the three perceptual dimensions. Signal-based indicators use numeric information to represent the characteristics of the signal, for example, the loudness of the speech signal. Parametric indicators are obtained from the network statistics, for example, the percentage of packet loss, which gives information about the level of the discontinuity in the speech signal. This work proposes hybrid indicators (using both signal-based and parametric metrics). It is shown that they are better speech quality predictors than existing models, either parametric only (e.g. ITU-T Recommendation G.107, also known as the E-model) or signal-based only (e.g. ITU-T Recommendation P.563 model).

Data and Resources

Automatic speech quality evaluation and...HTML
Explore
- More information
- Go to resource

Additional Info

Field	Value
Source	https://theses.hal.science/tel-00679705
Author	Leman, Adrien
Maintainer	CCSD
Last Updated	May 24, 2026, 11:39 (UTC)
Created	May 24, 2026, 11:39 (UTC)
Identifier	NNT: 2011ISAL0053
Language	fr
Rights	https://about.hal.science/hal-authorisation-v1/
contributor	Laboratoire Vibrations Acoustique (LVA) ; Institut National des Sciences Appliquées de Lyon (INSA Lyon) ; Université de Lyon-Institut National des Sciences Appliquées (INSA)-Université de Lyon-Institut National des Sciences Appliquées (INSA)
creator	Leman, Adrien
date	2011-06-07T00:00:00
harvest_object_id	e87ba828-902c-4d8e-9cfb-543982e0ad9d
harvest_source_id	3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title	test moissonnage SELUNE
metadata_modified	2026-03-30T00:00:00
set_spec	type:THESE