Algorithmes de traitement de flux XML : masses de données, mémoire externe et performances extensibles

Many modern applications require processing of massive streams of XML data, creating difficult technical challenges. Among these, there is the design and implementation of applications to optimize the processing of XPath queries and to provide an accurate cost estimation for these queries processed on a massive steam of XML data. In this thesis, we propose a novel performance prediction model which a priori estimates the cost (in terms of space used and time spent) for any structural query belonging to Forward XPath. In doing so, we perform an experimental study to confirm the linear relationship between stream-processing and data-access resources. Therefore, we introduce a mathematical model (linear regression functions) to predict the cost for a given XPath query. Moreover, we introduce a new selectivity estimation technique. It consists of two elements. The first one is the path tree structure synopsis: a concise, accurate, and convenient summary of the structure of an XML document. The second one is the selectivity estimation algorithm: an efficient stream-querying algorithm to traverse the path tree synopsis for estimating the values of cost-parameters. Those parameters are used by the mathematical model to determine the cost of a given XPath query. We compare the performance of our model with existing approaches. Furthermore, we present a use case for an online stream-querying system. The system uses our performance predicate model to estimate the cost for a given XPath query in terms of time/memory. Moreover, it provides an accurate answer for the query's sender. This use case illustrates the practical advantages of performance management with our techniques

Data and Resources

Additional Info

Field Value
Source https://theses.hal.science/tel-00779309
Author Alrammal, Muath
Maintainer CCSD
Last Updated May 15, 2026, 00:41 (UTC)
Created May 15, 2026, 00:41 (UTC)
Identifier NNT: 2011PEST1002
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor Laboratoire d'Algorithmique Complexité et Logique (LACL) ; Université Paris-Est Créteil Val-de-Marne - Paris 12 (UPEC UP12)-Centre National de la Recherche Scientifique (CNRS)
creator Alrammal, Muath
date 2011-05-16T00:00:00
harvest_object_id ff7c3350-5fa8-4295-bec6-32a497d22478
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2026-03-31T00:00:00
set_spec type:THESE