Création d'un multi-arbre à partir d'un texte balisé

This study focuses on automatic analysis of annotated transcribed speech. The annotation system considered has been recently introduced to address the several limitations of classical syntactic annotations when faced to natural speech transcriptions. It introduces many different components such as embedding, piles, kernels, pre-kernels, discursive markers etc.. All those components are tightly coupled in a complex tree structure and can hardly be considered separately because of their close intrication. Hence, a joint analysis is required but no analysis tool to handle them all together was available yet. In this study, we introduce such an automatic parser of annotated transcriptions of speech and present the corresponding framework based on multi-trees. This framework permits to jointly handle separate aspects of speech such as macro and micro syntactic levels, which are traditionnaly considered separately. Several applications are proposed, including analysis of the transcribed speech by classical parsers designed for written language

Data and Resources

Additional Info

Field Value
Source Proceedings of the Joint Conference JEP-TALN-RECITAL 2012
Author Beliao, Julie
Maintainer CCSD
Last Updated May 9, 2026, 11:45 (UTC)
Created May 9, 2026, 11:45 (UTC)
Identifier halshs-00869863
Language fr
Rights https://about.hal.science/hal-authorisation-v1/
contributor Modèles, Dynamiques, Corpus (MoDyCo) ; Université Paris Nanterre (UPN)-Centre National de la Recherche Scientifique (CNRS)
coverage Grenoble, France
creator Beliao, Julie
date 2012-06-04T00:00:00
harvest_object_id dbd2878b-c092-48e6-bc29-f7756d94222b
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2025-05-28T00:00:00
set_spec type:COMM