La ressource ANNODIS, un corpus enrichi d'annotations discursives

This paper describes the ANNODIS ressource, a corpus of written French enriched with several markups, including a manual annotation of discourse structures. The resource is original in that it offers a diversified corpus representing several text types, and two annotations based on different approaches to discourse organisation. As well as a description of the ressource - annotated objects, composition of the corpus - the paper presents the theoretical underpinnings of the annotation models and the methodological choices underlying corpus preparation and annotation. It also sketches the potential contribution of such a resource for linguistics and NLP, and describes initial results of its exploitation.

Data and Resources

Additional Info

Field Value
Source ISSN: 1248-9433
Author Péry-Woodley, Marie-Paule, Afantenos, Stergos, Ho-Dac, Lydia-Mai, Asher, Nicholas
Maintainer CCSD
Last Updated May 7, 2026, 07:11 (UTC)
Created May 7, 2026, 07:11 (UTC)
Identifier halshs-00935201
Language fr
Rights https://about.hal.science/hal-authorisation-v1/
contributor Cognition, Langues, Langage, Ergonomie (CLLE-ERSS) ; École Pratique des Hautes Études (EPHE) ; Université Paris Sciences et Lettres (PSL)-Université Paris Sciences et Lettres (PSL)-Université Toulouse - Jean Jaurès (UT2J) ; Communauté d'universités et établissements de Toulouse (Comue de Toulouse)-Communauté d'universités et établissements de Toulouse (Comue de Toulouse)-Université Bordeaux Montaigne (UBM)-Centre National de la Recherche Scientifique (CNRS)
creator Péry-Woodley, Marie-Paule
date 2011-05-07T00:00:00
harvest_object_id f4051d37-d552-4219-9aee-2d978fe4f4c7
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2025-10-22T00:00:00
set_spec type:ART