A generic and open framework for multiword expressions treatment : from acquisition to applications

This thesis presents an open and flexible methodological framework for the automatic acquisition of multiword expressions (MWEs) from monolingual textual corpora. This research is motivated by the importance of MWEs for NLP applications. After briefly presenting the modules of the framework, the work reports extrinsic evaluation results considering two applications: computer-aided lexicography and statistical machine translation. Both applications can benefit from automatic MWE acquisition and the expressions acquired automatically from corpora can both speed up and improve their quality. The promising results of our experiments encourage further investigation about the optimal way to integrate MWE treatment into these and many other applications.

Data and Resources

Additional Info

Field Value
Source https://theses.hal.science/tel-00741147
Author Ramisch, Carlos
Maintainer CCSD
Last Updated May 9, 2026, 19:46 (UTC)
Created May 9, 2026, 19:46 (UTC)
Identifier NNT: 2012GRENM059
Language fr
Rights https://about.hal.science/hal-authorisation-v1/
contributor Laboratoire d'Informatique de Grenoble (LIG) ; Université Pierre Mendès France - Grenoble 2 (UPMF)-Université Joseph Fourier - Grenoble 1 (UJF)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP)-Institut National Polytechnique de Grenoble (INPG)-Centre National de la Recherche Scientifique (CNRS)
creator Ramisch, Carlos
date 2012-09-11T00:00:00
harvest_object_id ea6d5d59-cc14-44dc-8ada-139ccdd8d024
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2026-03-30T00:00:00
set_spec type:THESE