Constitution of a semantic resource for the Arabic language from multilingual aligned corpora

This study aims at the implementation and evaluation of techniques for extracting semantic relations from a multilingual aligned corpus. Firstly, our observations will focus on the semantic comparison of translational equivalents in multilingual aligned corpus. From these equivalences, we will try to extract "cliques", which ara maximum complete related sub-graphs, where all units are interrelated because of a probable semantic intersection. These cliques have the advantage of giving information on both the synonymy and polysemy of units, and providing a form of semantic disambiguation. Secondly, we attempt to link these cliques with a semantic lexicon (like WordNet) in order to assess the possibility of recovering, for the Arabic units, a semantic relationships already defined for English, French or Spanish units. These relations would automatically build a semantic resource which would be useful for different applications of NLP, such as Question Answering systems, machine translation, alignment systems, Information Retrieval…etc.

Data and Resources

Additional Info

Field Value
Source https://theses.hal.science/tel-00836764
Author Abdulhay, Authoul
Maintainer CCSD
Last Updated May 10, 2026, 15:16 (UTC)
Created May 10, 2026, 15:16 (UTC)
Identifier NNT: 2012GRENL003
Language fr
Rights https://about.hal.science/hal-authorisation-v1/
contributor LInguistique et DIdactique des Langues Étrangères et Maternelles (LIDILEM) ; Université Stendhal - Grenoble 3
creator Abdulhay, Authoul
date 2012-11-23T00:00:00
harvest_object_id 86568691-5ffa-4815-8b39-8fe9415c47cb
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2026-03-31T00:00:00
set_spec type:THESE