Constitution of a semantic resource for the Arabic language from multilingual aligned corpora

This study aims at the implementation and evaluation of techniques for extracting semantic relations from a multilingual aligned corpus. Firstly, our observations will focus on the semantic comparison of translational equivalents in multilingual aligned corpus. From these equivalences, we will try to extract "cliques", which ara maximum complete related sub-graphs, where all units are interrelated because of a probable semantic intersection. These cliques have the advantage of giving information on both the synonymy and polysemy of units, and providing a form of semantic disambiguation. Secondly, we attempt to link these cliques with a semantic lexicon (like WordNet) in order to assess the possibility of recovering, for the Arabic units, a semantic relationships already defined for English, French or Spanish units. These relations would automatically build a semantic resource which would be useful for different applications of NLP, such as Question Answering systems, machine translation, alignment systems, Information Retrieval…etc.

Data and Resources

Constitution of a semantic resource for the...HTML
Explore
- More information
- Go to resource

Additional Info

Field	Value
Source	https://theses.hal.science/tel-00836764
Author	Abdulhay, Authoul
Maintainer	CCSD
Last Updated	May 10, 2026, 15:16 (UTC)
Created	May 10, 2026, 15:16 (UTC)
Identifier	NNT: 2012GRENL003
Language	fr
Rights	https://about.hal.science/hal-authorisation-v1/
contributor	LInguistique et DIdactique des Langues Étrangères et Maternelles (LIDILEM) ; Université Stendhal - Grenoble 3
creator	Abdulhay, Authoul
date	2012-11-23T00:00:00
harvest_object_id	86568691-5ffa-4815-8b39-8fe9415c47cb
harvest_source_id	3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title	test moissonnage SELUNE
metadata_modified	2026-03-31T00:00:00
set_spec	type:THESE