An efficient equi-semi-join algorithm for distributed architectures

Semi-joins is the most used technique to optimize the treatment of complex relational queries on distributed architectures. However the overcost related to semi-joins computation can be very high due to data skew and to the high cost of communication in distributed architectures. In this paper we present a parallel equi-semi-join algorithm for shared nothing machines. The performance of this algorithm is analyzed using the BSP cost model and is proved to have asymptotic optimal complexity and perfect load balancing even for highly skewed data. This guarantees unlimited scalability in all situations for this key algorithm.

Data and Resources

Additional Info

Field Value
Source Proceedings of the International Conference on Computational Science (ICCS'2005)
Author Bamha, Mostafa, Hains, Gaetan
Maintainer CCSD
Last Updated May 11, 2026, 11:57 (UTC)
Created May 11, 2026, 11:57 (UTC)
Identifier hal-00081352
Language en
contributor Laboratoire d'Informatique Fondamentale d'Orléans (LIFO) ; Université d'Orléans (UO)-Ecole Nationale Supérieure d'Ingénieurs de Bourges
creator Bamha, Mostafa
date 2005-05-11T00:00:00
harvest_object_id f552a9d2-4f74-43db-8ed8-c8266737b449
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2026-02-27T00:00:00
set_spec type:COMM