Visual tracking multi-target detections by partitioning : Application to construction albums of faces

This report describes my thesis work conducted within the ComSee (Computers That See) team related to the ISPR axis (ImageS, Perception Systems and Robotics) of Institut Pascal. It was financed by the Vesalis company via a CIFRE (Research Training in Industry Convention) agreement with Institut Pascal and publicly funded by ANRT (National Association of Research and Technology). The thesis was motivated by issues related to automation of video analysis encountered during police investigations. The theoretical research carried out in this thesis is applied to the automatic creation of a photo album summarizing people appearing in a CCTV sequence. Using a face detector, the aim is to group by identity all the faces detected throughout the whole video sequence. As the use of facial recognition techniques in unconstrained environments remains unreliable, we have focused instead on global multi-target tracking based on detections. This type of tracking is relatively recent. It involves an object detector and global processing of the video (as opposed to sequential processing commonly used). This issue has been represented by a Maximum A Posteriori probabilistic model. To find an optimal solution of Maximum A Posteriori formulation, we use a graph-based network flow approach, built upon third-party research. The study concentrates on the definition of inter-detections similarities related to the likelihood term of the model. Multiple similarity metrics based on different clues (time, position in the image, appearance and local movement) were tested. An original method to estimate these similarities was developed to merge these various clues and adjust to the encountered situation. Several experiments were done on challenging but real-world situations which may be gathered from CCTVs. Although the quality of generated albums do not yet satisfy practical use, the detections clustering system developed in this thesis provides a good initial solution. Thanks to the data clustering point of view adopted in this thesis, the proposed detection-based multi-target tracking allows easy transfer to other tracking domains.

Data and Resources

Additional Info

Field Value
Source https://theses.hal.science/tel-00919425
Author Schwab, Siméon
Maintainer CCSD
Last Updated May 7, 2026, 19:02 (UTC)
Created May 7, 2026, 19:02 (UTC)
Identifier NNT: 2013CLF22366
Language fr
Rights https://about.hal.science/hal-authorisation-v1/
contributor Laboratoire des Adaptations Métaboliques à l'Exercice en Conditions Physiologiques et Pathologiques (AME2P) ; Université Blaise Pascal - Clermont-Ferrand 2 (UBP)-UFR Sciences et Techniques des Activités Physiques et Sportives - Clermont-Ferrand (UFR STAPS - UBP) ; Université Blaise Pascal - Clermont-Ferrand 2 (UBP)-Université Blaise Pascal - Clermont-Ferrand 2 (UBP)
creator Schwab, Siméon
date 2013-07-08T00:00:00
harvest_object_id b43780a0-911d-4c59-b2f9-1592a6070be9
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2026-03-31T00:00:00
set_spec type:THESE