Multiple-View Constrained Clustering For Unsupervised Face Identification In TV-Broadcast - Aix-Marseille Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Multiple-View Constrained Clustering For Unsupervised Face Identification In TV-Broadcast

Résumé

Our goal is to automatically identify faces in TV broadcast without a pre-defined dictionary of identities. Most methods are based on identity detection (from OCR and ASR) and require a propagation strategy based on visual clustering. In TV content, people appear with many variations making the clustering difficult. In this case, speaker clustering can be a reliable link for face clustering. Multi-modal clustering methods assume a bipartite mapping between modalities. In this paper, we propose to build automatically an incomplete speaker-face mapping based on local evidence of OCR and Lip activity links. Then, we propose schemes of speaker constraints propagation to the face constrained-clustering problem. Experiments performed on the REPERE corpus show an improvement of face identification by propagating names to face clusters (+3.7% F-measure compared to the baseline).

Dates et versions

hal-01194240 , version 1 (05-09-2015)

Identifiants

Citer

Meriem Bendris, Delphine Charlet, Benoit Favre, Géraldine Damnati, Rémi Auguste. Multiple-View Constrained Clustering For Unsupervised Face Identification In TV-Broadcast. ICASSP2014 - Image, Video, and Multidimensional Signal Processing (ICASSP2014 - IVMSP), May 2014, Florence, Italy. pp.494 - 498, ⟨10.1109/ICASSP.2014.6853645⟩. ⟨hal-01194240⟩
170 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More