Semantic Segmentation using Foundation Models for Cultural Heritage: an Experimental Study on Notre-Dame de Paris

Kévin Réby; Anaïs Guillem; Livio De Luca

Conference Papers Year : 2023

Semantic Segmentation using Foundation Models for Cultural Heritage: an Experimental Study on Notre-Dame de Paris

(1) , (1) , (1)

Kévin Réby

Function : Author
PersonId : 1304946
IdHAL : kevin-reby
ORCID : 0009-0000-6823-280X

Modèles et simulations pour l'Architecture et le Patrimoine

Anaïs Guillem

Function : Author
PersonId : 1259682
IdHAL : anais-guillem
ORCID : 0000-0002-1473-7594

Modèles et simulations pour l'Architecture et le Patrimoine

Livio De Luca

Function : Author
PersonId : 1164075
IdHAL : livio-de-luca
ORCID : 0000-0003-0656-3165
IdRef : 115945512

Modèles et simulations pour l'Architecture et le Patrimoine

Abstract

The zero-shot performance of foundation models has captured a lot of attention. Specifically, the Segment Anything Model (SAM) has gained popularity in computer vision due to its label-free segmentation capabilities. Our study proposes using SAM on cultural heritage data, specifically images of Notre-Dame de Paris, with a controlled vocabulary. SAM can successfully identify objects within the cathedral. To further improve segmentation, we utilized Grounding DINO to detect objects and CLIP to automatically add labels from the segmentation masks generated by SAM. Our study demonstrates the usefulness of foundation models for zero-shot semantic segmentation of cultural heritage data.

Domains

Computer Vision and Pattern Recognition [cs.CV] Machine Learning [stat.ML]

Fichier principal

ICCV workshop paper.pdf (1.38 Mo)

Origin : Publisher files allowed on an open archive

Ariane Néroulidis : Connect in order to contact the contributor

https://hal.science/hal-04275484

Submitted on : Wednesday, November 8, 2023-3:20:42 PM

Last modification on : Friday, April 5, 2024-10:25:46 AM

Dates and versions

hal-04275484 , version 1 (08-11-2023)

Identifiers

HAL Id : hal-04275484 , version 1

Cite

Kévin Réby, Anaïs Guillem, Livio De Luca. Semantic Segmentation using Foundation Models for Cultural Heritage: an Experimental Study on Notre-Dame de Paris. 4th ICCV Workshop on Electronic Cultural Heritage, Computer Vision Foundation, Oct 2023, Paris, France. https://openaccess.thecvf.com/content/ICCV2023W/e-Heritage/html/Reby_Semantic_Segmentation_Using_Foundation_Models_for_Cultural_Heritage_an_Experimental_ICCVW_2023_paper.html. ⟨hal-04275484⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS MAP CHANTIER-SCIENTIFIQUE-NDP UPR-MAP

53 View

30 Download

Semantic Segmentation using Foundation Models for Cultural Heritage: an Experimental Study on Notre-Dame de Paris

Abstract

Domains

Dates and versions

Identifiers

Cite

Relations

Export

Collections

Share