Unbiased sampling of network ensembles - Aix-Marseille Université Accéder directement au contenu
Article Dans Une Revue New Journal of Physics Année : 2015

Unbiased sampling of network ensembles

Résumé

Sampling random graphs with given properties is a key step in the analysis of networks, as random ensembles represent basic null models required to identify patterns such as communities and motifs. An important requirement is that the sampling process is unbiased and efficient. The main approaches are microcanonical, i.e. they sample graphs that match the enforced constraints exactly. Unfortunately, when applied to strongly heterogeneous networks (like most real-world examples), the majority of these approaches become biased and/or time-consuming. Moreover, the algorithms defined in the simplest cases, such as binary graphs with given degrees, are not easily generalizable to more complicated ensembles. Here we propose a solution to the problem via the introduction of a 'Maximize and Sample' ('Max & Sam' for short) method to correctly sample ensembles of networks where the constraints are 'soft', i.e. realized as ensemble averages. Our method is based on exact maximum-entropy distributions and is therefore unbiased by construction, even for strongly heterogeneous networks. It is also more computationally efficient than most microcanonical alternatives. Finally, it works for both binary and weighted networks with a variety of constraints, including combined degree-strength sequences and full reciprocity structure, for which no alternative method exists. Our canonical approach can in principle be turned into an unbiased microcanonical one, via a restriction to the relevant subset. Importantly, the analysis of the fluctuations of the constraints suggests that the microcanonical and canonical versions of all the ensembles considered here are not equivalent. We show various real-world applications and provide a code implementing all our algorithms.
Fichier principal
Vignette du fichier
pdf.pdf (2.09 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01219776 , version 1 (23-10-2015)

Identifiants

Citer

Tiziano Squartini, Rossana Mastrandrea, Diego Garlaschelli. Unbiased sampling of network ensembles. New Journal of Physics, 2015, 17, pp.023052. ⟨10.1088/1367-2630/17/2/023052⟩. ⟨hal-01219776⟩
243 Consultations
664 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More