Internal Data Imputation in Data Warehouse Dimensions - IRIT - Université Toulouse Jean Jaurès Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Internal Data Imputation in Data Warehouse Dimensions

Yuzhao Yang
Fatma Abdelhedi
Franck Ravat
Olivier Teste

Résumé

Missing values occur commonly in the multidimensional data warehouses. They may generate problems of usefulness of data since the analysis performed on a multidimensional data warehouse is through different dimensions with hierarchies where we can roll up or drill down to the different parameters of analysis. Therefore, it's essential to complete these missing values in order to carry out a better analysis. There are existing data imputation methods which are suitable for numeric data, so they can be applied for fact tables but not for dimension tables. Some other data imputation methods need extra time and effort costs. As consequence, we propose in this article an internal data imputation method for multidimensional data warehouse based on the existing data and considering the intra-dimension and inter-dimension relationships.
Fichier principal
Vignette du fichier
dexa2021.pdf (148.97 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03265060 , version 1 (01-10-2021)

Licence

Paternité

Identifiants

Citer

Yuzhao Yang, Fatma Abdelhedi, Jérôme Darmont, Franck Ravat, Olivier Teste. Internal Data Imputation in Data Warehouse Dimensions. 32nd International Conference on Database and Expert Systems Applications (DEXA 2021), Sep 2021, Linz, Austria. pp.237-244, ⟨10.1007/978-3-030-86472-9_22⟩. ⟨hal-03265060⟩
146 Consultations
62 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More