Approximated Summarization of Data Provenance
Type de document :
Communication dans un congrès avec actes
Titre :
Approximated Summarization of Data Provenance
Auteur(s) :
Eleanor, Ainy [Auteur]
School of Computer Science [TAU-CS]
Bourhis, Pierre [Auteur]
Linking Dynamic Data [LINKS]
Davidson, Susan [Auteur]
Department of Computer and Information Science [Pennsylvania] [CIS]
Deutch, Daniel [Auteur]
School of Computer Science [TAU-CS]
Milo, Tova [Auteur]
School of Computer Science [TAU-CS]
School of Computer Science [TAU-CS]
Bourhis, Pierre [Auteur]

Linking Dynamic Data [LINKS]
Davidson, Susan [Auteur]
Department of Computer and Information Science [Pennsylvania] [CIS]
Deutch, Daniel [Auteur]
School of Computer Science [TAU-CS]
Milo, Tova [Auteur]
School of Computer Science [TAU-CS]
Titre de la manifestation scientifique :
CIKM
Ville :
Melbourn
Pays :
Australie
Date de début de la manifestation scientifique :
2015-10-19
Date de publication :
2015-10-19
Discipline(s) HAL :
Informatique [cs]/Base de données [cs.DB]
Résumé en anglais : [en]
Many modern applications involve collecting large amounts of data from multiple sources, and then aggregating and manipulating it in intricate ways. The complexity of such applications, combined with the size of the collected ...
Lire la suite >Many modern applications involve collecting large amounts of data from multiple sources, and then aggregating and manipulating it in intricate ways. The complexity of such applications, combined with the size of the collected data, makes it difficult to understand how the resulting information was derived. Data provenance has proven helpful in this respect, however, maintaining and presenting the full and exact provenance information may be infeasible due to its size and complexity. We therefore introduce the notion of approximated summarized provenance, which provides a compact representation of the provenance at the possible cost of information loss. Based on this notion, we present a novel provenance summarization algorithm which, based on the semantics of the underlying data and the intended use of provenance, outputs a summary of the input provenance. Experiments measure the conciseness and accuracy of the resulting provenance summaries, and improvement in provenance usage time.Lire moins >
Lire la suite >Many modern applications involve collecting large amounts of data from multiple sources, and then aggregating and manipulating it in intricate ways. The complexity of such applications, combined with the size of the collected data, makes it difficult to understand how the resulting information was derived. Data provenance has proven helpful in this respect, however, maintaining and presenting the full and exact provenance information may be infeasible due to its size and complexity. We therefore introduce the notion of approximated summarized provenance, which provides a compact representation of the provenance at the possible cost of information loss. Based on this notion, we present a novel provenance summarization algorithm which, based on the semantics of the underlying data and the intended use of provenance, outputs a summary of the input provenance. Experiments measure the conciseness and accuracy of the resulting provenance summaries, and improvement in provenance usage time.Lire moins >
Langue :
Anglais
Comité de lecture :
Oui
Audience :
Internationale
Vulgarisation :
Non
Collections :
Source :