Artykuł w czasopiśmie
Brak miniatury
Licencja
A Metadata Diagnostic Framework for a New Approximate Query Engine Working with Granulated Data Summaries
dc.abstract.en | This paper refers to a new database engine that acquires and utilizes granulated data summaries for the purposes of fast approximate execution of analytical SQL statements. We focus on the task of creation of a relational metadata repository which enables the engine developers and users to investigate the collected data summaries independently from the engine itself. We discuss how the design of the considered repository evolved over time from both conceptual and software engineering perspectives, addressing the challenges of conversion and accessibility of the internal engine contents that can represent hundreds of terabytes of the original data. We show some scenarios of a usage of the obtained metadata repository for both diagnostic and analytical purposes. We pay a particular attention to the relationships of the discussed scenarios with the principles of rough sets – one of the theories that hugely influenced the presented solutions. We also report some empirical results obtained for relatively small fragments (100×216 rows each) of data sets coming from two organizations that use the considered new engine. |
dc.affiliation | Uniwersytet Warszawski |
dc.conference.country | Polska |
dc.conference.datefinish | 2017-07-07 |
dc.conference.datestart | 2017-07-03 |
dc.conference.place | Olsztyn |
dc.conference.series | International Joint Conference on Rough Sets |
dc.conference.series | International Joint Conference on Rough Sets |
dc.conference.seriesshortcut | IJCRS (was RSCTC) |
dc.conference.shortcut | IJCRS 2017 |
dc.contributor.author | Ślęzak, Dominik |
dc.contributor.author | Stawicki, Sebastian |
dc.contributor.author | Chądzyńska-Krasowska, Agnieszka |
dc.date.accessioned | 2024-01-24T17:54:40Z |
dc.date.available | 2024-01-24T17:54:40Z |
dc.date.issued | 2017 |
dc.description.finance | Nie dotyczy |
dc.identifier.doi | 10.1007/978-3-319-60837-2_50 |
dc.identifier.uri | https://repozytorium.uw.edu.pl//handle/item/101659 |
dc.identifier.weblink | https://link.springer.com/chapter/10.1007%2F978-3-319-60837-2_50 |
dc.language | eng |
dc.pbn.affiliation | computer and information sciences |
dc.relation.pages | 623-643 |
dc.rights | ClosedAccess |
dc.sciencecloud | nosend |
dc.subject.en | Big data Approximate query Data granulation Metadata Data visualization Software tools Business analytics |
dc.title | A Metadata Diagnostic Framework for a New Approximate Query Engine Working with Granulated Data Summaries |
dc.type | JournalArticle |
dspace.entity.type | Publication |