Artykuł w czasopiśmie
Brak miniatury
Licencja

ClosedAccessDostęp zamknięty
 

Approximate Decision Tree Induction over Approximately Engineered Data Features

dc.abstract.enWe propose a simple SQL-based decision tree induction algorithm which makes its heuristic choices how to split the data basing on the results of automatically generated analytical queries. We run this algorithm using standard SQL and the approximate SQL engine which works on granulated data summaries. We compare the accuracy of trees obtained in these two modes on the real-world dataset provided to participants of the Suspicious Network Event Recognition competition organized at IEEE BigData 2019. We investigate whether trees induced using approximate SQL queries – although execution of such queries is incomparably faster – may yield poorer accuracy than in the standard scenario. Next, we investigate features – inputs to the decision tree induction algorithm – derived using SQL from a bigger associated data table which was provided in the aforementioned competition too. As before, we run standard and approximate SQL, although again, that latter mode needs to be checked with respect to the accuracy of trees learnt over the data with approximately extracted features.
dc.affiliationUniwersytet Warszawski
dc.conference.countryWęgry
dc.conference.datefinish2019-06-21
dc.conference.datestart2019-06-17
dc.conference.placeDebrecen
dc.conference.seriesInternational Joint Conference on Rough Sets
dc.conference.seriesInternational Joint Conference on Rough Sets
dc.conference.seriesshortcutIJCRS (was RSCTC)
dc.conference.shortcutIJCRS 2019
dc.conference.weblinkhttp://ijcrs.cujae.edu.cu/
dc.contributor.authorŚlęzak, Dominik
dc.contributor.authorChądzyńska-Krasowska, Agnieszka
dc.date.accessioned2024-01-24T16:55:09Z
dc.date.available2024-01-24T16:55:09Z
dc.date.issued2020
dc.description.financePublikacja bezkosztowa
dc.identifier.doi10.1007/978-3-030-52705-1_28
dc.identifier.urihttps://repozytorium.uw.edu.pl//handle/item/101044
dc.identifier.weblinkhttp://link.springer.com/content/pdf/10.1007/978-3-030-52705-1_28
dc.languageeng
dc.pbn.affiliationcomputer and information sciences
dc.relation.pages376-384
dc.rightsClosedAccess
dc.sciencecloudnosend
dc.subject.enSQL-based decision tree induction
dc.subject.enSQL-based feature engineering
dc.subject.enApproximate SQL engines
dc.subject.enGranulated data summarization
dc.subject.enBig data analytics
dc.subject.enCybersecurity analytics
dc.titleApproximate Decision Tree Induction over Approximately Engineered Data Features
dc.typeJournalArticle
dspace.entity.typePublication