Artykuł w czasopiśmie
Brak miniatury
Licencja

ClosedAccessDostęp zamknięty

Efficient method for updating class association rules in dynamic datasets with record deletion

Autor
NGUYEN, NGOC THANH
Nguyen, Hung Son
Nguyen, Loan T.
Vo, Bay
Data publikacji
2018
Abstrakt (EN)

Association rule mining is an important topic in data mining. The problem is to discover all (or almost all) associations among items in the transaction database that satisfy some user-specified constraints. Usually, the constraints are related to minimal support and minimal confidence. Class association rules (CARs) are a special type of association rules that can be applied for classification problem. Previous research showed that classification based on association rules has higher accuracy than can be achieved with an inductive learning algorithm or C4.5. As such, many methods have been proposed for mining CARs, although these use batch processing. However, datasets are often changed, with records added or/and deleted, and consequently updating CARs is a challenging problem. This paper proposes an efficient method for updating CARs when records are deleted. First, we use an MECR-tree to store nodes for the original dataset. The information in the nodes of this tree are updated based on the deleted records. Second, the concept of pre-large itemsets is used to avoid rescanning the original dataset. Finally, we propose an algorithm to efficiently update and generate CARs. We also analyze the time complexity to show the efficiency of our proposed algorithm. The experimental results show that the proposed method outperforms mining CARs from the dataset after record deletion.

Słowa kluczowe PL
eksploracja danych
Dyscyplina PBN
informatyka
Czasopismo
Applied Intelligence
Tom
48
Zeszyt
6
Strony od-do
1491–1505
ISSN
0924-669X
Licencja otwartego dostępu
Dostęp zamknięty