Efficient algorithms for mining frequent weighted itemsets from weighted items databases Le B., Nguyen H., Vo B. Faculty of Information Technology, University of Science, Ho Chi Minh, Viet Nam; Faculty of Information Technology, Saigon University, Ho Chi Minh, Viet Nam; Faculty of Information Technology, Ho Chi Minh City University of Technology, Viet Nam Abstract: In this paper, we propose algorithms for mining Frequent Weighted Itemsets (FWIs) from weighted items transaction databases. Firstly, we introduce the WIT-tree data structure for mining high utility itemsets in the work of Le et al. (2009) and modify it for mining FWIs. Next, some theorems are proposed. Based on these theorems and the WIT-tree, we propose an algorithm for mining FWIs. Finally, Diffset for fast computing the weighted support of itemsets and saving memory are also discussed. We test the proposed algorithms in many databases and experimental results show that they are very efficient in comparison with Apriori-based approach. ©2010 IEEE. Author Keywords: Frequent weighted itemsets; Frequent weighted support; Weighted items transaction databases; WIT-FWIs; WIT-tree Index Keywords: Frequent weighted support; Item sets; Weighted items transaction databases; WIT-FWIs; WIT-tree; Algorithms; Data structures; Database systems; Innovation; Trees (mathematics) Year: 2010 Source title: 2010 IEEE-RIVF International Conference on Computing and Communication Technologies: Research, Innovation and Vision for the Future, RIVF 2010 Art. No.: 5632814 Link: Scorpus Link Correspondence Address: Le, B.; Faculty of Information Technology, University of Science, Ho Chi Minh, Viet Nam; email:
[email protected] Conference name: 2010 8th IEEE-RIVF International Conference on Computing and Communication Technologies: Research, Innovation and Vision for the Future, RIVF 2010 Conference date: 1 November 2010 through 4 November 2010 Conference location: Hanoi Conference code: 83982 ISBN: 9.78142E+12 DOI: 10.1109/RIVF.2010.5632814 Language of Original Document: English Abbreviated Source Title: 2010 IEEE-RIVF International Conference on Computing and Communication Technologies: Research, Innovation and Vision for the Future, RIVF 2010 Document Type: Conference Paper Source: Scopus Authors with affiliations:
1. Le, B., Faculty of Information Technology, University of Science, Ho Chi Minh, Viet Nam 2. Nguyen, H., Faculty of Information Technology, Saigon University, Ho Chi Minh, Viet Nam 3. Vo, B., Faculty of Information Technology, Ho Chi Minh City University of Technology, Viet Nam
References: 1.
Agrawal, R., Imielinski, T., Swami, A., Mining association rules between sets of items in large databases (1993) Proceedings of the 1993 ACM SIGMOD Conference Washington DC, USA, May 1993, pp. 207-216
2.
Agrawal, R., Srikant, R., Fast algorithms for mining association rules (1994) VLDB'94 (1994), pp. 487-499
3.
Cai, C.H., Fu, A.W., Cheng, C.H., Kwong, W.W., Mining Association Rules with Weighted Items (1998) Proceedings of International Database Engineering and Applications Symposium (IDEAS 98), pp. 68-77
4.
Ganter, B., Wille, R., (1999) Formal Concept Analysis, , In Springer-Verlag
5.
Khan, M.S., Muyeba, M., Coenen, F., Fuzzy Weighted Association Rule Mining with Weighted Support and Confidence Framework (2008) Proc. 1st Int Workshop on Algorithms for Large-Scale Information Processing in Knowledge Discovery (ALSIP 2008), Held in Conjunction with PAKDD 2008 (Japan), pp. 52-64
6.
Le, B., Nguyen, H., Cao, T.A., Vo, B., A Novel Algorithm for Mining High Utility Itemsets (2009) Asian Conference on Intelligent Information and Database Systems, pp. 13-16. , the first published by IEEE
7.
Ramkumar, G.D., Ranka, S., Tsur, S., Weighted Association Rules: Model and Algorithm (1998) SIGKDD, pp. 661-666
8.
Tao, F., Murtagh, F., Farid, M., Weighted Association Rule Mining using Weighted Support and Significance Framework (2003) SIGKDD, pp. 661-666
9.
Wang, W., Yang, J., Yu, P.S., Efficient Mining of Weighted Association Rules (2003) SIGKDD 2003, pp. 270-274
10. Zaki, M.J., Hsiao, C.J., Efficient Algorithms for Mining Closed Itemsets and Their Lattice Structure (2005) IEEE Transactions on Knowledge and Data Engineering, 17 (4), pp. 462-478. , April 2005 11. Zaki, M.J., (2004) Mining Non-Redundant Association Rules, Data Mining and Knowledge Discovery, pp. 223-248. , Kluwer Academic Publishers. Manufactured in the Netherlands 12. Zaki, M.J., Gouda, K., Fast Vertical Mining Using Diffsets (2003) Proc. of Ninth ACM SIGKDD Int'l Conf., pp. 326-335