An Efficient GA-Based Algorithm for Mining ... - Semantic Scholar
Recommend Documents
are gathered automatically by Web servers and collected in access log files. ..... managed. This is performed by navigating through the tree in a downward or ...
are gathered automatically by Web servers and collected in access log files. .... best of our knowledge, current Web mining systems do not support such capabili-.
disk-resident datasets and whose I/O cost corresponds to the cost of sequentially ... outliers in very large datasets. ACM Trans. Knowl. Discov. Data. 3, 1, Article 4 ...
This paper presents efficient methods for automatic detection and ... sels and optic disc (OD), both of which play important role for ..... On a Pentium-4 PC.
Dec 26, 2008 - pattern-tree (FP-Growth) structure is proposed in [9]. The FP-Growth was used ... In [9], Han, Pei et al. proprosed a data structure called FP-tree ...
Index TermsâMining Association Rules, Large Itemsets. I. INTRODUCTION. Number of data mining algorithms have been recently developed that greatly ...
... in Large Databases. Dr.Hussam Al-Shorman, Dr.Yosef Hasan Jbara ..... Zaki Computer Science Department, Rensselaer Polytechnic. Institute, Troy NY 12180 ...
algorithms for incremental mining are being investigated. Their primary aim is to avoid or minimize scans of the older database by using the intermediate data ...
*GICTS College of Professional Education Gwalior. Abstract: Frequent pattern mining is a heavily researched area in the field of data mining with wide range of.
hash-based algorithm for the candidate set generation. Explicitly, the number .... one pass over the database of all transactions for each iteration. Note that as k ...
other methods such as SPADE(Sequential PAttern. Discovery using Equivalence classes)(Zaki 2001) and. SPAM(Sequential PAttern Mining)(Jay et al. 2002).
Yang Feng, Haitao Mi, Yang Liu and Qun Liu. Key Laboratory of Intelligent Information Processing. Institute of Computing Technology. Chinese Academy of ...
Jul 1, 2014 - Email addresses: [email protected] (Md. Hanif Seddiqui), ...... iMap system [5] addresses block matching and matches between relational ...
Dougu Nam · Seunghyun Seo · Sangsoo Kim ... network search algorithms have been developed for a noise-free case and for a .... Noise-free Boolean networks.
Dougu Nam · Seunghyun Seo · Sangsoo Kim. Received: 8 March 2004 / Revised: 1 September 2005 / Accepted: 24 April 2006 / Published online: 30 June. 2006. Springer .... Boolean networks for noisy data and the best fit extension problem.
loop-crossing, irregularly shaped, and often broken by the myocardium, it is ... developing a catheter simulator using a 3D heart model to improve doctors' skills.
requires a large number of members in the initial population. It is difficult to apply the original mGA to a highly dimensional problem. A fast messy GA (Goldberg.
IEEE TRANSACTIONS ON COMPUTERS, VOL. ... degree of parallelism than the QR-algorithm. ... (Cooordinate Rotation DIgital Computer), well suited for.
In addition, a t à t matrix N (called a TLC matrix) is maintained, where t is the number of edges that do .... is O(dvb), where dv represents the outdegree of v.
The problem of scheduling UET jobs with precedence constraints has ... for the minimization of total completion time for jobs with release dates; this is a major ...
Optimization algorithm (RPO), is defined to assign specific subcarriers to ..... 1150 â 1158. [6]. Myeon-gyun Cho, Woohyun Seo, Youngsoo Kim, Daesik Hong. "A.
Apr 9, 2013 - means state Xt is in an attractor, which can be located as much as j ..... Thomas R (1991) Regulatory networks seen as asynchronous automata: ... Heidel J, Maloney J, Farrow C, Rogers J (2003) Finding cycles in synchronous.
Aug 7, 2006 - may be obtained by representing subspaces as single exterior products [J.C. Alexander, R. Sachs, Linear instability of solitary waves of a.
finding multiple roots of polynomials. ... Finite-Difference Frequency-Domain methods (FDFD) require solution of large .... and eo is the free-space permittivity.
An Efficient GA-Based Algorithm for Mining ... - Semantic Scholar
as SPADE (Sequential PAttern Discovery using Equivalence classes)[12] and SPAM. (Sequential PAttern Mining)[4], are also widely used in researches.
An Efficient GA-Based Algorithm for Mining Negative Sequential Patterns Zhigang Zheng1, Yanchang Zhao1,2 , Ziye Zuo1 , and Longbing Cao1 1
Data Sciences & Knowledge Discovery Research Lab Centre for Quantum Computation and Intelligent Systems Faculty of Engineering & IT, University of Technology, Sydney, Australia {zgzheng,zzuo,lbcao}@it.uts.edu.au 2 Centrelink, Australia [email protected]
Abstract. Negative sequential pattern mining has attracted increasing concerns in recent data mining research because it considers negative relationships between itemsets, which are ignored by positive sequential pattern mining. However, the search space for mining negative patterns is much bigger than that for positive ones. When the support threshold is low, in particular, there will be huge amounts of negative candidates. This paper proposes a Genetic Algorithm (GA) based algorithm to find negative sequential patterns with novel crossover and mutation operations, which are efficient at passing good genes on to next generations without generating candidates. An effective dynamic fitness function and a pruning method are also provided to improve performance. The results of extensive experiments show that the proposed method can find negative patterns efficiently and has remarkable performance compared with some other algorithms of negative pattern mining. Keywords: Negative Sequential Pattern, Genetic Algorithm, Sequence Mining, Data Mining.
1 Introduction The concept of discovering sequential patterns was firstly introduced in 1995 [1], and aimed at discovering frequent subsequences as patterns in a sequence database, given a user-specified minimum support threshold. Some popular algorithms in sequential pattern mining include AprioriAll [1], Generalized Sequential Patterns (GSP) [10] and PrefixSpan [8]. GSP and AprioriAll are both Apriori-like methods based on breadthfirst search, while PrefixSpan is based on depth-first search. Some other methods, such as SPADE (Sequential PAttern Discovery using Equivalence classes)[12] and SPAM (Sequential PAttern Mining)[4], are also widely used in researches. In contrast to traditional positive sequential patterns, negative sequential patterns focus on negative relationships between itemsets, in which, absent items are taken into consideration. We give a simple example to illustrate the difference: suppose p1 = is a positive sequential pattern; p2 = is a negative sequential pattern; and each item, a, b, c, d and e, stands for a claim item code in the customer claim database M.J. Zaki et al. (Eds.): PAKDD 2010, Part I, LNAI 6118, pp. 262–273, 2010. c Springer-Verlag Berlin Heidelberg 2010