Using Subgroup Discovery Metrics to Mine Interesting Subgraphs

Recommend Documents

tional Model Mining for knowledge discovery in databases, where the task is to locate interesting ..... a single form by applying proper statistical models. ... larger number of parameters are commonly seen and differing from each other. There- .....

Using Unsupervised Link Discovery Methods to Find Interesting Facts ...

connections in the data? To do this we developed a set of unsupervised link discovery methods ..... an email address that belongs to that organization. A person.

Relational Subgroup Discovery for Descriptive

Descriptive Analysis of Microarray Data ... biology accounting for genes with correlated experimental patterns. .... 4 http://labe.felk.cvut.cz/Ëzelezny/rsd/rsd.pdf ...

Classification Rule Learning Using Subgroup Discovery of ... - ECE UNM

Cross-Domain Attributes Responsible for Design-Silicon Mismatch. Nicholas Callegari1 .... combination with subgroup discovery, to find rules describing sub-.

Semantic Subgroup Discovery: Using Ontologies in Microarray ... - IJS

capabilities, technologies that help analyzing and extracting useful information from ... ology, Vecna pot 111, Ljubljana, Slovenia, {helena.motaln, marko.petek ...

Novel Subgroup Discovery Over Scientific Datasets Using Bitmap

which identifies interesting subgroups with respect to a property of interest. While there is .... numeric attributes in array data, we discuss the quality function we choose and subgroup .... group discovery algorithms operate on, one key difference

Discovery of Interesting Regions in Spatial Data Sets Using

The goal of spatial data mining is to automate the extraction of interesting and ... This paper centers on discovering interesting regions in spatial datasets; in.

CRYPTOSYSTEMS USING SUBGROUP DISTORTION

Oct 24, 2016 - CRYPTOSYSTEMS USING SUBGROUP DISTORTION. INDIRA CHATTERJI, DELARAM KAHROBAEI, AND NI YEN LU. Abstract. In this paper ...

Evolutionary algorithms for subgroup discovery ... - Semantic Scholar

C.J. Carmona, P. GonzÃ¡lez, M.J. del Jesus. Department of Computer Science. University of Jaen. Jaen, Spain. {ccarmona, pglez, mjjesus}@ujaen.es. C. Romero ...

Subgroup Discovery Techniques and Applications - Semantic Scholar

Nova Gorica Polytechnic, Vipavska 13, 5000 Nova Gorica, Slovenia. Abstract. This paper presents the advances in subgroup discovery and the ways to use ...

Multiobjective Evolutionary Induction of Subgroup Discovery Fuzzy ...

Discovery Fuzzy Rules: A Case Study in Marketing. Francisco Berlanga1 ... Keywords: Data mining, descriptive induction, multiobjective evolutionary algorithms ...

Multiobjective Genetic Algorithm for Extracting Subgroup Discovery ...

Jul 7, 2010 - Spanish Ministry of Science and Technology and by the European Fund. ... M.J. Del Jesus is with the Department of Computer Science, ...

Multiobjective Evolutionary Induction of Subgroup Discovery Fuzzy

The multiobjective algorithm proposed in this paper defines three objectives. ... discover interesting properties of subgroups obtaining simple rules (i.e. with an .... to P't+1. As the size of P't+1 must be exactly the number of individuals to store

Subgroup Discovery Meets Bayesian NetworksâAn Exceptional ...

To illustrate the use of interdependencies in a Bayesian network as a relatively complex target concept, we consider the research performed by Robert T. Paine ...

Propositionalization-based relational subgroup discovery with RSD

Feb 24, 2003 - implementation of relational subgroup discovery algorithm RSD, ...... http://www.cs.bris.ac.uk/home/rawles/sinus/. ..... Slovenian-Czech bilateral project Knowledge Management in Medicine and Health Care (2004â2005).

Subgroup Discovery for Test Selection - People

Abstract. We propose a new approach to test selection based on the discovery of subgroups of patients sharing the same optimal test, and present its application ...

An Efficient Algorithm to Automated Discovery of Interesting Positive

Interesting Positive and Negative Association Rules. Ahmed Abdul-Wahab Al- ... correlation of data gives meaning full extraction process. For the discovering ...

interesting spatio-temporal region discovery ... - ISPRS Annals

Jul 15, 2015 - Discovery of interesting paths and regions in spatio-temporal data sets is ..... Land Cover Facility, University of Maryland, College Park,.

Bisociative Discovery of Interesting Relations Between Domains

operations. Thereby it additionally enables the search for interesting patterns on ... As a preliminary, we assume that the available knowledge is integrated into a.

Subgroup Identification Using the personalized

Sep 21, 2018 - inverse weighting, individualized treatment rules, precision medicine. 1. ...... âPerformance Guarantees for Individualized Treatment Rules.â.

Robust Distributed Estimation Using the Embedded Subgraphs ...

communication capabilities to solve the linear system (2) for a loopy GM in a distributed, energy-efficient way. If the un- derlying GM is loop-free, then the sensor ...

Interactive and Iterative Discovery of Entity Network Subgraphs

Aug 12, 2016 - In addition, on the hard drive of Shamzi's laptop computer was a record of a US and British passports that Shamzi had ... tive data exploration and knowledge discovery over graphs. 4. Using the results and a use .... used to recover co

Discovery of Top-k Dense Subgraphs in Dynamic Graph Collections

First, we remove the subgraphs of Gold from the answer set. Assume that k ..... the stream-based algorithm, the advanced one manages to keep storage require-.

Using Metrics to Improve Sales Results - RainToday

Mar 27, 2011 - Using Metrics to Improve Sales Results - RainToday. Page 1 of 2 ... Number of days with sales outstanding

Using Subgroup Discovery Metrics to Mine Interesting Subgraphs

Download PDF

0 downloads 0 Views 604KB Size Report

Comment

Russ Neely. Department of Computer Science. Tennessee Technological University. Cookeville, TN USA [email protected]. Zach Cleghern.

Proceedings of the Twenty-Eighth International Florida Artificial Intelligence Research Society Conference

Using Subgroup Discovery Metrics to Mine Interesting Subgraphs Douglas A. Talbert

Russ Neely

Zach Cleghern

Department of Computer Science Tennessee Technological University Cookeville, TN USA [email protected]

Department of Computer Science Tennessee Technological University Cookeville, TN USA [email protected]

Department of Computer Science Tennessee Technological University Cookeville, TN USA [email protected]

terestingness metrics with an existing subgraph discovery algorithm, SUBDUE (Cook, Holder 2000). The next section presents relevant background information about subgroup discovery, subgraph mining, and SUBDUE. Then, we describe our experimental methodology, including the data used in our analysis along with our algorithm modifications. After that, we present our results followed by conclusions and ideas for future work.

Abstract While extensive work has been done in both graph mining and subgroup discovery, the potential benefits of combining the two fields have not been well studied. We propose, implement, and evaluate an adaption of an existing subgroup discovery algorithm to mine graph data. Our experiments use two different metrics from the subgroup discovery literature to demonstrate value in using such metrics to guide subgraph discovery and to build a foundation to support further studies combining subgroup discovery and graph mining.

Background Subgroup Discovery

Introduction

Given a population of objects and a property of those objects in which we are interested, subgroup discovery seeks to “discover the subgroups of the population that are statistically ‘most interesting,’ i.e. are as large as possible and have the most unusual statistical (distributional) characteristics with respect to the property of interest.” (Wrobel 2001). The following rule illustrates a traditional subgroup definition:

The structural relationships revealed in graph representations of data enables discovery of knowledge that may not be as easily mined from other data representations. As part of our long-term goal of using knowledge discovery to improve healthcare delivery, we seek to apply graph mining to healthcare utilization data. Specifically, we want to discover graph-based patterns that are associated with high-quality/low-cost care. Subgraph discovery, however, typically searches for dominant (e.g., most frequent or largest) patterns. We desired a richer palette of heuristics to guide subgraph identification. The subgroup discovery literature presents numerous heuristics for discovering interesting patterns in data (Herrara, et al 2011). This paper seeks to demonstrate the utility of applying heuristics from subgroup discovery to subgraph mining by adapting an existing subgroup discovery algorithm to search for subgraphs instead of subgroups. We do not claim that this is the best approach, but instead, we use it to illustrate value in using ideas from subgroup discovery to guide the identification of subgraphs. Subgroup discovery seeks interesting subgroups. Numerous metrics have been used to quantify the interestingness of discovered subgroups. This exploratory study compares subgraph discovery using two specific subgroup in-

(Atti=Vali,j) ∧(Attk