Parallel Knowledge Discovery Using Domain ... - CiteSeerX

Recommend Documents

Using Domain Knowledge in Knowledge Discovery - Semantic Scholar

lect relevant domain knowledge without an exhaustive search of all ... We believe the availability of rel- .... where Ai, 15 i

Fragment-based drug discovery using a multi-domain, parallel MD ...

Fragment-based drug discovery; Molecular dynamics; MM/PBSA; N5-CAIR .... The python script, MMPBSA.py, included in AMBER11 was used to perform the ...

knowledge discovery using the knowminer framework - CiteSeerX

semi-automatic identification of previously unknown, potential new knowledge ... framework must be well documented, easy usable for application developers and has ..... Conference on Computer Systems 2006, ACM, New York, NY, USA, pp.

Automatic Keyword Extraction Using Domain Knowledge - CiteSeerX

cally organised domain speci c thesaurus as a second knowledge source ... lection automatically extract a list of potential keywords. ..... Name of document a12.

Using Domain Knowledge on Population Dynamics ... - CiteSeerX

or a very limited portion of the available domain background knowledge about ... Equation discovery systems search through the space of possible equation struc- .... finally, we use the predicate feeds on(domain name, predator population,.

Using Domain Knowledge on Population Dynamics ... - CiteSeerX

in domains where a limited amount of knowledge is expressed in the form of ... the domain of interest within the process of automated modeling of population.

Domain-Driven, Actionable Knowledge Discovery - ISI - VAST

that targets automated hidden knowledge discovery. Re- .... Process. Data mining is an automated process ... methods. In CRM applications, we can build.

Software Expert Discovery via Knowledge Domain

Oct 26, 2018 - Chaoran Huanga,ââ, Lina Yaoa, Xianzhi Wangb, Boualem Benatallaha, Xiang Zhanga. aUNSW Sydney, NSW ... 2Data used in this work are from âStack Overflow public dumpâ at ... Our method is test and evaluated with a relative large s

Terrorism Knowledge Discovery Project: A Knowledge ... - CiteSeerX

ism specialist at the Central Intelligence Agency, used bibliometric, citation and ..... in a Multilingual World: An Experiment on the Chinese Business Intelligence.

Discovery of protein interactions using parallel ...

Dec 12, 2013 - Cancer Institute (Boston, Massachusetts). This work was supported in part by a grant from the Leona M. and Harry B. Helmsley Charitable Trust.

Pattern Discovery in Melanoma Domain Using ...

[16] D. Randall Wilson and Tony R. Martinez. Improved heterogeneous distance functions. Journal of. Artificial Intelligence Research (JAIR), 1:1â34, 1997.

Using Rough Sets as Tools for Knowledge Discovery - CiteSeerX

Ford Escort. EFI. Medium. 876. 6. High yes high auto medium. Dodge Shadow. EFI. Medium. 1100. 6. High no medium menu medium. Ford Festive. EF! Medium.

Using a Knowledge Cache for Interactive Discovery of ... - CiteSeerX

all. The cache is organized in terms of buckets where the kth bucket stores only k-itemsets. Thus during pass k of Apriori, .... Find the bucket with the greatest value of Brepl(j) if inserting .... algorithms the hit time increases with cache size.

Using domain-specific and generic knowledge to support ... - CiteSeerX

4820&volume=14&issue=1&spage=79. Using just-in-time information to support discovery learning about geometrical optics in a computer-based simulation.

Using Domain Knowledge for Text Summarization in ... - CiteSeerX

Index Termsâ Text summarization, Domain-specific features, Novel medical ..... But person names, organization names, which are normally not available in.

Domain Knowledge Integration in Data Mining using ... - CiteSeerX

Keywords: domain knowledge, data mining, churn, decision tables. ... regression to predict churn, using the result from the best performing model in the ...

Using domain knowledge to select solutions in abductive ... - CiteSeerX

This paper presents a novel extension to abductive reasoning in causal nets, namely the use of domain knowledge to select among alternative diagnoses.

Using Knowledge Discovery Technology in ... - Semantic Scholar

software organizations. In this ... Keywords: Experience Management, Case-Based Reasoning, Software ...... [Tautz 2000] Tautz, Carsten: Customizing Software.

Kid website assessment using knowledge discovery - International ...

kids, families need to verify whether specific kid website is an appropriate ... faster and easier, there is no restriction on type and amount of information that put in it. .... w3c recommends many technologies that build ontology graphs and make.

Knowledge Discovery Using Bayesian Network Framework for ...

put enormous pressure on contemporary Network Management Systems .... Bayesian reasoning based software agents have been used to achieve intelligent .... number in the range (0-252) with 0 being the lowest priority (best effort service).

Knowledge Discovery in Learning Management System Using ...

Sep 23, 2016 - 1Department of Computer Science, United Institute of Technology, ... Recent developments in database technology have seen a wide variety of data being ... data. Therefore the open challenge in big data is big data analysis ...

Geobrowsing: creative thinking and knowledge discovery using ...

the creator of the map designs the map, selecting elements .... ways of aiding the map maker/user to see emergent ..... Graphics Newsletter 2000; 11: 13 â 19.

Geobrowsing: creative thinking and knowledge discovery using ...

cognitively use spatial knowledge.2â4 The modern mapping environment has been influenced ... we are in the mall, which subway train to take, and how to find our way ..... as a line or set of lines together with some battlefield locations. However .

Improving the knowledge discovery process using ...

Improving the Knowledge Discovery Process Using ... branch of the French national health care system (CAF: Caisse ..... For instance âHome Locationâ.

Parallel Knowledge Discovery Using Domain ... - CiteSeerX

Download PDF

11 downloads 8255 Views 200KB Size Report

Comment

example, a CH for the Location attribute in a sales database is shown in Fig- ... the name represents the same partition of the domain in the underlying CHs.

Parallel Knowledge Discovery Using Domain Generalization Graphs Robert J. Hilderman, Howard J. Hamilton, Robert J. Kowalchuk, and Nick Cercone Department of Computer Science University of Regina Regina, Saskatchewan, Canada, S4S 0A2 fhilder,hamilton,kowalc,[email protected]

Abstract. Multi-Attribute Generalization is an algorithm for attribute-

oriented induction in relational databases using domain generalization graphs. Each node in a domain generalization graph represents a dierent way of summarizing the domain values associated with an attribute. When generalizing a set of attributes, we show how a serial implementation of the algorithm generates all possible combinations of nodes from the domain generalization graphs associated with the attributes, resulting in the presentation of all possible generalized relations for the set. We then show how the inherent parallelism in domain generalization graphs is exploited by a parallel implementation of the algorithm. Signi cant speedups were obtained using our approach when large discovery tasks were partitioned across multiple processors. The results of our work enable a database analyst to quickly and eciently analyze the contents of a relational database from many dierent perspectives.

1 Introduction Knowledge discovery from database (KDD) algorithms can be broadly classi ed into two general areas: summarization and anomaly detection. Summarization algorithms nd concise descriptions of data, such as partitioning the data into disjoint groups. Anomaly detection algorithms identify unusual features of data, such as combinations that occur with greater or lesser frequency than expected. Attribute-oriented induction (AOI) [7, 8, 9] is a summarization algorithm that has been eective for KDD. AOI summarizes the information in a relational database by repeatedly replacing speci c attribute values with more general concepts according to user-de ned concept hierarchies (CHs). A concept hierarchy associated with an attribute in a database is represented as a tree where leaf nodes correspond to actual domain values in the database, intermediate nodes correspond to a more general representation of the domain values, and the root node corresponds to the most general representation of the domain values. For example, a CH for the Location attribute in a sales database is shown in Figure 1(a). Knowledge about the higher level concepts can be learned through generalization of the sales data at each node.

ANY

ANY

West

Vancouver

Office1

Office2

East

L.A.

Office3

Office4

Toronto

Office5 (a)

Office6

Division

New York

Office7

Office8

City

Office9

Office (b)

Fig. 1. A concept hierarchy (a) and a domain generalization graph (b) As the result of recent research, AOI methods are considered among the most ecient of KDD methods for knowledge discovery from databases [1, 2, 3, 4, 7, 10]. In particular, algorithms for generalizing relational databases are presented in [1] that run in O(n) time, where n is the number of tuples in the input relation, and require O(p) space, where p is the number of tuples in the generalized relation (typically p