A Clustering Method Based on the Maximum ... - Semantic Scholar

Recommend Documents

A Novelty-based Clustering Method for On-line ... - Semantic Scholar

3 Center for Computational Sciences. University of Tsukuba .... The focus of our research is to cluster what we call time-series documents, which are documents continually ...... puting (STOC), El Paso, Texas, USA, May 4-6, pp. 626â635 (1997).

A Corpus-based Conceptual Clustering Method for ... - Semantic Scholar

travel now admits motor-bike as adjunct after the by preposition and the frame of to drive now admits train as object, examples which did not occurs as such.

A new multimembership clustering method - Semantic Scholar

Clustering method is one of the most important tools in statistics. In a graph ... been developed and implemented in popular statistical software packages. A gen-.

A new clustering algorithm based on the chemical ... - Semantic Scholar

Thus, each ant has its own view of its colony odor at a given time, and updates it continuously. By this way, an ant preserves its nest from being attacked by ...

Grammar Acquisition Based on Clustering ... - Semantic Scholar

Experiments using Wall Street Journal data show that our ..... for the 97th Meeting of the Acoustical Society of America (D.H. Klatt and J.J. Wolf, eds.), pp.

A clustering-based prefetching scheme on a Web ... - Semantic Scholar

images to a rich assortment of dynamic and interactive services, such as video/audio conferencing, e-com- merce, and distance learning. The explosive growth ...

Model-Based Method for Projective Clustering - Semantic Scholar

that indicate the spam category in the email collections, such as cash and sex of Email1431, advertisement and unsolicit for LingSpam, and play and hot for ...

Speeding-up the k-means clustering method: A ... - Semantic Scholar

P. S. Bradley, U. Fayyad, and C. Raina. Scaling clustering ... V. Suresh Babu and P. Viswanath. Rough-fuzzy weighted ... P. Viswanath and V. Suresh Babu.

QUASI-NEWTON METHOD FOR MAXIMUM ... - Semantic Scholar

QUASI-NEWTON METHOD FOR MAXIMUM LIKELIHOOD ESTIMATION OF HIDDEN. MARKOV MODELS. Olivier CappÃ©, Vincent Buchoux, Eric Moulines.

A Novel Clustering Algorithm based on Attraction - Semantic Scholar

Some algorithms such as K-means require a ... cluster. Hierarchical clustering algorithms start with each data ... CURE [7] starts by using a constant number of.

Relational Clustering Based on a New Robust ... - Semantic Scholar

tion with the Web information space based on information about him/her. It can be .... MDE [12] is a new robust estimator that is free of any presuppositions about ...

Towards a Simple Clustering Criterion Based on ... - Semantic Scholar

Marcus-Christopher Ludl1 and Gerhard Widmer1;2 ..... h0(x) ln g(x) + h(x) g0(x) g(x) ... Figure 1 The Ripley-dataset: a two-dimensional two-class problem. -0.2. 0.

A Gas Dynamics Method Based on the Spectral ... - Semantic Scholar

American Journal of Computational Mathematics, 2011, 1, 303-317 ...... Dynamics,â SIAM Journal on Scientific Computing, Vol. 6, No. 1, 1985, pp. 104-117.

A Semantic Caching Method Based on Linear ... - Semantic Scholar

research field, especially in client-server databases and dis- ... DBMS that supports SQL queries. Suppose that a ... null (this is true for this example), it has to be sent to the server. ... query. The best case is achieved when a semantic region.

Clustering of web search results based on the ... - Semantic Scholar

In general, web clustering engines work as meta-search engines and collect .... outperforms Genetic Algorithm, Particle Swarm Optimization, and Differential ...

Knowledge-based Semantic Clustering - Semantic Scholar

importance relative to the content of the post, e.g. blog postings rapidly fade in importance as time ...... [7] Apple - iTunes - iTunes Store - Podcasts - Technical.

Mobile Localizaton Method Based on ... - Semantic Scholar

Qun Wan, Yong-Jie Luo, Wan-Lin Yang. Jia Xu, Jun Tang, Ying-Ning Peng. Department of Electronic Engineering. Department of Electronic Engineering.

Category Decomposition Method Based on ... - Semantic Scholar

AVIRIS imagery data of Ivanpah playa in California, USA. IV. CONCLUSION. Category decomposition method based on matched filter for un-mixing of mixed ...

Crossed Clustering method on Symbolic Data tables - Semantic Scholar

Abstract. In this paper we propose a crossed clustering algorithm in order to partition a set of symbolic objects in a predefined number of classes and to de-.

WSNs clustering based on semantic neighborhood ... - Semantic Scholar

Jan 31, 2012 - ios of damage with different data aggregation methods. We also ...... mable flash memory, 512 kB external serial flash, 4 kB EE-. PROM.

A model-based clustering method to detect

Nov 13, 2017 - and Laboratory Medicine, Western University, London, Ontario, Canada, ...... (PDF). Acknowledgments. We wish to acknowledge Dr. Simon D. W. Frost .... 2010; 5(3):e9490. https://doi.org/10.1371/journal.pone.0009490 PMID:.

Clustering on a hypercube multicomputer - Semantic Scholar

Stepl: [Cluster Reassignment] ... minimum squared distance, recompute cluster centers. The last ..... develop algorithms for the cluster reassignment (Step 1) and.

A DIVISIVE HIERARCHICAL CLUSTERING- BASED METHOD FOR ...

Many clustering algorithms are proposed by now, but only few research studies are ... CURE[14] for partitioning the data space in a hierarchical structure.

A Principal Components-Based Clustering Method to

Mar 10, 2011 - tween tag SNPs in the promoter region of hepatocyte nu- clear factor 4- .... Color key. 0 0.2 0.4 0.6 0.8 1.0. 0 0.2 0.4 0.6 0.8 1.0 a b. Co lo. r v e rsio n av a ila b le o n ..... teman JC, Yarnell JW, Zeggini E, Zelenika D,. Zetheli

A Clustering Method Based on the Maximum ... - Semantic Scholar

Download PDF

9 downloads 0 Views 848KB Size Report

Comment

Jan 7, 2015 - a set of k centroids (one for each cluster) and associating each ..... (4) Train the CM with Xi to obtain a clustering solution denoted as Yi. ..... In Proceedings of the 10th International Conference on Data Mining (ICDM), Sydney,.

Entropy 2015, 17, 151-180; doi:10.3390/e17010151

OPEN ACCESS

entropy ISSN 1099-4300 www.mdpi.com/journal/entropy Article

A Clustering Method Based on the Maximum Entropy Principle Edwin Aldana-Bobadilla 1, * and Angel Kuri-Morales 2 1

Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Ciudad Universitaria, 04510 Ciudad de México, Mexico 2 Instituto Tecnológico Autónomo de México, Río Hondo 1. Col. Progreso Tizapán, 01080 Ciudad de México, Mexico; E-Mail: [email protected] * Author to whom correspondence should be addressed; E-Mail: [email protected]; Tel.: +52-55-515-912-93. Academic Editor: Kevin H. Knuth Received: 6 October 2014 / Accepted: 26 December 2014 / Published: 7 January 2015

Abstract: Clustering is an unsupervised process to determine which unlabeled objects in a set share interesting properties. The objects are grouped into k subsets (clusters) whose elements optimize a proximity measure. Methods based on information theory have proven to be feasible alternatives. They are based on the assumption that a cluster is one subset with the minimal possible degree of “disorder”. They attempt to minimize the entropy of each cluster. We propose a clustering method based on the maximum entropy principle. Such a method explores the space of all possible probability distributions of the data to find one that maximizes the entropy subject to extra conditions based on prior information about the clusters. The prior information is based on the assumption that the elements of a cluster are “similar” to each other in accordance with some statistical measure. As a consequence of such a principle, those distributions of high entropy that satisfy the conditions are favored over others. Searching the space to find the optimal distribution of object in the clusters represents a hard combinatorial problem, which disallows the use of traditional optimization techniques. Genetic algorithms are a good alternative to solve this problem. We benchmark our method relative to the best theoretical performance, which is given by the Bayes classifier when data are normally distributed, and a multilayer perceptron network, which offers the best practical performance when data are not normal. In general, a supervised classification method will outperform a non-supervised one, since, in the first case, the elements of the classes are known a priori. In what follows, we show that our method’s effectiveness is comparable to a supervised one. This clearly exhibits the superiority of our method.

Entropy 2015, 17

152

Keywords: clustering; Shannon’s entropy; genetic algorithms

1. Introduction Pattern recognition is a scientific discipline whose methods allow us to describe and classify objects. The descriptive process involves the symbolic representation of these objects through a numerical vector ~x: ~x = [x1 , x2 , . . . xn ] ∈