Clustering on a hypercube multicomputer - Semantic Scholar

Recommend Documents

Parallel Computing on a Hypercube - Semantic Scholar

Ametek's Systeml14 with up to 256 processors, the. NCUBE Corporation's NCUBE/six-ten with up to 1024 processors, and Floating Point Systems T Series with ...

Inside the Hypercube - Semantic Scholar

Abstract. Bernstein's CubeHash is a hash function family that includes four functions submitted to the NIST Hash Competition. A CubeHash function is ...

On Multicast Wormhole Routing in Multicomputer ... - Semantic Scholar

parallel program. ... execution of parallel programs are broadcast and multicast ..... Resources. List. Obtained. Awaiting m1 a b; c a; b]; C onsb. Retransb m3 a.

A Coarse-Grained Multicomputer Algorithm for the ... - Semantic Scholar

max(llCs(i 1, j), llCs(i, j 1)), otherwise. Proof: see [23]. ..... Grained Parallel Maximum Matching in Convex ... [15] D.S. Hirschberg, Algorithms for the Longest Com-.

Latin Hypercube Sampling with Evolutionary ... - Semantic Scholar

Mar 21, 2013 - Latin Hypercube Sampling with Evolutionary Algorithm for Static Security Risk Assessment. Junfang Li. The Research Centre of Power Grid ...

Executing Algorithms with Hypercube Topology on ... - Semantic Scholar

algorithms with hypercube communication topology on multicomputers with mesh or torus interconnection topologies ...... [12] S.W. Turner, L.M. Ni, B.H.C. Cheng.

Hierarchical Refinement of Latin Hypercube ... - Semantic Scholar

Illustration of a random trial in the correlation control algorithmâa swap of samples ... agram of the sampling plan for four random variables and four simulations.

Asymptotically Efficient Hypercube Algorithms for ... - Semantic Scholar

Nearest Neighbors run in Î(Sort(n) Â· log log n) time, and our algorithms for .... We assume that any hypercube sorting algorithm will satisfy this property.

A hybrid hypercube Ð²â¬â Genetic algorithm ... - Semantic Scholar

Sep 9, 2010 - work and an application for transit mobile repair units (TMRU) in the city of ... considers the stochastic nature of such services, suggesting that a unit may be ..... algorithms in many disciplines, e.g. computer science (Jain et al.,.

A clustering-based prefetching scheme on a Web ... - Semantic Scholar

images to a rich assortment of dynamic and interactive services, such as video/audio conferencing, e-com- merce, and distance learning. The explosive growth ...

On Data Clustering Analysis: Scalability ... - Semantic Scholar

On Data Clustering Analysis: Scalability,. Constraints and Validation. Osmar R. ZaÄ±ane, Andrew Foss, Chi-Hoon Lee, and Weinan Wang. University of Alberta ...

Grammar Acquisition Based on Clustering ... - Semantic Scholar

Experiments using Wall Street Journal data show that our ..... for the 97th Meeting of the Acoustical Society of America (D.H. Klatt and J.J. Wolf, eds.), pp.

Relational Clustering - Semantic Scholar

Jun 7, 2007 - is guaranteed even for general symmetric and nonsingular metrics. ...... [30] S. Seo and K. Obermayer (2004), Self-organizing maps and ...

Document Clustering - Semantic Scholar

fields of information retrieval, natural language processing, and machine learning. ..... the effectiveness and efficiency of TOFA in text data for dimensionality ...

Clustering maps - Semantic Scholar

and my daily leader, Remko TronÃ§on, for given me valuable advice and tips pointing me in the ..... on a search engine will result in thousands of relevant web pages. Cluster analysis is ..... from becoming a global optimization. There exist some ...

hierarchical clustering - Semantic Scholar

Milan Jovovic a, Slavica Jonic a,b, Dejan Popovic a,b,* ... E-mail address: [email protected] (D. Popovic) ...... [9] Pitas I, Milos E, Venetsanopoulos AN.

Spectral Clustering - Semantic Scholar

Jan 23, 2009 - 5. 3 Strengths and weaknesses. 6. 3.1 Spherical, well separated clusters . ..... Step into the extracted

Developing Applications for Multicomputer Systems on ... - CiteSeerX

Systems on Workstation Clusters. Georg Stellner, Arndt Bode,. Stefan Lamberts and Thomas Ludwig? Technische Universit at M unchen. Institut f ur Informatik.

The Multicomputer Toolbox - CiteSeerX

Anthony Skjellumy. Mississippi State ...... 2] William C. Athas and Charles L. Seitz. ... 19] Anthony Skjellum, Manfred Morari, and Sven Mattisson. Waveform ...

Circuit Simulation on a Hypercube - DeBenedictis.org

The second is a .ynchronization protocol to ensure ~ha~ regions ate .... Protocols are represented using ... 01 indiyisible e..enls: sendins a meaaase, reeei..ins a.

A Novel Clustering Algorithm based on Attraction - Semantic Scholar

Some algorithms such as K-means require a ... cluster. Hierarchical clustering algorithms start with each data ... CURE [7] starts by using a constant number of.

A Novelty-based Clustering Method for On-line ... - Semantic Scholar

3 Center for Computational Sciences. University of Tsukuba .... The focus of our research is to cluster what we call time-series documents, which are documents continually ...... puting (STOC), El Paso, Texas, USA, May 4-6, pp. 626â635 (1997).

Relational Clustering Based on a New Robust ... - Semantic Scholar

tion with the Web information space based on information about him/her. It can be .... MDE [12] is a new robust estimator that is free of any presuppositions about ...

Towards a Simple Clustering Criterion Based on ... - Semantic Scholar

Marcus-Christopher Ludl1 and Gerhard Widmer1;2 ..... h0(x) ln g(x) + h(x) g0(x) g(x) ... Figure 1 The Ripley-dataset: a two-dimensional two-class problem. -0.2. 0.

Clustering on a hypercube multicomputer - Semantic Scholar

Download PDF

9 downloads 0 Views 811KB Size Report

Comment

Stepl: [Cluster Reassignment] ... minimum squared distance, recompute cluster centers. The last ..... develop algorithms for the cluster reassignment (Step 1) and.

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 2, NO. 2, APRIL 1991

129

Clustering on a Hypercube Multicomputer S a n j a y R a n k a , Member, IEEE, a n d Sartaj S a h n i , Fellow, IEEE

Abstract-In this paper, squared error clustering algorithms for SIMD hypercubes are presented. These algorithms are asymptotically faster than previously known algorithms and require lesser amount of memory per PE. For a clustering problem with .V patterns, M features per pattern and I< clusters, our algorithms complete in O ( k logd%-dI)steps on S-ZI processor hypercubes. This is optimal up to a constant factor. We extend these results to the case when -Vd\fK processors are available. Experimental results from an MIMD medium grain hypercube are also presented.

+

Index Term+ Clustering, feature vector, hypercube multicomputer, pattern recognition, MIMD, SIMD.

1. INTRODUCTION EATURE vector is a basic notion of pattern recognition. A feature vector v is a set of measurements (vir U,, . . . , v h l ) which map the important properties of an image into a Euclidean space of dimension M [I]. Clustering partitions a set of feature vectors into groups. It is a valuable tool in exploratory pattern analysis and helps making hypotheses about the structure of data. It is important in syntactic pattern recognition, image segmentation, and registration. There are many methods for clustering feature vectors [ l ] , [3],[6], [ 5 ] , [12], [13]. One popular technique is squared error clustering. Let N represent the number of patterns which are to be partitioned and let M represent the number of features per pattern. Let FIO . . . N - 1 , O . . . hl - 11 be the feature matrix such that F [ i , j ]denotes the value of the j t h feature in the ith pattern. Let SI.S p ,. . . , SICbe K clusters. Each pattern belongs to exactly one of the clusters. Let C[i]represent the cluster to which pattern i belongs. Thus, we can define Sk as

F

Sk

= {ilC[i]= k,O

5k 5K

-

1).

Further, I Sk 1 is the cardinality or size of the partition .SA.. The center of cluster k is a 1 x M vector defined as

The squared distance d2 between pattern i and cluster k is A-1

d 2 [ i ,k ] =

( F [ i , j ]- c e n t e r [ k , j ] ) * . ,=O

The squared error for the kth cluster is defined as

E 2 [ k ]=

d2[2,k]

05k

II

Fig. 12. Complexity analysis of Figs. 10 and 11

0 5 i < K, 0 5 j < M. N u m b e r ( i , j ) = \Szl, The algorithm to update the cluster centers is given in Fig. 13. Steps 1 and 2 are performed in K x M windows. The ( i ,j ) PE in each such window computes the change in FeatureSum(i,j) and Number(i, j ) contributed by the patterns in this window. These two steps can be restricted to for which NewCluster(i,j) # c(i,,j). In Steps 3 and 4 the topmost window accumulates the sum of these changes. Steps 5-8 update the clustering data. The complexity analysis is provided in Fig. 14. A total of O(1og'K l o-~ g ( N / K ) )unit routes are used. Overall Complexity: The total number of unit routes used by our algorithms for one pass of Fig. 1 is 4K O(log2K) O(1ogNMK) regardless of whether the amount of memory available is O ( K ) or O(1). This improves on the algorithm of Li and Fang [8] which requires O ( K * logNM) unit routes and O ( K ) memory per PE.

+

Fig. 11. O(1) memory cluster assignment Ii

< K, 0 5 j < M

q€S,

~1

5 M.

Fig. 10. 0(1)memory cluster assignment K

0 < k