Parallel Computing System for e cient computation of ... - Google Sites
Recommend Documents
Parallel programming considerations. ⢠The general process is executed over the CPU. â Host Code: C++. ⢠The strin
This paper describes e cient scalable parallel algorithms for the planar point location ..... Compute the new endpoints for the edges that are intersected by the dividing ... it su ces to look at the two points, one in P and the other one in Q, that
the smallest enclosing rectangle with sides parallel to the coordinate axes. (isothetic); ... lar and square measures and the diameter bipartitioning problem under the L1 and L1 ... to design e cient parallel algorithms for the other partitioning pro
5], and Karp 5] is based on minimizing communication. This work ..... of chart M(fk) coresponding to v and v0 are di erent (k = 1;2). 1. Let S. (k) .... rows, i.e. f0; 1gp, into equivalence classes E1;...; El such .... E2 is represented by bdd. (1).
This paper deals with e cient parallel implementations of reconstruction methods in 3D tomography. Depending on the method, we use two main approaches to ...
Mar 24, 1998 - degree algebraic numbers for geometric applications. Such computations fre ..... First, we approximate only algebraic numbers with rectangular.
Section 4 which fully describes the escape time clas- .... Each IFS T describes a non-empty compact set ..... Figure 7: The smashed gasket whose IFS has an.
Linda is a coordination language invented by David. Gelernter at Yale University 7], which, when com- bined with a computation language (like C) yields a.
A multirate synchronous data- ow graph, in which nodes operate at distinct ... data ow allows for the speci cation of multirate signal processing algo- ..... elements equal to the minimum of the delay elements of the combined edges (note that any ...
tree construction and comparison processes for a set of molecules. Fur- thermore, in .... The abstractions and virtualiz
(v; v0) is also called a child edge of v0. v0 is an ancestor of v i there is a directed path from v to v0 in T. v is also called a descendent of v0. A subgraph of (V; E) is ...
The main goal of a good mapping is to minimize the execution time of the whole .... by choosing its best \neighbor" (neighborhood is de ned for instance by ...
of process migration in a non-dedicated heterogeneous environment. ...... The scheduler could be either a server process implemented on a dedicated machine ...
Marc Feeley and James S. Miller. Brandeis University. Waltham, MA 02254-9110. Abstract. Programs compiled by Gambit, our Scheme compiler, achieve.
11, 13, 15 propose an approach di erent from the one ad- dressed above. ... Experimental results close the paper section 5 . 2 Basic de .... 1 g be the function which maps v 2 f0 1gp to the index j of the class K k j to which it ..... E2 = f010 011 1
We design the rst e cient parallel algorithm for computing the min- imal elimination .... for the Gaussian elimination on sparse symmetric matrices (cf. Ro 73]).
are propagated through the procedure call graph (PCG), whose nodes represent ..... ow-sensitive analysis, < u;a > holds at S3, S4, and. S8, and < u; b > holds at ...... ings of the Sigplan '88 Conference on Programming. Language Design and ...
Jan 29, 1995 - E-mail: [email protected]. Abstract. Recently, Yen and Laih 1] proposed a algorithm to quickly compute LUC digital signatures. This signature ...
Gleb Naumovich, George S. Avrunin, and Lori A. Clarke. Laboratory for Advanced ...... 7] R. K. Keller, M. Cameron, R. N. Taylor, and D. B. Troup. User interface ...
possible to de ne a class C in one computation, pass C to an independent computation that has never ... consistent multiple assignment, that is, assignments must be consistent with each other. We keep ..... of nil then A. [] X|Ls then {Sum Ls A+X} ..
Download Best Book Parallel Computing for Data Science: With Examples in R, .... type of hardware and can be used from m
ing the merge step of a divide-and-conquer algorithm, as in 38]) reduces this .... cally, it can ship a recursive serial function call to an idle processor in order to ...
of the technique used by Barnard's spectral bisection algo- rithm on the Cray T3D ..... To test portability, we used three parallel architectures: a loosely-coupled ...
be found in LP85]. Basically, we use a set of inequalities to represent a number of di erent clock ... inequalities are isomorphic, if they have the same solution set.
Parallel Computing System for e cient computation of ... - Google Sites
Host Code: C++. The string comparison process is made in parallel. Device Code: CUDA for C. Raul Torres. Parallel Comput
Methodology Results Conclusions More information
Parallel Computing System for ecient computation of Molecular Similarity based on Negative Electrostatic Potential: First results Raul Torres1 1 Grupo de Química Teórica - Universidad Nacional de Colombia
Research Seminar, 2009
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
TARIS Method
Figure: A. Isopotential Surface Size B. Isopotencial value
Raul Torres
Each node saves:
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Data set
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Classication
The similarity matrix obtained with the GPU computing process (CUDA) was analyzed by means of hierarchical clustering using the average linkage method(R Statistical Package).
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
General Representation of molecules Every node is represented by [] characters When a node has children, each child is established inside the [] of the parent: [ [][] ] If a weight is associated to a non-leaf node, this value is written after the rst [: [45,889[78,76[][]][987,5[][]] The leaf nodes have no weight associated We propose a canonical representation The sub-trees with more nodes are translated rst Next the sub-trees with more levels are listed rst Next the sub-trees with greater weight are listed rst
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Proposed Kernel Simple kernel:
k (x , y ) = A more complex kernel:
∑ nums (x )nums (y )
s ∈B
∑s ∈B nums (x )nums (y )ws k (x , y ) ws is 1 if wy and wx are 0 (wy and wx are the respective weights of x and y trees) ws = wwyx if wx ≤ wy Otherwise, ws = wwyx k w (x , y ) =
B is the set of balanced sub-string [...] Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Proposed Kernel
The weights can be calculated in 9 dierent ways The process is reduced to nd the number of balanced sub-strings founded in both molecules;in other words, a sub-tree. The leaf nodes are not counted as sub-trees The whole string is considered a sub-string
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Proposed Kernel
Tree representation: The red and green circles denote sub-trees that appears in both trees. In this case, the simple kernel is 2 Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Proposed Kernel
String representation: The gray area is the same green circle in the previous gure. The red square is related to the red circle too.
Raul Torres
Parallel Computing System for ecient computation of Mo
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Parallel programming considerations
The general process is executed over the CPU Host Code: C++
The string comparison process is made in parallel Device Code: CUDA for C
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Experimental congurations Variables (to construct the weight): Isopotential surface size: ISS Isosurface value: IV Factorial design (32 experiments), three states (N)Don't use (A)Accumulated: the summation of all the values of each node gives the weight for the sub-tree (S)Simple: the value of the root node of the sub-tree gives the weight for the sub-tree
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Experimental congurations
W0: There are no weights. Only the structure is important. (N)ISS x (N) IV W1: Accumulated isopotential surface size. (A)ISS x (N) IV W2: Accumulated isopotential surface size times Accumulated isosurface value. (A)ISS x (A) IV W3: Simple isopotential surface size. (S)ISS x (N) IV W4: Accumulated isosurface value. (N)ISS x (A) IV
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Experimental congurations
W5: Simple isosurface value. (N)ISS x (S) IV W6: Accumulated isopotential surface size times Simple isosurface value. (A)ISS x (S) IV W7: Simple isopotential surface size times Simple isosurface value. (S)ISS x (S) IV W8: Simple isopotential surface size times Accumulated isosurface value. (S)ISS x (A) IV
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Experiment W3
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Experiment W8
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Experiment W7
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Experiment W2
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Experiment W0
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Execution time
In general, the execution time is approximately 2 seconds.
Raul Torres
Parallel Computing System for ecient computation of Mo
Methodology Results Conclusions More information
Conclusions We don´t need to accumulate the sizes of the children In general terms, the kernel method used achieves a good classication The pre-assumptions that a string kernel can be applied to tree-like structured data was veried The next eorts of this research will be focused in the application of a kernel that uses the co-rooted tree and a more robust representation of strings named sux tree. The use of CUDA as a programming environment has allow us to perform a several concurrent operations in a fast way than in the serial paradigm Without the need of a cluster of computers, GPU Computing oers a tremendous computational power at low cost. Raul Torres
Parallel Computing System for ecient computation of Mo
Appendix
For Further Reading
For Further Reading I
Raul Torres
Parallel Computing System for ecient computation of Mo