IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
1
Dealing with Multiple Requirements in Geometric Arrangements Erick Gomez-Nieto, Student Member, IEEE, Wallace Casaca, Danilo Motta, Ivar Hartmann, Gabriel Taubin, Fellow, IEEE, and Luis Gustavo Nonato, Member, IEEE Abstract—Existing algorithms for building layouts from geometric primitives are typically designed to cope with requirements such as orthogonal alignment, overlap removal, optimal area usage, hierarchical organization, among others. However, most techniques are able to tackle just a few of those requirements simultaneously, impairing their use and flexibility. In this work we propose a novel methodology for building layouts from geometric primitives that concurrently addresses a wider range of requirements. Relying on multidimensional projection and mixed integer optimization, our approach arranges geometric objects in the visual space so as to generate well structured layouts that preserve the semantic relation among objects while still making an efficient use of display area. Moreover, scalability is handled through a hierarchical representation scheme combined with navigation tools. A comprehensive set of quantitative comparisons against existing geometry-based layouts and applications on text, image, and video data set visualization prove the effectiveness of our approach. Index Terms—Overlap Removal, Similarity Preserving, Structured Layouts, Area Optimization.
F
1
I NTRODUCTION
RRANGING geometric primitives such as boxes and discs in a two-dimensional layout is a nontrivial task that inherently appears in important visualization applications, as for example, word cloud construction [1], [2], small-multiples arrangements [3], [4], [5], and visual boards [6], [7], [8]. The difficulty in building layouts made up of dozens of geometric objects rests in the set of requirements to be handled simultaneously, e.g., readability, overlaps, object size, semantic proximity and area usage. Moreover, the number of data instances represented as geometric entities is typically much larger than the visualization area, demanding the use of clustering, hierarchies, and navigation resources to assist the visualization. Although significant advances have been made towards building meaningful layouts from geometric primitives, existing techniques are formulated to deal with a limited number of requirements simultaneously, restricting their use to specific applications. For instance, techniques such as visual boards and small multiples provide well structured layouts which are easily readable, but they pay the price of scalability. Hierarchical methods such as Treemaps [9] mitigate the issue of scalability while making an efficient use of display area. However, readability and semantic organization of data are aspects not so easily handled by those methods. Overlap-free semantic preserving techniques such
A
• E. Gomez-Nieto, W. Casaca, D. Motta and L.G. Nonato are with Instituto de Ciências Matemáticas e de Computação (ICMC), Universidade de São Paulo, São Carlos, Brazil. Email:
[email protected], {wallace,daniloam,gnonato}@icmc.usp.br • I. Hartmann is with Universidade Estadual de Rio de Janeiro, Rio de Janeiro, Brazil. Email:
[email protected] • G. Taubin is with Brown University, School of Engineering, Providence, USA. Email:
[email protected]
as RWordles [10] and ProjCloud [11] generate somewhat structured layouts and keep instances with similar content close to each other. Nevertheless, they are not designed to make an efficient use of display area and also suffer from scalability. Handling many requirements is not straightforward because distinct requirements can compete with each other during layout construction. For instance, to facilitate readability, layouts should be built with as large as possible geometric entities. However, large objects easily fill up the display area, thus limiting the number of instances that can be visualized. Therefore, finding an optimal balance among multiple concurrent requirements is a challenging task, which has not been properly tackled by existing methods. In this work we present a novel methodology for building layouts from geometric primitives which is able to deal with a wide range of requirements simultaneously. Relying on multidimensional projection, density-based adaptive grids, and mixed integer optimization, our approach is semantically aware, makes an efficient use of display area, and generates well structured grid-like layouts. Moreover, the formulation intrinsically impose a hierarchy on the data, enabling alternatives for the scalability issue. The proposed optimization scheme arranges geometric entities (boxes) with varying sizes so as to avoid overlaps while preserving the neighborhood structure of the underlying data (semantics). The area of each geometric primitive is also included in the optimization process to ensure that the display area will be efficiently occupied. In fact, supported by the adaptive grid, our formulation is able to scale elements with different sizes using only one variable, thus rendering the optimization procedure as simple as possible. A comprehensive set of quantitative comparisons against existing geometry-based layouts shows the effectiveness of our approach. The usefulness of our
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
methodology in visualizing text, image, and video data sets is also confirmed in several practical applications. The main contributions of this work can be summarized as follows: • A novel methodology to generate layouts made up of geometric primitives which deals with multiple requirements such as grid-like structure, semantic relation among objects, display area usage, and overlap removal. The methodology naturally imposes an hierarchy on the visual representation in order to handle a large number of data instances. • A new optimization scheme that relies on a reduced number of unknowns to generate well structured layouts while making a good use of display area. • A comprehensive set of comparisons against existing methods and several practical applications to confirm the effectiveness and usefulness of our methodology. One of the positive aspects in managing multiple requirements during layout construction is that we can play with properties such as object size and neighborhood to highlight relevant portions of the layout without losing the semantic relation among objects. By handling additional requisites such as overlap-free and optimal usage of display area we can reduce visual clutter while improving readability. As far as we known, no other technique devoted to build layouts from geometric primitives is able to deal with so many concurrent requirements to generate meaningful layouts.
2
R ELATED W ORK
We focus the following discussion on techniques that rely on the arrangement of simple geometric objects such as boxes, discs, and polygons to visualize data information. We group existing methods in three main categories, structured arrangements, hierarchy-based, and nodedisplacement techniques. It is worth mentioning that our goal is just to provide an overview of the main classes of methods that make use of simple geometric objects to build visualizations, pointing out their strengths and weaknesses in order to better contextualize our contribution. There are a huge amount of work in the literature and a comprehensive survey about all those methods is beyond the scope of this paper. Structured arrangements comprise the set of techniques that tile the visual space with geometric primitives so as to produce a well structured layout. IncBoard [6], for instance, uses boxes or hexagons to represent highdimensional instances which are arranged as a regular grid in the visual space. The arrangement is done such that proximity among objects reflects similarity, that is, neighbor cells in the grid tend to represent similar instances of data high-dimensional data as grid according to so as to keep making an analogy with representation analogous to a chessboard where rows and columns determines cells to represent high dimensional data on 2D space. Self-Sorting Map [7], [8] is another approach that relies on regular
2
grids to perform visualization. Items are arranged based on a sorting mechanism that keeps similar instances close to each other by using a hierarchical swapping process which aims to maximize normalized cross correlation between the input data and the positions of a structured grid. Small multiples [3], [4] is another class of techniques that rely on structured arrangement of geometric primitives to build visualization layouts. Small multiples is typically used to create different views of a data set [5] or to enable the visual analysis of multiple complex data such as time series [12], [13]. One of the main advantages of structured layouts is the ease of interpretation of the resulting visualization. In fact, van den Elzen et al. [5] showed in a user study that visualizations made up of small multiples are more effective than other methods to visualize different views of the same data. A similar conclusion has been reached by Javed et al. [12] in the context of time series visual analytics. Despite the advantages, structured layouts do not scale easily, which impairs their use in applications involving large data sets. Hierarchy-based methods are designed to make an efficient use of display area while enabling dynamic navigation throughout different levels of a hierarchical representation [14]. Treemap [15] and its variants [16], [9], [17], [18] are examples of hierarchy-based techniques tailored to visualize data organized as a tree structure. The visualization is performed by recursively splitting the space in rectangular boxes whose size and orientation reflect the extent of nodes in the hierarchy. Another example is Pedvis [19], a technique that builds upon H-tree layout and rectangular boxes to depict pedigree information structured as deep hierarchies. In contrast to the hierarchical methods described above, Voronoi Treemap [20] uses polygonal objects rather than rectangular boxes towards better representing the importance of nodes in the hierarchy and further differentiate sibling and non-sibling nodes. Despite the efficiency in space occupation, most tree-base layouts are not devised to place similar instances close to each other, making them unsuitable for applications involving similarity-based data exploration. Aiming at addressing this issue, some authors [21], [22] have proposed ordering mechanisms that consistently arranges tree nodes according to some similarity measure. Nocaj and Brandes [23] combined multidimensional scaling, Voronoi Treemap, and a set of visualization resources to highlight the similarity among data instances while enabling navigation throughout the hierarchy. Hierarchy-based methods scale well and they make good use of display area. However, in contrast to structured arrangements, the layout resulting from those methods are not so easy to read as similar instances are not necessarily placed close to each other. Node-displacement techniques aim at arranging geometric objects in the visual space so as to preserve similarity relations while avoiding overlaps. Overlap-free techniques vary greatly as to mathematical formulation and they can
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
be classified as physical-base, heuristic-base, and optimalbased models. Physical-based models rely on force schemes [24], [25] or spring systems [26], [27] to position geometric entities in the visual space. The lack of guarantees in terms of convergence and the untidiness of the resulting layout are the main weaknesses of physical-based approaches. Heuristic methods typically scan the visual space looking for the best location for each instance. Recent approaches such as RWordles [10] take into account similarity information to build semantically aware layouts. The main advantage of heuristic techniques is their low computational cost that makes possible to build large layouts quickly. Uneconomical use of display area is, however, a major drawback. Optimal approaches arrange objects by minimizing an energy functional subject to a set of constraints. Dwyer et al. [28] and Marriott et al. [29], proposed energy functionals derived from intersection tests, using quadratic optimization to find an optimal solution. In order to improve readability some techniques impose constraints to the energy functional towards generating grid-like arrangements [30], [31]. ProjSnippet [32] is a two-step approach that first projects data onto the visual space and then builds an energy functional that accounts for object overlap as well as neighborhood preservation. The technique called MIOLA [33] also makes use of neighborhood information provided by multidimensional projection to formulate a mixed integer optimization that results in almost orthogonal layouts. Heuristic and optimal methods perform well in terms of overlap removal and similarity preservation. However, those methods do not make an efficient use of display area, mainly when dealing with geometric objects with different sizes. Scalability is another hurdle for those techniques, since, with a few exceptions [34], node-displacement methods typically do not operate hierarchically. The method proposed in this work gathers a set of traits not presented in other existing approaches, as it has been designed to account for multiple concurrent requirements such as object scale, neighborhood and aspect ratio preservation among cells, optimal use of display area, and overlap-free grid-like cell arrangement (see Figure 1 bottom). Moreover, the proposed technique naturally imposes a hierarchy on the data, thus being able to handle large data sets. Proximity between similar entities is also enforced during layout construction and navigation. In summary, the proposed method bears a set of properties not present together in any other geometry-based layout technique.
3
P ROPOSED M ETHODOLOGY
As illustrated in Figure 1, the proposed methodology comprises three main steps: multidimensional projection, density-based adaptive grid generation, and layout optimization. In the first step, high-dimensional data is mapped to the visual space so as to preserve distance among data instances. In our implementation we use the Least Square Projection (LSP) method [35], which preserves distances
3
Fig. 1. Pipeline used comprises three main steps: (i) data is mapped into visual space using a multidimensional projection technique, (ii) adaptive grid is built from projected points and (iii) an optimization algorithm is performed to reach an optimal area usage.
nicely during the mapping process, thus enforcing that neighbor points in the visual space correspond to similar/close instances in the original space. The use of a distance preserving mapping to place instances in the visual space provides the neighborhood structure that must be “mimicked” by the final layout. An adaptive grid is then constructed from projected points. In contrast to traditional adaptive grid generation schemes, our approach refines the grid in less dense (but not empty) regions of the visual space, thus placing larger grid cells in denser regions (see Figure 1 top right). The rationale is to allow users easily identify dense regions by visually recognizing large geometric objects in the layout. The number of refinement levels is a user defined parameter. The adaptive grid produces a size varying tiling of the visual space. Moreover, grid cells inherit the semantic relation of the project instances, that is, neighbor grid cells tend to encompass similar instances. Although well structured, the cell arrangement resulting from the refinement is typically spread, making an inefficient use of display area. Therefore, in the third step of the proposed pipeline, cells are rearranged in the visual space to optimize the area usage. The optimization is formulated to account for object scale, overlapping, and grid-like arrangement while preserving neighborhood relationships and the aspect ratio among cells (see Figure 1 bottom). The rationale here is to build layouts as readable as possible while still dragging users attention to denser regions that should be further explored. The following subsections detail the second and third step of our pipeline, which correspond to the major technical contributions of our approach. 3.1
Adaptive Grid generation
Let P = {p1 , p2 , . . . , pq } ⊂ R2 be the projection of a set of instances into the visual space and G be a n × m regular
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
(a)
(b)
pointing out where users should nail down their exploration. Second, the proposed refinement scheme allows for resizing all grid cells by controlling only one parameter. In other words, if one wants to scale all cells preserving their relative size, the only parameter to be tuned is the length δ of coarser cells. In fact, the length of the cells in each refinement level is given by (1/2k )δ , where k = 0, 1, · · · is the level of refinement. This last property will be exploited during optimization to find the value of δ that results in the best use of display area, as detailed below. 3.2
(c)
(d)
Fig. 2. Density based refinement process. (a) An initial regular grid is defined in the visual space. Such an initial grid provides the coarser refinement level (first level, as depicted in (b)). (c) An adaptive grid is built from density information. Denser areas are represented by larger cells. (d) Active (not empty) cells are then ready to be optimized in the next step of the proposed framework.
grid which discretizes the bounding box of P. The number of cells in each orthogonal direction is defined such that each grid cell gi j has a square shape, that is, given the number of subdivision in one direction, for instance m, the H number of subdivision in the other direction is n = d W me (the boundary of the bounding box can be displaced to accommodate the number of cells), where H and W account for the height and width of the bounding box of P, respectively. The regular mesh G is the coarser grid level for the adaptive process. The refinement is driven by density information computed from P in each grid cell. The density can be estimated in different ways, for instance by integrating a kernel density estimator over each grid cell. As our application does not demand highly accurate density estimation, we opt to approximate the density in each cell by simply counting the number of points pi contained in the cell. Given the density information, the refinement is carried out as follows: let di j be the density of the grid cell gi j and dmax = max{di j } be the largest density value in G. A grid cell gi j undergoes one level of refinement if di j ≤ 0.5dmax , two levels of refinement if di j ≤ 0.25dmax , three levels if di j ≤ 0.125dmax , and so on. Figure 2 illustrates the refinement process with three levels. Although simple, the grid generation scheme described above has two helpful properties. First, dense regions in the visual space can easily be identified from the larger cells,
4
Optimization
Let G˜ = {g1 , g2 , ..., gN } be the set of non-empty cells in G, that is, G˜ comprises the cells of G with projected points in their interior. Each cell gi is a square box described by the vector gi = (xi , yi , wi ) ∈ R3 , where (xi , yi ) accounts for the center of the box and wi > 0 is the edge length of gi . The cells gi should be rearranged and resized inside a W ×H display area so as to make an efficient use of display area while preserving grid alignment and neighborhood structures. As described in Section 3.1, wi = αi δ , where αi = 1/2k , k corresponding to the level of refinement of gi . Therefore, δ is a parameter to be optimized (initially set as the length δ0 of the coarser cell) from which the size of each cell is derived, as illustrated in Figure 3(a). 3.2.1
Rearranging Cells with Area Resizing
The problem of rearranging cells gi so as to generate layouts that optimize use of area while presenting a gridlike structure is an NP-hard problem [36] that appears in contexts such as cutting optimization and rectangle packing [37]. In order to get an approximate solution, we formulate the problem as a computationally tractable quadratic optimization problem. The optimization is formulated as in Equation (1) below: minimize E(z) = Ecomp (z) + Erezise (z), subject to Az ≤ b,
z = [x y r δ ]> ,
x = (x1 , x2 , ..., xN )> ∈ RN y = (y1 , y2 , ..., yN )> ∈ RN r = (r12 , ..., r1N , r23 , ..., r2N , ..., rN−1N )> , ri j ∈ {0, 1} δ0 ≤ δ ≤ min (W, H),
(1)
where z is the sought solution; y and x correspond to the coordinates of the centroids of the cells; δ is the scaling factor; A and b hold the constraints imposed on the optimization problem. The unknowns ri j are control variables used to properly avoid overlaps. The energy components Ecomp (z) and Eresize (z) control the proximity between cells and the area increase, respectively. The former term accounts for overlaps and neighborhood preservation and the second term is designed to scale the box to fill up as much area as possible (see illustration in Figure 4). Compactness Energy Term The first term in the energy functional, Ecomp , controls the relative position among cells and its goal is to keep the layout compact. The energy Ecomp
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
W
δ δ0
α3 δ 0 α3 δ 0
δ0 δ0
W
H δ δ0
α2 δ 0
5
H
α2 δ 0
(a)
(a)
(b)
(c)
(d)
(b)
Fig. 3. (a) The input layout with δ0 being the length of the coarser cell. The size of the remaining cells are defined according to the initial parameter δ0 and their scale parameters αi . (b) The result of the optimization when δ > δ0 .
is a quadratic function that simply takes into account the centers of the boxes, as shown in Equation (2): Ecomp (z) = C ∑ (xi − x j )2 + (yi − y j )2 ,
(2)
(i, j)
pairs where (i, j) = ( j, i) represents the N(N−1) 2 (gi , g j ), i, j, ∈ {1, ..., N} and C = 1/ min (W, H) · N(N−1) 2 is a normalization factor. In less mathematical terms, Ecomp forces boxes to be close to each other and the normalization factor C ensures that the term Ecomp contributes in the same amount as Eresize in the total energy E, thus enabling a good balance between compactness and area usage. Area Usage Energy Term The energy term Eresize controls the amount boxes should be scaled to optimize the use of display area. Since the area of any cell depends only on the length of the largest cells, that is, the area of each cell gi is given by Ai = (αi δ )2 , αi = 1/2k , where k is the refinement level of gi , we can define the resize energy as a quadratic function as follows: Eresize (z) = (δ − min (W, H))2 ,
x p1 ≤ x p2 ≤ ... ≤ x pn ⇒ x pi − x pi+1 ≤ 0 yq1 ≤ yq2 ≤ ... ≤ yqn ⇒ yqi − yqi+1 ≤ 0
Optimization Constraints
As described in Equation (1), constraints are gathered in matrix A and vector b and they are imposed to properly control overlaps and the relative position of cells. Given the initial position (xi , yi ) and length αi δ of each cell, the relative order (also called orthogonal order) of the boxes is given by:
(4)
where p, q : {1, 2, ..., N} → {1, 2, ..., N} are permutations of indices generated by sorting the coordinates xi and yi . Inequalities (4) allows for preserving the relative order of the cells, but it does not account for overlap. Overlaps can be handled by forcing non-overlap conditions as follows: (αi + α j ) (αi + α j ) δ or |y j − yi | ≥ δ , (5) 2 2 From Equation (4) we have that if xi ≤ x j and yi ≤ y j then |x j − xi | = x j − xi and |y j − yi | = y j − yi , which give rise to the following linear system of inequality: |x j − xi | ≥
(3)
where min (W, H) is the minimum between the width and height of the display area (see Fig. 4(c)). It is easy to see that if no constraints are imposed to the unknowns, the minimum of Ecomp is reached when all cells have the same center and the minimum of Eresize takes place when the length of the larger cell is equal to min (W, H). Therefore, without constraints the optimization process will stack cells on over the other. To avoid such unsuitable output, we must settle constraints as discussed next. 3.2.2
Fig. 4. Minimizing each term of the energy functional E in an illustrative layout. (a) Original layout, (b) Ecomp only, (c) Eresize only, (d) the proposed energy functional E = Ecomp + Eresize .
(α +α )
xi − x j ≤ − i 2 j δ , or (α +α ) yi − y j ≤ − i 2 j δ ,
(6)
As proposed in [33], the or condition in (6) can be handle by binary variables ri j ∈ {0, 1} such that: xi −x j ≤ αi j δ +Mri j ⇔ yi −y j ≤ αi j δ +M(1−ri j ),
(7)
(α +α )
where αi j = − i 2 j and M is a very large constant. The role of ri j is to ensure that if one of the inequalities in Equation (6) holds, its counterpart is set aside. For instance, if xi −x j ≤ αi j δ , then ri j is equal to zero, thus yi −y j ≤ αi j δ +M(1−ri j ) is naturally satisfied for any values of y. The linear inequalities, ri j unknowns, and δ make up the linear system [A|b]. The left and right bounds for the x coordinate of the center of the cells must also be imposed in [A|b], that is, 0 ≤ xi −
αi αi δ and xi + δ < W, i = 1, 2, ..., N, 2 2
(8)
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
6
0,7 0,6
Area Usage (A) better
0,5 0,4 0,3 0,2 0,1 0
Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 RWordle
PRISM
VPSC
MIOLA
ProjSnippet
Our Approach
0,8
Orthogonal Ordering (O) better
0,7 0,6 0,5 0,4 0,3 0,2 0,1 0
Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 RWordle
RWordle
PRISM
VPSC
PRISM MIOLA
VPSC
ProjSnippet
MIOLA
Our Approach
RWordle
0,95
0,90 0,85 0,80
0,75 0,70 0,65
PRISM
VPSC
MIOLA
Our Approach ProjSnippet
Our Approach
1,7000
Orthogonal Alignment (L) better
Neighborhood Preservation (K) better
1,00
ProjSnippet
1,6000
1,5000 1,4000 1,3000
1,2000 1,1000
1,000 10%
20%
30%
40%
50%
10%
20%
Percentage of Neighbors
30% Percentage of Neighbors
40%
50%
Fig. 5. Comparison against overlap removal techniques for the three initial classes of experiments.
Similar inequalities are set to limit the upper and lower bounds of the y coordinates. Constraints Relaxation The optimization problem described in Equation (1) provides a flexible and simple mechanism to relax constraints and generate distinct layout arrangements. This is performed by adding relaxation parameters (scalars) in the inequalities to allow for some degree of overlap, orthogonal disorder and so on. 3.2.3
Computational Aspects
The formulation (1) is a Mixed Integer Quadratic Programming (MIQP) Problem, which is minimized by computing an extension of the so-called Branch-and-Bound optimization scheme, where the original MIQP problem is converted
to a Linear Programming Problem in order to be properly handled and solved [38]. In terms of usage, our code was implemented using the solvers and optimization routines provided by Gurobi Optimization Package, which is available at http://www. gurobi.com/.
4
R ESULTS
AND
C OMPARISONS
The performance of the proposed optimization scheme is assessed through a set of comparisons against well established geometry based layout construction techniques. More precisely, we employ four distinct metrics to quantitatively measure the quality of layouts produced by two distinct classes of techniques, namely: overlap removal methods
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
Orthogonal Ordering (O): measures the number of changes in both horizontal and vertical order of geometric entities [39]. More precisely, positions are sorted in increasing horizontal and vertical order and the number of inversions in the lists provides the quality measure. In mathematical terms: O=
∑
(t)
∑ invs (i, j),
(9)
s∈{x,y} i< j
better
Ar ea Usage ( A)
0,3 0,2
0
Area Usage (A): let r be the longer diagonal of the bounding box of the layout produced by a given method and Ai the area of the i-th box in the layout. The area usage metric is defined similarly as [40]: A=
2πr2 ∑i Ai
.
(11)
This metric gauges layout compactness and values close to π/2 are better. Orthogonal Alignment (L): let {z1 , z2 , . . . , zr } be points in Z × Z discretizing the interior of a disk, as illustrated in the inline figure. More14 15 10 over, let bi be the center of a square 16 2 5 13 6 in the layout and bi j , j = 1, . . . , k be 9 1 the center of the squares in the k- 11 3 nearest neighbor of bi . The orthogonal 17 7 4 8 20 alignment metric is given by: 12 18 19 < zs , bi j − bi > 1 n L= (12) ∑ ∑ max kzs kkbi j − bi k , s nk i=1 bi j where n is the number of geometric objects in the layout and < ·, · > accounts for the dot product. This metric gauges how much a layout deviates from a regular grid, the closer to 1 the better. Metrics above have been chosen because they measure important properties a layout must hold, namely, similarity preserving, compactness, and orthogonal alignment.
IsoMatch
Our Approach
Incboard
SSM
IsoMatch
Our Approach
0,7 0,6 0,5 0,4 0,3
0,2 0,1 0
IncBoard
SSM
IsoMatch
Our Approach
1
0,9 0,8
better
Neighborhood Preservation (K): computes the average percentage of the k-nearest neighbors of the initial configuration that are preserved in the final layouts [35]. More specifically, for each geometric entity we verify the percentage of its neighbors in the final layout that were also neighbors in the initial layout before optimization. Values close to 1 indicates 100% of preservation.
SSM
0,8
1, 0, otherwise
(10) where i, j are indices of the sorted lists before (t − 1) and (t) after (t) layout arrangement. invy (i, j) is defined analo(t) gously to invx (i, j). Small values indicate better behavior.
Incboard
0,9
Neighborhood Preservation (K)
(t)
invx (i, j) =
(t) (t) (t−1) (t−1) if(xi − x j )(xi −xj ) < 0
0,4
0,1
0,7 0,6 0,5 0,4
10%
20%
30%
40%
50%
Percentage of Neighbors Incboard
SSM
IsoMatch
Our Approach
1,0045 1,0040 1,0035
1,0030
better
(
0,5
better
Metrics and Data Sets
0,6
Orthogonal Alignment ( L)
4.1
0,7
Orthogonal Order ing ( O)
and visual board techniques. Before presenting the comparison, we briefly describe the metrics as well as the data sets used in our tests.
7
1,0025 1,0020
1,0015 1,0010 1,0005
1,00
10%
20%
30%
40%
50%
Percentage of Neighbors
Fig. 6. Comparison with visual board techniques for the last class (taken from Fig. 8).
Data sets and Test Settings We divided the experiments in four groups: the first three groups of tests assess overlap removal methods while the last group evaluates the performance of our optimization scheme against visual board techniques. In the first test we randomly draw points inside a rectangular two-dimensional domain, which represents the display area. The rectangle is discretized as a regular grid with dimensions 1600 × 1200. Cells of the grid that do not contain any point are discarded. The remaining cells define the boxes to be arranged in the layout. As no overlap exists at this point, we enforce the cells to overlap by uniformly increasing their area in 10%. Forcing overlap is mandatory since most overlap removal methods can not start up from an overlap free arrangement. Notice that this is not the case for our approach, which also operates on overlap free layouts in order to optimize area usage and
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
DT2
DT3
DT4
DT5
2
3
PRISM
VPSC
RWordle-C
MIOLA
ProjSnippet
Our
Original
DT1
8
Fig. 7. Layouts produced by our approach, ProjSnippet, MIOLA, RWordle-C, VPSC and PRISM in 5 datasets.
structure. Fifteen configurations, with fifty random points each, make up the first group of tests (called class 1 in
Figure 5). The second experiment is similar to the first one, except we use a three level adaptive grid to generate the
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
4.2
Comparisons
The metrics describe above are used to assess the effectiveness of our approach when compared against well known overlap removal techniques. Precisely, VPSC [28], PRISM [24], RWordle-C [10], MIOLA [33], and ProjSnippet [32] are used as basis for comparisons. These methods have been chosen either because they are widely used or due to their good performance reported in the literature. We also compare our approach against three visual board techniques, namely, Incboard [6], SSM [8] and IsoMatch [41]. Figure 5 shows quantitative results obtained from the metrics described above. Notice that our approach clearly outperforms most techniques as to area usage (top row) and orthogonal alignment (bottom row right). Regarding orthogonal ordering (middle row), our approach turns out to be competitive, performing better for class 1. In terms of neighborhood preservation (bottom row left), our approach also performed well, being surpassed only by ProjSnippet and PRISM, which are known to preserve neighborhoods well. Figure 6 brings quantitative comparison against visual board techniques. Notice that the proposed method outperforms IncBoard and behaves nicely even when compared to SSM and IsoMatch, which have optimal area usage (notice the vanish box plots) and cell alignment. Figure 7 and 8 depict qualitative results comparing our approach with overlap removal and visual board techniques, respectively. Notice that the proposed method gives rise to well structured layouts where neighborhoods (indicated by color map) are nicely preserved. Moreover, our approach makes a better use of display area, thus improving readability and content analysis.
25 Boxes
36 Boxes
49 Boxes
SSM
IncBoard
Our
Original
16 Boxes
IsoMatch
geometric primitives (class 2 in Figure 5). In the third experiment (class 3 in Figure 5) we use real data set containing high-dimensional instances which are mapped to the visual space using the LSP multidimensional projection scheme. The layout is built from the projected points using a three level adaptive grid. Five data sets, taken from a public repository (available at http://infoserver.lcad.icmc.usp.br/infovis2/DataSets), have been employed in the third experiment. One data set contains a collection of snippets retrieved from a web search using "Scientific Visualization" as query expression; two data sets contain news from BBC, CNN, and REUTERS; the fourth data set corresponds to abstracts from IEEE InfoVis 2004 conference; and the last data set contains metadata from a scientific paper collection. Bag-of-words were extracted from textual content and used to define the similarity between instances. Those distances are used as input to the LSP projection method [35], which maps instances to the visual space. Finally, in the fourth group of tests we have randomly generated squares in a 2D square domain, as depicted in Figure 8. Those layouts are used as input to existing visual board techniques.
9
Fig. 8. Qualitative comparison of the proposed optimization scheme against visual board techniques Incboard [6], SSM [8] when taking the best result from SSM w.r.t. their objective function after 1000 executions and IsoMatch [41] .
5
A PPLICATIONS
In this section we present three different applications of our technique, namely, image gallery construction, textual documents analysis, and video data set visualization. In those applications, effective data exploration is enabled if requirements such as object size, semantic and overlapfree arrangements, scalability, and optimal area usage are combined in an optimal manner, showing the relevance of our methodology to applications as the ones presented in this section. The image gallery application, depicted in Figure 9, aims at visualizing image data sets such that highly similar images, which tend to be project close to each other in the visual space, are summarized in large icons, while images with discriminative feature are represented in small icons. To build the image gallery we extract 96 features from color attribute. More specifically, each image is split in 16 nonoverlapping regular regions covering the image from which the first and second statistical moments for each R,G,B channel is extracted. Feature vectors representing the images are projected in the visual space using the Least Square Projection (LSP) [35] method. A three level adaptive grid is constructed and the proposed layout optimization scheme is triggered in order to arrange the cells in a structured overlap free layout. Large grid cells containing several similar images give rise to iconic representations that summarizes the underlying images. Such iconic representation is built
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
10
(a)
(b)
(c)
deros pier
alto south
palo head hilton
bier
william fishkin
assess
evid
link enhanc spatial
cartograph
layout
order maryland
market
select
coordin high
springer-verlag
cleveland
cone
coordin
inselberg mcdonald
layout
social balanc
andamp
uncertainti
planar
ratio note
buja
parallel adjust
fishey
disc
distort edg
layer embed ead transit cluster feng straight-lin
convex
network automat
feng
tamassia
plane
symp browser
berlin
force-direct
multilevel
cell
sketch note
hyperbol
cluster anneal
munzner
direct
space
abello
korfhag
librari
ead
algebra
koren
simul undirect
optim
multilevel
plot specif
rel
statist
multivari
gene
multivari
node statist
layout note
tamassia gd ead
planar lnc cluster di symp
central multilevel
m ult iv ar i
coordin geograph
robertstart
layout
n-dimension
log
cluster clutter set
tolli
preserv
force-direct
suggest
maceachren
network
map
c olo r
parallel
product
m as t er
vir t ual
geograph cartographi
geos c ie nc
weizmann
expert
placement
facilit
cartograph shape cartogram maceachren gi cartographi statist tempor spatial
associ
java
cartographi problem aesthet tempor
exploratori howard
nice lnc ac
harel
christensen
vir t ual
cartographi
cartogram
anneal
method
park
shneiderman abstract
hierarchi
pixel
chart
hierarch
convei
comp
uncertainti
munzner
xmdvtool parallel reduct
layout
abello
fluid
flow
alex
eick
traffic
network massiv
hierarchi
pang
vrml
arg
ward
multivari
cluster
hyperbol mgv navig will
octob
scott
wireless
parallel
traffic
mbone
dimension
dimens
non-linear apperlei
dept hierarch sugiyama louisiana
rundenstein
uncertainti
statist
context
brush bound olston
small
multidimension
holmquist
magnif
hierarchi
write
hierarchi
keim radial
multidimension
focu fishey chi uist
hyperbol
bar
coordin rundenstein
hierarchi
furna
edward
ward
structure-bas
björk spenc
keahei
richard
layout
space-fil
francisco
sarkar
zoom
allen
joe
creat
ead
detail
uist
wong
comprehens
nonlinear robertson
boston
reach
stasko
zoom
fishey
fracchia mental
fast anneal multi-scal
19th
knowledg
m in e
product mark
text
multidimension autom geograph
silicon
construct
discoveri sun collabor geograph area maceachren java geoscienc htm
map
syst nonlinear
mcgraw-hil
f r eem an
label vrml placement map
maceachren
(b)
hawaii
halasz
memori diagram louisiana notecard myer alto palo myer area
nest
disk
memori
internet acm-siam editor
roth
secur
san
kriegel
c onklin
iii
space jock
amsterdam autom goldstein
wattenberg
tree-map rectangl
huge
chapel
write
pier transpar
toi
ca
moran
stephen jeff
jai mendelzon conklin
bergeron
usag
label
der
spatial remot
paradigm
hypermedia problem
adapt ny architectur
gi
name
geograph
scale
linear imag percept
perform suggest author introduc colour
bradmatti
netherland
hierarch
space-fil stock turo
wijk
brushga
adjust
ibm
marc
medic
map
immers
form
chart
hypertext code understand
tamassia
cartograph
simul
tran
shape
project sphere siggraph multidimension
orlean
multicast jeffrei
cartograph metaphor hypertext
virtual
implicit
color desir
similar
curv
surfac
found
observ grai
celebr interdepend
demonstr
sort
geograph sens cartographi
optim
proxim circuit book
davi
educ kei
method
gross
carolina
springer-verlag network
son standard wilei john alex java scale placement gener
kamada
transform
navig
realiti
web
sedgewick control
benford alamito lo octob
multimedia devic allow upper
w eb-bas
featur
multipl simul
dan manipul
individu
lorensen
van
numer increment significantli
normal
toi
brad
bobrow
vitter extern
site wavelet multiresolut discoveri
taylor
blinn
contour
mass carolina trigg hill
williamson antonio control stone texa write input buxton
ben
fishey
dialogu
sigir card
boston
editor
dynam log spie
video
plan
wavelet
chapel trigg
hypertext hypermedia
27-mai
ca
jan
c
treemap colleg
discret
isbn
imag
virtual
w ork
cartogram
tempor
jr
oper
variabl psycholog
specif
error
dynam
domain
evalu mark
effici
time effect afip
subject multidimension
rep sept
jon
inf
current
area
kei
detect
web
fluid stasko glasgow
site simul
lnc
basic molecular
high-dimension
c
text
evalu path tango
pattern measur
field
shapepitkow
shade
rai
volum
object-ori
visualis
pitkow
world plant pak
distanc witkingrammar scale lorensen segment psychometrikaann content siggraph editor arrang
cach splat trace imag curv
shape
vision
test
nelson
hypermedia
stuart georg document web mine
tour
marshal hypertext
mackinlai
modular
optim herman
scalar volum
max
render
surfac
implicit utah
recognit
segment
web
stream video
document hypermedia
robert
sugiyama
notecard implicit
gaithersburg
represent manag
rotat
william
perform
case
surfac
pilot
rapid
index point
mine
rule toc
concurr
handbook
catherin frank
man-machin
shneiderman url
notecard
20th august
frank
path toi hypertext halasz
manual
halasz misu
cognit
cartogram continu
target
multimedia
space match
steven object-ori
lo gershon pak sequenti
concept
electron
featur
healei
psychol
winter
van
decemb
brad
evolut
visag
workspac
math primit
facil weight chi
group execut oper imag parallel
whitnei alamito
hendlei
gd
wong
similar textur
rai cube
geometr scene
siggraph render
feng
lnc
problem
ftp
tabular
optim
visibl textur
concurr
timelin
curv solid
color vari
trace densiti
salesin
field
reflect scalar
textur
depth
layer imag
juli
aug
direct
pat
produc
hanrahan
reiss
imag
witkin
log
surfac shade blinn
engin
reliabl distribut order
layout
planar
programm
protocol
straight-lin
ac cluster anneal force-direct
transform
close
koren
harel
distanc roth sequenc
nearest neighbor conf evolutionari
shneiderman
human-comput meet
alto
author
springer-verlag
m ult iv ar i
int conf agraw
vi
parallel
assesslink maceachren
similar
search
book
bier
see-through transpar
north randal
ken
eric
handbook
orthogon
gi
s t at is t
nam e
statist
geograph
light raiimag
dimension
juli nov
featur
camera
render
scatter siggraph
densiti
biologi
maryland
menu
eric ken
maureenstone toni
exploratori
geoscienc
cartograph
illustr shape henri percept levoi andrew
molecular
item
menu
buxton
robertson
real-tim
object
bell
on-lin
relationship
intellig
percept tutori search 22nd accur
boundari session
properti
c ons t r uc t
sequenc
tech
satisfact respons lab
product prec syst perform
output
kurtenbach buxton
w ashington
planar
java
plot
coordin
j ava
sun
collabor
problem
area discoveri
geograph
cartogram maceachren
dimension
constraint dam av
construct
christensen
cartogram
field
ead layer
scott
primit michael dimension placement
rai
brow ser nice simul
leblanc
appr oxim
cartograph
symp
tolli
cartographi
label
color
abello librari
class
barri
parallel parallel coordin multivari
transit network
korfhag cluster layout
vision
render
surfacvolumline
object-ori
tog
light
brush
virginia detect
ead tamassia
m et aphor
geogr aph
hyper m edia
bar
multivari uncertainti n-dimension rundenstein ward
categor
nasa time file
layout orthogon
map
hypertext
lnc
cartographi
cartograph placement
geograph label spatial
di
paradigm
hyperbol
in c r em ent
parallel
sigop simul
hala s z
27-mai
statist
distort
hierarchi
brush
ward keim space-fil radial rundenstein
discnode
hierarchi
turo
fracchia
fishey
non-linear
secur
structure-bas space-fil
hypertext
keahei
holmquist
magnif context
stock wijk
order shneiderman hierarch
björk focu
distribut ga
textur
softw
monitor execut
function curv trace
navig
uncertainti
munzner
kei
multidimension
transpar
jame
eric
palo
author
nest
fishey mental
robertson nonlinear
ken
control
path
toi
marshal
myer reach
zoom
understand
comprehens
planar
circuitpr
oxim distanc
arrang scale psychometrika ann
wilei
sketch robert
son
architectur
web-bas
dan
imag linear observ percept
debugg madison
market
uncertainti
abello
network
cluster
debug event
wattenberg
inselberg
hyperbol
distribut
netherland
wong detail
unger
sigplannotic
louisiana chart
treemap
kenneth
set seri
va
search
binari
oper
lamport
parallel
metric
languag
calgari languag
im plic it
demonstr
brad
mgv flow coordin fluid
jeffrei
parallel
texa
catherin
andri widget
snibb
ablex
translat
detect enn
chung
mackinlai
joe
toi
trigg
stuart
prec
transpar
lab
see-through
hawaii
specif
sigir
web
multimedia
disk dynam
multicast
extern
benford
arg
stasko fluid
pitkow
video
plant
web
pak
gershon
evalu path
web
georg
antonio iii frank
hypertext hypermedia amsterdam
xmdvtool
adapt
multipl
optim
medic
scale
perform
surfac
imag
trace
kurtenbach
int
realiti
w orld simulstandard
curv
grossshape
hil
c ar olin a
robertson
sort mbone
memori
chapel
notecard hypermedia
brad bobrow
hypertext
chapelhalasz
card
vitter
s pat ia l
neighbor
microsoft
transform
planar
project sphere
document
hypertext
ken
maureen
mackinlai
mine site
virtual multiresolut
video w itkin lorensen modular
normal
color grai
global
toc
group
textur
wavelet shape
segment
reliabl
rotat
web
stream
engin
close
distribut imag curv
case
rai
shade
reconstruct nearest
file sequenc
pat
siggraph hanrahan light
alamito
va
render
imag
protocol
visibl trace cach
imag
percept
scalar aug volum
nelson
reiss
wavelet
menu
bier
notecard hypermedia
cognit
sigmod
distanc roth
imag
alto
buxton stone
pier
conf faloutso document
grammar
surfac
manual
shneiderman
searchpoint
mine rule
eric
bier
alto
dialogu
pilot
t r ans f or m
featur
imag c
parallel
render
light
curv
densiti juli
render
surfac line
distribut languag
concurr
plan
perform
distribut
riedl
notic
calgari
event
debugg
monitor sigplan
textur
ln c
biologi
textur
splat
acid
detect
biolog
uist
arc
nov
spreadsheet detect
oper
maryland
human-comput buxton
dynam item
molecular evolut
cell chi
spreadsheet microsoft
parallel
debug
illustr
distancchi
time class
financi
concurr
surfac
visag
categor
nasa
distribut
cell
transpar
sequenc
virginia
motif
brown depend
shape
conf
similar
search
genombiologi
manipul
int
high-dimension
spreadsheet
mechan
index
multimedia
percept
protein
sequenc subsequ
acm-siam
neighbor log
target
healei tutori search
robbin
nearest
buxton
syst
time
product
approxim
bentlei
22nd
andri daniel siggraph
molecular
satisfact
respons afip
neighbor
handbook
perform
friedman
search
simul
hierarchi
inf
search file menu
winter
detect
index
constraint
zeleznik
robbin
detect riedl
toolkit
multi-scal fast
healei
binari
bentlei
rang jon
search detect simul featur
rep rang syst loui
estim vision
psychol
preattentenn
widgethugh
toolkitdam preattent
daniel
public long similar
distribut chi
distanc
languag
zeleznik simul re toolkit hugh preattent
pat card
spreadsheet
(a)
siggraph
ac orthogon document match
hci
mackinlai
financi tabl start
global
cell
oct
manipul
point faloutso
similar
storag
cellarc spreadsheet select microsoft
sigmod
search object
region constrain
metric
conf roth
manag
point
acm-siam
depend
microsoft
rang filecolor
theori
nov mechan
properti
effici
neighbor
search log
binari
nucleic
brown uist fluid
bibliographi
dynam
approxim
reconstruct
search
henikoff long
rang
kolojejchick
loui
import
molecular
genomstring biologi
charact
search
syst
tabl
amino
sequenc
boundari
binari
bentlei
neighbor friedman match lee stanford
simul orient
int index
spreadsheet motif acid
problem
estim
featur search numer
multimedia
subsequ
bentlei
nearest
jon multidimension
align
biolog similar
protein pattern
target attent
psychol preattent
loui
vision
enn re
healei wolf
calif
color
pr oposo vic h yankel
Fig. 9. Image gallery application: (a) an adaptive grid is built from projected images; (b) an initial iconic representation is generated; (c) the layout is optimize to make an optimal use of area while preserving the neighborhood of the grid cells.
battista sugiyama
(c)
Fig. 10. Visualizing a dataset containing 515 documents using word clouds.(a) each document is represented by a point embedded in 2D space where adaptive grid is imposed, (b) a word cloud based on term-frequency of documents contained in each cell of grid is built.(c) shows the optimized layout using the proposed method. Notice the size increasing that allow us to increase the number of words by each cell (colored in lightgray).
by blending a randomly chosen subset of the underlying images using five levels of Laplacian pyramid [42]. In the
example shown in Figure 9, cells containing less then four images are represented by a single representative image,
11
Our last application regards video visualization, as illustrated in Figure 11. We build a collection of 300 videos from Youtube querying six distinct topics, namely, linux, civil war, fifa world cup, hawk, guitar and information visualization. Textual information associated to each video is processed to generate a feature vector that represents the video in a high-dimensional space. The proposed methodology with three level grid refinement is employed to build a layout where larger cells are textured as word clouds while the cells in the lowest refinement level (smaller cells) are textured with a snippet build from a screenshot of a randomly chosen video contained in the cell as well as textual information containing title, description and URL of the video. Word clouds are generated using keywords extracted from the text associated to the video. As in the previous application, font sizes and priority-based opacity are used to highlight the importance of each keyword, which are ranked as in [11]. Notice how our approach was able to semantically organize the layout according to the distinct topics, keeping videos related to “guitar” grouped on the left part of the layout, “linux” and “information visualization” on the top right, “hawk” in the center, “fifa world cup” on the bottom right, and ”civil war” centered on the bottom. Interestingly, the smaller cells in the layout contain mainly videos about civil war and music videos related to the song “Civil War” by Guns N’Roses. Moreover, besides being well structured and semantically organized, the layout resulting from our methodology is compact and easy to read due to its effective use of display area. One interesting fact of all generated layouts is the high level of interactivity for the exploration task i.e. users are allowed to choose a cell summarizing some content (from all levels except the last one) and then further explore that cell in more detail by recursively applying the whole layout construction process only within that cell.
Indian Warriors: The Untold S tory of the... Indian Warriors: The Untold S tory of the Civil War (AMAZING HIS TOR Y DOCUME NTAR Y)...
megan chief dis cover y
rufus
CIVIL WAR / MAR VE L / Andrés Navy E n este video hablaremos sobre CIVIL WAR ! Uno de los acontecimientos más importa...
reconstruction
john channel
amazing
fussball
video
coming
ole
en
mundial
partgoals world war
copa
civil
página quality
sobre
https commentary english más
user
world
events
puppy
based
video
learn
operating free
show
linux os
system complete
top
features
juego
check online
gameplay ea
fifabrazil
world hdfull
team copa
la
final
la
en
song
editor
bit
music official
brazil
world
jugando online
marvel
ola
el
part gettysburg live battlefield youtube
amigos
ly
original
em
history
amazing
face
battle
play
war roses
enjoy
hd
el
dejo
gunsvideo
school
tutorial command version historywindows
fifa goals cup top este
este
syria
videomix
reality
ubuntugnu
install
La Guerra Civil Marvel - Civil War #1 Pa... Nuestra página web : http://goo.gl/QLQL3c CIVIL WAR !!!!!! Uno de los eventos más...
goo gl
uno
play
guitar
american full
july brain
interactive
game series
espn
joe
shocking
Tales of the Gun - E pisode 17: Guns of t... Tales of the Gun - E pisode 17: Guns of the Civil War (History Documentary) Throu...
linguistic
individual
gaming
skater
buenos
computer
science
cup
twitter
images brazil por cup el
Footage of Civil War Veterans at 50yr An... Amazing historical movie footage from 1913 and 1938 of the 50th and 75th anniver...
tony twitter songs
les
skills pickett union targets
primera
gameplay cup fifa
air
july player unio
hawk vez world entop
los
civil lured
The E nglish Civil War Cromwell YouTub... THE GR E AT E NGLIS H R E VOLUTION.
group
tutorials computer mint file systems visit
road
free goo
gl
Guns N' R oses - Civil War (Music Video) Music Video For Guns N' R oses Civil War With 'Breakdown' Intro.
civil
hostile
street guitar war
Civil War - S lash & Myles Kennedy - R are... Civil War R are Acoustic Version S lash & Myles Kennedy @ the MAX S essions 2010 My...
https
argentina germany du
strategy
rednex indy
joined
facebook full tony
skate
park
backyard
Canister S hot From Civil War Cannon.wmv We fired 17 rounds of Canister from an original Civil War Napoleon Field Cannon ...
documentary
performer
mind open
young
How to S witch Operating S ystems: Linux, ... I Need Your Help: http://vid.io/x6R (Please S upport) http://live.pirillo.com - A...
moments fifabit ly
hd
usman
spirit
brutalmoose
facebook
facebook virtualbox
heavyclashes music
picks
Guns N' R oses - Civil War [Live in Paris... Live at Hippodrome de Vincennes, Paris, France, June 6, 1992. Full Concert: http...
cam action official
evan
conan album luca guitarist
Metal Detecting: Civil War R elics https://www.youtube.com/user/aquachigger This is a video I shot of a hunt in a m...
part
top
red operating free
open
video visit hawk team
goals
skateboarding
successful
data visual information work online
brasil
greg
full
fifa
keef
world
star
music
shep
distro
building
music hd riley
edit
system
hawk tony
Wake Up Call 2014: Obama DHS Prepares Fo... Civil unrest coming to America. Make this viral and get the word out! OR IGINAL V...
rednexmusic
watching
itunes
harris words
university fellow
link research
Libyan Loop: Back to brink of civil war ... Libya is on the brink of plunging into yet another bloody civil conflict - that'...
day
harrelson hypertree
trapping
vital
chase social life evensen bull
snake official
subscribe
del
linux
official full song video er
rileyhttps
classical
bmx
footage
profit
album
story reed
watchmojo
counting
topriaz
fellowapple
app ist
download
S yrian Civil War 2013 - Battlefield S yri... S yria Civil War 2013 - Battlefield S yria: Heavy Fighting Clashes And Firefights....
catherine
Guns n R oses Civil war live (Voodoo Chil... stop posting all this "you say jonas brothers I say blabla" shit....thank you. G...
ch
roll
meets flying
check
riffs ted guitar https
American Civil War Documentary - R ally A... This is the first DVD of a 3 DVD documentary box set on the American Civil War. ...
Civil War Combat - Battle of S hiloh - Pa... A documentary of History Channel about the Battle of S hiloh on April 6-7 1862 in...
dawgs
van
today
amazing
confederate
twitter
guitar
time
black
meditationsongs
swirling
yellow
videos
mind
park
ve yearsda
note
eruption chase
le
knowledge korea
contemporary
mundo link
Warriors (Imagine Dragons guitar cover) Cover of the new World Championship theme song by Imagine Dragons S hitty recordi...
The Civil War in Four Minutes: The Battl... Historian Garry Adelman describes the events that took place during the Battle o...
https hd
creepy fails speeds changed mission braveheartdobsoncompilation https
mini caught
piano
firefights
hd
facebook
spanish facebook enjoy youtube
tabs original jung korea
singapore charged
north
playing
sungha guitar video
hope
twitter brain hawk topic
play
gettysburg
time
gig
bit
shopping 2ne1 ewan
skater
red-tailed hawk tony facebook
hits
handshake big
black
awesome youtube
world
fix
feat
hacks
videoimagine
hell
rude
enjoy
special
lesson
challenge
ve
life
mods
dragons coldplay
top
easy
songs awesome
guitar
fingerstyle
learn make
song play
video
note sound playlist instrumental production
monroe
Causes of the Civil War - Popular sovere... 55 From The American Civil War Volume 1 hosted by Dale R eed 1793: Invention of t...
video bmx
hawks
American Civil War Timeline 1861-1865 http://storiesofusa.com/american-civil-wartimeline-battlefields-1854-1865/ - Am...
guitar full house austin produced intro episode
back
candyrat
magic
spastic
cover
relax
performace
ly jung
boy
hawk apple chadwick download alex
acoustic
relaxing music part
field
music
directed
song bit
rubberbandits
children
sungha
album riffs itunesem
making
i-hawk
https
qua
tony roll
guitar
m-tp
monologue
rock
original history
Avengers 2 S POILE R S - Marvel Civil War t... Today, Marvel Phase 3 predictions on how Civil War could be coming soon to the M...
Civil War Veterans on Film, E CHOE S OF TH... Civil War Veterans on film, from the documentary E CHOE S OF THE BLUE & GR AY. Prod...
The Civil War: Gettysburg On July 3, 1863 in Gettysburg, PA the Union lured the Confederate troops out int...
back
wimbledon
Guns N' R oses-Civil War w/Lyrics "What we've got here is failure to communicate. S ome men you just can't reach......
tide
built
documentary
soulja
cover
DE ATH AND THE CIVIL WAR (AMAZING AME R ICA... DE ATH AND THE CIVIL WAR (AMAZING AME R ICAN HIS TOR Y DOCUME NTAR Y) With the coming o...
chief
tatyana part
revolution
Civil War Journal - The Gray Ghost: John... Leading a cavalry of 800 men without interference from ranking Confederacy offic...
facebook full
Civil War Journal - Pickett's Charge Pickett's charge at Gettysburg would be rebuffed, leading to a major defeat for ...
skills
ryzhkova gattar
click di
put
ultimate
song
easy
kid
wyclefgiuliani enjoy performs child boy
long
beautiful
hetfield
prank made guitar ak47
create
room rising center
youtube
james
enjoy
zen
piano
mind
Football Focus - FIFA World Cup 2014 This weeks Football Focus is 'focused' on upcoming FIFA World Cup 2014. Dan Walk...
software
design
maryland
starlight
S tar Wars: Battlefront 2 - Galactic Conq... This is an episode of my S tar Wars Battlefront 2 Galactic Conquest series. This ...
meditation
visualization final info
Battleplan American Civil War Campaign L... Let's Play Battleplan: American Civil War, a PC and mobile real time strategy ga...
tool
R are Civil War Postmortem Photographs R are post-mortem photographs taken during the Civil War. E xcludes images taken d...
life
interactive information visualization inf ovis
flip
ca
editor
university data
movie
hay hours
background
collect ion
finder
các hd
time
sound cách ch ng nh
guitar music
interaction
work
relaxing bài
và
video system
bigplaisant based
therapy
day
therapyguitar
lonely
The second application concerns textual document visualization. Following the conventional tf-idf bag-of-words construction [44] and stemming [45], one can yield feature vectors from which our methodology can straightly be applied. In the application illustrated in Figure 10, we visualize a collection of 515 related papers published in the IEEE VisWeek Conference 2004. Each cell is textured with a word cloud built from keywords extracted from the documents contained in the cell while font size and opacity is used to highlight the most relevant keywords whose ranking is computed using the method described in [11]. Notice that the layout is easy to read, mainly due to the effective use of display area. Moreover, it is not difficult to note that the grid-like structure makes the visual identification of similar documents an easier task.
instrumental
green
cells containing four to eight images gives rise to a four blended image icon, and icon with nine blended images is used to represent cells with nine or more images. Images used in this application were obtained from [43].
park
mam epi jsin t ig shnei out orht sar ode yer vel d h es
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
mundo
simulation coupe
sports
video final official belgium
Fig. 11. Visualizing 300 videos extracted from Youtube querying 6 different topics.
6
D ISCUSSION
AND
L IMITATIONS
Quantitative and qualitative comparisons presented in Section 4 attest the quality of our layout arrangement mechanism. In fact, our approach clearly outperforms most existing techniques with respect to properties such as area usage, orthogonal ordering and layout alignment, in addition to perform well as to neighborhood preservation. Therefore, our approach is able to handle several requirement simultaneously, making it an attractive alternative for several applications. The applications described in Section 5 show the versatility of our approach in scenarios ranging from image gallery construction to word cloud based text visualization. Moreover, the layouts resulting from our methodology turned out to be clean, well structured, and easy to read. Another interesting aspect of our approach is that the only parameter to be set is the number of refinement levels in the adaptive grid. In our tests we use three levels of refinement and we notice that the layout is not so easy to read when more than five levels are used. However, the appropriate number of refinement levels depends on the underlying application and the size of screen. A weakness of our methodology is that dense and tight data cluster may be cut into multiple cells during the grid construction. This problem can be mitigated through density-aware grid generation, an improvement to be incorporated in our methodology. The quality of the semantic relation among grid cells also depends on the effectiveness of the multidimensional projection scheme in preserving neighborhoods during the mapping process. Visually encoding uncertainties as to neighborhood relation can help users during data exploration. However, this is an aspect that has not been properly addressed even in the context of multidimensional projection methods. From our experimental analysis we also notice that it
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
is not necessary to run the optimization procedure until convergence. In other words, good layouts are obtained after a few hundred iteration steps, being unnecessary to wait until a local minimum is reached. This fact makes the proposed methodology also attractive in terms of computational times. For instance, the layouts presented in Section 5 took less than six seconds to be produced. However, the algorithm can take a few minutes to fully converge if a local minimum is sought. To numerically illustrate this fact, we show in Fig. 12 the computational time spent by our methodology to reach a certain number of iterations versus the ratio between the resulting and global minimization energies Et /Eg for each layout from Fig. 7. Note from the y-axis of the graph that how close to 1 the energy rates are, even for the layouts produced with a reduced number of iteration steps. DT1
DT2
DT3
DT4
DT5
[2] [3] [4] [5] [6] [7]
[8] [9] [10]
1,1000
Energy ratio (Et/Eg)
1,0800
[11]
1,0600 1,0400
1,0200
[12]
1,0000
[13]
0,9800 0,00
0,50
1,00
1,50
2,00
2,50
Iteration time (log, in sec.)
Fig. 12. Computational time versus optimization convergence rate for the layouts from Fig. 7.
[14]
[15]
7
C ONCLUSION
We introduced a novel methodology to build geometric layouts to visualize data. In contrast to existing techniques, the proposed method makes use of a novel optimization procedure that is able to handle several requirements simultaneously, yielding structured overlap-free arrangements while still ensuring a semantic relation among neighbor entities. Furthermore, the method makes optimal use of display area, rendering it an effective and flexible tool for applications varying from image gallery construction to video data set visualization. We are currently investigating interactive mechanisms to enable a free navigation throughout the layout as well as dynamic user-driven layout updates.
[16] [17] [18] [19]
[20]
[21]
ACKNOWLEDGMENTS The authors acknowledge the financial support from FAPESP (#2011/22749-8 #2013/00191-0 and #2014/16857-0) and CNPq (#302643/2013-3).
R EFERENCES [1]
K. Koh, B. Lee, B. Kim, and J. Seo, “Maniwordle: Providing flexible control over wordle,” IEEE Transactions on Visualization and Computer Graphics, vol. 16, no. 6, pp. 1190–1197, Nov. 2010.
[22] [23] [24]
12
F. V. Paulovich, F. M. B. Toledo, G. P. Telles, R. Minghim, and L. G. Nonato, “Semantic wordification of document collections,” Comp. Graph. Forum, vol. 31, no. 3pt3, pp. 1145–1153, Jun. 2012. J. Bertin, Graphics and graphic information-processing. de Gruyter (The University of Michigan), 2007. E. Tufte, Envisioning information, ser. Envisioning Information. Graphics Press, 1990, no. 914. S. Van den Elzen and J. J. van Wijk, “Small multiples, large singles: A new approach for visual data exploration,” Computer Graphics Forum, vol. 32, no. 3pt2, pp. 191–200, 2013. R. D. Pinho, M. C. Oliveira, and A. Andrade Lopes, “An incremental space to visualize dynamic data sets,” Multimedia Tools and Applications, vol. 50, no. 3, pp. 533–562, 2010. G. Strong and M. Gong, “Data organization and visualization using self-sorting map,” in Proceedings of Graphics Interface 2011, ser. GI’2011. Canadian Human-Computer Communications Society, 2011, pp. 199–206. ——, “Self-sorting map: An efficient algorithm for presenting multimedia data in structured layouts,” Multimedia, IEEE Transactions on, vol. 16, no. 4, pp. 1045–1058, June 2014. J.-D. Fekete and C. Plaisant, “Interactive information visualization of a million items,” in IEEE Symposium on Information Visualization (INFOVIS 2002). IEEE Computer Society, 2002, pp. 117–124. H. Strobelt, M. Spicker, A. Stoffel, D. Keim, and O. Deussen, “Rolled-out wordles: A heuristic method for overlap removal of 2d data representatives,” Computer Graphics Forum, vol. 31, no. 3pt3, pp. 1135–1144, 2012. F. V. Paulovich, F. Toledo, G. P. Telles, R. Minghim, and L. G. Nonato, “Semantic wordification of document collections,” in Computer Graphics Forum, vol. 31, no. 3pt3. Wiley-Blackwell Online Library, 2012, pp. 1145–1153. W. Javed, B. McDonnel, and N. Elmqvist, “Graphical perception of multiple time series,” IEEE Transactions on Visualization and Computer Graphics, vol. 16, no. 6, pp. 927–934, 2010. J. Kehrer, H. Piringer, W. Berger, and M. E. Groller, “A model for structure-based comparison of many categories in small-multiple displays,” IEEE Transactions on Visualization and Computer Graphics, vol. 19, no. 12, pp. 2287–2296, 2013. R. Blanch and E. Lecolinet, “Browsing zoomable treemaps: Structure-aware multi-scale navigation techniques,” IEEE Transactions on Visualization and Computer Graphics, vol. 13, no. 6, pp. 1248–1253, 2007. B. Johnson and B. Shneiderman, “Tree-maps: A space-filling approach to the visualization of hierarchical information structures,” in Proceedings of the 2Nd Conference on Visualization, ser. VIS’91. Los Alamitos, CA, USA: IEEE Computer Society Press, 1991, pp. 284–291. M. Bruls, K. Huizing, and J. J. Van Wijk, “Squarified treemaps,” in Data Visualization. Springer, 2000, pp. 33–42. B. Shneiderman and M. Wattenberg, “Ordered treemap layouts,” in IEEE Symposium on Information Visualization (INFOVIS 2001). IEEE Computer Society, 2001, pp. 73–73. B. Shneiderman, “Treemaps for space-constrained visualization of hierarchies,” 1998-2013. [Online]. Available: http://www.cs.umd. edu/hcil/treemap-history/ C. Tuttle, L. G. Nonato, and C. Silva, “Pedvis: A structured, spaceefficient technique for pedigree visualization,” IEEE Transactions on Visualization and Computer Graphics, vol. 16, no. 6, pp. 1063–1072, 2010. M. Balzer, O. Deussen, and C. Lewerentz, “Voronoi treemaps for the visualization of software metrics,” in Proceedings of the ACM Symposium on Software Visualization, ser. SoftVis’2005. New York, NY, USA: ACM, 2005, pp. 165–172. B. B. Bederson, B. Shneiderman, and M. Wattenberg, “Ordered and quantum treemaps: Making effective use of 2d space to display hierarchies,” ACM Transactions on Graphics (TOG), vol. 21, no. 4, pp. 833–854, 2002. J. Wood and J. Dykes, “Spatially ordered treemaps,” IEEE Transactions on Visualization and Computer Graphics, vol. 14, no. 6, pp. 1348–1355, 2008. A. Nocaj and U. Brandes, “Organizing search results with a reference map,” IEEE Transactions on Visualization and Computer Graphics, vol. 18, no. 12, pp. 2546–2555, 2012. E. R. Gansner and Y. Hu, “Graph drawing,” I. G. Tollis and M. Patrignani, Eds. Berlin, Heidelberg: Springer-Verlag, 2009, ch. Efficient Node Overlap Removal Using a Proximity Stress Model, pp. 206–217.
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
[25] X. Huang, W. Lai, A. Sajeev, and J. Gao, “A new algorithm for removing node overlapping in graph visualization,” Information Sciences, vol. 177, no. 14, pp. 2821–2844, 2007. [26] C.-C. Lin, H.-C. Yen, and J.-H. Chuang, “Drawing graphs with nonuniform nodes using potential fields,” in Journal of Visual Languages and Computing, vol. 20, no. 6. Springer, 2009, pp. 385–402. [27] D. Harel and Y. Koren, “Drawing graphs with non-uniform vertices,” in Proceedings of the Working Conference on Advanced Visual Interfaces. ACM, 2002, pp. 157–166. [28] T. Dwyer, K. Marriott, and P. J. Stuckey, “Fast node overlap removal,” in Graph Drawing. Springer, 2006, pp. 153–164. [29] K. Marriott, P. Stuckey, V. Tam, and W. He, “Removing node overlapping in graph layout using constrained optimization,” Constraints Journal, vol. 8, no. 2, pp. 143–171, 2003. [30] S. Kieffer, T. Dwyer, K. Marriott, and M. Wybrow, “Incremental grid-like layout using soft and hard constraints,” in Graph Drawing (Lectures Notes in Computer Science), 2013, pp. 448–459. [31] U. Ruegg, S. Kieffer, T. Dwyer, K. Marriott, and M. Wybrow, “Stress-minimizing orthogonal layout of data flow diagrams with ports,” in Graph Drawing, 2014, pp. 1–16. [32] E. Gomez-Nieto, F. S. Roman, P. Pagliosa, W. Casaca, E. S. Helou, M. C. F. de Oliveira, and L. G. Nonato, “Similarity preserving snippet-based visualization of web search results,” IEEE Transactions on Visualization and Computer Graphics, vol. 20, no. 3, pp. 457–470, 2014. [33] E. Gomez-Nieto, W. Casaca, L. G. Nonato, and G. Taubin, “Mixed integer optimization for layout arrangement,” in Proceedings of the 26th Conference on Graphics, Patterns and Images, ser. SIBGRAPI’13. Washington, DC, USA: IEEE Computer Society, 2013, pp. 115–122. [34] T. Dwyer and L. Nachmanson, “Fast edge-routing for large graphs,” in Graph Drawing. Springer, 2010, pp. 147–158. [35] F. V. Paulovich, L. G. Nonato, R. Minghim, and H. Levkowitz, “Least square projection: A fast high-precision multidimensional projection technique and its application to document mapping,” IEEE Transactions on Visualization and Computer Graphics, vol. 14, no. 3, pp. 564–575, 2008. [36] J. Leung, T. Tam, C. Wong, G. Young, and F. Chin, “Packing squares into a square,” Journal of Parallel and Distributed Computing, vol. 10, no. 3, pp. 271–275, 1990. [37] K. Jansen and L. Pradel, How to Maximize the Total Area of Rectangles Packed into a Rectrangle? Inst. fur Informatik, CristianAlbrechts-Univ., Kiev, 2009. [38] J. Lee and S. Leyffer, Mixed Integer Nonlinear Programming (The IMA Volumes in Mathematics and its Applications). Springer, 2012. [39] K. Misue, P. Eades, W. Lai, and K. Sugiyama, “Layout adjustment and the mental map,” Journal of visual languages and computing, vol. 6, no. 2, pp. 183–210, 1995. [40] H. Samet, Foundations of Multidimensional and Metric Data Structures. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2005. [41] O. Fried, S. DiVerdi, M. Halber, E. Sizikova, and A. Finkelstein, “IsoMatch: Creating informative grid layouts,” Computer Graphics Forum (Proc. Eurographics), vol. 34, no. 2, May 2015. [42] M. N. Do and M. Vetterli, “Framing pyramids,” IEEE Transactions on Signal Processing, vol. 51, no. 9, pp. 2329–2342, 2003. [43] L. Fei-Fei, R. Fergus, and P. Perona, “Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories,” 2004. [44] G. Salton, “Developments in automatic text retrieval,” Science, vol. 253, no. 5023, pp. 974–980, 1991. [45] H. P. Luhn, “The automatic creation of literature abstracts,” IBM Journal of research and development, vol. 2, no. 2, pp. 159–165, 1958.
13
Erick Gomez-Nieto obtained a BSc degree (2009) in Informatics Engineering from San Pablo Catholic University - Arequipa, Peru. He holds a MSc degree (2012) in Computer Science from University of São Paulo - SP, Brazil. He was lecturer/researcher at Computer Science School of San Pablo Catholic University where he was awarded by his notable scientific research, in 2012/2013. Currently, he is working toward the PhD degree at University of São Paulo. His research interests are in Interactive Visualization, Visual Analytics and Geometry Processing.
Wallace Casaca obtained the B.Sc. and Master degree in Pure and Applied Mathematics from São Paulo State University (UNESP), Brazil, in 2008 and 2010, respectively. During 2010-2014, he pursued his Ph.D. in Computer Sciences and Applied Mathematics at University of São Paulo (USP), Brazil. As part of his doctoral studies, he also worked at visiting researcher at Brown University, School of Engineering, USA, and in cooperation with MIT, Media Lab, USA. His research interests include image segmentation, image restoration, image retrieval, information visualization, optimization techniques, partial differential equations and numerical analysis.
Danilo Motta received his BSc degree in Computer Science by the Federal University of Mato Grosso do Sul, in 2010, and his MSc degree in Computer Science and Computational Mathematics from University of São Paulo - SP, Brazil, in 2013. Currently he is a PhD student and member of the Visual and Geometry Processing Group - VGPG at ICMC-USP. His current interests include data processing and visualization, optimization and applied machine learning.
Ivar Hartmann is a Professor at FGV Law School in Rio de Janeiro, where he has taught Cyberlaw, Programming for Lawyers and Theory of Justice. He coordinates the "Supreme Court in Numbers" project, which employs computer science to empirical legal research in order to produce data on tens of millions of lawsuits from different Brazilian courts. The project team is composed of nearly a dozen researchers with diverse backgrounds such as Law, Computer Engineering and Statistics. Ivar’s own research and publications cover topics of Constitutional, Private, Enviromental, Procedural and Cyberlaw, as well as Empirical Legal Research. Ivar holds an MsC from the Catholic University of Rio Grande do Sul, Brazil and an LL.M. from Harvard Law School. He is an S.J.D. candidate at the State University of Rio de Janeiro.
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015
Gabriel Taubin earned a Licenciado en Ciencias Matematicas degree from the University of Buenos Aires, Argentina, and a Ph.D. degree in Electrical Engineering from Brown University. In 1990 he joined the IBM Research Division, where he held various positions, including Research Staff Member and Research Manager. In 2003 Taubin joined the Brown University School of Engineering as an Associate Professor of Engineering and Computer Science. During the 2000-2001 academic year, on sabbatical from IBM, he was Visiting Professor of Electrical Engineering at the California Institute of Technology. During the Spring semester of 2010, on sabbatical from Brown, he was Visiting Associate Professor of Media Arts and Sciences at MIT. Prof. Taubin was the Editor-in-Chief of the IEEE Computer Graphics and Applications Magazine from 2010 to 2013, and has served as a member of the Editorial Board of the Geometric Models journal, and as associate editor of the IEEE Transactions of Visualization and Computer Graphics. Prof. Taubin was named IEEE Fellow for His Contributions to the development of three-dimensional geometry compression technology and multimedia standards, won the Eurographics 2002 Günter Enderle Best Paper Award, and was named IBM Master Inventor. He has contributed to the field called Digital Geometry Processing with methods to capture 3D shape, for surface reconstruction, geometric modeling, geometry compression, progressive transmission, signal processing, and display of discrete surfaces. The 3D geometry compression technology that he have developed at IBM was incorporated into the MPEG-4 standard, and became an integral part of IBM products.
Luis Gustavo Nonato received the PhD degree in applied mathematics from the Pontifícia Universidade Católica do Rio de Janeiro, Rio de Janeiro - Brazil, in 1998. He is currently full professor at the Instituto de Ciências Matemáticas e de Computação (ICMC) - Universidade de São Paulo (USP) - Brazil. He spent a sabbatical leave in the Scientific Computing and Imaging Institute at the University of Utah from 2008 to 2010. Besides having served in several program committees, including IEEE Visualization and Eurovis, he was a member of the editorial board of Computer Graphics Forum and the president of the Special Committee on Computer Graphics and Image Processing of Brazilian Computer Society. Currently Dr. Nonato leads the Visual and Geometry Processing Group at ICMC-USP.
14