Dealing with Multiple Requirements in Geometric Arrangements

1 downloads 0 Views 19MB Size Report
A new optimization scheme that relies on a reduced number of ...... xto n eric manual transparalto handbook pier control ken stone toi input book. 20thca robert august ... 27-mai orlean creat stasko ..... Guns n Roses Civil war live (Voodoo Chil.
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

1

Dealing with Multiple Requirements in Geometric Arrangements Erick Gomez-Nieto, Student Member, IEEE, Wallace Casaca, Danilo Motta, Ivar Hartmann, Gabriel Taubin, Fellow, IEEE, and Luis Gustavo Nonato, Member, IEEE Abstract—Existing algorithms for building layouts from geometric primitives are typically designed to cope with requirements such as orthogonal alignment, overlap removal, optimal area usage, hierarchical organization, among others. However, most techniques are able to tackle just a few of those requirements simultaneously, impairing their use and flexibility. In this work we propose a novel methodology for building layouts from geometric primitives that concurrently addresses a wider range of requirements. Relying on multidimensional projection and mixed integer optimization, our approach arranges geometric objects in the visual space so as to generate well structured layouts that preserve the semantic relation among objects while still making an efficient use of display area. Moreover, scalability is handled through a hierarchical representation scheme combined with navigation tools. A comprehensive set of quantitative comparisons against existing geometry-based layouts and applications on text, image, and video data set visualization prove the effectiveness of our approach. Index Terms—Overlap Removal, Similarity Preserving, Structured Layouts, Area Optimization.

F

1

I NTRODUCTION

RRANGING geometric primitives such as boxes and discs in a two-dimensional layout is a nontrivial task that inherently appears in important visualization applications, as for example, word cloud construction [1], [2], small-multiples arrangements [3], [4], [5], and visual boards [6], [7], [8]. The difficulty in building layouts made up of dozens of geometric objects rests in the set of requirements to be handled simultaneously, e.g., readability, overlaps, object size, semantic proximity and area usage. Moreover, the number of data instances represented as geometric entities is typically much larger than the visualization area, demanding the use of clustering, hierarchies, and navigation resources to assist the visualization. Although significant advances have been made towards building meaningful layouts from geometric primitives, existing techniques are formulated to deal with a limited number of requirements simultaneously, restricting their use to specific applications. For instance, techniques such as visual boards and small multiples provide well structured layouts which are easily readable, but they pay the price of scalability. Hierarchical methods such as Treemaps [9] mitigate the issue of scalability while making an efficient use of display area. However, readability and semantic organization of data are aspects not so easily handled by those methods. Overlap-free semantic preserving techniques such

A

• E. Gomez-Nieto, W. Casaca, D. Motta and L.G. Nonato are with Instituto de Ciências Matemáticas e de Computação (ICMC), Universidade de São Paulo, São Carlos, Brazil. Email: [email protected], {wallace,daniloam,gnonato}@icmc.usp.br • I. Hartmann is with Universidade Estadual de Rio de Janeiro, Rio de Janeiro, Brazil. Email: [email protected] • G. Taubin is with Brown University, School of Engineering, Providence, USA. Email: [email protected]

as RWordles [10] and ProjCloud [11] generate somewhat structured layouts and keep instances with similar content close to each other. Nevertheless, they are not designed to make an efficient use of display area and also suffer from scalability. Handling many requirements is not straightforward because distinct requirements can compete with each other during layout construction. For instance, to facilitate readability, layouts should be built with as large as possible geometric entities. However, large objects easily fill up the display area, thus limiting the number of instances that can be visualized. Therefore, finding an optimal balance among multiple concurrent requirements is a challenging task, which has not been properly tackled by existing methods. In this work we present a novel methodology for building layouts from geometric primitives which is able to deal with a wide range of requirements simultaneously. Relying on multidimensional projection, density-based adaptive grids, and mixed integer optimization, our approach is semantically aware, makes an efficient use of display area, and generates well structured grid-like layouts. Moreover, the formulation intrinsically impose a hierarchy on the data, enabling alternatives for the scalability issue. The proposed optimization scheme arranges geometric entities (boxes) with varying sizes so as to avoid overlaps while preserving the neighborhood structure of the underlying data (semantics). The area of each geometric primitive is also included in the optimization process to ensure that the display area will be efficiently occupied. In fact, supported by the adaptive grid, our formulation is able to scale elements with different sizes using only one variable, thus rendering the optimization procedure as simple as possible. A comprehensive set of quantitative comparisons against existing geometry-based layouts shows the effectiveness of our approach. The usefulness of our

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

methodology in visualizing text, image, and video data sets is also confirmed in several practical applications. The main contributions of this work can be summarized as follows: • A novel methodology to generate layouts made up of geometric primitives which deals with multiple requirements such as grid-like structure, semantic relation among objects, display area usage, and overlap removal. The methodology naturally imposes an hierarchy on the visual representation in order to handle a large number of data instances. • A new optimization scheme that relies on a reduced number of unknowns to generate well structured layouts while making a good use of display area. • A comprehensive set of comparisons against existing methods and several practical applications to confirm the effectiveness and usefulness of our methodology. One of the positive aspects in managing multiple requirements during layout construction is that we can play with properties such as object size and neighborhood to highlight relevant portions of the layout without losing the semantic relation among objects. By handling additional requisites such as overlap-free and optimal usage of display area we can reduce visual clutter while improving readability. As far as we known, no other technique devoted to build layouts from geometric primitives is able to deal with so many concurrent requirements to generate meaningful layouts.

2

R ELATED W ORK

We focus the following discussion on techniques that rely on the arrangement of simple geometric objects such as boxes, discs, and polygons to visualize data information. We group existing methods in three main categories, structured arrangements, hierarchy-based, and nodedisplacement techniques. It is worth mentioning that our goal is just to provide an overview of the main classes of methods that make use of simple geometric objects to build visualizations, pointing out their strengths and weaknesses in order to better contextualize our contribution. There are a huge amount of work in the literature and a comprehensive survey about all those methods is beyond the scope of this paper. Structured arrangements comprise the set of techniques that tile the visual space with geometric primitives so as to produce a well structured layout. IncBoard [6], for instance, uses boxes or hexagons to represent highdimensional instances which are arranged as a regular grid in the visual space. The arrangement is done such that proximity among objects reflects similarity, that is, neighbor cells in the grid tend to represent similar instances of data high-dimensional data as grid according to so as to keep making an analogy with representation analogous to a chessboard where rows and columns determines cells to represent high dimensional data on 2D space. Self-Sorting Map [7], [8] is another approach that relies on regular

2

grids to perform visualization. Items are arranged based on a sorting mechanism that keeps similar instances close to each other by using a hierarchical swapping process which aims to maximize normalized cross correlation between the input data and the positions of a structured grid. Small multiples [3], [4] is another class of techniques that rely on structured arrangement of geometric primitives to build visualization layouts. Small multiples is typically used to create different views of a data set [5] or to enable the visual analysis of multiple complex data such as time series [12], [13]. One of the main advantages of structured layouts is the ease of interpretation of the resulting visualization. In fact, van den Elzen et al. [5] showed in a user study that visualizations made up of small multiples are more effective than other methods to visualize different views of the same data. A similar conclusion has been reached by Javed et al. [12] in the context of time series visual analytics. Despite the advantages, structured layouts do not scale easily, which impairs their use in applications involving large data sets. Hierarchy-based methods are designed to make an efficient use of display area while enabling dynamic navigation throughout different levels of a hierarchical representation [14]. Treemap [15] and its variants [16], [9], [17], [18] are examples of hierarchy-based techniques tailored to visualize data organized as a tree structure. The visualization is performed by recursively splitting the space in rectangular boxes whose size and orientation reflect the extent of nodes in the hierarchy. Another example is Pedvis [19], a technique that builds upon H-tree layout and rectangular boxes to depict pedigree information structured as deep hierarchies. In contrast to the hierarchical methods described above, Voronoi Treemap [20] uses polygonal objects rather than rectangular boxes towards better representing the importance of nodes in the hierarchy and further differentiate sibling and non-sibling nodes. Despite the efficiency in space occupation, most tree-base layouts are not devised to place similar instances close to each other, making them unsuitable for applications involving similarity-based data exploration. Aiming at addressing this issue, some authors [21], [22] have proposed ordering mechanisms that consistently arranges tree nodes according to some similarity measure. Nocaj and Brandes [23] combined multidimensional scaling, Voronoi Treemap, and a set of visualization resources to highlight the similarity among data instances while enabling navigation throughout the hierarchy. Hierarchy-based methods scale well and they make good use of display area. However, in contrast to structured arrangements, the layout resulting from those methods are not so easy to read as similar instances are not necessarily placed close to each other. Node-displacement techniques aim at arranging geometric objects in the visual space so as to preserve similarity relations while avoiding overlaps. Overlap-free techniques vary greatly as to mathematical formulation and they can

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

be classified as physical-base, heuristic-base, and optimalbased models. Physical-based models rely on force schemes [24], [25] or spring systems [26], [27] to position geometric entities in the visual space. The lack of guarantees in terms of convergence and the untidiness of the resulting layout are the main weaknesses of physical-based approaches. Heuristic methods typically scan the visual space looking for the best location for each instance. Recent approaches such as RWordles [10] take into account similarity information to build semantically aware layouts. The main advantage of heuristic techniques is their low computational cost that makes possible to build large layouts quickly. Uneconomical use of display area is, however, a major drawback. Optimal approaches arrange objects by minimizing an energy functional subject to a set of constraints. Dwyer et al. [28] and Marriott et al. [29], proposed energy functionals derived from intersection tests, using quadratic optimization to find an optimal solution. In order to improve readability some techniques impose constraints to the energy functional towards generating grid-like arrangements [30], [31]. ProjSnippet [32] is a two-step approach that first projects data onto the visual space and then builds an energy functional that accounts for object overlap as well as neighborhood preservation. The technique called MIOLA [33] also makes use of neighborhood information provided by multidimensional projection to formulate a mixed integer optimization that results in almost orthogonal layouts. Heuristic and optimal methods perform well in terms of overlap removal and similarity preservation. However, those methods do not make an efficient use of display area, mainly when dealing with geometric objects with different sizes. Scalability is another hurdle for those techniques, since, with a few exceptions [34], node-displacement methods typically do not operate hierarchically. The method proposed in this work gathers a set of traits not presented in other existing approaches, as it has been designed to account for multiple concurrent requirements such as object scale, neighborhood and aspect ratio preservation among cells, optimal use of display area, and overlap-free grid-like cell arrangement (see Figure 1 bottom). Moreover, the proposed technique naturally imposes a hierarchy on the data, thus being able to handle large data sets. Proximity between similar entities is also enforced during layout construction and navigation. In summary, the proposed method bears a set of properties not present together in any other geometry-based layout technique.

3

P ROPOSED M ETHODOLOGY

As illustrated in Figure 1, the proposed methodology comprises three main steps: multidimensional projection, density-based adaptive grid generation, and layout optimization. In the first step, high-dimensional data is mapped to the visual space so as to preserve distance among data instances. In our implementation we use the Least Square Projection (LSP) method [35], which preserves distances

3

Fig. 1. Pipeline used comprises three main steps: (i) data is mapped into visual space using a multidimensional projection technique, (ii) adaptive grid is built from projected points and (iii) an optimization algorithm is performed to reach an optimal area usage.

nicely during the mapping process, thus enforcing that neighbor points in the visual space correspond to similar/close instances in the original space. The use of a distance preserving mapping to place instances in the visual space provides the neighborhood structure that must be “mimicked” by the final layout. An adaptive grid is then constructed from projected points. In contrast to traditional adaptive grid generation schemes, our approach refines the grid in less dense (but not empty) regions of the visual space, thus placing larger grid cells in denser regions (see Figure 1 top right). The rationale is to allow users easily identify dense regions by visually recognizing large geometric objects in the layout. The number of refinement levels is a user defined parameter. The adaptive grid produces a size varying tiling of the visual space. Moreover, grid cells inherit the semantic relation of the project instances, that is, neighbor grid cells tend to encompass similar instances. Although well structured, the cell arrangement resulting from the refinement is typically spread, making an inefficient use of display area. Therefore, in the third step of the proposed pipeline, cells are rearranged in the visual space to optimize the area usage. The optimization is formulated to account for object scale, overlapping, and grid-like arrangement while preserving neighborhood relationships and the aspect ratio among cells (see Figure 1 bottom). The rationale here is to build layouts as readable as possible while still dragging users attention to denser regions that should be further explored. The following subsections detail the second and third step of our pipeline, which correspond to the major technical contributions of our approach. 3.1

Adaptive Grid generation

Let P = {p1 , p2 , . . . , pq } ⊂ R2 be the projection of a set of instances into the visual space and G be a n × m regular

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

(a)

(b)

pointing out where users should nail down their exploration. Second, the proposed refinement scheme allows for resizing all grid cells by controlling only one parameter. In other words, if one wants to scale all cells preserving their relative size, the only parameter to be tuned is the length δ of coarser cells. In fact, the length of the cells in each refinement level is given by (1/2k )δ , where k = 0, 1, · · · is the level of refinement. This last property will be exploited during optimization to find the value of δ that results in the best use of display area, as detailed below. 3.2

(c)

(d)

Fig. 2. Density based refinement process. (a) An initial regular grid is defined in the visual space. Such an initial grid provides the coarser refinement level (first level, as depicted in (b)). (c) An adaptive grid is built from density information. Denser areas are represented by larger cells. (d) Active (not empty) cells are then ready to be optimized in the next step of the proposed framework.

grid which discretizes the bounding box of P. The number of cells in each orthogonal direction is defined such that each grid cell gi j has a square shape, that is, given the number of subdivision in one direction, for instance m, the H number of subdivision in the other direction is n = d W me (the boundary of the bounding box can be displaced to accommodate the number of cells), where H and W account for the height and width of the bounding box of P, respectively. The regular mesh G is the coarser grid level for the adaptive process. The refinement is driven by density information computed from P in each grid cell. The density can be estimated in different ways, for instance by integrating a kernel density estimator over each grid cell. As our application does not demand highly accurate density estimation, we opt to approximate the density in each cell by simply counting the number of points pi contained in the cell. Given the density information, the refinement is carried out as follows: let di j be the density of the grid cell gi j and dmax = max{di j } be the largest density value in G. A grid cell gi j undergoes one level of refinement if di j ≤ 0.5dmax , two levels of refinement if di j ≤ 0.25dmax , three levels if di j ≤ 0.125dmax , and so on. Figure 2 illustrates the refinement process with three levels. Although simple, the grid generation scheme described above has two helpful properties. First, dense regions in the visual space can easily be identified from the larger cells,

4

Optimization

Let G˜ = {g1 , g2 , ..., gN } be the set of non-empty cells in G, that is, G˜ comprises the cells of G with projected points in their interior. Each cell gi is a square box described by the vector gi = (xi , yi , wi ) ∈ R3 , where (xi , yi ) accounts for the center of the box and wi > 0 is the edge length of gi . The cells gi should be rearranged and resized inside a W ×H display area so as to make an efficient use of display area while preserving grid alignment and neighborhood structures. As described in Section 3.1, wi = αi δ , where αi = 1/2k , k corresponding to the level of refinement of gi . Therefore, δ is a parameter to be optimized (initially set as the length δ0 of the coarser cell) from which the size of each cell is derived, as illustrated in Figure 3(a). 3.2.1

Rearranging Cells with Area Resizing

The problem of rearranging cells gi so as to generate layouts that optimize use of area while presenting a gridlike structure is an NP-hard problem [36] that appears in contexts such as cutting optimization and rectangle packing [37]. In order to get an approximate solution, we formulate the problem as a computationally tractable quadratic optimization problem. The optimization is formulated as in Equation (1) below: minimize E(z) = Ecomp (z) + Erezise (z), subject to Az ≤ b,

z = [x y r δ ]> ,

x = (x1 , x2 , ..., xN )> ∈ RN y = (y1 , y2 , ..., yN )> ∈ RN r = (r12 , ..., r1N , r23 , ..., r2N , ..., rN−1N )> , ri j ∈ {0, 1} δ0 ≤ δ ≤ min (W, H),

(1)

where z is the sought solution; y and x correspond to the coordinates of the centroids of the cells; δ is the scaling factor; A and b hold the constraints imposed on the optimization problem. The unknowns ri j are control variables used to properly avoid overlaps. The energy components Ecomp (z) and Eresize (z) control the proximity between cells and the area increase, respectively. The former term accounts for overlaps and neighborhood preservation and the second term is designed to scale the box to fill up as much area as possible (see illustration in Figure 4). Compactness Energy Term The first term in the energy functional, Ecomp , controls the relative position among cells and its goal is to keep the layout compact. The energy Ecomp

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

W

δ δ0

α3 δ 0 α3 δ 0

δ0 δ0

W

H δ δ0

α2 δ 0

5

H

α2 δ 0

(a)

(a)

(b)

(c)

(d)

(b)

Fig. 3. (a) The input layout with δ0 being the length of the coarser cell. The size of the remaining cells are defined according to the initial parameter δ0 and their scale parameters αi . (b) The result of the optimization when δ > δ0 .

is a quadratic function that simply takes into account the centers of the boxes, as shown in Equation (2): Ecomp (z) = C ∑ (xi − x j )2 + (yi − y j )2 ,

(2)

(i, j)

pairs where (i, j) = ( j, i) represents  the N(N−1) 2  (gi , g j ), i, j, ∈ {1, ..., N} and C = 1/ min (W, H) · N(N−1) 2 is a normalization factor. In less mathematical terms, Ecomp forces boxes to be close to each other and the normalization factor C ensures that the term Ecomp contributes in the same amount as Eresize in the total energy E, thus enabling a good balance between compactness and area usage. Area Usage Energy Term The energy term Eresize controls the amount boxes should be scaled to optimize the use of display area. Since the area of any cell depends only on the length of the largest cells, that is, the area of each cell gi is given by Ai = (αi δ )2 , αi = 1/2k , where k is the refinement level of gi , we can define the resize energy as a quadratic function as follows: Eresize (z) = (δ − min (W, H))2 ,

x p1 ≤ x p2 ≤ ... ≤ x pn ⇒ x pi − x pi+1 ≤ 0 yq1 ≤ yq2 ≤ ... ≤ yqn ⇒ yqi − yqi+1 ≤ 0

Optimization Constraints

As described in Equation (1), constraints are gathered in matrix A and vector b and they are imposed to properly control overlaps and the relative position of cells. Given the initial position (xi , yi ) and length αi δ of each cell, the relative order (also called orthogonal order) of the boxes is given by:

(4)

where p, q : {1, 2, ..., N} → {1, 2, ..., N} are permutations of indices generated by sorting the coordinates xi and yi . Inequalities (4) allows for preserving the relative order of the cells, but it does not account for overlap. Overlaps can be handled by forcing non-overlap conditions as follows: (αi + α j ) (αi + α j ) δ or |y j − yi | ≥ δ , (5) 2 2 From Equation (4) we have that if xi ≤ x j and yi ≤ y j then |x j − xi | = x j − xi and |y j − yi | = y j − yi , which give rise to the following linear system of inequality: |x j − xi | ≥

(3)

where min (W, H) is the minimum between the width and height of the display area (see Fig. 4(c)). It is easy to see that if no constraints are imposed to the unknowns, the minimum of Ecomp is reached when all cells have the same center and the minimum of Eresize takes place when the length of the larger cell is equal to min (W, H). Therefore, without constraints the optimization process will stack cells on over the other. To avoid such unsuitable output, we must settle constraints as discussed next. 3.2.2

Fig. 4. Minimizing each term of the energy functional E in an illustrative layout. (a) Original layout, (b) Ecomp only, (c) Eresize only, (d) the proposed energy functional E = Ecomp + Eresize .

(α +α )

xi − x j ≤ − i 2 j δ , or (α +α ) yi − y j ≤ − i 2 j δ ,

(6)

As proposed in [33], the or condition in (6) can be handle by binary variables ri j ∈ {0, 1} such that: xi −x j ≤ αi j δ +Mri j ⇔ yi −y j ≤ αi j δ +M(1−ri j ),

(7)

(α +α )

where αi j = − i 2 j and M is a very large constant. The role of ri j is to ensure that if one of the inequalities in Equation (6) holds, its counterpart is set aside. For instance, if xi −x j ≤ αi j δ , then ri j is equal to zero, thus yi −y j ≤ αi j δ +M(1−ri j ) is naturally satisfied for any values of y. The linear inequalities, ri j unknowns, and δ make up the linear system [A|b]. The left and right bounds for the x coordinate of the center of the cells must also be imposed in [A|b], that is, 0 ≤ xi −

αi αi δ and xi + δ < W, i = 1, 2, ..., N, 2 2

(8)

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

6

0,7 0,6

Area Usage (A) better

0,5 0,4 0,3 0,2 0,1 0

Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 RWordle

PRISM

VPSC

MIOLA

ProjSnippet

Our Approach

0,8

Orthogonal Ordering (O) better

0,7 0,6 0,5 0,4 0,3 0,2 0,1 0

Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 Class 1 Class 2 Class 3 RWordle

RWordle

PRISM

VPSC

PRISM MIOLA

VPSC

ProjSnippet

MIOLA

Our Approach

RWordle

0,95

0,90 0,85 0,80

0,75 0,70 0,65

PRISM

VPSC

MIOLA

Our Approach ProjSnippet

Our Approach

1,7000

Orthogonal Alignment (L) better

Neighborhood Preservation (K) better

1,00

ProjSnippet

1,6000

1,5000 1,4000 1,3000

1,2000 1,1000

1,000 10%

20%

30%

40%

50%

10%

20%

Percentage of Neighbors

30% Percentage of Neighbors

40%

50%

Fig. 5. Comparison against overlap removal techniques for the three initial classes of experiments.

Similar inequalities are set to limit the upper and lower bounds of the y coordinates. Constraints Relaxation The optimization problem described in Equation (1) provides a flexible and simple mechanism to relax constraints and generate distinct layout arrangements. This is performed by adding relaxation parameters (scalars) in the inequalities to allow for some degree of overlap, orthogonal disorder and so on. 3.2.3

Computational Aspects

The formulation (1) is a Mixed Integer Quadratic Programming (MIQP) Problem, which is minimized by computing an extension of the so-called Branch-and-Bound optimization scheme, where the original MIQP problem is converted

to a Linear Programming Problem in order to be properly handled and solved [38]. In terms of usage, our code was implemented using the solvers and optimization routines provided by Gurobi Optimization Package, which is available at http://www. gurobi.com/.

4

R ESULTS

AND

C OMPARISONS

The performance of the proposed optimization scheme is assessed through a set of comparisons against well established geometry based layout construction techniques. More precisely, we employ four distinct metrics to quantitatively measure the quality of layouts produced by two distinct classes of techniques, namely: overlap removal methods

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

Orthogonal Ordering (O): measures the number of changes in both horizontal and vertical order of geometric entities [39]. More precisely, positions are sorted in increasing horizontal and vertical order and the number of inversions in the lists provides the quality measure. In mathematical terms: O=



(t)

∑ invs (i, j),

(9)

s∈{x,y} i< j

better

Ar ea Usage ( A)

0,3 0,2

0

Area Usage (A): let r be the longer diagonal of the bounding box of the layout produced by a given method and Ai the area of the i-th box in the layout. The area usage metric is defined similarly as [40]: A=

2πr2 ∑i Ai

.

(11)

This metric gauges layout compactness and values close to π/2 are better. Orthogonal Alignment (L): let {z1 , z2 , . . . , zr } be points in Z × Z discretizing the interior of a disk, as illustrated in the inline figure. More14 15 10 over, let bi be the center of a square 16 2 5 13 6 in the layout and bi j , j = 1, . . . , k be 9 1 the center of the squares in the k- 11 3 nearest neighbor of bi . The orthogonal 17 7 4 8 20 alignment metric is given by: 12 18 19   < zs , bi j − bi > 1 n L= (12) ∑ ∑ max kzs kkbi j − bi k , s nk i=1 bi j where n is the number of geometric objects in the layout and < ·, · > accounts for the dot product. This metric gauges how much a layout deviates from a regular grid, the closer to 1 the better. Metrics above have been chosen because they measure important properties a layout must hold, namely, similarity preserving, compactness, and orthogonal alignment.

IsoMatch

Our Approach

Incboard

SSM

IsoMatch

Our Approach

0,7 0,6 0,5 0,4 0,3

0,2 0,1 0

IncBoard

SSM

IsoMatch

Our Approach

1

0,9 0,8

better

Neighborhood Preservation (K): computes the average percentage of the k-nearest neighbors of the initial configuration that are preserved in the final layouts [35]. More specifically, for each geometric entity we verify the percentage of its neighbors in the final layout that were also neighbors in the initial layout before optimization. Values close to 1 indicates 100% of preservation.

SSM

0,8

1, 0, otherwise

(10) where i, j are indices of the sorted lists before (t − 1) and (t) after (t) layout arrangement. invy (i, j) is defined analo(t) gously to invx (i, j). Small values indicate better behavior.

Incboard

0,9

Neighborhood Preservation (K)

(t)

invx (i, j) =

(t) (t) (t−1) (t−1) if(xi − x j )(xi −xj ) < 0

0,4

0,1

0,7 0,6 0,5 0,4

10%

20%

30%

40%

50%

Percentage of Neighbors Incboard

SSM

IsoMatch

Our Approach

1,0045 1,0040 1,0035

1,0030

better

(

0,5

better

Metrics and Data Sets

0,6

Orthogonal Alignment ( L)

4.1

0,7

Orthogonal Order ing ( O)

and visual board techniques. Before presenting the comparison, we briefly describe the metrics as well as the data sets used in our tests.

7

1,0025 1,0020

1,0015 1,0010 1,0005

1,00

10%

20%

30%

40%

50%

Percentage of Neighbors

Fig. 6. Comparison with visual board techniques for the last class (taken from Fig. 8).

Data sets and Test Settings We divided the experiments in four groups: the first three groups of tests assess overlap removal methods while the last group evaluates the performance of our optimization scheme against visual board techniques. In the first test we randomly draw points inside a rectangular two-dimensional domain, which represents the display area. The rectangle is discretized as a regular grid with dimensions 1600 × 1200. Cells of the grid that do not contain any point are discarded. The remaining cells define the boxes to be arranged in the layout. As no overlap exists at this point, we enforce the cells to overlap by uniformly increasing their area in 10%. Forcing overlap is mandatory since most overlap removal methods can not start up from an overlap free arrangement. Notice that this is not the case for our approach, which also operates on overlap free layouts in order to optimize area usage and

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

DT2

DT3

DT4

DT5

2

3

PRISM

VPSC

RWordle-C

MIOLA

ProjSnippet

Our

Original

DT1

8

Fig. 7. Layouts produced by our approach, ProjSnippet, MIOLA, RWordle-C, VPSC and PRISM in 5 datasets.

structure. Fifteen configurations, with fifty random points each, make up the first group of tests (called class 1 in

Figure 5). The second experiment is similar to the first one, except we use a three level adaptive grid to generate the

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

4.2

Comparisons

The metrics describe above are used to assess the effectiveness of our approach when compared against well known overlap removal techniques. Precisely, VPSC [28], PRISM [24], RWordle-C [10], MIOLA [33], and ProjSnippet [32] are used as basis for comparisons. These methods have been chosen either because they are widely used or due to their good performance reported in the literature. We also compare our approach against three visual board techniques, namely, Incboard [6], SSM [8] and IsoMatch [41]. Figure 5 shows quantitative results obtained from the metrics described above. Notice that our approach clearly outperforms most techniques as to area usage (top row) and orthogonal alignment (bottom row right). Regarding orthogonal ordering (middle row), our approach turns out to be competitive, performing better for class 1. In terms of neighborhood preservation (bottom row left), our approach also performed well, being surpassed only by ProjSnippet and PRISM, which are known to preserve neighborhoods well. Figure 6 brings quantitative comparison against visual board techniques. Notice that the proposed method outperforms IncBoard and behaves nicely even when compared to SSM and IsoMatch, which have optimal area usage (notice the vanish box plots) and cell alignment. Figure 7 and 8 depict qualitative results comparing our approach with overlap removal and visual board techniques, respectively. Notice that the proposed method gives rise to well structured layouts where neighborhoods (indicated by color map) are nicely preserved. Moreover, our approach makes a better use of display area, thus improving readability and content analysis.

25 Boxes

36 Boxes

49 Boxes

SSM

IncBoard

Our

Original

16 Boxes

IsoMatch

geometric primitives (class 2 in Figure 5). In the third experiment (class 3 in Figure 5) we use real data set containing high-dimensional instances which are mapped to the visual space using the LSP multidimensional projection scheme. The layout is built from the projected points using a three level adaptive grid. Five data sets, taken from a public repository (available at http://infoserver.lcad.icmc.usp.br/infovis2/DataSets), have been employed in the third experiment. One data set contains a collection of snippets retrieved from a web search using "Scientific Visualization" as query expression; two data sets contain news from BBC, CNN, and REUTERS; the fourth data set corresponds to abstracts from IEEE InfoVis 2004 conference; and the last data set contains metadata from a scientific paper collection. Bag-of-words were extracted from textual content and used to define the similarity between instances. Those distances are used as input to the LSP projection method [35], which maps instances to the visual space. Finally, in the fourth group of tests we have randomly generated squares in a 2D square domain, as depicted in Figure 8. Those layouts are used as input to existing visual board techniques.

9

Fig. 8. Qualitative comparison of the proposed optimization scheme against visual board techniques Incboard [6], SSM [8] when taking the best result from SSM w.r.t. their objective function after 1000 executions and IsoMatch [41] .

5

A PPLICATIONS

In this section we present three different applications of our technique, namely, image gallery construction, textual documents analysis, and video data set visualization. In those applications, effective data exploration is enabled if requirements such as object size, semantic and overlapfree arrangements, scalability, and optimal area usage are combined in an optimal manner, showing the relevance of our methodology to applications as the ones presented in this section. The image gallery application, depicted in Figure 9, aims at visualizing image data sets such that highly similar images, which tend to be project close to each other in the visual space, are summarized in large icons, while images with discriminative feature are represented in small icons. To build the image gallery we extract 96 features from color attribute. More specifically, each image is split in 16 nonoverlapping regular regions covering the image from which the first and second statistical moments for each R,G,B channel is extracted. Feature vectors representing the images are projected in the visual space using the Least Square Projection (LSP) [35] method. A three level adaptive grid is constructed and the proposed layout optimization scheme is triggered in order to arrange the cells in a structured overlap free layout. Large grid cells containing several similar images give rise to iconic representations that summarizes the underlying images. Such iconic representation is built

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

10

(a)

(b)

(c)

deros pier

alto south

palo head hilton

bier

william fishkin

assess

evid

link enhanc spatial

cartograph

layout

order maryland

market

select

coordin high

springer-verlag

cleveland

cone

coordin

inselberg mcdonald

layout

social balanc

andamp

uncertainti

planar

ratio note

buja

parallel adjust

fishey

disc

distort edg

layer embed ead transit cluster feng straight-lin

convex

network automat

feng

tamassia

plane

symp browser

berlin

force-direct

multilevel

cell

sketch note

hyperbol

cluster anneal

munzner

direct

space

abello

korfhag

librari

ead

algebra

koren

simul undirect

optim

multilevel

plot specif

rel

statist

multivari

gene

multivari

node statist

layout note

tamassia gd ead

planar lnc cluster di symp

central multilevel

m ult iv ar i

coordin geograph

robertstart

layout

n-dimension

log

cluster clutter set

tolli

preserv

force-direct

suggest

maceachren

network

map

c olo r

parallel

product

m as t er

vir t ual

geograph cartographi

geos c ie nc

weizmann

expert

placement

facilit

cartograph shape cartogram maceachren gi cartographi statist tempor spatial

associ

java

cartographi problem aesthet tempor

exploratori howard

nice lnc ac

harel

christensen

vir t ual

cartographi

cartogram

anneal

method

park

shneiderman abstract

hierarchi

pixel

chart

hierarch

convei

comp

uncertainti

munzner

xmdvtool parallel reduct

layout

abello

fluid

flow

alex

eick

traffic

network massiv

hierarchi

pang

vrml

arg

ward

multivari

cluster

hyperbol mgv navig will

octob

scott

wireless

parallel

traffic

mbone

dimension

dimens

non-linear apperlei

dept hierarch sugiyama louisiana

rundenstein

uncertainti

statist

context

brush bound olston

small

multidimension

holmquist

magnif

hierarchi

write

hierarchi

keim radial

multidimension

focu fishey chi uist

hyperbol

bar

coordin rundenstein

hierarchi

furna

edward

ward

structure-bas

björk spenc

keahei

richard

layout

space-fil

francisco

sarkar

zoom

allen

joe

creat

ead

detail

uist

wong

comprehens

nonlinear robertson

boston

reach

stasko

zoom

fishey

fracchia mental

fast anneal multi-scal

19th

knowledg

m in e

product mark

text

multidimension autom geograph

silicon

construct

discoveri sun collabor geograph area maceachren java geoscienc htm

map

syst nonlinear

mcgraw-hil

f r eem an

label vrml placement map

maceachren

(b)

hawaii

halasz

memori diagram louisiana notecard myer alto palo myer area

nest

disk

memori

internet acm-siam editor

roth

secur

san

kriegel

c onklin

iii

space jock

amsterdam autom goldstein

wattenberg

tree-map rectangl

huge

chapel

write

pier transpar

toi

ca

moran

stephen jeff

jai mendelzon conklin

bergeron

usag

label

der

spatial remot

paradigm

hypermedia problem

adapt ny architectur

gi

name

geograph

scale

linear imag percept

perform suggest author introduc colour

bradmatti

netherland

hierarch

space-fil stock turo

wijk

brushga

adjust

ibm

marc

medic

map

immers

form

chart

hypertext code understand

tamassia

cartograph

simul

tran

shape

project sphere siggraph multidimension

orlean

multicast jeffrei

cartograph metaphor hypertext

virtual

implicit

color desir

similar

curv

surfac

found

observ grai

celebr interdepend

demonstr

sort

geograph sens cartographi

optim

proxim circuit book

davi

educ kei

method

gross

carolina

springer-verlag network

son standard wilei john alex java scale placement gener

kamada

transform

navig

realiti

web

sedgewick control

benford alamito lo octob

multimedia devic allow upper

w eb-bas

featur

multipl simul

dan manipul

individu

lorensen

van

numer increment significantli

normal

toi

brad

bobrow

vitter extern

site wavelet multiresolut discoveri

taylor

blinn

contour

mass carolina trigg hill

williamson antonio control stone texa write input buxton

ben

fishey

dialogu

sigir card

boston

editor

dynam log spie

video

plan

wavelet

chapel trigg

hypertext hypermedia

27-mai

ca

jan

c

treemap colleg

discret

isbn

imag

virtual

w ork

cartogram

tempor

jr

oper

variabl psycholog

specif

error

dynam

domain

evalu mark

effici

time effect afip

subject multidimension

rep sept

jon

inf

current

area

kei

detect

web

fluid stasko glasgow

site simul

lnc

basic molecular

high-dimension

c

text

evalu path tango

pattern measur

field

shapepitkow

shade

rai

volum

object-ori

visualis

pitkow

world plant pak

distanc witkingrammar scale lorensen segment psychometrikaann content siggraph editor arrang

cach splat trace imag curv

shape

vision

test

nelson

hypermedia

stuart georg document web mine

tour

marshal hypertext

mackinlai

modular

optim herman

scalar volum

max

render

surfac

implicit utah

recognit

segment

web

stream video

document hypermedia

robert

sugiyama

notecard implicit

gaithersburg

represent manag

rotat

william

perform

case

surfac

pilot

rapid

index point

mine

rule toc

concurr

handbook

catherin frank

man-machin

shneiderman url

notecard

20th august

frank

path toi hypertext halasz

manual

halasz misu

cognit

cartogram continu

target

multimedia

space match

steven object-ori

lo gershon pak sequenti

concept

electron

featur

healei

psychol

winter

van

decemb

brad

evolut

visag

workspac

math primit

facil weight chi

group execut oper imag parallel

whitnei alamito

hendlei

gd

wong

similar textur

rai cube

geometr scene

siggraph render

feng

lnc

problem

ftp

tabular

optim

visibl textur

concurr

timelin

curv solid

color vari

trace densiti

salesin

field

reflect scalar

textur

depth

layer imag

juli

aug

direct

pat

produc

hanrahan

reiss

imag

witkin

log

surfac shade blinn

engin

reliabl distribut order

layout

planar

programm

protocol

straight-lin

ac cluster anneal force-direct

transform

close

koren

harel

distanc roth sequenc

nearest neighbor conf evolutionari

shneiderman

human-comput meet

alto

author

springer-verlag

m ult iv ar i

int conf agraw

vi

parallel

assesslink maceachren

similar

search

book

bier

see-through transpar

north randal

ken

eric

handbook

orthogon

gi

s t at is t

nam e

statist

geograph

light raiimag

dimension

juli nov

featur

camera

render

scatter siggraph

densiti

biologi

maryland

menu

eric ken

maureenstone toni

exploratori

geoscienc

cartograph

illustr shape henri percept levoi andrew

molecular

item

menu

buxton

robertson

real-tim

object

bell

on-lin

relationship

intellig

percept tutori search 22nd accur

boundari session

properti

c ons t r uc t

sequenc

tech

satisfact respons lab

product prec syst perform

output

kurtenbach buxton

w ashington

planar

java

plot

coordin

j ava

sun

collabor

problem

area discoveri

geograph

cartogram maceachren

dimension

constraint dam av

construct

christensen

cartogram

field

ead layer

scott

primit michael dimension placement

rai

brow ser nice simul

leblanc

appr oxim

cartograph

symp

tolli

cartographi

label

color

abello librari

class

barri

parallel parallel coordin multivari

transit network

korfhag cluster layout

vision

render

surfacvolumline

object-ori

tog

light

brush

virginia detect

ead tamassia

m et aphor

geogr aph

hyper m edia

bar

multivari uncertainti n-dimension rundenstein ward

categor

nasa time file

layout orthogon

map

hypertext

lnc

cartographi

cartograph placement

geograph label spatial

di

paradigm

hyperbol

in c r em ent

parallel

sigop simul

hala s z

27-mai

statist

distort

hierarchi

brush

ward keim space-fil radial rundenstein

discnode

hierarchi

turo

fracchia

fishey

non-linear

secur

structure-bas space-fil

hypertext

keahei

holmquist

magnif context

stock wijk

order shneiderman hierarch

björk focu

distribut ga

textur

softw

monitor execut

function curv trace

navig

uncertainti

munzner

kei

multidimension

transpar

jame

eric

palo

author

nest

fishey mental

robertson nonlinear

ken

control

path

toi

marshal

myer reach

zoom

understand

comprehens

planar

circuitpr

oxim distanc

arrang scale psychometrika ann

wilei

sketch robert

son

architectur

web-bas

dan

imag linear observ percept

debugg madison

market

uncertainti

abello

network

cluster

debug event

wattenberg

inselberg

hyperbol

distribut

netherland

wong detail

unger

sigplannotic

louisiana chart

treemap

kenneth

set seri

va

search

binari

oper

lamport

parallel

metric

languag

calgari languag

im plic it

demonstr

brad

mgv flow coordin fluid

jeffrei

parallel

texa

catherin

andri widget

snibb

ablex

translat

detect enn

chung

mackinlai

joe

toi

trigg

stuart

prec

transpar

lab

see-through

hawaii

specif

sigir

web

multimedia

disk dynam

multicast

extern

benford

arg

stasko fluid

pitkow

video

plant

web

pak

gershon

evalu path

web

georg

antonio iii frank

hypertext hypermedia amsterdam

xmdvtool

adapt

multipl

optim

medic

scale

perform

surfac

imag

trace

kurtenbach

int

realiti

w orld simulstandard

curv

grossshape

hil

c ar olin a

robertson

sort mbone

memori

chapel

notecard hypermedia

brad bobrow

hypertext

chapelhalasz

card

vitter

s pat ia l

neighbor

microsoft

transform

planar

project sphere

document

hypertext

ken

maureen

mackinlai

mine site

virtual multiresolut

video w itkin lorensen modular

normal

color grai

global

toc

group

textur

wavelet shape

segment

reliabl

rotat

web

stream

engin

close

distribut imag curv

case

rai

shade

reconstruct nearest

file sequenc

pat

siggraph hanrahan light

alamito

va

render

imag

protocol

visibl trace cach

imag

percept

scalar aug volum

nelson

reiss

wavelet

menu

bier

notecard hypermedia

cognit

sigmod

distanc roth

imag

alto

buxton stone

pier

conf faloutso document

grammar

surfac

manual

shneiderman

searchpoint

mine rule

eric

bier

alto

dialogu

pilot

t r ans f or m

featur

imag c

parallel

render

light

curv

densiti juli

render

surfac line

distribut languag

concurr

plan

perform

distribut

riedl

notic

calgari

event

debugg

monitor sigplan

textur

ln c

biologi

textur

splat

acid

detect

biolog

uist

arc

nov

spreadsheet detect

oper

maryland

human-comput buxton

dynam item

molecular evolut

cell chi

spreadsheet microsoft

parallel

debug

illustr

distancchi

time class

financi

concurr

surfac

visag

categor

nasa

distribut

cell

transpar

sequenc

virginia

motif

brown depend

shape

conf

similar

search

genombiologi

manipul

int

high-dimension

spreadsheet

mechan

index

multimedia

percept

protein

sequenc subsequ

acm-siam

neighbor log

target

healei tutori search

robbin

nearest

buxton

syst

time

product

approxim

bentlei

22nd

andri daniel siggraph

molecular

satisfact

respons afip

neighbor

handbook

perform

friedman

search

simul

hierarchi

inf

search file menu

winter

detect

index

constraint

zeleznik

robbin

detect riedl

toolkit

multi-scal fast

healei

binari

bentlei

rang jon

search detect simul featur

rep rang syst loui

estim vision

psychol

preattentenn

widgethugh

toolkitdam preattent

daniel

public long similar

distribut chi

distanc

languag

zeleznik simul re toolkit hugh preattent

pat card

spreadsheet

(a)

siggraph

ac orthogon document match

hci

mackinlai

financi tabl start

global

cell

oct

manipul

point faloutso

similar

storag

cellarc spreadsheet select microsoft

sigmod

search object

region constrain

metric

conf roth

manag

point

acm-siam

depend

microsoft

rang filecolor

theori

nov mechan

properti

effici

neighbor

search log

binari

nucleic

brown uist fluid

bibliographi

dynam

approxim

reconstruct

search

henikoff long

rang

kolojejchick

loui

import

molecular

genomstring biologi

charact

search

syst

tabl

amino

sequenc

boundari

binari

bentlei

neighbor friedman match lee stanford

simul orient

int index

spreadsheet motif acid

problem

estim

featur search numer

multimedia

subsequ

bentlei

nearest

jon multidimension

align

biolog similar

protein pattern

target attent

psychol preattent

loui

vision

enn re

healei wolf

calif

color

pr oposo vic h yankel

Fig. 9. Image gallery application: (a) an adaptive grid is built from projected images; (b) an initial iconic representation is generated; (c) the layout is optimize to make an optimal use of area while preserving the neighborhood of the grid cells.

battista sugiyama

(c)

Fig. 10. Visualizing a dataset containing 515 documents using word clouds.(a) each document is represented by a point embedded in 2D space where adaptive grid is imposed, (b) a word cloud based on term-frequency of documents contained in each cell of grid is built.(c) shows the optimized layout using the proposed method. Notice the size increasing that allow us to increase the number of words by each cell (colored in lightgray).

by blending a randomly chosen subset of the underlying images using five levels of Laplacian pyramid [42]. In the

example shown in Figure 9, cells containing less then four images are represented by a single representative image,

11

Our last application regards video visualization, as illustrated in Figure 11. We build a collection of 300 videos from Youtube querying six distinct topics, namely, linux, civil war, fifa world cup, hawk, guitar and information visualization. Textual information associated to each video is processed to generate a feature vector that represents the video in a high-dimensional space. The proposed methodology with three level grid refinement is employed to build a layout where larger cells are textured as word clouds while the cells in the lowest refinement level (smaller cells) are textured with a snippet build from a screenshot of a randomly chosen video contained in the cell as well as textual information containing title, description and URL of the video. Word clouds are generated using keywords extracted from the text associated to the video. As in the previous application, font sizes and priority-based opacity are used to highlight the importance of each keyword, which are ranked as in [11]. Notice how our approach was able to semantically organize the layout according to the distinct topics, keeping videos related to “guitar” grouped on the left part of the layout, “linux” and “information visualization” on the top right, “hawk” in the center, “fifa world cup” on the bottom right, and ”civil war” centered on the bottom. Interestingly, the smaller cells in the layout contain mainly videos about civil war and music videos related to the song “Civil War” by Guns N’Roses. Moreover, besides being well structured and semantically organized, the layout resulting from our methodology is compact and easy to read due to its effective use of display area. One interesting fact of all generated layouts is the high level of interactivity for the exploration task i.e. users are allowed to choose a cell summarizing some content (from all levels except the last one) and then further explore that cell in more detail by recursively applying the whole layout construction process only within that cell.

Indian Warriors: The Untold S tory of the... Indian Warriors: The Untold S tory of the Civil War (AMAZING HIS TOR Y DOCUME NTAR Y)...

megan chief dis cover y

rufus

CIVIL WAR / MAR VE L / Andrés Navy E n este video hablaremos sobre CIVIL WAR ! Uno de los acontecimientos más importa...

reconstruction

john channel

amazing

fussball

video

coming

ole

en

mundial

partgoals world war

copa

civil

página quality

sobre

https commentary english más

user

world

events

puppy

based

video

learn

operating free

show

linux os

system complete

top

features

juego

check online

gameplay ea

fifabrazil

world hdfull

team copa

la

final

la

en

song

editor

bit

music official

brazil

world

jugando online

marvel

ola

el

part gettysburg live battlefield youtube

amigos

ly

original

em

history

amazing

face

battle

play

war roses

enjoy

hd

el

dejo

gunsvideo

school

tutorial command version historywindows

fifa goals cup top este

este

syria

videomix

reality

ubuntugnu

install

La Guerra Civil Marvel - Civil War #1 Pa... Nuestra página web : http://goo.gl/QLQL3c CIVIL WAR !!!!!! Uno de los eventos más...

goo gl

uno

play

guitar

american full

july brain

interactive

game series

espn

joe

shocking

Tales of the Gun - E pisode 17: Guns of t... Tales of the Gun - E pisode 17: Guns of the Civil War (History Documentary) Throu...

linguistic

individual

gaming

skater

buenos

computer

science

cup

twitter

images brazil por cup el

Footage of Civil War Veterans at 50yr An... Amazing historical movie footage from 1913 and 1938 of the 50th and 75th anniver...

tony twitter songs

les

skills pickett union targets

primera

gameplay cup fifa

air

july player unio

hawk vez world entop

los

civil lured

The E nglish Civil War Cromwell YouTub... THE GR E AT E NGLIS H R E VOLUTION.

group

tutorials computer mint file systems visit

road

free goo

gl

Guns N' R oses - Civil War (Music Video) Music Video For Guns N' R oses Civil War With 'Breakdown' Intro.

civil

hostile

street guitar war

Civil War - S lash & Myles Kennedy - R are... Civil War R are Acoustic Version S lash & Myles Kennedy @ the MAX S essions 2010 My...

https

argentina germany du

strategy

rednex indy

joined

facebook full tony

skate

park

backyard

Canister S hot From Civil War Cannon.wmv We fired 17 rounds of Canister from an original Civil War Napoleon Field Cannon ...

documentary

performer

mind open

young

How to S witch Operating S ystems: Linux, ... I Need Your Help: http://vid.io/x6R (Please S upport) http://live.pirillo.com - A...

moments fifabit ly

hd

usman

spirit

brutalmoose

facebook

facebook virtualbox

heavyclashes music

picks

Guns N' R oses - Civil War [Live in Paris... Live at Hippodrome de Vincennes, Paris, France, June 6, 1992. Full Concert: http...

cam action official

evan

conan album luca guitarist

Metal Detecting: Civil War R elics https://www.youtube.com/user/aquachigger This is a video I shot of a hunt in a m...

part

top

red operating free

open

video visit hawk team

goals

skateboarding

successful

data visual information work online

brasil

greg

full

fifa

keef

world

star

music

shep

distro

building

music hd riley

edit

system

hawk tony

Wake Up Call 2014: Obama DHS Prepares Fo... Civil unrest coming to America. Make this viral and get the word out! OR IGINAL V...

rednexmusic

watching

itunes

harris words

university fellow

link research

Libyan Loop: Back to brink of civil war ... Libya is on the brink of plunging into yet another bloody civil conflict - that'...

day

harrelson hypertree

trapping

vital

chase social life evensen bull

snake official

subscribe

del

linux

official full song video er

rileyhttps

classical

bmx

footage

profit

album

story reed

watchmojo

counting

topriaz

fellowapple

app ist

download

S yrian Civil War 2013 - Battlefield S yri... S yria Civil War 2013 - Battlefield S yria: Heavy Fighting Clashes And Firefights....

catherine

Guns n R oses Civil war live (Voodoo Chil... stop posting all this "you say jonas brothers I say blabla" shit....thank you. G...

ch

roll

meets flying

check

riffs ted guitar https

American Civil War Documentary - R ally A... This is the first DVD of a 3 DVD documentary box set on the American Civil War. ...

Civil War Combat - Battle of S hiloh - Pa... A documentary of History Channel about the Battle of S hiloh on April 6-7 1862 in...

dawgs

van

today

amazing

confederate

twitter

guitar

time

black

meditationsongs

swirling

yellow

videos

mind

park

ve yearsda

note

eruption chase

le

knowledge korea

contemporary

mundo link

Warriors (Imagine Dragons guitar cover) Cover of the new World Championship theme song by Imagine Dragons S hitty recordi...

The Civil War in Four Minutes: The Battl... Historian Garry Adelman describes the events that took place during the Battle o...

https hd

creepy fails speeds changed mission braveheartdobsoncompilation https

mini caught

piano

firefights

hd

facebook

spanish facebook enjoy youtube

tabs original jung korea

singapore charged

north

playing

sungha guitar video

hope

twitter brain hawk topic

play

gettysburg

time

gig

bit

shopping 2ne1 ewan

skater

red-tailed hawk tony facebook

hits

handshake big

black

awesome youtube

world

fix

feat

hacks

videoimagine

hell

rude

enjoy

special

lesson

challenge

ve

life

mods

dragons coldplay

top

easy

songs awesome

guitar

fingerstyle

learn make

song play

video

note sound playlist instrumental production

monroe

Causes of the Civil War - Popular sovere... 55 From The American Civil War Volume 1 hosted by Dale R eed 1793: Invention of t...

video bmx

hawks

American Civil War Timeline 1861-1865 http://storiesofusa.com/american-civil-wartimeline-battlefields-1854-1865/ - Am...

guitar full house austin produced intro episode

back

candyrat

magic

spastic

cover

relax

performace

ly jung

boy

hawk apple chadwick download alex

acoustic

relaxing music part

field

music

directed

song bit

rubberbandits

children

sungha

album riffs itunesem

making

i-hawk

https

qua

tony roll

guitar

m-tp

monologue

rock

original history

Avengers 2 S POILE R S - Marvel Civil War t... Today, Marvel Phase 3 predictions on how Civil War could be coming soon to the M...

Civil War Veterans on Film, E CHOE S OF TH... Civil War Veterans on film, from the documentary E CHOE S OF THE BLUE & GR AY. Prod...

The Civil War: Gettysburg On July 3, 1863 in Gettysburg, PA the Union lured the Confederate troops out int...

back

wimbledon

Guns N' R oses-Civil War w/Lyrics "What we've got here is failure to communicate. S ome men you just can't reach......

tide

built

documentary

soulja

cover

DE ATH AND THE CIVIL WAR (AMAZING AME R ICA... DE ATH AND THE CIVIL WAR (AMAZING AME R ICAN HIS TOR Y DOCUME NTAR Y) With the coming o...

chief

tatyana part

revolution

Civil War Journal - The Gray Ghost: John... Leading a cavalry of 800 men without interference from ranking Confederacy offic...

facebook full

Civil War Journal - Pickett's Charge Pickett's charge at Gettysburg would be rebuffed, leading to a major defeat for ...

skills

ryzhkova gattar

click di

put

ultimate

song

easy

kid

wyclefgiuliani enjoy performs child boy

long

beautiful

hetfield

prank made guitar ak47

create

room rising center

youtube

james

enjoy

zen

piano

mind

Football Focus - FIFA World Cup 2014 This weeks Football Focus is 'focused' on upcoming FIFA World Cup 2014. Dan Walk...

software

design

maryland

starlight

S tar Wars: Battlefront 2 - Galactic Conq... This is an episode of my S tar Wars Battlefront 2 Galactic Conquest series. This ...

meditation

visualization final info

Battleplan American Civil War Campaign L... Let's Play Battleplan: American Civil War, a PC and mobile real time strategy ga...

tool

R are Civil War Postmortem Photographs R are post-mortem photographs taken during the Civil War. E xcludes images taken d...

life

interactive information visualization inf ovis

flip

ca

editor

university data

movie

hay hours

background

collect ion

finder

các hd

time

sound cách ch ng nh

guitar music

interaction

work

relaxing bài



video system

bigplaisant based

therapy

day

therapyguitar

lonely

The second application concerns textual document visualization. Following the conventional tf-idf bag-of-words construction [44] and stemming [45], one can yield feature vectors from which our methodology can straightly be applied. In the application illustrated in Figure 10, we visualize a collection of 515 related papers published in the IEEE VisWeek Conference 2004. Each cell is textured with a word cloud built from keywords extracted from the documents contained in the cell while font size and opacity is used to highlight the most relevant keywords whose ranking is computed using the method described in [11]. Notice that the layout is easy to read, mainly due to the effective use of display area. Moreover, it is not difficult to note that the grid-like structure makes the visual identification of similar documents an easier task.

instrumental

green

cells containing four to eight images gives rise to a four blended image icon, and icon with nine blended images is used to represent cells with nine or more images. Images used in this application were obtained from [43].

park

mam epi jsin t ig shnei out orht sar ode yer vel d h es

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

mundo

simulation coupe

sports

video final official belgium

Fig. 11. Visualizing 300 videos extracted from Youtube querying 6 different topics.

6

D ISCUSSION

AND

L IMITATIONS

Quantitative and qualitative comparisons presented in Section 4 attest the quality of our layout arrangement mechanism. In fact, our approach clearly outperforms most existing techniques with respect to properties such as area usage, orthogonal ordering and layout alignment, in addition to perform well as to neighborhood preservation. Therefore, our approach is able to handle several requirement simultaneously, making it an attractive alternative for several applications. The applications described in Section 5 show the versatility of our approach in scenarios ranging from image gallery construction to word cloud based text visualization. Moreover, the layouts resulting from our methodology turned out to be clean, well structured, and easy to read. Another interesting aspect of our approach is that the only parameter to be set is the number of refinement levels in the adaptive grid. In our tests we use three levels of refinement and we notice that the layout is not so easy to read when more than five levels are used. However, the appropriate number of refinement levels depends on the underlying application and the size of screen. A weakness of our methodology is that dense and tight data cluster may be cut into multiple cells during the grid construction. This problem can be mitigated through density-aware grid generation, an improvement to be incorporated in our methodology. The quality of the semantic relation among grid cells also depends on the effectiveness of the multidimensional projection scheme in preserving neighborhoods during the mapping process. Visually encoding uncertainties as to neighborhood relation can help users during data exploration. However, this is an aspect that has not been properly addressed even in the context of multidimensional projection methods. From our experimental analysis we also notice that it

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

is not necessary to run the optimization procedure until convergence. In other words, good layouts are obtained after a few hundred iteration steps, being unnecessary to wait until a local minimum is reached. This fact makes the proposed methodology also attractive in terms of computational times. For instance, the layouts presented in Section 5 took less than six seconds to be produced. However, the algorithm can take a few minutes to fully converge if a local minimum is sought. To numerically illustrate this fact, we show in Fig. 12 the computational time spent by our methodology to reach a certain number of iterations versus the ratio between the resulting and global minimization energies Et /Eg for each layout from Fig. 7. Note from the y-axis of the graph that how close to 1 the energy rates are, even for the layouts produced with a reduced number of iteration steps. DT1

DT2

DT3

DT4

DT5

[2] [3] [4] [5] [6] [7]

[8] [9] [10]

1,1000

Energy ratio (Et/Eg)

1,0800

[11]

1,0600 1,0400

1,0200

[12]

1,0000

[13]

0,9800 0,00

0,50

1,00

1,50

2,00

2,50

Iteration time (log, in sec.)

Fig. 12. Computational time versus optimization convergence rate for the layouts from Fig. 7.

[14]

[15]

7

C ONCLUSION

We introduced a novel methodology to build geometric layouts to visualize data. In contrast to existing techniques, the proposed method makes use of a novel optimization procedure that is able to handle several requirements simultaneously, yielding structured overlap-free arrangements while still ensuring a semantic relation among neighbor entities. Furthermore, the method makes optimal use of display area, rendering it an effective and flexible tool for applications varying from image gallery construction to video data set visualization. We are currently investigating interactive mechanisms to enable a free navigation throughout the layout as well as dynamic user-driven layout updates.

[16] [17] [18] [19]

[20]

[21]

ACKNOWLEDGMENTS The authors acknowledge the financial support from FAPESP (#2011/22749-8 #2013/00191-0 and #2014/16857-0) and CNPq (#302643/2013-3).

R EFERENCES [1]

K. Koh, B. Lee, B. Kim, and J. Seo, “Maniwordle: Providing flexible control over wordle,” IEEE Transactions on Visualization and Computer Graphics, vol. 16, no. 6, pp. 1190–1197, Nov. 2010.

[22] [23] [24]

12

F. V. Paulovich, F. M. B. Toledo, G. P. Telles, R. Minghim, and L. G. Nonato, “Semantic wordification of document collections,” Comp. Graph. Forum, vol. 31, no. 3pt3, pp. 1145–1153, Jun. 2012. J. Bertin, Graphics and graphic information-processing. de Gruyter (The University of Michigan), 2007. E. Tufte, Envisioning information, ser. Envisioning Information. Graphics Press, 1990, no. 914. S. Van den Elzen and J. J. van Wijk, “Small multiples, large singles: A new approach for visual data exploration,” Computer Graphics Forum, vol. 32, no. 3pt2, pp. 191–200, 2013. R. D. Pinho, M. C. Oliveira, and A. Andrade Lopes, “An incremental space to visualize dynamic data sets,” Multimedia Tools and Applications, vol. 50, no. 3, pp. 533–562, 2010. G. Strong and M. Gong, “Data organization and visualization using self-sorting map,” in Proceedings of Graphics Interface 2011, ser. GI’2011. Canadian Human-Computer Communications Society, 2011, pp. 199–206. ——, “Self-sorting map: An efficient algorithm for presenting multimedia data in structured layouts,” Multimedia, IEEE Transactions on, vol. 16, no. 4, pp. 1045–1058, June 2014. J.-D. Fekete and C. Plaisant, “Interactive information visualization of a million items,” in IEEE Symposium on Information Visualization (INFOVIS 2002). IEEE Computer Society, 2002, pp. 117–124. H. Strobelt, M. Spicker, A. Stoffel, D. Keim, and O. Deussen, “Rolled-out wordles: A heuristic method for overlap removal of 2d data representatives,” Computer Graphics Forum, vol. 31, no. 3pt3, pp. 1135–1144, 2012. F. V. Paulovich, F. Toledo, G. P. Telles, R. Minghim, and L. G. Nonato, “Semantic wordification of document collections,” in Computer Graphics Forum, vol. 31, no. 3pt3. Wiley-Blackwell Online Library, 2012, pp. 1145–1153. W. Javed, B. McDonnel, and N. Elmqvist, “Graphical perception of multiple time series,” IEEE Transactions on Visualization and Computer Graphics, vol. 16, no. 6, pp. 927–934, 2010. J. Kehrer, H. Piringer, W. Berger, and M. E. Groller, “A model for structure-based comparison of many categories in small-multiple displays,” IEEE Transactions on Visualization and Computer Graphics, vol. 19, no. 12, pp. 2287–2296, 2013. R. Blanch and E. Lecolinet, “Browsing zoomable treemaps: Structure-aware multi-scale navigation techniques,” IEEE Transactions on Visualization and Computer Graphics, vol. 13, no. 6, pp. 1248–1253, 2007. B. Johnson and B. Shneiderman, “Tree-maps: A space-filling approach to the visualization of hierarchical information structures,” in Proceedings of the 2Nd Conference on Visualization, ser. VIS’91. Los Alamitos, CA, USA: IEEE Computer Society Press, 1991, pp. 284–291. M. Bruls, K. Huizing, and J. J. Van Wijk, “Squarified treemaps,” in Data Visualization. Springer, 2000, pp. 33–42. B. Shneiderman and M. Wattenberg, “Ordered treemap layouts,” in IEEE Symposium on Information Visualization (INFOVIS 2001). IEEE Computer Society, 2001, pp. 73–73. B. Shneiderman, “Treemaps for space-constrained visualization of hierarchies,” 1998-2013. [Online]. Available: http://www.cs.umd. edu/hcil/treemap-history/ C. Tuttle, L. G. Nonato, and C. Silva, “Pedvis: A structured, spaceefficient technique for pedigree visualization,” IEEE Transactions on Visualization and Computer Graphics, vol. 16, no. 6, pp. 1063–1072, 2010. M. Balzer, O. Deussen, and C. Lewerentz, “Voronoi treemaps for the visualization of software metrics,” in Proceedings of the ACM Symposium on Software Visualization, ser. SoftVis’2005. New York, NY, USA: ACM, 2005, pp. 165–172. B. B. Bederson, B. Shneiderman, and M. Wattenberg, “Ordered and quantum treemaps: Making effective use of 2d space to display hierarchies,” ACM Transactions on Graphics (TOG), vol. 21, no. 4, pp. 833–854, 2002. J. Wood and J. Dykes, “Spatially ordered treemaps,” IEEE Transactions on Visualization and Computer Graphics, vol. 14, no. 6, pp. 1348–1355, 2008. A. Nocaj and U. Brandes, “Organizing search results with a reference map,” IEEE Transactions on Visualization and Computer Graphics, vol. 18, no. 12, pp. 2546–2555, 2012. E. R. Gansner and Y. Hu, “Graph drawing,” I. G. Tollis and M. Patrignani, Eds. Berlin, Heidelberg: Springer-Verlag, 2009, ch. Efficient Node Overlap Removal Using a Proximity Stress Model, pp. 206–217.

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

[25] X. Huang, W. Lai, A. Sajeev, and J. Gao, “A new algorithm for removing node overlapping in graph visualization,” Information Sciences, vol. 177, no. 14, pp. 2821–2844, 2007. [26] C.-C. Lin, H.-C. Yen, and J.-H. Chuang, “Drawing graphs with nonuniform nodes using potential fields,” in Journal of Visual Languages and Computing, vol. 20, no. 6. Springer, 2009, pp. 385–402. [27] D. Harel and Y. Koren, “Drawing graphs with non-uniform vertices,” in Proceedings of the Working Conference on Advanced Visual Interfaces. ACM, 2002, pp. 157–166. [28] T. Dwyer, K. Marriott, and P. J. Stuckey, “Fast node overlap removal,” in Graph Drawing. Springer, 2006, pp. 153–164. [29] K. Marriott, P. Stuckey, V. Tam, and W. He, “Removing node overlapping in graph layout using constrained optimization,” Constraints Journal, vol. 8, no. 2, pp. 143–171, 2003. [30] S. Kieffer, T. Dwyer, K. Marriott, and M. Wybrow, “Incremental grid-like layout using soft and hard constraints,” in Graph Drawing (Lectures Notes in Computer Science), 2013, pp. 448–459. [31] U. Ruegg, S. Kieffer, T. Dwyer, K. Marriott, and M. Wybrow, “Stress-minimizing orthogonal layout of data flow diagrams with ports,” in Graph Drawing, 2014, pp. 1–16. [32] E. Gomez-Nieto, F. S. Roman, P. Pagliosa, W. Casaca, E. S. Helou, M. C. F. de Oliveira, and L. G. Nonato, “Similarity preserving snippet-based visualization of web search results,” IEEE Transactions on Visualization and Computer Graphics, vol. 20, no. 3, pp. 457–470, 2014. [33] E. Gomez-Nieto, W. Casaca, L. G. Nonato, and G. Taubin, “Mixed integer optimization for layout arrangement,” in Proceedings of the 26th Conference on Graphics, Patterns and Images, ser. SIBGRAPI’13. Washington, DC, USA: IEEE Computer Society, 2013, pp. 115–122. [34] T. Dwyer and L. Nachmanson, “Fast edge-routing for large graphs,” in Graph Drawing. Springer, 2010, pp. 147–158. [35] F. V. Paulovich, L. G. Nonato, R. Minghim, and H. Levkowitz, “Least square projection: A fast high-precision multidimensional projection technique and its application to document mapping,” IEEE Transactions on Visualization and Computer Graphics, vol. 14, no. 3, pp. 564–575, 2008. [36] J. Leung, T. Tam, C. Wong, G. Young, and F. Chin, “Packing squares into a square,” Journal of Parallel and Distributed Computing, vol. 10, no. 3, pp. 271–275, 1990. [37] K. Jansen and L. Pradel, How to Maximize the Total Area of Rectangles Packed into a Rectrangle? Inst. fur Informatik, CristianAlbrechts-Univ., Kiev, 2009. [38] J. Lee and S. Leyffer, Mixed Integer Nonlinear Programming (The IMA Volumes in Mathematics and its Applications). Springer, 2012. [39] K. Misue, P. Eades, W. Lai, and K. Sugiyama, “Layout adjustment and the mental map,” Journal of visual languages and computing, vol. 6, no. 2, pp. 183–210, 1995. [40] H. Samet, Foundations of Multidimensional and Metric Data Structures. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2005. [41] O. Fried, S. DiVerdi, M. Halber, E. Sizikova, and A. Finkelstein, “IsoMatch: Creating informative grid layouts,” Computer Graphics Forum (Proc. Eurographics), vol. 34, no. 2, May 2015. [42] M. N. Do and M. Vetterli, “Framing pyramids,” IEEE Transactions on Signal Processing, vol. 51, no. 9, pp. 2329–2342, 2003. [43] L. Fei-Fei, R. Fergus, and P. Perona, “Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories,” 2004. [44] G. Salton, “Developments in automatic text retrieval,” Science, vol. 253, no. 5023, pp. 974–980, 1991. [45] H. P. Luhn, “The automatic creation of literature abstracts,” IBM Journal of research and development, vol. 2, no. 2, pp. 159–165, 1958.

13

Erick Gomez-Nieto obtained a BSc degree (2009) in Informatics Engineering from San Pablo Catholic University - Arequipa, Peru. He holds a MSc degree (2012) in Computer Science from University of São Paulo - SP, Brazil. He was lecturer/researcher at Computer Science School of San Pablo Catholic University where he was awarded by his notable scientific research, in 2012/2013. Currently, he is working toward the PhD degree at University of São Paulo. His research interests are in Interactive Visualization, Visual Analytics and Geometry Processing.

Wallace Casaca obtained the B.Sc. and Master degree in Pure and Applied Mathematics from São Paulo State University (UNESP), Brazil, in 2008 and 2010, respectively. During 2010-2014, he pursued his Ph.D. in Computer Sciences and Applied Mathematics at University of São Paulo (USP), Brazil. As part of his doctoral studies, he also worked at visiting researcher at Brown University, School of Engineering, USA, and in cooperation with MIT, Media Lab, USA. His research interests include image segmentation, image restoration, image retrieval, information visualization, optimization techniques, partial differential equations and numerical analysis.

Danilo Motta received his BSc degree in Computer Science by the Federal University of Mato Grosso do Sul, in 2010, and his MSc degree in Computer Science and Computational Mathematics from University of São Paulo - SP, Brazil, in 2013. Currently he is a PhD student and member of the Visual and Geometry Processing Group - VGPG at ICMC-USP. His current interests include data processing and visualization, optimization and applied machine learning.

Ivar Hartmann is a Professor at FGV Law School in Rio de Janeiro, where he has taught Cyberlaw, Programming for Lawyers and Theory of Justice. He coordinates the "Supreme Court in Numbers" project, which employs computer science to empirical legal research in order to produce data on tens of millions of lawsuits from different Brazilian courts. The project team is composed of nearly a dozen researchers with diverse backgrounds such as Law, Computer Engineering and Statistics. Ivar’s own research and publications cover topics of Constitutional, Private, Enviromental, Procedural and Cyberlaw, as well as Empirical Legal Research. Ivar holds an MsC from the Catholic University of Rio Grande do Sul, Brazil and an LL.M. from Harvard Law School. He is an S.J.D. candidate at the State University of Rio de Janeiro.

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, (PREPRINT), 2015

Gabriel Taubin earned a Licenciado en Ciencias Matematicas degree from the University of Buenos Aires, Argentina, and a Ph.D. degree in Electrical Engineering from Brown University. In 1990 he joined the IBM Research Division, where he held various positions, including Research Staff Member and Research Manager. In 2003 Taubin joined the Brown University School of Engineering as an Associate Professor of Engineering and Computer Science. During the 2000-2001 academic year, on sabbatical from IBM, he was Visiting Professor of Electrical Engineering at the California Institute of Technology. During the Spring semester of 2010, on sabbatical from Brown, he was Visiting Associate Professor of Media Arts and Sciences at MIT. Prof. Taubin was the Editor-in-Chief of the IEEE Computer Graphics and Applications Magazine from 2010 to 2013, and has served as a member of the Editorial Board of the Geometric Models journal, and as associate editor of the IEEE Transactions of Visualization and Computer Graphics. Prof. Taubin was named IEEE Fellow for His Contributions to the development of three-dimensional geometry compression technology and multimedia standards, won the Eurographics 2002 Günter Enderle Best Paper Award, and was named IBM Master Inventor. He has contributed to the field called Digital Geometry Processing with methods to capture 3D shape, for surface reconstruction, geometric modeling, geometry compression, progressive transmission, signal processing, and display of discrete surfaces. The 3D geometry compression technology that he have developed at IBM was incorporated into the MPEG-4 standard, and became an integral part of IBM products.

Luis Gustavo Nonato received the PhD degree in applied mathematics from the Pontifícia Universidade Católica do Rio de Janeiro, Rio de Janeiro - Brazil, in 1998. He is currently full professor at the Instituto de Ciências Matemáticas e de Computação (ICMC) - Universidade de São Paulo (USP) - Brazil. He spent a sabbatical leave in the Scientific Computing and Imaging Institute at the University of Utah from 2008 to 2010. Besides having served in several program committees, including IEEE Visualization and Eurovis, he was a member of the editorial board of Computer Graphics Forum and the president of the Special Committee on Computer Graphics and Image Processing of Brazilian Computer Society. Currently Dr. Nonato leads the Visual and Geometry Processing Group at ICMC-USP.

14