Interconnection Networks: Architectural ... - Semantic Scholar

Computer

Previous Page | Contents | Zoom in | Zoom out | Front Cover | Search Issue | Next Page

BEMaGS

A F

R E S E A R C H F E AT U R E

Interconnection Networks: Architectural Challenges for Utility Computing Data Centers Olav Lysne, Sven-Arne Reinemo, Tor Skeie, Åshild Grønstad Solheim, and Thomas Sødring Simula Research Laboratory, University of Oslo

Lars Paul Huse and Bjørn Dag Johnsen, Sun Microsystems

Advancing research will enable an interconnection network to support the same seamless virtualization found in other parts of hardware, such as CPUs. Such a network thus poses particular challenges as well as opportunities for a utility computing data center. ears ago, the vision of a computational grid received considerable attention. The idea that computational power should be as readily available over the Internet as electrical power is over the power grid is appealing, and it inspired researchers worldwide to launch many new research projects and programs addressing different issues. Even though academic interest in the computational grid appears to have peaked, it spawned awareness of a new operational mode for computational data centers, often called utility computing. In this mode, the system assigns compute resources to customers on demand, who request a subset of the resources in a data center for a defined period. Typically, customers pay only for the resources they require and only for the period they require them. To capitalize on this model, vendors have provided various solutions to utility computing, such as Sun’s N1,1 HP’s Adaptive Enterprise, 2 and IBM’s E-Business On Demand. 3 Recent examples of utility computing services now being offered include the Sun Grid Compute Utility4 and the Amazon Elastic Compute Cloud (www.amazon.com/ec2). A utility computing data center (UCDC) dynamically creates virtual servers containing a subset of the available resources to fulfill user demands. The jobs or services running in a UCDC typically have diverse characteristics, such as different resource requirements, run-

Y

62

Computer

Computer

ning times, software quality, security requirements, and importance. In addition to the classic high-performance computing applications, a job might be, for example, an ad hoc Web service set up to host a World Cup football championship for two months. While present-day solutions to utility computing mainly consist of software running atop existing architectural designs, we argue that to get the full benefit of UCDCs in the future, the underlying architecture should evolve. Recent innovations include the development of virtualization support in hardware, such as in CPUs from major vendors. In research on interconnection networks, the common assumption has been that the system is executing only one job or class of jobs concurrently, and that the interconnection network’s task is to maximize overall performance. A UCDC’s diverse requirements therefore generate a set of problems that researchers have only marginally studied before.

INTERCONNECTION NETWORK RESEARCH Interconnection networks stipulate particularly high demands in terms of bandwidth, delay, and delivery over short distances. Typical sample interconnection application areas include multiprocessors, computing clusters, storage servers, computer I/O devices, processor-memory interconnects, networks-on-a-chip, and

Published by the IEEE Computer Society

0018-9162/08/$25.00 © 2008 IEEE


BEMaGS

A F

Computer


BEMaGS

A F

the internal network of Internet Protocol routers. 5 Two decades ago, interconnection netSwitch 1 Switch 4 works were mainly bus-based systems. However, such architectures suffer from both serial access and arbitration overheads, and in that respect they scale poorly. Dest 2 To keep pace with computer evolution and the increased burden imposed on data cenDest 3 ters, application processing, and enterprise computing, interconnection network technologies have migrated into point-to-point switch-based networks over the past 10 to Blocked packet Full buffer 15 years. These technologies benefit from the parallelism offered by nonblocking Dest 1 switches capable of forwarding packets at full link speed concurrently between input Dest 4 and output ports if no group of packets competes for the same output port. Today, most state-of-the-art interconnection network technologies like InfiniBand (www. ____ infinibandta.org) Switch 2 Switch 3 ____________ and HyperTransport (www.hypertransport.org) support switchEmpty buffer based principles. Interconnection networks typically use techniques like virtual cut-through or Figure. 1. Deadlock configuration example. The packets in switches 1, 2, 3, and 4 wormhole switching. In both these tech- are destined for switches 3, 4, 1, and 2, respectively. A lack of buffer space in the niques, when a packet header arrives at a next switch blocks each packet in this set and, since each packet awaits another switch, the switch immediately routes it packet in the set to advance, all packets remain blocked. onward, which reduces packet latency in large interconnection networks. The difference between simultaneously try to forward streams of packets to the virtual cut-through and wormhole switching is that the switch located diagonally opposite. The packets will former assumes a maximum packet size and will buffer need to make two hops to reach their destination, but the packet completely in one switch if the requested out- are blocked because they need access to the buffers in put port is busy. The latter, on the other hand, does not the next switch. However, packets being blocked for the assume a maximum packet size, and a switch may only same reason occupy those buffers, and so forth. Thus, have sufficient buffer space for a portion of a packet. the routing function causes a cyclic dependency between This makes a packet stretch out in the network during the channels’ buffer resources. The first important breakthrough in deadlock avoidcongestion, causing a movement toward its destination that resembles a worm wriggling through a heap of ance considered only the store-and-forward paradigm. Later, researchers extended this result to wormhole sand—hence the term wormhole switching. A distinguishing factor when comparing interconnec- switching and adaptive routing techniques. They based tion networks to traditional computer networks is that all these results on the notion of channel dependency no packet loss occurs in an interconnection network. graphs, which uses conventional graph theory to examBecause the system does not let a switch discard noncor- ine whether a routing function is deadlock-free. Along with the migration to switch-based networks, rupted packets, it must exercise flow control on the links. Packets need sufficient credits in the next switch to be most academic research shifted focus to propose scalforwarded, which guarantees the buffer space to receive able regular network topologies—where meshes, hyperit at the other end. Such lossless networks are suscep- cubes, tori, and multistage interconnects are most promtible to deadlocks because buffer resources of one chan- inent—together with deadlock-free routing algorithms nel will have direct and indirect dependencies on buffer that could benefit from the regular structure of these resources of other channels. As such, the system must topologies. In the late 1990s, the high-performance computing choose routing strategies carefully to avoid deadlocks. Figure 1 depicts a classic example of a small network community started building cluster computers conthat can deadlock. If all switches forward their packets nected by off-the-shelf switches. Such computer systems counterclockwise, deadlock occurs when all switches might also be structured as irregular topologies, leading 63

September 2008

Computer


BEMaGS

A F

Computer


BEMaGS

A F


standards offer QoS mechanisms, while academia has been researching how to best utilize these mechanisms. Recently, both standardization bodies and academia have studied congestion management. Since interconnection networks use flow control, congestion affects overall performance, severely limiting the network’s ability to meet its QoS guarantees. When congestion occurs, a depletion of credits begins, with the result that buffers start filling up. The situation spreads upstream through the network, building up congestion trees. To remedy this problem, the interconnection networking community has proposed various congestionmanagement techniques that span simple forward/backward explicit notifications to more advanced solutions that perform congestion-tree housekeeping.

SYSTEM MODEL We view a UCDC as a large collection of resources in which each resource falls into one of the following categories: Compute node

Storage node

Access node

Figure 2. Schematic diagram of a utility computing data center. In this example, a 3 × 3 mesh interconnect links the resources. This simplified figure only illustrates the concept—real UCDCs will be much larger and might use different topologies.

to the need for topology-agnostic routing strategies. Academia focused on developing such routing algorithms and, recently, researchers have introduced algorithms that perform almost as well as regular topology-specific algorithms, in addition to the flexibility that generic algorithms provide. Recently, there has been a shift toward an increased focus on the network’s ability to provide sustained service if a network component fails. A plethora of work has addressed this problem area, most of which relies on a static fault model that assumes the entire network and accompanying applications remain shut down during the network’s reconfiguration. The current trend is to handle fault situations on the fly, assuming a dynamic fault model. This poses new challenges related to the deadlock problem because rerouting packets to a new fault-free path causes new channel dependencies in the resulting routing function. Researchers have also focused on quality-of-service (QoS) in interconnection networks. Along with the introduction of new application areas related to multimedia and other time-sensitive applications, it became evident that interconnection networks must offer traffic differentiation, as increases in bandwidth do not perpetuate, and thus developers cannot solely rely on overprovisioning in the future. The research community has taken note of this, and today most interconnection network 64

• compute nodes (CPUs with memory), • storage nodes (disks or tapes), and • access nodes (gateways to external networks). An interconnection network links these resources. In principle, the interconnection technology used is not fixed, but we assume it is based on a point-to-point link technology like InfiniBand. At least one link connects each resource to the interconnection network. Both capacity and resilience can prompt the need for connecting a resource to the interconnection network via more than one link. Figure 2 shows our UCDC model. End users for such a system simply lease resources as required, which gives them the flexibility to offload management of the physical resources to the UCDC provider. The UCDC provider benefits by sharing a single set of physical resources among several different users. The UCDC system can group a network’s resources into logical subnetworks to support virtualization. In this case, each logical subnetwork, which has its own virtual interface, is referred to as a virtual server. The benefits of virtualization include increased flexibility, availability, and scalability. The flexibility arises from the ability to change the allocated resources’ setup during runtime. The ability to move the virtual server from one set of physical resources to another, if required, provides availability. Increased capability, such as an increase in available compute power, yields scalability by allowing an allocation of additional resources to the virtual server. However, virtualization does introduce concerns relating to security, anonymity, and QoS. An allocated partition within the network should not be allowed to interfere with another partition. Interference can take the form of unauthorized access to resources, applica-

Computer

Computer


BEMaGS

A F

Computer


BEMaGS

A F

tions, and data—including interference on the intercon• Only a subset of the available resources will be required—typically storage nodes, a specified nect in the form of malicious or accidental cross-trafnumber of compute nodes, and one or more access fic generation. Anonymity can be an important aspect nodes. within such a domain as an end user might want to prevent a third party from obtaining knowledge about the • Multiple jobs, using overlapping sets of resources, will execute simultaneously to ensure effective traffic traversing both shared and private links. This resource utilization. kind of anonymity also extends to whether the partition is idle or not. Allocated partitions should ensure • Organizations over which the UCDC provider has little control will develop these jobs. an implementation of privacy and security where the typical sandbox approach to security might apply. QoS • Depending on the payment model, these organizations would prefer a predictable service for the exists on many levels, such as between applications in execution of their jobs. Ideally, they would like to one operating system instance, between several opersee the system perform as if there were no other jobs ating system instances on one CPU, between several on it, and as if the entire computer consisted of the CPUs in one node, between nodes in the same partiresources assigned to their job. tion, and between different partitions. Traditionally, processor architectures include two • The software’s quality will be highly variable. For each job, typically the customer or a third party, modes of operation: user (or unprivileged) and supervinot the data center’s owner, will have developed the sor (or privileged). The operating system uses the supersoftware. Even jobs stemming from visor mode, in which the executing a debugging phase of development code has access to all instructions A fault-tolerance are conceivable. and control registers. Since the solution should allow operating system, including device This list’s properties are not new. drivers, controls the partition’s unaffected jobs to run All are easily recognizable from the operation, the UCDC provider must without interruption. requirements that first faced timeeither trust the operating system on sharing mainframes decades ago the compute nodes to behave corand that led to solutions such as CPU rectly, or the network implementation (including the network interface cards) must be able scheduling, virtual memory, and disk scheduling. For the to enforce critical policies for access control and traffic system interconnect, however, the preceding list presents separation. Hence, in UCDC environments where a job the following yet-to-be-addressed challenges. specification may include operating systems and drivers Flexible partitioning. Because several jobs will run that the UCDC owner does not control, the compute concurrently and the software’s quality in these jobs will nodes become inherently untrusted, and the require- vary, a misbehaving job must not consume bandwidth ments on the network implementation become stricter. in the interconnect to the extent that other jobs suffer. Recent developments in CPU technology include mul- Moreover, a typical job will come with a set of requireticore CPUs. The use of multicore CPUs will profoundly ments relating to the number of compute nodes, storage affect the programming paradigm, which will also be nodes, and access nodes it demands. This requires partireflected in future UCDCs as application software will tioning the network as a set of, possibly noncontiguous, be written and rewritten to take advantage of this paral- regions to maximize resource utilization. lelism. Hypervisors must take into account both multiFault tolerance. The effect of a faulty component core CPUs and how virtualization should extend from a in the interconnection network should be constrained node into the UCDC’s interconnection network. to as few jobs as possible, and the set of jobs the fault As the number of cores increases to hundreds of cores terminates should, to the greatest extent possible, be per CPU, we see the problems facing resource manage- controlled from a job-importance perspective. A faultment within a UCDC being reflected in resource man- tolerance solution should allow unaffected jobs to run agement within a CPU. Clearly, the number of cores can- without interruption. not simply increase without a corresponding increase in Predictable service. With regard to partitioning and the available I/O resources. From the interconnect per- multiple jobs, it should be possible to guarantee a spespective, we believe that the use of multicore comput- cific portion of the interconnect network capacity to ing within a UCDC should be as seamless as single-core each partition or job. Further, it should be possible to computing is today. differentiate between jobs based on importance. A UCDC that participates in a large grid will have a workload consisting of a set of jobs or service RESOURCE PARTITIONING requests. These jobs can have a property set that disPartitioning describes the mapping of a request for tinguishes them from jobs traditionally run on a big compute resources to a subset of the physical archimulticomputer: tecture. Taking a snapshot of an operational UCDC, 65

September 2008

Computer


BEMaGS

A F

Computer


BEMaGS

A F


0

1

2

3

0

1

2

3

0

1

2

3

4

5

6

7

4

5

6

7

4

5

6

7

8

9

10

11

8

9

10

11

8

9

10

11

12

13

14

15

12

13

14

15

12

13

14

15

Req: 6

(a)

Req: 7

Req: 6

(b)

(c)

Figure 3. Three different approaches to resource partitioning. A new request for compute resources arrives when three jobs are running in the sample UCDC: The first job runs on entities 0, 1, 4, and 5; the second runs on entities 7 and 11; and the third runs on entities 12 through 15. The arrow indicates the number of resource entities requested, and the broken line frames the resulting allocation. (a) Allocations must be rectangular. (b) Allocations may be irregular. (c) Allocations may be irregular and overlapping.

we expect to see a set of jobs running, each utilizing a subset of the assigned resources, and each possibly belonging to different owners. The set of resources assigned to a job forms a partition (or virtual server). In the interconnection network, we only observe communication between two resources if they are part of the same partition. Figure 3 illustrates three different approaches to resource partitioning. In the case where all partitions are disjoint and can be assumed to utilize disjoint parts of the interconnection network, the problem is relatively easy. It is equivalent to the traditional problem of contiguously allocating processors and has been studied extensively, particularly for meshes, but also for tori and hypercube topologies.6 In the two-dimensional case, these topologies are rectangular, and resources (CPUs) have typically been allocated in nonoverlapping rectangular partitions of suitable size. The preferred routing methods for these topologies automatically let the resources in each subrectangle communicate internally without sending packets outside its own partition. Thus, there is no link contention between the separate partitions. This can be advantageous for communication-intensive applications. Fragmentation limits a system’s resource utilization and is a well-known problem for traditional contiguous processor-allocation schemes. External fragmentation means that there are enough resources available to serve an incoming job request, but the shape of the available resources is not as required. Internal fragmentation occurs when some restriction causes the allocation of more resources than requested by a job. One possible restriction could be that the allocated resources must constitute a square in which the length of each side is a power of 2. Noncontiguous allocation methods solve the problem of fragmentation at the expense of increased link contention between the different partitions. Researchers have studied and assessed several such allocation algo66

rithms for various communication patterns.7 Hybrid schemes have also been proposed. For several reasons, however, previous studies of processor allocation did not solve the problems facing a UCDC provider. First, these studies considered only allocation of compute nodes. In addition to these CPUs, a job running on a UCDC typically also requires access to other resource types such as storage and access nodes. Thus, our system model for a UCDC poses new challenges to the resource partitioning problem. Second, traditional approaches to contiguous resource allocation assumed that the resources needed for each partition can be made disjoint. This assumption might not be sustainable for a provider of virtual servers due to resource utilization requirements. Moreover, utilization of storage and access nodes could imply overlapping partitions and shared links. Third, the predominant trend in network-based computing uses multistage topologies like Clos networks and fat trees. These topologies have characteristics that differ from meshes, tori, and hypercubes, making the existing studies inapplicable. Meeting future challenges requires new studies on resource partitioning, which must take the following into account: • In addition to the allocation of compute nodes, other resources such as storage nodes and access nodes must be considered. • Partitions should preferably be as disjoint as possible. • Communication interference between partitions should be limited as far as possible. • Multicore CPUs are now common and should be a design consideration. • Sharing physical resources between partitions might raise security issues that should be carefully considered.

Computer

Computer


BEMaGS

A F

Computer


• Currently, the most relevant network topologies are multistage networks, therefore these topologies should be in focus.

BEMaGS

A F

FAULT TOLERANCE

Fault tolerance is important to both the user and owner of a UCDC, but for different reasons. The user often embeds fault tolerance in the application, which Generic routing is crucial for achieving flexible parti- lets it gracefully handle the occurrence of a fault in one or more resources. In this case, the most important feationing and high utilization of UCDCs. ture required from the UCDC is redundancy, specifically the replacement of a faulty component such as a broken GENERIC ROUTING FOR compute node to retain the amount of resources dediFLEXIBLE PARTITIONING In previous work related to processor allocation in net- cated to the user. Thus, the user handles faults, while the work-based computing, researchers generally assumed owner makes sure that faulty components are replaced. From the UCDC owner’s perspective, making sophisthat the underlying routing function is fixed. For example, most mesh-based architectures rely on dimension- ticated fault tolerance part of the UCDC architecture order routing (DOR) because the deadlock problem helps simplify system administration and improve utiliinherent in most interconnection networks has deprived zation in the case of faults. This makes it possible to, for example, have a scheme that seamthese networks of the routing flexlessly preempts resources from a ibility available in other network The new partitioning lower-priority partition whenever types, such as the Internet. For cona resource in a higher-priority partiguous allocation, this has influalgorithms must ensure tition fails. enced the research community to sufficient performance Over the past few decades, partition in submeshes, where DOR within each partition. researchers have extensively studied is applicable. Also, for noncontiguthe ability to maintain continued ous partitioning of mesh-based servoperation in the presence of faults in ers, DOR provides the underlying routing. This might cause the internal communication interconnection networks. This has resulted in a plethora within a partition to pass through several other parti- of methods for addressing different fault models (link/ node faults, static/dynamic/transient faults), different tions and interfere with their behavior. Two recent developments have opened up new possibili- topologies (meshes/tori/hypercubes/multistage/irregular), ties. First, researchers have offered methods for dynami- and places of application (local/end-to-end rerouting).5 Most methods developed for interconnection netcally reconfiguring interconnection networks. These methods let an interconnection network change from an works in general can also be applied in the system model old routing function to a new one while the network is we have described, but in our scenario new aspects that up and running—without creating deadlocks during the have not been considered previously become important. transition phase.8,9 Second, researchers have significantly These are connected to the observation that failure of a improved topology-agnostic routing algorithms, and their node or link will most likely affect only a few—perhaps performance can now compete with special-purpose rout- only one—of the current jobs or partitions. This means that most jobs could and should be allowed to continue ing algorithms.10,11 These developments allow for the dynamic creation of as if nothing had happened. Methods for fault tolerance that are limited to the irregular subnetworks that serve a particular partition. Systems can use efficient topology-agnostic routing algo- resources set aside for one partition therefore become rithms to keep traffic confined to a subnetwork, so that important. This means that methods for fault tolerance partitions remain separate from each other. Dynamic restricted to only a subpart of the network should be reconfiguration methods can change the routing func- studied. Further, methods for rearranging resources tion while the network is up and running to allow imple- between partitions of different priorities after a fault has mentation of the routing function that a subnetwork occurred must be studied. needs for a new partition. This poses the challenge of developing efficient parti- PREDICTABLE SERVICE tioning methods that map jobs to computing resources by In many cases, guaranteeing completely disjoint physiexploiting the flexibility that generic routing and dynamic cal resources to different partitions will be impractical. reconfiguration provide. The new partitioning algorithms This means that sharing switches and links between must ensure sufficient performance within each partition. virtual servers will be commonplace. Guaranteeing netIn the model we have described, the scheduling problem work service to a virtual server that shares resources is still prominent—the scheduler’s task is to select, from with other virtual servers, even if the software running the set of available jobs, the next job to be served, taking on a different virtual server misbehaves, requires QoS into account the partitioning strategy. Job throughput is guarantees in the interconnection network. thus an important data center metric. Although researchers have studied QoS intensively 67

September 2008

Computer


BEMaGS

A F

Computer


BEMaGS

A F


for mainstream packet-based networks over the years, beneficial results for interconnection networks have been scarce. The most important developments are the multimedia router,12 some work on reducing the resources needed for service differentiation,13 and some work specific to InfiniBand.14 These works’ focus, however, is somewhat different from the UCDC’s needs. In particular, when studying QoS guarantees in the interconnection network, we must assume that applications might misbehave and attempt to use far more of the network resources than their allotted share. Further, the class-based differentiation that most methods suggest is insufficient for the level of differentiation required in the UCDC. The virtual view of the available resources requires a more fine-grained QoS scheme that reflects fault tolerance and partitioning in the compute and storage nodes and in the network. The Internet community has conducted much research on algorithms that support the type of guarantees we seek, such as high granularity differentiation similar to IntServ (www.rfc-editor.org/rfc/rfc1633. txt). These results and methods could be ported to __ interconnection networks—a task more challenging than it might first appear. The naive approach would be to apply the mechanisms proposed for use for example in the Internet or asynchronous transfer mode to interconnection networks. Yet this approach must overcome the problem that researchers based these methods on the assumption that overutilization of one link will have no effect on the performance of any other. This is not true for interconnection networks because the use of link-level flow control introduces congestion trees, which reduce the network’s overall performance and might even include flows not destined for the overutilized link. Therefore, we must study existing results from the Internet community but within the context of interconnection networks, as well as develop new solutions. Three major topics that researchers must address to successfully implement the UCDC are high-granularity service differentiation, congestion control, and admission control. Recent work in the context of interconnection networks includes high-granularity12 and class-based service differentiation schemes.14 Some researchers recently proposed congestion-control mechanisms,15,16 while other work supports a series of admission-control mechanisms.17 The question of how the suggested mechanisms impact partitioning and fault-tolerance schemes must be answered, and the Grand Challenge of combining all these features in the UCDC remains to be addressed.

e believe that more research will enable an interconnection network to support the same seamless virtualization found in other parts of hardware, such as CPUs. The interconnection network

W

68

thus poses particular challenges as well as opportunities for a UCDC. N

References 1. “N1 Grid Engine 6 Features and Capabilities,” white paper, Sun Microsystems, 2004. 2. “Adaptive Enterprise: Business and IT Synchronized to Capitalize on Change,” white paper, HP, 2005. 3. “Unleash the Power of E-Business on Demand,” white paper, IBM, 2003. 4. Sun Microsystems, Sun Grid Compute Utility—Reference Guide, part no. 819-5131-10, 2006. 5. J. Duato, S. Yalamanchili, and L. Ni, Interconnection Networks: An Engineering Approach, Morgan Kaufmann, 2003. 6. F. Wu, C.-C. Hsu, and L.-P. Chou, “Processor Allocation in the Mesh Multiprocessors Using the Leapfrog Method,” IEEE Trans. Parallel and Distributed Systems, vol. 14, no. 3, 2003, pp. 276-289. 7. D.P. Bunde, V.J. Leung, and J. Mache, “Communication Patterns and Allocation Strategies,” Proc. 18th Int’l Parallel and Distributed Processing Symp. (IPDPS 04), IEEE CS Press, 2004, p. 248. 8. J. Duato et al., “Part I: A Theory for Deadlock-Free Dynamic Network Reconfiguration,” IEEE Trans. Parallel Distributed Systems, vol. 16, no. 5, 2005, pp. 412-427. 9. T. Pinkston, R. Pang, and J. Duato, “Deadlock-Free Dynamic Reconfiguration Schemes for Increased Network Dependability,” IEEE Trans. Parallel and Distributed Systems, vol. 14, no. 8, 2003, pp. 780-794. 10. O. Lysne et al., “Layered Routing in Irregular Networks,” IEEE Trans. Parallel and Distributed Systems, vol. 17, no. 1, 2006, pp. 51-65. 11. A. Jouraku, M. Koibuchi, and H. Amano, “An Effective Design of Deadlock-Free Routing Algorithms Based on 2D Turn Model for Irregular Networks,” IEEE Trans. Parallel and Distributed Systems, vol. 18, no. 3, 2007, pp. 320-333. 12. J. Duato et al., “MMR: A High-Performance Multimedia Router—Architecture and Design Trade-Offs,” Proc. Int’l Symp. High Performance Computer Architecture (HPCA 99), IEEE CS Press, 1999, pp. 300-309. 13. A. Martin´ ez et al., “A New Cost-Effective Technique for QoS Support in Clusters,” IEEE Trans. Parallel and Distributed Systems, vol. 18, no. 12, 2007, pp. 1714-1726. 14. F.J. Alfaro, J.L. Sanchez, and J. Duato, “QoS in InfiniBand Subnetworks,” IEEE Trans. Parallel and Distributed Systems, vol. 15, no. 9, 2004, pp. 810-823. 15. J. Duato et al., “A New Scalable and Cost-Effective Congestion Management Strategy for Lossless Multistage Interconnection Networks,” Proc. 11th Int’l Symp. High-Performance Computer Architecture (HPCA 05), IEEE CS Press, 2005, pp. 108-119. 16. M. Gusat et al., “Congestion Control in InfiniBand Networks,” Proc. 13th Symp. High-Performance Interconnects, IEEE CS Press, 2005, pp. 158-159.

Computer

Computer


BEMaGS

A F

Computer


BEMaGS

A F

17. S.-A. Reinemo et al., “Admission Control for Diffserv-Based Quality of Service in Cut-Through Networks,” Proc. 10th Int’l Conf. High-Performance Computing (HiPC 03), Springer, 2003, pp. 118-129.

ests include resource partitioning, deadlock-free routing, and fault tolerance in interconnection networks. Solheim received an MS in computer science from the University of Oslo. Contact her at _____________ [email protected].

Olav Lysne is a research director at Simula Research Laboratory and a professor of computer science at the University

Thomas Sødring is a postdoctoral researcher at Simula Research Laboratory working on the ICON project. His

of Oslo. His research focuses on interconnects, specifically effective routing, fault tolerance, and quality of service. Lysne received an MS and a Dr.Scient, both from the University of Oslo. Contact him at [email protected]. ____________

research focuses on interconnects and the development of generic solutions for routing, fault tolerance, and quality of service. Sødring received a PhD in computer applications from Dublin City University. He is a member of the ACM. Contact him at ______________ [email protected].

Sven-Arne Reinemo is a postdoctoral researcher at Simula Research Laboratory. His research focuses on routing and

quality of service in interconnection networks. Reinemo received a Cand.Scient. and a Dr.Scient. in computer science from the University of Oslo. Contact him at ______ svenar@ simula.no. _______ Tor Skeie is an associate professor at Simula Research Laboratory and the University of Oslo. His research focuses on

scalability, effective routing, fault tolerance, and quality of service in switched network topologies. Skeie received an MS and a PhD in computer science from the University of Oslo. Contact him at [email protected]. ____________ Åshild Grønstad Solheim is a PhD student at Simula Research Laboratory and the University of Oslo. Her research inter-

Lars Paul Huse is a senior staff engineer at Sun Microsystems , Oslo. His research focuses on crafting software

to effectively utilize fast interconnects from hardwareclose programming up to the user application interfacing. Huse received a PhD in computer science from the University of Oslo. Contact him at ______________ lars.paul.huse@sun. ___ com. Bjørn Dag Johnsen is a principal engineer at Sun Microsystems, Oslo. His research focuses on the implementation,

configuration, and management of large-scale fabric and cluster systems. Johnsen received an MS in computer science from the University of Bergen, Institute of Informatics, Norway. Contact him at _________________ bjorn-dag.johnsen@sun. com. ___

Silver Bullet Security Podcast I n - d e p t h i n t e r v i e w s w i t h s e c u r i t y g u r u s . H o s t e d b y G a r y M c G r a w.

w w w. c ompu t e r. or g / s e c ur i t y / p o dc as t s

_____________________________________________ Sponsored by

69

September 2008

Computer


BEMaGS

A F

Interconnection Networks: Architectural ... - Semantic Scholar

Interconnection Networks: Architectural ... - Semantic Scholar

Suggest Documents

Topologies for Optical Interconnection Networks ... - Semantic Scholar

Interconnection Networks

Interplanetary Networks: Architectural Analysis ... - Semantic Scholar

Filtering Random Networks to Synthesize Interconnection Networks

Architectural Congruence - Semantic Scholar

Multilayer Multistage Interconnection Networks - CiteSeerX

Nanophotonic On-Chip Interconnection Networks

US public safety networks: Architectural patterns ... - Semantic Scholar

Probabilistic interconnection between interdependent networks

Optimized Interconnection of Disjoint Wireless ... - Semantic Scholar

Communication Throughput of Interconnection ... - Semantic Scholar

Some Properties of Swapped Interconnection ... - Semantic Scholar

The future of interconnection technology - Semantic Scholar

On Interconnection Models and Strategies - Semantic Scholar

Control Models at Interconnection Points - Semantic Scholar

Evaluating On-Chip Interconnection Architectures ... - Semantic Scholar

A Scalable Interconnection Structure for ... - Semantic Scholar

Challenges in Interconnection and Packaging - Semantic Scholar

Tolerating Architectural Mismatches - Semantic Scholar

Architectural Mismatch Tolerance - Semantic Scholar

Tolerating Architectural Mismatches - Semantic Scholar

nextgrid architectural concepts - Semantic Scholar

Assessing Architectural Complexity - Semantic Scholar

architectural state-machines - Semantic Scholar