Loading...
Search for: interconnection-networks
0.008 seconds
Total 101 records

    Topological properties of stretched graphs

    , Article IEEE International Conference on Computer Systems and Applications, 2006, Sharjah, 8 March 2006 through 8 March 2006 ; Volume 2006 , 2006 , Pages 647-650 ; 1424402123 (ISBN); 9781424402120 (ISBN) Shareghi, P ; Sarbazi Azad, H ; Sharif University of Technology
    IEEE Computer Society  2006
    Abstract
    We study a class of interconnection networks for multiprocessors, called the Stretched-G network, which is based on the base graph G by replacing each edge of the base network with an array of processors. Two interesting features of the proposed topology are its area-efficient VLSI layout and superior scalability over the underlying base network while preserving most of its desirable properties. We conduct a general study on the topological properties of stretched networks. We first obtain their basic topological parameters, after that we present an optimal routing algorithm. We also present a unified approach to obtain the topological properties and the VLSI-layout of an arbitrary stretched... 

    Characterization of spatial fault patterns in interconnection networks

    , Article Parallel Computing ; Volume 32, Issue 11-12 , 2006 , Pages 886-901 ; 01678191 (ISSN) Hoseiny Farahabady, M ; Safaei, F ; Khonsari, A ; Fathy, M ; Sharif University of Technology
    2006
    Abstract
    Parallel computers, such as multiprocessors system-on-chip (Mp-SoCs), multicomputers and cluster computers, are consisting of hundreds or thousands multiple processing units and components (such as routers, channels and connectors) connected via some interconnection network that collectively may undergo high failure rates. Therefore, these systems are required to be equipped with fault-tolerant mechanisms to ensure that the system will keep running in a degraded mode. Normally, the faulty components are coalesced into fault regions, which are classified into two major categories: convex and concave regions. In this paper, we propose the first solution to calculate the probability of... 

    Modelling and evaluation of adaptive routing in high-performance n-D tori networks

    , Article Simulation Modelling Practice and Theory ; Volume 14, Issue 6 , 2006 , Pages 740-751 ; 1569190X (ISSN) Sarbazi Azad, H ; Ould-Khaoua, M ; Sharif University of Technology
    2006
    Abstract
    Many fully-adaptive algorithms have been proposed to overcome the performance limitations of deterministic routing in networks used in high-performance multicomputers, such as the well-known regular n-D torus. This paper proposes a simple yet reasonably accurate analytical performance model to predict message communication latency in tori networks. This model requires a running time of O(1) which is the fastest model yet reported in the literature. Extensive simulations reveal that the new performance model maintains a reasonable accuracy when the network operates under different traffic conditions. The model is then used to perform an extensive investigation into the performance merits of... 

    The grid-pyramid: A generalized pyramid network

    , Article Journal of Supercomputing ; Volume 37, Issue 1 , 2006 , Pages 23-45 ; 09208542 (ISSN) Hoseinyfarahabady, M. R ; Sarbazi Azad, H ; Sharif University of Technology
    2006
    Abstract
    The Pyramid network is a desirable network topology used as both software data-structure and hardware architecture. In this paper, we propose a general definition for a class of pyramid networks that are based on grid connections between the nodes in each level. Contrary to the conventional pyramid network in which the nodes in each level form a mesh, the connections between these nodes may also be according to other grid-based topologies such as the torus, hypermesh or WK-recursive. Such pyramid networks form a wide class of interconnection networks that possess rich topological properties. We study a number of important properties of these topologies for general-purpose parallel processing... 

    Performance comparison of adaptive routing algorithms in the star interconnection network

    , Article 8th International Conference on High-Performance Computing in Asia-Pacific Region, HPC Asia 2005, Beijing, 30 November 2005 through 3 December 2005 ; Volume 2005 , 2005 , Pages 257-264 ; 0769524869 (ISBN); 9780769524863 (ISBN) Kiasari, A. E ; Sarbazi Azad, H ; Rezazad, M. S ; Sharif University of Technology
    2005
    Abstract
    The star graph was introduced as an attractive alternative to the well-known hypercube and its properties have been well studied in the past. Most of these studies have focused on topological properties and algorithmic aspects of this network. In this paper, the performance of nine fully-adaptive routing algorithms in the star network are studied under uniform (with and without locality) and hotspot traffic patterns. The performance factors considered are the average message latency and network utilization. We consider three basic routing algorithms (two hop scheme routing algorithms: positive-hop and negative-hop, and Msic's routing algorithm) and two other improved sets of improved... 

    Parallel 3-dimensional DCT computation on k-Ary n-cubes

    , Article 8th International Conference on High-Performance Computing in Asia-Pacific Region, HPC Asia 2005, Beijing, 30 November 2005 through 3 December 2005 ; Volume 2005 , 2005 , Pages 91-97 ; 0769524869 (ISBN); 9780769524863 (ISBN) Modarressi, M ; Sarbazi Azad, H ; Sharif University of Technology
    2005
    Abstract
    The three dimensional discrete cosine transform (3D DCT) has been widely used in many applications such as video compression. On the other hand, the kary n-cube is one of the most popular interconnection networks used in many recent multicomputers. As direct calculation of 3D DCT is very time consuming, many researchers have been working on developing algorithms and special-purpose architectures for fast computation of 3D DCT. This paper proposes a parallel algorithm for efficient calculation of 3D DCT on the k-ary n-cube multicomputers. The time complexity of the proposed algorithm is of O(N) for an N × N × N input data cube while direct calculation of 3D DCT has a complexity of O(N6). ©... 

    The stretched-hypercube: A VLSI efficient network topology

    , Article 8th International Symposium on Parallel Architectures, Algorithms and Networks, I-SPAN 2005, Las Vegas, NV, 7 December 2005 through 9 December 2005 ; Volume 2005 , 2005 , Pages 462-467 ; 0769525091 (ISBN); 9780769525099 (ISBN) Shareghi, P ; Sarbazi Azad, H ; Sharif University of Technology
    2005
    Abstract
    In this paper, we introduce a new class of interconnection networks for multiprocessor systems which we refer to as Stretched-Hypercubes, or shortly the Stretched-Cube networks. These networks are obtained by replacing an edge of the well-known hypercube network with an array of processors. Two interesting features of the proposed topology are its area-efficient VLSI layout and superior scalability over the traditional hypercube network. Some topological properties of the proposed network are studied. In addition, an area-efficient VLSI layout for the stretched-cube is suggested and some comparisons between the proposed network and previously studied networks such as the star and hypercube... 

    Topological properties of necklace networks

    , Article 8th International Symposium on Parallel Architectures, Algorithms and Networks, I-SPAN 2005, Las Vegas, NV, 7 December 2005 through 9 December 2005 ; Volume 2005 , 2005 , Pages 40-45 ; 0769525091 (ISBN); 9780769525099 (ISBN) Shareghi, P ; Sarbazi Azad, H ; Sharif University of Technology
    2005
    Abstract
    We study a class of interconnection networks for multiprocessors, called the Necklace-G network that is based on the base graph G by attaching an array of processors to each two adjacent nodes of G. One of the interesting features of the proposed topology is its scalability while preserving most of the desirable properties of the underlying base network G. We conduct a general study on the topological properties of necklace networks. We first obtain their basic topological parameters, and then present optimal routing and broadcasting algorithms. We also present a unified approach to obtain the topological properties and the VLSI-layout of an arbitrary necklace network based on the properties... 

    An empirical comparison of OTIS-mesh and OTIS-hypercube multicomputer systems under deterministic routing

    , Article 19th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2005, Denver, CO, 4 April 2005 through 8 April 2005 ; Volume 2005 , 2005 ; 0769523129 (ISBN); 0769523129 (ISBN); 9780769523125 (ISBN) Najaf abadi, H. H ; Sarbazi azad, H ; Sharif University of Technology
    2005
    Abstract
    In optoelectronic OTIS architectures, electrical and optical interconnects are used for local and global communication, respectively. Interesting instances of the OTIS architecture are the OTIS-hypercube and OTIS-mesh. This paper conducts a performance evaluation and comparison of these networks under different structural conditions and traffic loads. All judgments made, are based on observations from extensive simulation results of the interconnection networks. We conclude that, when the OTIS-hypercube and equivalent 2-D OTIS-mesh are compared under the constraint of equal bisection bandwidth, the performance of the OTIS-hypercube is of superior performance. We however show that the... 

    The effect of virtual channel organization on the performance of interconnection networks

    , Article 19th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2005, Denver, CO, 4 April 2005 through 8 April 2005 ; Volume 2005 , 2005 ; 0769523129 (ISBN); 0769523129 (ISBN); 9780769523125 (ISBN) Rezazad, M ; Sarbazi Azad, H ; Sharif University of Technology
    2005
    Abstract
    Most of previous studies have assessed the performance issues for regular buffer and virtual channel organiza-tions and have not considered overall buffer size constraint. In this paper, the performance of mesh-based interconnection networks (mesh, torus and hypercube networks) under different traffic patterns (uniform, hotspot, and matrix-transpose) is studied. We investigate the effect of the number of virtual channels and their buffer lengths, on the performance of these topologies when the total buffer size associated to each physical channel (and thus router buffer size) is fixed.The results show that the optimal number of virtual channels and buffer length highly depends on the traffic... 

    A constraint-based performance comparison of hypercube and star multicomputers with failures

    , Article 19th International Conference on Advanced Information Networking and Applications, AINA 2005, Taipei, 28 March 2005 through 30 March 2005 ; Volume 1 , 2005 , Pages 841-846 ; 1550445X (ISSN); 0769522491 (ISBN); 9780769522494 (ISBN) Rezazad, M ; Sarbazi Azad, H ; Sharif University of Technology
    2005
    Abstract
    Many theoretical studies have compared the hypercube and star graphs from a graph theoretical viewpoint, under structural and algorithmic properties. None of these studies have, however, considered real working conditions and implementation constraints. In this paper, the hypercube and star graphs are compared in view of fault tolerance and technological implementation constraints. In order to realize a fair comparison, we use the unsafely-vector fault tolerant routing algorithm, recently introduced in [1] and [2], for the hypercube and star graph. Under two implementation constraints, namely constant bisection bandwidth and constant node pin-out, we have compared the performance of the two... 

    The star-pyramid graph: An attractive alternative to the pyramid

    , Article 10th Asia-Pacific Conference on Advances in Computer Systems Architecture, ACSAC 2005, Singapore, 24 October 2005 through 26 October 2005 ; Volume 3740 LNCS , 2005 , Pages 509-519 ; 03029743 (ISSN); 3540296433 (ISBN); 9783540296430 (ISBN) Imani, N ; Sarbazi Azad, H ; Sharif University of Technology
    2005
    Abstract
    This paper introduces a new class of interconnection networks named Star-Pyramid, SP(n). A star-pyramid of dimension n is formed by piling up star graphs of dimensions 1 to n in a hierarchy, connecting any node in each i-dimensional star, 1< i ≤ n, to a node in (i - 1)-star whose index is reached by removing the i symbol from the index of the former node in the i-star graph. Having extracted the properties of the new topology, featuring topological properties, a simple routing algorithm and Hamiltonicity then we compare the network properties of the proposed topology and the well-known pyramid topology. We show that the star-pyramid is more fault-tolerant and has less network diameter than... 

    Parallel clustering on the star graph

    , Article 6th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP, Melbourne, 2 October 2005 through 3 October 2005 ; Volume 3719 LNCS , 2005 , Pages 287-292 ; 03029743 (ISSN); 3540292357 (ISBN); 9783540292357 (ISBN) Fazeli, M ; Sarbazi Azad, H ; Farivar, R ; Sharif University of Technology
    2005
    Abstract
    In this paper, a parallel algorithm for data clustering is presented on a multi-computer with star topology. This algorithm is fast and requires a small amount of memory per processing element, which makes it even suitable for SIMD implementation. The proposed parallel algorithm completes in O(K+S 2-T 2) steps for a clustering problem of N data patterns with M features per pattern and K clusters, where N.M = S!, K.M = T!, and M=R!, on a s-star interconnection network. © Springer-Verlag Berlin Heidelberg 2005  

    The Recursive Transpose-Connected Cycles (RTCC) interconnection network for multiprocessors

    , Article 20th Annual ACM Symposium on Applied Computing, Santa Fe, NM, 13 March 2005 through 17 March 2005 ; Volume 1 , 2005 , Pages 734-738 Farahabady, M. H ; Sarbazi Azad, H ; Sharif University of Technology
    2005
    Abstract
    In this paper, we propose a new modular topology for interconnection networks, the Recursive Transpose-Connected Cycles (RTCC). The RTCC has a recursive definition quite similar to that of fractal graphs having interesting topological characteristics, making it suitable for utilization as the base topology of large-scale multicomputer interconnection networks. We study important properties of this topology such as diameter, bisection width and issues related to implementation, such as routing algorithms and the average message latency under VLSI layout constraints. In addition, we prove that the RTCC is a Hamiltonian graph, We conclude that, insight of most of the above-mentioned properties,... 

    Analysis of true fully adaptive routing with software-based deadlock recovery

    , Article Journal of Systems and Software ; Volume 71, Issue 3 , 2004 , Pages 259-270 ; 01641212 (ISSN) Khonsari, A ; Sarbazi Azad, H ; Ould Khaoua, M ; Sharif University of Technology
    2004
    Abstract
    Several analytical models of fully adaptive routing (AR) in wormhole-routed networks have recently been reported in the literature. All these models, however, have been discussed for routing algorithms with deadlock avoidance. Recent studies have revealed that deadlocks are quite rare in the network, especially when enough routing freedom is provided. Thus the hardware resources, e.g. virtual channels, dedicated for deadlock avoidance are not utilised most of the time. This consideration has motivated researchers to introduce fully adaptive routing algorithms with deadlock recovery. This paper describes a new analytical model of a true fully AR algorithm with software-based deadlock... 

    Constraint-based performance comparison of multi-dimensional interconnection networks with deterministic and adaptive routing strategies

    , Article Computers and Electrical Engineering ; Volume 30, Issue 3 , 2004 , Pages 167-182 ; 00457906 (ISSN) Sarbazi Azad, H ; Sharif University of Technology
    2004
    Abstract
    Several studies have examined the relative performance merits of the torus and hypercube taking into account the channel bandwidth constraints imposed by implementation technology. While the torus has been shown to outperform the hypercube under the constant wiring density constraint, the opposite conclusion has been reached when the constant pin-out constraint is considered. However, all these studies have assumed deterministic routing and have not taken into account the internal hardware cost of routers. This paper re-examines the performance merits of the torus and hypercube using both fully-adaptive and deterministic routing strategies. Moreover, it uses a new cost model which takes into... 

    Towards a more realistic comparative analysis of multicomputer networks

    , Article Concurrency and Computation: Practice and Experience ; Volume 16, Issue 13 , 2004 , Pages 1271-1289 ; 15320626 (ISSN) Sarbazi Azad, H ; Ould Khaoua, M ; Mackenzie, L. M ; Sharif University of Technology
    John Wiley and Sons Ltd  2004
    Abstract
    Several studies have examined the relative performance merits of the torus and hypercube taking into account the channel bandwidth constraints imposed by implementation technology. While the torus has been shown to outperform the hypercube under the constant wiring density constraint, the opposite conclusion has been reached when the constant pin-out constraint is considered. However, these studies have assumed a pure uniform traffic pattern and deterministic routing. The 'uniform traffic' assumption is not always justifiable in practice as there are many real-world parallel applications that exhibit non-uniform traffic patterns, which can create unbalanced traffic such as hotspots in the... 

    Analytical modelling of wormhole-routed k-ary n-cubes in the presence of matrix-transpose traffic

    , Article Journal of Parallel and Distributed Computing ; Volume 63, Issue 4 , 2003 , Pages 396-409 ; 07437315 (ISSN) Sarbazi Azad, H ; Ould-Khaoua, M ; Mackenzie, L. M ; Sharif University of Technology
    Academic Press Inc  2003
    Abstract
    Several analytical models of fully adaptive routing in wormhole-routed k-ary n-cubes under the uniform traffic pattern have recently been proposed in the literature. Although the uniform reference model has been widely used in the past, it is not always true in practice as there are many applications that exhibit non-uniform traffic patterns. There has not been so far any study that describes an analytical model of fully adaptive routing under non-uniform traffic conditions. This paper describes a new analytical model of fully adaptive routing in k-ary n-cubes in the presence of non-uniform traffic generated by matrix-transpose permutations, which is an important communication operation... 

    A mathematical model of deterministic wormhole routing in hypercube multicomputers using virtual channels

    , Article Applied Mathematical Modelling ; Volume 27, Issue 12 , 2003 , Pages 943-953 ; 0307904X (ISSN) Sarbazi Azad, H ; Sharif University of Technology
    Elsevier Inc  2003
    Abstract
    Although several analytical models have been proposed in the literature for binary n-cubes with deterministic routing, most of them have not included the effects of virtual channel multiplexing on network performance. The only mathematical model for deterministic wormhole routing in hypercubes with virtual channels was proposed in [Y. Boura, Design and Analysis of Routing Schemes and Routers for Wormhole-routed Mesh Architectures, Ph.D. Thesis, Department of Computer Science and Engineering, Pennsylvania State University, 1995] which uses complex combinatorial analysis with a computation time of O(N = 2n) for an n-dimensional hypercube. This paper proposes a new and simple analytical model... 

    A parallel algorithm for Lagrange interpolation on the cube-connected cycles

    , Article Microprocessors and Microsystems ; Volume 24, Issue 3 , 2000 , Pages 135-140 ; 01419331 (ISSN) Sarbazi Azad, H ; Ould-Khaoua, M ; Mackenzie, L. M ; Sharif University of Technology
    Elsevier  2000
    Abstract
    This paper introduces a parallel algorithm for computing an N = n2n point Lagrange interpolation on an n-dimensional cube-connected cycles (CCCn). The algorithm consists of three phases: initialisation, main and final. While there is no computation in the initialisation phase, the main phase is composed of n2n-1 steps, each consisting of four multiplications, four subtractions and one communication operation, and an additional step including one division and one multiplication. The final phase is carried out in two sub-phases. There are [n/2] steps in the first sub-phase, each including two additions and one communication, followed by the second sub-phase which comprises n steps each...