Loading...
Search for: benchmarking
0.01 seconds
Total 203 records

    Energy efficiency in cloud-based mapReduce applications through better performance estimation

    , Article Proceedings of the 2016 Design, Automation and Test in Europe Conference and Exhibition, DATE 2016, 14 March 2016 through 18 March 2016 ; 2016 , Pages 1339-1344 ; 9783981537062 (ISBN) Nabavinejad, S. M ; Goudarzi, M ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2016
    Abstract
    An important issue for efficient execution of MapReduce jobs on a cloud platform is selecting the best fitting virtual machine (VM) configuration(s) among the miscellany of choices that cloud providers offer. Wise selection of VM configurations can lead to better performance, cost and energy consumption. Therefore, it is crucial to explore the available configurations and choose the best one for each given MapReduce application. Executing the given application on all the configurations for comparison is a costly, time and energy consuming process. An alternative is to run the application on a subset of configurations (sample configurations) and estimate its performance on other... 

    An operating system level data migration scheme in hybrid DRAM-NVM memory architecture

    , Article Proceedings of the 2016 Design, Automation and Test in Europe Conference and Exhibition, DATE 2016, 14 March 2016 through 18 March 2016 ; 2016 , Pages 936-941 ; 9783981537062 (ISBN) Salkhordeh, R ; Asadi, H ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2016
    Abstract
    With the emergence of Non-Volatile Memories (NVMs) and their shortcomings such as limited endurance and high power consumption in write requests, several studies have suggested hybrid memory architecture employing both Dynamic Random Access Memory (DRAM) and NVM in a memory system. By conducting a comprehensive experiments, we have observed that such studies lack to consider very important aspects of hybrid memories including the effect of: a) data migrations on performance, b) data migrations on power, and c) the granularity of data migration. This paper presents an efficient data migration scheme at the Operating System level in a hybrid DRAM-NVM memory architecture. In the proposed... 

    Assessment of characteristic boundary conditions based on the artificial compressibility method in generalized curvilinear coordinates for solution of the euler equations

    , Article Computational Methods in Applied Mathematics ; 2017 ; 16094840 (ISSN) Parseh, K ; Hejranfar, K ; Sharif University of Technology
    Abstract
    The characteristic boundary conditions are applied and assessed for the solution of incompressible inviscid flows. The two-dimensional incompressible Euler equations based on the artificial compressibility method are considered and then the characteristic boundary conditions are formulated in the generalized curvilinear coordinates and implemented on both the far-field and wall boundaries. A fourth-order compact finite-difference scheme is used to discretize the resulting system of equations. The solution methodology adopted is more suitable for this assessment because the Euler equations and the high-accurate numerical scheme applied are quite sensitive to the treatment of boundary... 

    An exact algorithm for the minimum dilation triangulation problem

    , Article Journal of Global Optimization ; Volume 69, Issue 2 , 2017 , Pages 343-367 ; 09255001 (ISSN) Sattari, S ; Izadi, M ; Sharif University of Technology
    Abstract
    Given a triangulation of a point set on the plane, dilation of any pair of the points is the ratio of their shortest path length to their Euclidean distance. The maximum dilation over all pairs of points is called the dilation of this triangulation. Minimum dilation triangulation problem seeks a triangulation with the least possible dilation of an input point set. For this problem no polynomial time algorithm is known. We present an exact algorithm based on a branch and bound method for finding minimum dilation triangulations. This deterministic algorithm after generating an initial solution, iteratively computes a lower bound for the answer and then applies a branch and bound method to find... 

    Interpolation of sparse graph signals by sequential adaptive thresholds

    , Article 2017 12th International Conference on Sampling Theory and Applications, SampTA 2017, 3 July 2017 through 7 July 2017 ; 2017 , Pages 266-270 ; 9781538615652 (ISBN) Boloursaz Mashhadi, M ; Fallah, M ; Marvasti, F ; Sharif University of Technology
    Abstract
    This paper considers the problem of interpolating signals defined on graphs. A major presumption considered by many previous approaches to this problem has been low-pass/band-limitedness of the underlying graph signal. However, inspired by the findings on sparse signal reconstruction, we consider the graph signal to be rather sparse/compressible in the Graph Fourier Transform (GFT) domain and propose the Iterative Method with Adaptive Thresholding for Graph Interpolation (IMATGI) algorithm for sparsity promoting interpolation of the underlying graph signal. We analytically prove convergence of the proposed algorithm. We also demonstrate efficient performance of the proposed IMATGI algorithm... 

    A hybrid project scheduling and material ordering problem: modeling and solution algorithms

    , Article Applied Soft Computing Journal ; Volume 58 , 2017 , Pages 700-713 ; 15684946 (ISSN) Zoraghi, N ; Shahsavar, A ; Niaki, S. T. A ; Sharif University of Technology
    Abstract
    A novel combination of a multimode project scheduling problem with material ordering, in which material procurements are exposed to the total quantity discount policy is investigated in this paper. The study aims at finding an optimal Pareto frontier for a triple objective model derived for the problem. While the first objective minimizes the makespan of the project, the second objective maximizes the robustness of the project schedule and finally the third objective minimizes the total costs pertaining to renewable and nonrenewable resources involved in a project. Four well-known multi-objective evolutionary algorithms including non-dominated sorting genetic algorithm II (NSGAII), strength... 

    A fundamental tradeoff between computation and communication in distributed computing

    , Article IEEE Transactions on Information Theory ; 2017 ; 00189448 (ISSN) Li, S ; Maddah Ali, M. A ; Yu, Q ; Avestimehr, A. S ; Sharif University of Technology
    Abstract
    How can we optimally trade extra computing power to reduce the communication load in distributed computing? We answer this question by characterizing a fundamental tradeoff between computation and communication in distributed computing, i.e., the two are inversely proportional to each other. More specifically, a general distributed computing framework, motivated by commonly used structures like MapReduce, is considered, where the overall computation is decomposed into computing a set of “Map” and “Reduce” functions distributedly across multiple computing nodes. A coded scheme, named “Coded Distributed Computing” (CDC), is proposed to demonstrate that increasing the computation load of the... 

    QuARK: quality-configurable approximate STT-MRAM cache by fine-grained tuning of reliability-energy knobs

    , Article Proceedings of the International Symposium on Low Power Electronics and Design, 24 July 2017 through 26 July 2017 ; 2017 ; 15334678 (ISSN) ; 9781509060238 (ISBN) Hosseini Monazzah, A. M ; Shoushtari, M ; Miremadi, S. G ; Rahmani, A. M ; Dutt, N ; Sharif University of Technology
    Abstract
    Emerging STT-MRAM memories are promising alternatives for SRAM memories to tackle their low density and high static power consumption, but impose high energy consumption for reliable read/write operations. However, absolute data integrity is not required for many approximate computing applications, allowing energy savings with minimal quality loss. This paper proposes QuARK, a hardware/software approach for trading reliability of STT-MRAM caches for energy savings in the on-chip memory hierarchy of multi- A nd many-core systems running approximate applications. In contrast to SRAM-based cache-way-level actuators, QuARK utilizes fine-grained cache-line-level actuation knobs with different... 

    A power gating switch box architecture in routing network of SRAM-based FPGAs in dark silicon era

    , Article 20th Design, Automation and Test in Europe, DATE 2017, 27 March 2017 through 31 March 2017 ; 2017 , Pages 1342-1347 ; 9783981537093 (ISBN) Seifoori, Z ; Khaleghi, B ; Asadi, H ; ACM Special Interest Group on Design Automation (ACM SIGDA); Electronic System Design Alliance (ESDA); et al.; European Design and Automation Association (EDAA); European Electronic Chips and Systems Design Initiative (ECSI); IEEE Council on Electronic Design Automation (CEDA) ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2017
    Abstract
    Continuous down scaling of CMOS technology in recent years has resulted in exponential increase in static power consumption which acts as a power wall for further transistor integration. One promising approach to throttle the substantial static power of Field-Programmable Gate Array (FPGAs) is to power off unused routing resources such as switch boxes, known as dark silicon. In this paper, we present a Power gating Switch Box Architecture (PESA) for routing network of SRAM-based FPGAs to overcome the obstacle for further device integration. In the proposed architecture, by exploring various patterns of used multiplexers in switch boxes, we employ a configurable controller to turn off unused... 

    Supervised heart rate tracking using wrist-type photoplethysmographic (PPG) signals during physical exercise without simultaneous acceleration signals

    , Article 2016 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2016, 7 December 2016 through 9 December 2016 ; 2017 , Pages 1166-1170 ; 9781509045457 (ISBN) Essalat, M ; Boloursaz Mashhadi, M ; Marvasti, F ; IEEE Signal Processing Society; The Institute of Electrical and Electronics Engineers ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2017
    Abstract
    PPG based heart rate (HR) monitoring has recently attracted much attention with the advent of wearable devices such as smart watches and smart bands. However, due to severe motion artifacts (MA) caused by wristband stumbles, PPG based HR monitoring is a challenging problem in scenarios where the subject performs intensive physical exercises. This work proposes a novel approach to the problem based on supervised learning by Neural Network (NN). By simulations on the benchmark datasets [1], we achieve acceptable estimation accuracy and improved run time in comparison with the literature. A major contribution of this work is that it alleviates the need to use simultaneous acceleration signals.... 

    High-order weighted essentially nonoscillatory finite-difference formulation of the lattice boltzmann method in generalized curvilinear coordinates

    , Article Physical Review E ; Volume 95, Issue 2 , 2017 ; 24700045 (ISSN) Hejranfar, K ; Saadat, M. H ; Taheri, S ; Sharif University of Technology
    Abstract
    In this work, a high-order weighted essentially nonoscillatory (WENO) finite-difference lattice Boltzmann method (WENOLBM) is developed and assessed for an accurate simulation of incompressible flows. To handle curved geometries with nonuniform grids, the incompressible form of the discrete Boltzmann equation with the Bhatnagar-Gross-Krook (BGK) approximation is transformed into the generalized curvilinear coordinates and the spatial derivatives of the resulting lattice Boltzmann equation in the computational plane are solved using the fifth-order WENO scheme. The first-order implicit-explicit Runge-Kutta scheme and also the fourth-order Runge-Kutta explicit time integrating scheme are... 

    A heuristic filter based on firefly algorithm for nonlinear state estimation

    , Article 2016 IEEE Symposium Series on Computational Intelligence, SSCI 2016, 6 December 2016 through 9 December 2016 ; 2017 ; 9781509042401 (ISBN) Nobahari, H ; Raoufi, M ; Sharifi, A ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2017
    Abstract
    A new heuristic filter, called firefly filter, is proposed for state estimation of nonlinear stochastic systems. The new filter formulates the state estimation problem as a stochastic dynamic optimization and utilizes the firefly optimization algorithm to find and track the best estimation. The fireflies search the state space dynamically and are attracted to one other based on the perceived brightness. The performance of the proposed filter is evaluated for a set of benchmarks and the results are compared with the well-known filters like extended Kalman filter and particle filter, showing improvements in terms of estimation accuracy. © 2016 IEEE  

    An improved scheme for pre-computed patterns in core-based SoC architecture

    , Article Proceedings of 2016 IEEE East-West Design and Test Symposium, EWDTS 2016, 14 October 2016 through 17 October 2016 ; 2017 ; 9781509006939 (ISBN) Sadredini, E ; Rahimi, R ; Foroutan, P ; Fathy, M ; Navabi, Z ; Sharif University of Technology
    Abstract
    By advances in technology, integrated circuits have come to include more functionality and more complexity in a single chip. Although methods of testing have improved, but the increase in complexity of circuits, keeps testing a challenging problem. Two important challenges in testing of digital circuits are test time and accessing the circuit under test (CUT) for testing. These challenges become even more important in complex system on chip (SoC) zone. This paper presents an improved scheme for generating pre-computed test patterns in core-based systems on chip. This approach reduces the number of pre-computed test patterns and as the result, test application time (TAT) will be decreased.... 

    Assessment of characteristic boundary conditions based on the artificial compressibility method in generalized curvilinear coordinates for solution of the Euler equations

    , Article Computational Methods in Applied Mathematics ; Volume 18, Issue 4 , 2018 , Pages 717-740 ; 16094840 (ISSN) Parseh, K ; Hejranfar, K ; Sharif University of Technology
    De Gruyter  2018
    Abstract
    The characteristic boundary conditions are applied and assessed for the solution of incompressible inviscid flows. The two-dimensional incompressible Euler equations based on the artificial compressibility method are considered and then the characteristic boundary conditions are formulated in the generalized curvilinear coordinates and implemented on both the far-field and wall boundaries. A fourth-order compact finite-difference scheme is used to discretize the resulting system of equations. The solution methodology adopted is more suitable for this assessment because the Euler equations and the high-accurate numerical scheme applied are quite sensitive to the treatment of boundary... 

    A dynamic metaheuristic optimization model inspired by biological nervous systems: neural network algorithm

    , Article Applied Soft Computing Journal ; Volume 71 , 2018 , Pages 747-782 ; 15684946 (ISSN) Sadollah, A ; Sayyaadi, H ; Yadav, A ; Sharif University of Technology
    Abstract
    In this research, a new metaheuristic optimization algorithm, inspired by biological nervous systems and artificial neural networks (ANNs) is proposed for solving complex optimization problems. The proposed method, named as neural network algorithm (NNA), is developed based on the unique structure of ANNs. The NNA benefits from complicated structure of the ANNs and its operators in order to generate new candidate solutions. In terms of convergence proof, the relationship between improvised exploitation and each parameter under asymmetric interval is derived and an iterative convergence of NNA is proved theoretically. In this paper, the NNA with its interconnected computing unit is examined... 

    A discrete water cycle algorithm for solving the symmetric and asymmetric traveling salesman problem

    , Article Applied Soft Computing Journal ; Volume 71 , 2018 , Pages 277-290 ; 15684946 (ISSN) Osaba, E ; Ser, J. D ; Sadollah, A ; Bilbao, M. N ; Camacho, D ; Sharif University of Technology
    Abstract
    The water cycle algorithm (WCA) is a nature-inspired meta-heuristic recently contributed to the community in 2012, which finds its motivation in the natural surface runoff phase in water cycle process and on how streams and rivers flow into the sea. This method has been so far successfully applied to many engineering applications, spread over a wide variety of application fields. In this paper an enhanced discrete version of the WCA (coined as DWCA) is proposed for solving the Symmetric and Asymmetric Traveling Salesman Problem. Aimed at proving that the developed approach is a promising approximation method for solving this family of optimization problems, the designed solver has been... 

    Persian word embedding evaluation benchmarks

    , Article 26th Iranian Conference on Electrical Engineering, ICEE 2018, 8 May 2018 through 10 May 2018 ; 2018 , Pages 1583-1588 ; 9781538649169 (ISBN) Zahedi, M. S ; Bokaei, M. H ; Shoeleh, F ; Yadollahi, M. M ; Doostmohammadi, E ; Farhoodi, M ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2018
    Abstract
    Recently, there has been renewed interest in semantic word representation also called word embedding, in a wide variety of natural language processing tasks requiring sophisticated semantic and syntactic information. The quality of word embedding methods is usually evaluated based on English language benchmarks. Nevertheless, only a few studies analyze word embedding for low resource languages such as Persian. In this paper, we perform such an extensive word embedding evaluation in Persian language based on a set of lexical semantics tasks named analogy, concept categorization, and word semantic relatedness. For these evaluation tasks, we provide three benchmark data sets to show the... 

    High-Performance predictable NVM-based instruction memory for real-time embedded systems

    , Article IEEE Transactions on Emerging Topics in Computing ; 2018 ; 21686750 (ISSN) Bazzaz, M ; Hoseinghorban, A ; Poursafaei, F ; Ejlali, A ; Sharif University of Technology
    IEEE Computer Society  2018
    Abstract
    Worst case execution time and energy consumption are two of the most important design constraints of real-time embedded systems. Many recent studies have tried to improve the memory subsystem of embedded systems by using emerging non-volatile memories. However, accessing these memories imposes performance and energy overhead and using them as the code memory could increase the worst case execution time of the system. In this paper, a new code memory architecture for non-volatile memories is proposed which reduces the effective memory access latency by employing memory access interleaving technique. Unlike common instruction access latency improvement techniques such as prefetching and... 

    A fundamental tradeoff between computation and communication in distributed computing

    , Article IEEE Transactions on Information Theory ; Volume 64, Issue 1 , 2018 , Pages 109-128 ; 00189448 (ISSN) Li, S ; Maddah Ali, M. A ; Yu, Q ; Salman Avestimehr, A ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2018
    Abstract
    How can we optimally trade extra computing power to reduce the communication load in distributed computing? We answer this question by characterizing a fundamental tradeoff between computation and communication in distributed computing, i.e., the two are inversely proportional to each other. More specifically, a general distributed computing framework, motivated by commonly used structures like MapReduce, is considered, where the overall computation is decomposed into computing a set of “Map” and “Reduce” functions distributedly across multiple computing nodes. A coded scheme, named “coded distributed computing” (CDC), is proposed to demonstrate that increasing the computation load of the... 

    Dandelion: a unified code offloading system for wearable computing

    , Article IEEE Transactions on Mobile Computing ; 29 May , 2018 ; 15361233 (ISSN) Golkarifard, M ; Yang, J ; Huang, Z ; Movaghar, A ; Hui, P ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2018
    Abstract
    Execution speed seriously bothers application developers and users for wearable devices such as Google Glass. Intensive applications like 3D games suffer from significant delays when CPU is busy. Energy is another concern when the devices are in low battery level but users need them for urgency use. To ease such pains, one approach is to expand the computational power by cloud offloading. This paradigm works well when the available Internet access has enough bandwidth. Another way is to leverage nearby devices for computation-offloading, which is known as device-to-device (D2D) offloading. In this paper, we present Dandelion, a unified code offloading system for wearable computing. Dandelion...