Sharif Digital Repository / Sharif University of Technology / Search result

Fast data delivery for many-core processors

, Article IEEE Transactions on Computers ; Volume 67, Issue 10 , 2018 , Pages 1416-1429 ; 00189340 (ISSN) Bakhshalipour, M ; Lotfi Kamran, P ; Mazloumi, A ; Samandi, F ; Naderan Tahan, M ; Modarressi, M ; Sarbazi Azad, H ; Sharif University of Technology

Abstract

Server workloads operate on large volumes of data. As a result, processors executing these workloads encounter frequent L1-D misses. In a many-core processor, an L1-D miss causes a request packet to be sent to an LLC slice and a response packet to be sent back to the L1-D, which results in high overhead. While prior work targeted response packets, this work focuses on accelerating the request packets. Unlike aggressive OoO cores, simpler cores used in many-core processors cannot hide the latency of L1-D request packets. We observe that LLC slices that serve L1-D misses are strongly temporally correlated. Taking advantage of this observation, we design a simple and accurate predictor. Upon...

Characterizing the rate-memory tradeoff in cache networks within a factor of 2

, Article IEEE Transactions on Information Theory ; 2018 ; 00189448 (ISSN) Yu, Q ; Maddah Ali, M. A ; Avestimehr, A. S ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2018

Abstract

We consider a basic caching system, where a single server with a database of N files (e.g. movies) is connected to a set of K users through a shared bottleneck link. Each user has a local cache memory with a size of M files. The system operates in two phases. a placement phase, where each cache memory is populated up to its size from the database, and a following delivery phase, where each user requests a file from the database, and the server is responsible for delivering the requested contents. The objective is to design the two phases to minimize the load (peak or average) of the bottleneck link. We characterize the rate-memory tradeoff of the above caching system within a factor of...

Learning overcomplete dictionaries from markovian data

, Article 10th IEEE Sensor Array and Multichannel Signal Processing Workshop, SAM 2018, 8 July 2018 through 11 July 2018 ; Volume 2018-July , 2018 , Pages 218-222 ; 2151870X (ISSN); 9781538647523 (ISBN) Akhavan, S ; Esmaeili, S ; Babaie Zadeh, M ; Soltanian Zadeh, H ; Sharif University of Technology

IEEE Computer Society 2018

Abstract

We explore the dictionary learning problem for sparse representation when the signals are dependent. In this paper, a first-order Markovian model is considered for dependency of the signals, that has many applications especially in medical signals. It is shown that the considered dependency among the signals can degrade the performance of the existing dictionary learning algorithms. Hence, we propose a method using the Maximum Log-likelihood Estimator (MLE) and the Expectation Minimization (EM) algorithm to learn the dictionary from the signals generated under the first-order Markovian model. Simulation results show the efficiency of the proposed method in comparison with the...

QoR-aware power capping for approximate big data processing

, Article Proceedings of the 2018 Design, Automation and Test in Europe Conference and Exhibition, DATE 2018 ; Volume 2018-January , 19 April , 2018 , Pages 253-256 ; 9783981926316 (ISBN) Nabavinejad, S. M ; Zhan, X ; Azimi, R ; Goudarzi, M ; Reda, S ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2018

Abstract

To limit the peak power consumption of a cluster, a centralized power capping system typically assigns power caps to the individual servers, which are then enforced using local capping controllers. Consequently, the performance and throughput of the servers are affected, and the runtime of jobs is extended as a result. We observe that servers in big data processing clusters often execute big data applications that have different tolerance for approximate results. To mitigate the impact of power capping, we propose a new power-Capping aware resource manager for Approximate Big data processing (CAB) that takes into consideration the minimum Quality-of-Result (QoR) of the jobs. We use...

Regression-based convolutional 3D pose estimation from single image

, Article Electronics Letters ; Volume 54, Issue 5 , March , 2018 , Pages 292-293 ; 00135194 (ISSN) Ershadi Nasab, S ; Kasaei, S ; Sanaei, E ; Sharif University of Technology

Institution of Engineering and Technology 2018

Abstract

Estimation of 3D human pose from a single image is a challenging task because of ambiguities in projection from 3D space to the 2D image plane. A new two-stage deep convolutional neural network-based method is proposed for regressing the distance and angular difference matrices among body joints. Using the angular difference between body joints in addition to the distance between them in articulated objects such as human body can better model the structure of the shapes and increases the modelling capability of the learning method. Experimental results on HumanEva I and Human3.6M datasets show that the proposed method has substantial improvement in the mean per joint position error measure...

Median filtering forensics in compressed video

, Article IEEE Signal Processing Letters ; Volume 26, Issue 2 , 2019 , Pages 287-291 ; 10709908 (ISSN) Amanipour, V ; Ghaemmaghami, S ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2019

Abstract

Median filtering has received extensive attention from forensics analyzers, as a common content-preserving, smoothing, and denoising manipulation. We propose a detection scheme for median filtering of video sequences in compressed domain based on the singular value decomposition of the process matrix, which approximates the median filtering. Projection over some of the eigenspaces of the process matrix gives a set of features of small dimension, even as small as three, making the proposed scheme a fast and suitable detector for video median filtering. The experimental evaluations show that the proposed method outperforms the state-of-the-art detectors of median filtering, and its edge...

Divide and conquer frontend bottleneck

, Article 47th ACM/IEEE Annual International Symposium on Computer Architecture, ISCA 2020, 30 May 2020 through 3 June 2020 ; Volume 2020-May , 2020 , Pages 65-78 Ansari, A ; Lotfi Kamran, P ; Sarbazi Azad, H ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2020

Abstract

The frontend stalls caused by instruction and BTB misses are a significant source of performance degradation in server processors. Prefetchers are commonly employed to mitigate frontend bottleneck. However, next-line prefetchers, which are available in server processors, are incapable of eliminating a considerable number of L1 instruction misses. Temporal instruction prefetchers, on the other hand, effectively remove most of the instruction and BTB misses but impose significant area overhead. Recently, an old idea of using BTB-directed instruction prefetching is revived to address the limitations of temporal instruction prefetchers. While this approach leads to prefetchers with low area...

Harnessing pairwise-correlating data prefetching with runahead metadata

, Article IEEE Computer Architecture Letters ; Volume 19, Issue 2 , 2020 , Pages 130-133 ; ISSN: 15566056 Golshan, F ; Bakhshalipour, M ; Shakerinava, M ; Ansari, A ; Lotfi Kamran, P ; Sarbazi Azad, H ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2020

Abstract

Recent research revisits pairwise-correlating data prefetching due to its extremely low overhead. Pairwise-correlating data prefetching, however, cannot accurately detect where data streams end. As a result, pairwise-correlating data prefetchers either expose low accuracy or they lose timeliness when they are performing multi-degree prefetching. In this letter, we propose a novel technique to detect where data streams end and hence, control the multi-degree prefetching in the context of pairwise-correlated prefetchers. The key idea is to have a separate metadata table that operates one step ahead of the main metadata table. This way, the runahead metadata table harnesses the degree of...

Unsupervised grammar induction using a parent based constituent context model

, Article 18th European Conference on Artificial Intelligence, ECAI 2008, 21 July 2008 through 25 July 2008 ; Volume 178 , 2008 , Pages 293-297 ; 09226389 (ISSN); 978158603891 (ISBN) Mirroshandel, S. A ; Ghassem Sani, G ; Sharif University of Technology

IOS Press 2008

Abstract

Grammar induction is one of attractive research areas of natural language processing. Since both supervised and to some extent semi-supervised grammar induction methods require large treebanks, and for many languages, such treebanks do not currently exist, we focused our attention on unsupervised approaches. Constituent Context Model (CCM) seems to be the state of the art in unsupervised grammar induction. In this paper, we show that the performance of CCM in free word order languages (FWOLs) such as Persian is inferior to that of fixed order languages such as English. We also introduce a novel approach, called parent-based constituent context model (PCCM), and show that by using some...

Using background knowledge and context knowledge in ontology mapping

, Article Frontiers in Artificial Intelligence and Applications ; Volume 174, Issue 1 , 2008 , Pages 56-64 ; 09226389 (ISSN); 9781586038717 (ISBN) Fatemi, H ; Sayyadi, M ; Abolhassani, H ; Sharif University of Technology

IOS Press 2008

Abstract

Recent evaluations of mapping systems show that lack of background knowledge, most often domain specific knowledge, is one of the key problems of mapping systems these days. In fact, at present, most state of the art systems, for the tasks of mapping large ontologies, perform not with such high values of recall (~ 30%), because they mainly rely on label and structure based similarity measures. Disregarding context knowledge in ontology mapping is another drawback that almost all current approaches suffer from. In this paper we use the semantic web as background knowledge and introduce a novel approach for capturing context knowledge from the ontology for improving mapping results. We have...

Uncalibrated multi-view multiple humans association and 3D pose estimation by adversarial learning

, Article Multimedia Tools and Applications ; Volume 80, Issue 2 , 2021 , Pages 2461-2488 ; 13807501 (ISSN) Ershadi Nasab, S ; Kasaei, S ; Sanaei, E ; Sharif University of Technology

Springer 2021

Abstract

Multiple human 3D pose estimation is a useful but challenging task in computer vison applications. The ambiguities in estimation of 2D and 3D poses of multiple persons can be verified by using multi-view frames, in which the occluded or self-occluded body parts of some persons might be visible in other camera views. But, when cameras are moving and uncalibrated, estimating the association of multiple human body parts among different camera views is a challenging task. This paper presents novel methods for multiple human 3D pose estimation and pose association in multi-view camera frames in an uncalibrated camera setup using an adversarial learning framework. The generator is a 3D pose...

Data-Aware compression of neural networks

, Article IEEE Computer Architecture Letters ; Volume 20, Issue 2 , 2021 , Pages 94-97 ; 15566056 (ISSN) Falahati, H ; Peyro, M ; Amini, H ; Taghian, M ; Sadrosadati, M ; Lotfi Kamran, P ; Sarbazi Azad, H ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2021

Abstract

Deep Neural networks (DNNs) are getting deeper and larger which intensify the data movement and compute demands. Prior work focuses on reducing data movements and computation through exploiting sparsity and similarity. However, none of them exploit input similarity and only focus on sparsity and weight similarity. Synergistically analysing the similarity and sparsity of inputs and weights, we show that memory accesses and computations can be reduced by 5.7× and 4.1×, more than what can be decreased by exploiting only sparsity, and 3.9× and 2.1×, more than what can be decreased by exploiting only weight similarity. We propose a new data-aware compression approach, called DANA, to effectively...

COMET: Context-Aware IoU-guided network for small object tracking

, Article 15th Asian Conference on Computer Vision, ACCV 2020, 30 November 2020 through 4 December 2020 ; Volume 12623 LNCS , 2021 , Pages 594-611 ; 03029743 (ISSN); 9783030695316 (ISBN) Marvasti Zadeh, S. M ; Khaghani, J ; Ghanei Yakhdan, H ; Kasaei, S ; Cheng, L ; Sharif University of Technology

Springer Science and Business Media Deutschland GmbH 2021

Abstract

We consider the problem of tracking an unknown small target from aerial videos of medium to high altitudes. This is a challenging problem, which is even more pronounced in unavoidable scenarios of drastic camera motion and high density. To address this problem, we introduce a context-aware IoU-guided tracker (COMET) that exploits a multitask two-stream network and an offline reference proposal generation strategy. The proposed network fully exploits target-related information by multi-scale feature learning and attention modules. The proposed strategy introduces an efficient sampling strategy to generalize the network on the target and its parts without imposing extra computational...

Weight-based colour constancy using contrast stretching

, Article IET Image Processing ; Volume 15, Issue 11 , 2021 , Pages 2424-2440 ; 17519659 (ISSN) Abedini, Z ; Jamzad, M ; Sharif University of Technology

John Wiley and Sons Inc 2021

Abstract

One of the main issues in colour image processing is changing objects' colour due to colour of illumination source. Colour constancy methods tend to modify overall image colour as if it was captured under natural light illumination. Without colour constancy, the colour would be an unreliable cue to object identity. Till now, many methods in colour constancy domain are presented. They are in two categories; statistical methods and learning-based methods. This paper presents a new statistical weighted algorithm for illuminant estimation. Weights are adjusted to highlight two key factors in the image for illuminant estimation, that is contrast and brightness. The focus was on the convex part of...

Weight-based colour constancy using contrast stretching

, Article IET Image Processing ; Volume 15, Issue 11 , 2021 , Pages 2424-2440 ; 17519659 (ISSN) Abedini, Z ; Jamzad, M ; Sharif University of Technology

John Wiley and Sons Inc 2021

Abstract

One of the main issues in colour image processing is changing objects' colour due to colour of illumination source. Colour constancy methods tend to modify overall image colour as if it was captured under natural light illumination. Without colour constancy, the colour would be an unreliable cue to object identity. Till now, many methods in colour constancy domain are presented. They are in two categories; statistical methods and learning-based methods. This paper presents a new statistical weighted algorithm for illuminant estimation. Weights are adjusted to highlight two key factors in the image for illuminant estimation, that is contrast and brightness. The focus was on the convex part of...

COACH: Consistency aware check-pointing for nonvolatile processor in energy harvesting systems

, Article IEEE Transactions on Emerging Topics in Computing ; Volume 9, Issue 4 , December , 2021 , Pages 2076-2088 ; 21686750 (ISSN) Hosseinghorban, A ; Hosseini Monazzah, A. M ; Bazzaz, M ; Safaei, B ; Ejlali, A ; Sharif University of Technology

IEEE Computer Society 2021

Abstract

Recently, energy harvesting systems that utilize hybrid NVM-SRAM memory in their designs are introduced as a promising alternative for battery-operated systems. Since the ambient input power of an energy harvesting system fluctuates as the environmental conditions change, the system may stop the execution of programs until it receives enough energy to continue the execution. Resuming the execution of a program after the suspension may lead to data inconsistency in an energy harvesting system and threatens the correct functionality of the programs. In this article, we propose COACH, an energy-efficient consistency-aware memory scheme which guarantees the correct functionality and consistency...

Recurrent poisson factorization for temporal recommendation

, Article IEEE Transactions on Knowledge and Data Engineering ; 2018 ; 10414347 (ISSN) Hosseini, S ; Khodadadi, A ; Alizadeh, K ; Arabzadeh, A ; Farajtabar, M ; Zha, H ; Rabiee, H. R. R ; Sharif University of Technology

IEEE Computer Society 2018

Abstract

Poisson Factorization (PF) is the gold standard framework for recommendation systems with implicit feedback whose variants show state-of-the-art performance on real-world recommendation tasks. However, most of the previous work do not explicitly take into account the temporal behavior of users which is essential to recommend the right item to the right user at the right time. In this paper, we introduce a Recurrent Poisson Factorization (RPF) framework that generalizes the classical PF methods by utilizing a Poisson process for modeling the implicit feedback. RPF treats time as a natural constituent of the model, and takes important factors for recommendation into consideration to provide a...

Extracting implicit social relation for social recommendation techniques in user rating prediction

, Article 26th International World Wide Web Conference, WWW 2017 Companion, 3 April 2017 through 7 April 2017 ; 2019 , Pages 1343-1351 ; 9781450349147 (ISBN) Taheri, S. M ; Elahe Ghalebi, K ; Mahyar, H ; Grosu, R ; Firouzi, M ; Movaghar, A ; Sharif University of Technology

International World Wide Web Conferences Steering Committee 2019

Abstract

Recommendation plays an increasingly important role in our daily lives. Recommender systems automatically suggest items to users that might be interesting for them. Recent studies illustrate that incorporating social trust in Matrix Factorization methods demonstrably improves accuracy of rating prediction. Such approaches mainly use the trust scores explicitly expressed by users. However, it is often challenging to have users provide explicit trust scores of each other. There exist quite a few works, which propose Trust Metrics to compute and predict trust scores between users based on their interactions. In this paper, first we present how social relation can be extracted from users'...

Extracting implicit social relation for social recommendation techniques in user rating prediction

, Article 26th International World Wide Web Conference, WWW 2017 Companion, 3 April 2017 through 7 April 2017 ; 2019 , Pages 1343-1351 ; 9781450349147 (ISBN) Taheri, S. M ; Elahe Ghalebi, K ; Mahyar, H ; Grosu, R ; Firouzi, M ; Movaghar, A ; Sharif University of Technology

International World Wide Web Conferences Steering Committee 2019

Abstract

Recommendation plays an increasingly important role in our daily lives. Recommender systems automatically suggest items to users that might be interesting for them. Recent studies illustrate that incorporating social trust in Matrix Factorization methods demonstrably improves accuracy of rating prediction. Such approaches mainly use the trust scores explicitly expressed by users. However, it is often challenging to have users provide explicit trust scores of each other. There exist quite a few works, which propose Trust Metrics to compute and predict trust scores between users based on their interactions. In this paper, first we present how social relation can be extracted from users'...

Recurrent poisson factorization for temporal recommendation

, Article IEEE Transactions on Knowledge and Data Engineering ; Volume 32, Issue 1 , 2020 , Pages 121-134 Hosseini, S. A ; Khodadadi, A ; Alizadeh, K ; Arabzadeh, A ; Farajtabar, M ; Zha, H ; Rabiee, H. R ; Sharif University of Technology

IEEE Computer Society 2020

Abstract

Poisson Factorization (PF) is the gold standard framework for recommendation systems with implicit feedback whose variants show state-of-the-art performance on real-world recommendation tasks. However, they do not explicitly take into account the temporal behavior of users which is essential to recommend the right item to the right user at the right time. In this paper, we introduce Recurrent Poisson Factorization (RPF) framework that generalizes the classical PF methods by utilizing a Poisson process for modeling the implicit feedback. RPF treats time as a natural constituent of the model, and takes important factors for recommendation into consideration to provide a rich family of...