Dynamic temporal error concealment for video data in error-prone environments

, Article Iranian Conference on Machine Vision and Image Processing, MVIP ; 2013 , Pages 43-47 ; 21666776 (ISSN) ; 9781467361842 (ISBN) Marvasti Zadeh, S. M ; Ghanei Yakhdan, H ; Kasaei, S ; Sharif University of Technology

IEEE Computer Society 2013

Abstract

Error concealment is a useful method for improving the damaged video quality in the decoder side. In this paper, a dynamic method with low computational complexity is presented to improve the visual quality of videos when up to 50% of the frames are damaged. In the proposed method, temporal replacement and the improved outer boundary matching algorithm are used for dynamical error concealment in inter-frames of videos. With the use of motion vectors (MVs) which are close to the damaged macroblock (MB) the method can determine whether the motion in specific areas is either regular, irregular, or zero. Then, based on this knowledge, different methods are performed. It adaptively selects a set...

A novel video temporal error concealment algorithm based on moment invariants

, Article 9th Iranian Conference on Machine Vision and Image Processing, 18 November 2015 through 19 November 2015 ; Volume 2016-February , 2015 , Pages 20-23 ; 21666776 (ISSN) ; 9781467385398 (ISBN) Marvasti Zadeh, S. M ; Ghanei Yakhdan, H ; Kasaei, S ; Sharif University of Technology

IEEE Computer Society 2015

Abstract

Nowadays, the use of multimedia services such as video sequences is constantly growing. Unfortunately, due to the lack of reliable communication channels and video data sensitivity to transmission errors, the quality of received video might decrease. Therefore, decoder error concealment methods have been developed to retrieve the damaged or lost data. In this paper, a novel temporal error concealment (TEC) algorithm based on moment invariants is presented. It includes three main stages of: designation of candidate motion vectors (MVs) set, adaptive determination of block size in the current and reference frames for feature extraction, and error function calculation based on moment...

Adaptive spatio-temporal context learning for visual target tracking

, Article 10th Iranian Conference on Machine Vision and Image Processing, MVIP 2017, 22 November 2017 through 23 November 2017 ; Volume 2017-November , April , 2018 , Pages 10-14 ; 21666776 (ISSN) ; 9781538644041 (ISBN) Marvasti Zadeh, S. M ; Ghanei Yakhdan, H ; Kasaei, S ; Sharif University of Technology

IEEE Computer Society 2018

Abstract

While visual target tracking is one of the noteworthy and the most active research areas in computer vision and machine learning, many challenges are still unresolved. In this paper, an adaptive generic target tracker is proposed that includes the adaptive determination of learning parameters from spatio-temporal context model, analysis of prior targets and confidence map for accurate target localization, and modified scale estimation scheme based on confidence map. According to spatio-temporal context model, the learning parameters are adaptively determined for achieving confidence map and target scale robustly. Moreover, analysis of the confidence map helps our tracker to change context...

Efficient scale estimation methods using lightweight deep convolutional neural networks for visual tracking

, Article Neural Computing and Applications ; 2021 ; 09410643 (ISSN) Marvasti Zadeh, S. M ; Ghanei Yakhdan, H ; Kasaei, S ; Sharif University of Technology

Springer Science and Business Media Deutschland GmbH 2021

Abstract

In recent years, visual tracking methods that are based on discriminative correlation filters (DCFs) have been very promising. However, most of these methods suffer from a lack of robust scale estimation skills. Although a wide range of recent DCF-based methods exploit the features that are extracted from deep convolutional neural networks (CNNs) in their translation model, the scale of the visual target is still estimated by hand-crafted features. Whereas the exploitation of CNNs imposes a high computational burden, this paper exploits pre-trained lightweight CNNs models to propose two efficient scale estimation methods, which not only improve the visual tracking performance but also...

Adaptive exploitation of pre-trained deep convolutional neural networks for robust visual tracking

, Article Multimedia Tools and Applications ; Volume 80, Issue 14 , 2021 , Pages 22027-22076 ; 13807501 (ISSN) Marvasti-Zadeh, S.M ; Ghanei Yakhdan, H ; Kasaei, S ; Sharif University of Technology

Springer 2021

Abstract

Due to the automatic feature extraction procedure via multi-layer nonlinear transformations, the deep learning-based visual trackers have recently achieved a great success in challenging scenarios for visual tracking purposes. Although many of those trackers utilize the feature maps from pre-trained convolutional neural networks (CNNs), the effects of selecting different models and exploiting various combinations of their feature maps are still not compared completely. To the best of our knowledge, all those methods use a fixed number of convolutional feature maps without considering the scene attributes (e.g., occlusion, deformation, and fast motion) that might occur during tracking. As a...

A new method for separation of speech signals in convolutive mixtures

, Article 13th European Signal Processing Conference, EUSIPCO 2005, Antalya, 4 September 2005 through 8 September 2005 ; 2005 , Pages 2210-2213 ; 1604238216 (ISBN); 9781604238211 (ISBN) Ferdosizadeh, M ; Babaie Zadeh, M ; Marvasti, F. A ; Sharif University of Technology

2005

Abstract

In this paper, the performance of the gradient method based on Score Function Difference (SFD) in the separation of i.i.d. and periodic signals will be investigated. We will see that this algorithm will separate periodic signals better than i.i.d. ones. By using this experimental result and the fact that voiced frames of speech signals are approximately periodic, a modified algorithm named VDGaradient has been proposed for separation of speech signals in synthetic convolutive mixtures. In this method, voiced frames of speech signal will be used as the input to the gradient method, then the resulting separating system will be applied to separate sources completely

Deep Learning for Visual Tracking: A Comprehensive Survey

, Article IEEE Transactions on Intelligent Transportation Systems ; 2021 ; 15249050 (ISSN) Marvasti Zadeh, S. M ; Cheng, L ; Ghanei Yakhdan, H ; Kasaei, S ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2021

Abstract

Visual target tracking is one of the most sought-after yet challenging research topics in computer vision. Given the ill-posed nature of the problem and its popularity in a broad range of real-world scenarios, a number of large-scale benchmark datasets have been established, on which considerable methods have been developed and demonstrated with significant progress in recent years - predominantly by recent deep learning (DL)-based methods. This survey aims to systematically investigate the current DL-based visual tracking methods, benchmark datasets, and evaluation metrics. It also extensively evaluates and analyzes the leading visual tracking methods. First, the fundamental...

Deep learning for visual tracking: a comprehensive survey

, Article IEEE Transactions on Intelligent Transportation Systems ; Volume 23, Issue 5 , 2022 , Pages 3943-3968 ; 15249050 (ISSN) Marvasti Zadeh, S. M ; Cheng, L ; Ghanei Yakhdan, H ; Kasaei, S ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2022

Abstract

Visual target tracking is one of the most sought-after yet challenging research topics in computer vision. Given the ill-posed nature of the problem and its popularity in a broad range of real-world scenarios, a number of large-scale benchmark datasets have been established, on which considerable methods have been developed and demonstrated with significant progress in recent years - predominantly by recent deep learning (DL)-based methods. This survey aims to systematically investigate the current DL-based visual tracking methods, benchmark datasets, and evaluation metrics. It also extensively evaluates and analyzes the leading visual tracking methods. First, the fundamental...

COMET: Context-Aware IoU-guided network for small object tracking

, Article 15th Asian Conference on Computer Vision, ACCV 2020, 30 November 2020 through 4 December 2020 ; Volume 12623 LNCS , 2021 , Pages 594-611 ; 03029743 (ISSN); 9783030695316 (ISBN) Marvasti Zadeh, S. M ; Khaghani, J ; Ghanei Yakhdan, H ; Kasaei, S ; Cheng, L ; Sharif University of Technology

Springer Science and Business Media Deutschland GmbH 2021

Abstract

We consider the problem of tracking an unknown small target from aerial videos of medium to high altitudes. This is a challenging problem, which is even more pronounced in unavoidable scenarios of drastic camera motion and high density. To address this problem, we introduce a context-aware IoU-guided tracker (COMET) that exploits a multitask two-stream network and an offline reference proposal generation strategy. The proposed network fully exploits target-related information by multi-scale feature learning and attention modules. The proposed strategy introduces an efficient sampling strategy to generalize the network on the target and its parts without imposing extra computational...

Effective fusion of deep multitasking representations for robust visual tracking

, Article Visual Computer ; 2021 ; 01782789 (ISSN) Marvasti Zadeh, S. M ; Ghanei Yakhdan, H ; Kasaei, S ; Nasrollahi, K ; Moeslund, T. B ; Sharif University of Technology

Springer Science and Business Media Deutschland GmbH 2021

Abstract

Visual object tracking remains an active research field in computer vision due to persisting challenges with various problem-specific factors in real-world scenes. Many existing tracking methods based on discriminative correlation filters (DCFs) employ feature extraction networks (FENs) to model the target appearance during the learning process. However, using deep feature maps extracted from FENs based on different residual neural networks (ResNets) has not previously been investigated. This paper aims to evaluate the performance of 12 state-of-the-art ResNet-based FENs in a DCF-based framework to determine the best for visual tracking purposes. First, it ranks their best feature maps and...

Effective fusion of deep multitasking representations for robust visual tracking

, Article Visual Computer ; Volume 38, Issue 12 , 2022 , Pages 4397-4417 ; 01782789 (ISSN) Marvasti Zadeh, S. M ; Ghanei Yakhdan, H ; Kasaei, S ; Nasrollahi, K ; Moeslund, T. B ; Sharif University of Technology

Springer Science and Business Media Deutschland GmbH 2022

Abstract

Visual object tracking remains an active research field in computer vision due to persisting challenges with various problem-specific factors in real-world scenes. Many existing tracking methods based on discriminative correlation filters (DCFs) employ feature extraction networks (FENs) to model the target appearance during the learning process. However, using deep feature maps extracted from FENs based on different residual neural networks (ResNets) has not previously been investigated. This paper aims to evaluate the performance of 12 state-of-the-art ResNet-based FENs in a DCF-based framework to determine the best for visual tracking purposes. First, it ranks their best feature maps and...

Multiple wavelet denoising for embolic signal enhancement

, Article 2007 IEEE International Conference on Telecommunications and Malaysia International Conference on Communications, ICT-MICC 2007, Penang, 14 May 2007 through 17 May 2007 ; February , 2007 , Pages 658-664 ; 1424410940 (ISBN); 9781424410941 (ISBN) Marvasti, S ; Ghandi, M ; Marvasti, F ; Markus, H. S ; Gillies, D ; Sharif University of Technology

2007

Abstract

Transcranial Doppler ultrasound can be used to detect circulating cerebral eraboli. Embolie signals have characteristic transient chirps suitable for wavelet analysis. We have implemented and evaluated the first online selective selective wavelet transient enhancement filter to amplify embolic signals in a preprocessing system. Our approach is similar to wavelet de-noising for signal enhancement, but, in order to retain blood flow information, we do not use traditional threshold methods. The selective wavelet amplifier uses the matched filter properties of wavelets to enhance embolic signals significantly and improve classification performance using a novel noise tolerant approach. Even the...

A geometric approach for separating several speech signals

, Article Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) ; Volume 3195 , 2004 , Pages 798-806 ; 03029743 (ISSN); 3540230564 (ISBN); 9783540230564 (ISBN) Babaie Zadeh, M ; Mansour, A ; Jutten, C ; Marvasti, F ; Sharif University of Technology

Springer Verlag 2004

Abstract

In this paper a new geometrical approach for separating speech signals is presented. This approach can be directly applied to separate more than two speech signals. It is based on clustering the observation points, and then fitting a line (hyper-plane) onto each cluster. The algorithm quality is shown to be improved by using DCT coefficients of speech signals, as opposed to using speech samples. © Springer-Verlag 2004

Learning overcomplete dictionaries from markovian data

, Article 10th IEEE Sensor Array and Multichannel Signal Processing Workshop, SAM 2018, 8 July 2018 through 11 July 2018 ; Volume 2018-July , 2018 , Pages 218-222 ; 2151870X (ISSN); 9781538647523 (ISBN) Akhavan, S ; Esmaeili, S ; Babaie Zadeh, M ; Soltanian Zadeh, H ; Sharif University of Technology

IEEE Computer Society 2018

Abstract

We explore the dictionary learning problem for sparse representation when the signals are dependent. In this paper, a first-order Markovian model is considered for dependency of the signals, that has many applications especially in medical signals. It is shown that the considered dependency among the signals can degrade the performance of the existing dictionary learning algorithms. Hence, we propose a method using the Maximum Log-likelihood Estimator (MLE) and the Expectation Minimization (EM) algorithm to learn the dictionary from the signals generated under the first-order Markovian model. Simulation results show the efficiency of the proposed method in comparison with the...

Contourlet-based image watermarking using optimum detector in a noisy environment

, Article IEEE Transactions on Image Processing ; Volume 19, Issue 4 , 2010 , Pages 967-980 ; 10577149 (ISSN) Akhaee, M. A ; Sahraeian, S. M. E ; Marvasti, F ; Sharif University of Technology

2010

Abstract

In this paper, an improved multiplicative image watermarking system is presented. Since human visual system is less sensitive to the image edges, watermarking is applied in the contourlet domain, which represents image edges sparsely. In the presented scheme, watermark data is embedded in directional subband with the highest energy. By modeling the contourlet coefficients with General Gaussian Distribution (GGD), the distribution of watermarked noisy coefficients is analytically calculated. The tradeoff between the transparency and robustness of the watermark data is solved in a novel fashion. At the receiver, based on the Maximum Likelihood (ML) decision rule, an optimal detector by the aid...

Blind image watermarking based onsample rotation with optimal detector

, Article European Signal Processing Conference, 24 August 2009 through 28 August 2009, Glasgow ; 2009 , Pages 278-282 ; 22195491 (ISSN) Sahraeian, S. M. E ; Akhaee, M. A ; Marvasti, F ; Sharif University of Technology

2009

Abstract

This paper present a simple watermarking approach based on the rotation of low frequency components of image blocks. The rotation process is performed with less distortion by projection of the samples on specific lines according to message bit. To have optimal detection Maximum Likelihood criteria has been used. Thus, by computing the distribution of rotated noisy samples the optimum decoder is presented and its performance is analytically investigated. The privilege of this proposed algorithm is its inherent robustness against gain attack as well as its simplicity. Experimental results confirm the validity of the analytical derivations and also high robustness against common attacks. ©...

Seismic reliability assessment of structures using artificial neural network

, Article Journal of Building Engineering ; Volume 11 , 2017 , Pages 230-235 ; 23527102 (ISSN) Vazirizade, S. M ; Nozhati, S ; Allameh Zadeh, M ; Sharif University of Technology

Elsevier Ltd 2017

Abstract

Localization and quantification of structural damage and estimating the failure probability are key outputs in the reliability assessment of structures. In this study, an Artificial Neural Network (ANN) is used to reduce the computational effort required for reliability analysis and damage detection. Toward this end, one demonstrative structure is modeled and then several damage scenarios are defined. These scenarios are considered as training data sets for establishing an ANN model. In this regard, the relationship between structural response (input) and structural stiffness (output) is established using ANN models. The established ANN is more economical and achieves reasonable accuracy in...

Vision-based trajectory tracking controller for autonomous close proximity operations

, Article 2008 IEEE Aerospace Conference, AC, Big Sky, MT, 1 March 2008 through 8 March 2008 ; 2008 ; 1095323X (ISSN) ; 1424414881 (ISBN); 9781424414888 (ISBN) Sashafi, F ; Khansari Zadeh, S. M ; Sharif University of Technology

2008

Abstract

Tight unmanned aerial vehicle (UAV) autonomous missions such as formation flight and aerial refueling (AR) requires an active controller that works in conjunction with a precise vision-based sensor that is able to extract In-front aircraft relative position and orientation from captured images. A key point in implementing such a sensor is its robustness in the presence of noises and other uncertainties. In this paper, a new vision-based algorithm that uses neural networks to estimate the In-front aircraft relative orientation and position is developed. The accuracy and robustness of the proposed algorithm has been validated via a detailed modeling and a complete virtual environment based on...

Scale invariant feature transform using oriented pattern

, Article Canadian Conference on Electrical and Computer Engineering ; 2014 Daneshvar, M. B ; Babaie-Zadeh, M ; Ghorshi, S ; Sharif University of Technology

2014

Abstract

Image matching plays an important role in many aspects of computer vision. Our proposed method is based on Scale Invariant Feature Transform (SIFT) which is one of the popular image matching methods. The main ideas behind our method are removing the excess keypoints, adding oriented patterns to descriptor, and decreasing the size of the descriptors. By doing these changes to SIFT, we would have oriented patterns of keypoints. In addition, the numbers of keypoints have been reduced and the places of keypoints would be selected more accurately, and also the size of the descriptors has been reduced

Mining social network for extracting topic of textual conversations

, Article 5th International Conference on Soft Computing As Transdisciplinary Science and Technology, CSTST '08, Cergy-Pontoise, 28 October 2008 through 31 October 2008 ; October , 2008 , Pages 232-237 ; 9781605580463 (ISBN) Moradian Zadeh, P ; Mohi, M ; Moshkenani, M. S ; Sharif University of Technology

2008

Abstract

Developing internet usage and services urged play strong role for social network. Social networks are environment which uses internet as interface to provide relations between people, in the other word to interchange data and information between persons. Email and Instant Messengers are popular examples of them. Whereas these environments are continuously and instantly developing, revising and viewing by humans, they are good places for mining. In this paper, the topic of exchanged information between users in this type of networks will be our target. Our method is to use a hierarchical dictionary of semantically related topics and words that is mapped to a graph. Then extracted keywords...