Loading...
Search for:
feature-vector
0.006 seconds
Total 23 records
An image annotation rectifying method based on deep features
, Article 2nd International Conference on Digital Signal Processing, ICDSP 2018, 25 February 2018 through 27 February 2018 ; 2018 , Pages 88-92 ; 9781450364027 (ISBN) ; Jamzad, M ; Sharif University of Technology
Association for Computing Machinery
2018
Abstract
Automatic image annotation methods generate a list of tags for each test image and present it in a matrix structure. To achieve a more accurate annotation, we propose a method with the aim of correcting the tag list. In our method, we detect an indicator for each group of tags and use it to rectify the annotation results. To find a correct indicator, we apply a deep feature vector generated by the “AlexNet” model. Using this indicator, we determine the suitable tags for an image. The purposed method is independent of feature vector, dataset, and annotation method. It can be applied to the currently available annotation methods. Our experiments showed improvement in all annotation methods...
EEG Signal Processing in BCI Applications
, M.Sc. Thesis Sharif University of Technology ; Haj Sadeghi, Khosrow (Supervisor)
Abstract
Brain-inspired methods are now widely used to process the data generated by the brain with the aim of improving our understanding of how the brain functions and produces the remarkable intelligence exhibited by humans, which is the source of all realizations, perception and actions. Therefore brain-computer interface (BCI) is one of the most challenging scientific problems which focuses scientists attention, in most cases these systems are based on EEG signals recorded from the surface of the scalp because this method of the brain activity monitoring is noninvasive, easy to use and quit inexpensive. Brain computer interface (BCI) systems analyse the EEG signals and translate person’s...
From windows to logos: analyzing outdoor images to aid flyer classification
, Article 15th International Conference on Image Analysis and Recognition, ICIAR 2018, 27 June 2018 through 29 June 2018 ; Volume 10882 LNCS , 2018 , Pages 175-184 ; 03029743 (ISSN); 9783319929996 (ISBN) ; Tomuro, N ; Bagheri Shouraki, S ; Sharif University of Technology
Springer Verlag
2018
Abstract
The goal of this paper was to create a new method for analyzing the online real estate flyers based on their property types. We created an algorithm which identifies the buildings and windows from the buildings in order to extract some useful features for classifying the flyers. Our novel approach for building recognition has two main steps: 1- Building Detector 2- Region Growing. Our novel window detection algorithm uses vanishing point to identify nearly the best angle for applying window detection. It transforms the 2D image into 3D and rotates the 3D image around the z-axis and picks the appropriate angle based on the vanishing points. Using these two novel techniques we were be able to...
A new image texture extraction algorithm based on Matching Pursuit Gabor wavelets
, Article 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05, Philadelphia, PA, 18 March 2005 through 23 March 2005 ; Volume II , 2005 , Pages II740-II744 ; 15206149 (ISSN); 0780388747 (ISBN); 9780780388741 (ISBN) ; Rabiee, H. R ; Ghanbari, M ; Shamsollahi, M. B ; Sharif University of Technology
2005
Abstract
Feature vector extraction, based on local image texture, is a primitive algorithm for many other applications, like segmentation, clustering and identification. If these feature vectors are a good match to the human visual system (HVS), we can expect to get the appropriate results by using them. Gabor filters has been used for this purpose successfully. In this paper we introduce a novel refinement, with the use of Matching Pursuit (MP) to improve the Gabor based texture feature extractor. With this improvement, we show that the separability of different textures will increase. Another consideration in this work is computation complexity. Therefore, we limit the basis function set to reduce...
Wavelet transform and fusion of linear and non linear method for face recognition
, Article DICTA 2009 - Digital Image Computing: Techniques and Applications, 1 December 2009 through 3 December 2009, Melbourne ; 2009 , Pages 296-302 ; 9780769538662 (ISBN) ; Kasaei, S ; Neissi, N. A ; Sharif University of Technology
Abstract
This work presents a method to increase the face recognition accuracy using a combination of Wavelet, PCA, KPCA, and RBF Neural Networks. Preprocessing, feature extraction and classification rules are three crucial issues for face recognition. This paper presents a hybrid approach to employ these issues. For preprocessing and feature extraction steps, we apply a combination of wavelet transform, PCA and KPCA. During the classification stage, the Neural Network (RBF) is explored to achieve a robust decision in presence of wide facial variations. At first derives a feature vector from a set of downsampled wavelet representation of face images, then the resulting PCA-based linear features and...
Content Based Image Retrieval Using Segmentation Similarity Measure
,
M.Sc. Thesis
Sharif University of Technology
;
Jamzad, Mansour
(Supervisor)
Abstract
Content Based Image Retrieval (CBIR) is a research area in computer vision. This area comprises of two main steps, low level feature extraction such as color, texture and shape extraction and also similarity measures for comparison of images. The challenge in this system is the existence semantic gap between the low level visual features and the high level image semantics. The aim of research in this field is to reduce this semantic gap. In this study the images are divided into regions using Meanshift method, for color segmentation and then moments of each region as color feature are calculated. Also for extracting texture the images are divided into regions using Jseg method, and then...
The effect of a two steps searching mechanism Using Feature Vectors Related to Image Class in Improving the Performance of CBIR System
,
M.Sc. Thesis
Sharif University of Technology
;
Jamzad, Mansoor
(Supervisor)
;
Manzuri Shalmani, Mohammad Taghi
(Co-Advisor)
Abstract
Nowadays, retrieval is an inseparable part of user activities and due to growing usage of Content-Based Image Retrieval (CBIR), it has become a hot and challenging research topic specially in the past decade. The most important challenge that retrieval systems (including CBIR systems) are facing is the semantic gap between abstractions in the user’s mind and what is searched. One of the ways of dealing with this challenge is getting more information from the user about what he needs and so decreasing the distance between user’s will and what he gives to search engine as the description of his need. In this research, the class of query image is supposed to be given. For using this...
Estimating Stopping Time Using Function Approximation Algorithms in Reinforcement Learning
, M.Sc. Thesis Sharif University of Technology ; Alishahi, Kasra (Supervisor) ; Haji Mirsadeghi, Mir omid (Supervisor)
Abstract
We study the expected value of stopping times in stochastic processes. Since there is no rigorous solution for computing stopping times in many processes, our approach is based on estimation using well-known methods in the Reinforcement Learning literature. The primary method in this research is the temporal difference algorithm. With some modifications, we can study the role of some state features in determining the stopping time. Moreover, without a complicated mathematical analysis, we can find functions closely enough to the goal function.Furthermore, we compare our proposed algorithm to the well-known regression methods and show our algorithm's advantages and disadvantages. The primary...
Universal image steganalysis using singular values of DCT coefficients
, Article 2013 10th International ISC Conference on Information Security and Cryptology ; 2013 ; Gaemmaghami, S ; Sharif University of Technology
IEEE Computer Society
2013
Abstract
We propose a blind image steganalysis method based on the Singular Value Decomposition (SVD) of the Discrete Cosine Transform (DCT) coefficients that are revisited in this work. We compute geometric mean, mean of log values, and statistical moments (mean, variance and skewness) of the SVDs of the DCT sub-blocks that are averaged over the whole image to construct a 480-element feature vector for steganalysis. These features are fed to a Support Vector Machine (SVM) classifier to discriminate between stego and cover images. Experimental results show that the proposed method outperforms most powerful steganalyzers when applied to some well-known steganography algorithms
MMRO: A feature selection criterion for MR images based on alpha stable filter responses
, Article 2011 7th Iranian Conference on Machine Vision and Image Processing, MVIP 2011 - Proceedings ; 2011 ; 9781457715358 (ISBN) ; Fatemizadeh, E ; Sharif University of Technology
Abstract
In feature-based image registration, feature selection and reduction methods play an important role in decreasing computational burden of these operations. In this paper, a new approach is introduced to reduce the dimension of extracted feature vectors of MR images. This approach is based on the selection of the maximum and minimum responses of the alpha stable filter for the MR images over the extracted features with different orientation in frequency domain. This algorithm selects the rotation invariant features which are suitable for image registration purposes. It has been shown that these features could efficiently describe the image elements. The discriminating ability of the features...
Video keyframe analysis using a segment-based statistical metric in a visually sensitive parametric space
, Article IEEE Transactions on Image Processing ; Volume 20, Issue 10 , Oct , 2011 , Pages 2730-2737 ; 10577149 (ISSN) ; Ghaemmaghami, S ; Shirmohammadi, S ; Sharif University of Technology
2011
Abstract
This paper addresses a new approach to the keyframe extraction problem employing generalized Gaussian density (GGD) parameters of wavelet transform subbands along with Kullback-Leibler distance (KLD) measurement. Shot and cluster boundaries are selected using KLDs between GGD feature vectors, and then keyframes are located based on similarity and dissimilarity criteria. Objective and subjective evaluations show the high accuracy of this new approach compared with traditional methods
Learning strengths and weaknesses of classifiers for RGB-D semantic segmentation
, Article 9th Iranian Conference on Machine Vision and Image Processing, 18 November 2015 through 19 November 2015 ; Volume 2016-February , 2015 , Pages 176-179 ; 21666776 (ISSN) ; 9781467385398 (ISBN) ; Kasaei, S ; Sharif University of Technology
IEEE Computer Society
Abstract
3D scene understanding is an open challenge in the field of computer vision. Most of the focus is on 2D methods in which the semantic labeling of each RGB pixel is considered. But, in this paper, the 3D semantic labeling of RGB-D images is considered. In the proposed method, to extract some meaningful features, the superpixel generation algorithm is applied to the RGB image to segment it into a set of disjoint pixels. After that, the set of three powerful classifiers are utilized to semantically label each superpixel. In the proposed method, the probability outputs of these classifiers are concatenated as the novel feature vector for each superpixel. Consequently, to analyze the strengths...
A new word clustering method for building n-gram language models in continuous speech recognition systems
, Article Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 18 June 2008 through 20 June 2008, Wroclaw ; Volume 5027 LNAI , 2008 , Pages 286-293 ; 03029743 (ISSN) ; 354069045X (ISBN); 9783540690450 (ISBN) ; Sameti, H ; Hafezi, N ; Momtazi, S ; Sharif University of Technology
2008
Abstract
In this paper a new method for automatic word clustering is presented. We used this method for building n-gram language models for Persian continuous speech recognition (CSR) systems. In this method, each word is specified by a feature vector that represents the statistics of parts of speech (POS) of that word. The feature vectors are clustered by k-means algorithm. Using this method causes a reduction in time complexity which is a defect in other automatic clustering methods. Also, the problem of high perplexity in manual clustering methods is abated. The experimental results are based on "Persian Text Corpus" which contains about 9 million words. The extracted language models are evaluated...
Hippocampal shape analysis in the Laplace Beltrami feature space for temporal lobe epilepsy diagnosis and lateralization
, Article Proceedings - International Symposium on Biomedical Imaging ; 2012 , Pages 150-153 ; 19457928 (ISSN) ; 9781457718588 (ISBN) ; Gandomkar, Z ; Soltaman Zadeh, H ; Moghadasi, S. R ; Sharif University of Technology
IEEE
2012
Abstract
Shape analysis plays an important role in many medical imaging studies. One of the recent shape analysis methods uses the Laplace Beltrami operator which is also used in this paper for hippocampal shape comparison. We proposed a feature vector which consists of size measures and shape descriptors based on Laplace Beltrami eigenvalues and eigenfunctions. The aforementioned feature space is utilised for automatic differentiating normal subjects from epileptic patients as well as distinguishing epileptic patients with left temporal lobe epilepsy (LTLE) from patients with right temporal lobe epilepsy (RTLE). Achieved results are diagnostic accuracy of 93% with 95% sensitivity and lateralization...
Support vector data description for spoken digit recognition
, Article BIOSIGNALS 2012 - Proceedings of the International Conference on Bio-Inspired Systems and Signal Processing ; 2012 , Pages 32-37 ; 9789898425898 (ISBN) ; Ghasemi, A ; Tavanaei, M ; Sameti, H ; Manzuri, M. T ; Inst. Syst. Technol. Inf., Control Commun. (INSTICC) ; Sharif University of Technology
2012
Abstract
A classifier based on Support Vector Data Description (SVDD) is proposed for spoken digit recognition. We use the Mel Frequency Discrete Wavelet Coefficients (MFDWC) and the Mel Frequency cepstral Coefficients (MFCC) as the feature vectors. The proposed classifier is compared to the HMM and results are promising and we show the HMM and SVDD classifiers have equal accuracy rates. The performance of the proposed features and SVDD classifier with several kernel functions are evaluated and compared in clean and noisy speech. Because of multi resolution and localization of the Wavelet Transform (WT) and using SVDD, experiments on the spoken digit recognition systems based on MFDWC features and...
Mel-scaled Discrete Wavelet Transform and dynamic features for the Persian phoneme recognition
, Article 2011 International Symposium on Artificial Intelligence and Signal Processing, AISP 2011, 15 June 2011 through 16 June 2011 ; June , 2011 , Pages 138-140 ; 9781424498345 (ISBN) ; Manzuri, M. T ; Sameti, H ; Sharif University of Technology
2011
Abstract
In this paper we use a feature vector consisting of the Mel Frequency Discrete Wavelet Coefficients to recognize spoken phonemes in the Persian language. The purpose of using wavelet in feature extraction is to benefit from its multi resolution analysis and localization property in time and frequency domains. The MFDWCs are obtained by applying the Discrete Wavelet Transform (DWT) to the Mel-scaled log filter bank energies of a speech frame. Feature vectors are used for the HMM-based phoneme recognition on a portion of the FarsDat Persian language database consisting of 35 hour recorded data for training and 15 hour for testing. We evaluate the performance of new features for clean speech...
Car type recognition in highways based on wavelet and contourlet feature extraction
, Article Proceedings of the 2010 International Conference on Signal and Image Processing, ICSIP 2010, 15 December 2010 through 17 December 2010, Chennai ; 2010 , Pages 353-356 ; 9781424485949 (ISBN) ; Jamzad, M ; Sharif University of Technology
2010
Abstract
Recently many works focus on the vehicle type recognition because it is important in security and authentication systems. Computational complexity and low recognition rate especially when the system has to recognize among a large number of vehicles, are two major problems in vehicle type recognition. In recent years wavelet and contourlet transform have been applied in the recognition tasks successfully. In this paper we proposed a method for recognizing vehicle type in different lighting conditions. We used wavelet and contourlet as tools for feature extraction. These features are powerful and robust to illumination and scale variation. We reduced the dimension of feature vector by resizing...
Off-line Arabic/Farsi handwritten word recognition using RBF neural network and genetic algorithm
, Article Proceedings - 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems, ICIS 2010, 29 October 2010 through 31 October 2010, Xiamen ; Volume 3 , 2010 , Pages 352-357 ; 9781424465835 (ISBN) ; Alamdar, F ; Azmi, R ; Haratizadeh, S ; Sharif University of Technology
2010
Abstract
In this paper an off-line ArabiclFarsi handwritten recognition Algorithm on a subset of Farsi name is proposed. In this system, There is no sub-word segmentation phase. Script database includes 3300 images of 30 Farsi common names. The features are wavelet coefficients extracted from smoothed word image profiles in four standard directions. The Centers of competitive layer of RBF neural network have been determined by combining GA and K-Means clustering algorithm. Weights of supervised layer has been trained by using LMS rule and the distances of feature vector of each sample to the centre of RBF network have been computed based on warping function. Experimental results show advantages of...
Skin detection using contourlet texture analysis
, Article 2009 14th International CSI Computer Conference, CSICC 2009, 20 October 2009 through 21 October 2009, Tehran ; 2009 , Pages 367-372 ; 9781424442621 (ISBN) ; Rohban, M. H ; Kasaei, S ; Sharif University of Technology
Abstract
A combined texture- and color-based skin detection is proposed in this paper. Nonsubsampled contourlet transform is used to represent texture of the whole image. Local neighbor contourlet coefficients of a pixel are used as feature vectors to classify each pixel. Dimensionality reduction is addressed through principal component analysis (PCA) to remedy the curse of dimensionality in the training phase. Before texture classification, the pixel is tested to determine whether it is skin-colored. Therefore, the classifier is learned to discriminate skin and non-skin texture for skin colored regions. A multi-layer perceptron is then trained using the feature vectors in the PCA reduced space. The...
A possibilistic approach for building statistical language models
, Article ISDA 2009 - 9th International Conference on Intelligent Systems Design and Applications, 30 November 2009 through 2 December 2009, Pisa ; 2009 , Pages 1014-1018 ; 9780769538723 (ISBN) ; Sameti, H ; Sharif University of Technology
Abstract
Class-based n-gram language models are those most frequently-used in continuous speech recognition systems, especially for languages for which no richly annotated corpora are available. Various word clustering algorithms have been proposed to build such class-based models. In this work, we discuss the superiority of soft approaches to class construction, whereby each word can be assigned to more than one class. We also propose a new method for possibilistic word clustering. The possibilistic C-mean algorithm is used as our clustering method. Various parameters of this algorithm are investigated; e.g., centroid initialization, distance measure, and words' feature vector. In the experiments...