A comparative study on single-channel noise estimation methods for speech enhancement

, Article International Conference on Intelligent Systems Design and Applications, ISDA ; 2012 , Pages 645-650 ; 21647143 (ISSN) ; 9781467351188 (ISBN) Veisi, H ; Sameti, H ; Sharif University of Technology

2012

Abstract

This paper studies a number of well-known noise estimation techniques and provides a comparative performance analysis of them in speech enhancement platform. Two types of evaluation data that simulate consistent and inconsistent noisy conditions are prepared in the presence of six noise types at different SNR levels. The performance of speech enhancement systems and the spectrum distance of the estimated and original noise spectrums are used as evaluation criteria. The evaluations indicate that a simple VAD method outperforms noise estimation methods in most of the consistent noisy conditions

Hidden-Markov-model-based voice activity detector with high speech detection rate for speech enhancement

, Article IET Signal Processing ; Volume 6, Issue 1 , February , 2012 , Pages 54-63 ; 17519675 (ISSN) Veisi, H ; Sameti, H ; Sharif University of Technology

2012

Abstract

A new voice activity detection (VAD) algorithm with soft decision output in Mel-frequency domain is developed based on hidden Markov model (HMM) and is incorporated in an HMM-based speech enhancement system. The proposed VAD uses a two-state ergodic HMM representing speech presence and speech absence. The states are constructed from noisy speech and noise HMMs used in the speech enhancement system. This composite model provides a robust detection of speech segments in the presence of noise and obviates the need for extra modeling in HMM-based speech enhancement applications. As the main purpose of the proposed VAD is to detect speech segments accurately, a hang-over mechanism is proposed and...

The integration of principal component analysis and cepstral mean subtraction in parallel model combination for robust speech recognition

, Article Digital Signal Processing: A Review Journal ; Volume 21, Issue 1 , 2011 , Pages 36-53 ; 10512004 (ISSN) Veisi, H ; Sameti, H ; Sharif University of Technology

2011

Abstract

This paper addresses the problem of automatic speech recognition in real applications in which the speech signal is altered by various noises. Feature compensation and model compensation robustness methods are studied. Parallel model combination (PMC) and its recent advances are reviewed and a novel algorithm called PC-PMC is proposed. This algorithm utilizes cepstral mean subtraction (CMS) normalization ability and principal component analysis (PCA) compression and de-correlation capability in the combination with PMC model transformation method. PC-PMC algorithm takes the advantages of additive noise compensation ability of PMC and convolutional noise removal capability of CMS and PCA. In...

Cepstral-domain HMM-based speech enhancement using vector Taylor series and parallel model combination

, Article 2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA 2012, 2 July 2012 through 5 July 2012 ; July , 2012 , Pages 298-303 ; 9781467303828 (ISBN) Veisi, H ; Sameti, H ; Sharif University of Technology

2012

Abstract

Speech enhancement problem using hidden Markov model (HMM) and minimum mean square error (MMSE) in cepstral domain is studied. This noise reduction approach can be considered as weighted-sum filtering of the noisy speech signal in which the filters weights are estimated using the HMM of noisy speech. To have an accurate estimation of the noisy speech HMM, vector Taylor series (VTS) is proposed and compared with the parallel model combination (PMC) technique. Furthermore, proposed cepstral-domain HMM-based speech enhancement systems are compared with the renowned autoregressive HMM (AR-HMM) approach. The evaluation results confirm the superiority of the cepstral domain approach in comparison...

A parallel cepstral and spectral modeling for HMM-based speech enhancement

, Article 17th DSP 2011 International Conference on Digital Signal Processing, Proceedings, 6 July 2011 through 8 July 2011, Corfu ; 2011 ; 9781457702747 (ISBN) Veisi, H ; Sameti, H ; Sharif University of Technology

2011

Abstract

An HMM-based speech enhancement in Mel-frequency domain is introduced and improved. It is shown that hidden Markov modeling in the Mel-frequency domain is beneficial due to its effective representation of the speech spectrum; however, speech enhancement in this domain requires an inversion from the Mel-frequency to the spectral domain which introduces distortion artifacts for spectrum estimation. To reduce the distortion effects of the inversion and employ the advantages of robustness modeling in the Mel-frequency domain, a parallel cepstral and spectral (PCS) modeling is proposed. In PCS, a concurrent modeling in both cepstral and spectral domains is performed. The performances of the...

Speech enhancement using hidden Markov models in Mel-frequency domain

, Article Speech Communication ; Volume 55, Issue 2 , 2013 , Pages 205-220 ; 01676393 (ISSN) Veisi, H ; Sameti, H ; Sharif University of Technology

2013

Abstract

Hidden Markov model (HMM)-based minimum mean square error speech enhancement method in Mel-frequency domain is focused on and a parallel cepstral and spectral (PCS) modeling is proposed. Both Mel-frequency spectral (MFS) and Mel-frequency cepstral (MFC) features are studied and experimented for speech enhancement. To estimate clean speech waveform from a noisy signal, an inversion from the Mel-frequency domain to the spectral domain is required which introduces distortion artifacts in the spectrum estimation and the filtering. To reduce the corrupting effects of the inversion, the PCS modeling is proposed. This method performs concurrent modeling in both cepstral and magnitude spectral...

An improved parallel model combination method for noisy speech recognition

, Article Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009 ; 2009 , Pages 237-242 ; 9781424454792 (ISBN) Veisi, H ; Sameti, H ; Sharif University of Technology

2009

Abstract

In this paper a novel method, called PC-PMC, is proposed to improve the performance of automatic speech recognition systems in noisy environments. This method is based on the parallel model combination (PMC) technique and uses the Cepstral Mean Subtraction (CMS) normalization ability and Principal Component Analysis (PCA) compression and decorrelation capabilities. It takes the advantages of both additive noise compensation of PMC and convolutive noise removal ability of CMS and PCA. The first problem to be solved in the realizing of PC-PMC is that PMC algorithm requires invertible modules in the front-end of the system while CMS normalization is not an invertible process. Also, it is...

Improving the performance of speech recognition systems using fault-tolerant techniques

, Article 2008 9th International Conference on Signal Processing, ICSP 2008, Beijing, 26 October 2008 through 29 October 2008 ; 2008 , Pages 579-582 ; 9781424421794 (ISBN) Veisi, H ; Sameti, H ; Sharif University of Technology

2008

Abstract

In this paper, using of fault tolerant techniques are studied and experimented in speech recognition systems to make these systems robust to noise. Recognizer redundancy is implemented to utilize the strengths of several recognition methods that each one has acceptable performance in a specific condition. Duplication-with-comparison and NMR methods are experimented with majority and plurality voting on a telephony Persian speech-enabled IVR system. Results of evaluations present two promising outcomes, first, it improves the performance considerably; second, it enables us to detect the outputs with low confidence. © 2008 IEEE

Noise and speaker robustness in a persian continuous speech recognition system

, Article 2007 9th International Symposium on Signal Processing and its Applications, ISSPA 2007, Sharjah, 12 February 2007 through 15 February 2007 ; 2007 ; 1424407796 (ISBN); 9781424407798 (ISBN) Veisi, H ; Sameti, H ; Sharif University of Technology

2007

Abstract

In this paper VTLN speaker normalization, MLLR and MAP adaptation methods are investigated in a Persian HMM-based speaker independent large vocabulary continuous speech recognition system. Speaker and environmental noise robustness are achieved in real world applications for this system. A search-based method is used in VTLN to find speaker relative warping factors. The warping factors are applied to signal's spectrum to normalize the variation effect of VTL between speakers. In the MLLR framework, Gaussian mean and covariance transformations in global and full adaptation are experienced. In this method, regression tree based adaptation in batch-supervised fashion is used. Also the standard...

The combination of CMS with PMC for improving robustness of speech recognition systems

, Article 13th International Computer Society of Iran Computer Conference on Advances in Computer Science and Engineering, CSICC 2008, Kish Island, 9 March 2008 through 11 March 2008 ; Volume 6 CCIS , 2008 , Pages 825-829 ; 18650929 (ISSN); 3540899847 (ISBN); 9783540899846 (ISBN) Veisi, H ; Sameti, H ; Sharif University of Technology

2008

Abstract

This paper addresses the robustness problem of automatic speech recognition systems for real applications in presence of noise. PMCC algorithm is proposed for combining PMC technique with CMS method. The proposed algorithm utilizes the CMS normalization ability in PMC method to takes the advantages of these methods to compensate the effect of both additive and convolutional noises. Also, we have investigated VTLN for speaker normalization and MLLR and MAP for speaker and acoustic adaptation. Different combinations of these methods are used to achieve robustness and making the system usable in real applications. Our evaluations are done on 4 different real noisy tasks on Nevisa recognition...

An optimum MMSE post-filter for Adaptive Noise Cancellation in automobile environment

, Article 2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA 2012 ; 2012 , Pages 431-435 ; 9781467303828 (ISBN) Khorram, S ; Sameti, H ; Veisi, H ; Sharif University of Technology

2012

Abstract

Adaptive Noise Cancellation (ANC) is an effective dual-channel technique for background noise reduction. Due to the presence of uncorrelated noise components at the two inputs in vehicular environments, ANC does not provide sufficient background noise reduction. To alleviate this problem, a complementary linear filter is added to ANC structure. Filter coefficients are determined to make the enhanced signal an MMSE estimation of speech signal. Therefore, the ANC structure is modified to a dual-channel Wiener structure. We prove that this structure is identical to the LMS type ANC which is followed by a Wiener post-filter. A new method is proposed for the noise spectrum estimation in the...

An algebraic gain estimation method to improve the performance of HMM-based speech enhancement systems

, Article Proceedings - 2010 18th Iranian Conference on Electrical Engineering, ICEE 2010, 11 May 2010 through 13 May 2010 ; 2010 , Pages 336-339 ; 9781424467600 (ISBN) Mariooryad, S ; Sameti, H ; Veisi, H ; Sharif University of Technology

2010

Abstract

An extension to conventional Hidden Markov Model (HMM)-based speech enhancement method is developed. An algebraic method is proposed to estimate gain of speech and noise in order to improve the quality of the estimated speech. Different pronunciations and intonations may affect speech gain. Besides, gain of noise may vary remarkably from one environment to the other one. This may lead in a mismatch between energy contour of trained models and energy contour of noisy speech signal. In this work, speech gain and noise gain are estimated based on an algebraic method simultaneously in order to match gain of noisy speech and noisy model. To carry out this procedure an extension of least square...

LP-based over-sampled subband adaptive noise canceller for speech enhancement in diffuse noise fields

, Article 2008 9th International Conference on Signal Processing, ICSP 2008, Beijing, 26 October 2008 through 29 October 2008 ; 2008 , Pages 157-161 ; 9781424421794 (ISBN) Khorram, S ; Sameti, H ; Veisi, H ; Sharif University of Technology

2008

Abstract

Adaptive Noise Cancellers (ANCs) do not provide sufficient noise reduction in the diffuse noise fields. In this paper, a new hybrid structure is proposed as a solution to this problem. The proposed system is a combination of two subsystems, an ANC and a new multistage post-filter. The post-filter is based on linear prediction (LP) and attempts to extract speech component by using intermediate ANC signals. The system is implemented on an over-sampled DFT filterbank with different analysis and synthesis prototype filters. The experimental results using various quality measures show that the proposed system is superior to both the subband ANC and subband LP based speech enhancement systems.1 ©...

A new lattic LP-based post filter for adaptive noise cancellers in mobile and vehicular applications

, Article Proceedings of the 8th IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2008, 16 December 2008 through 19 December 2008, Sarajevo ; 2008 , Pages 407-412 ; 9781424435555 (ISBN) Khorram, S ; Sameti, H ; Veisi, H ; Abutalebi, H. R ; Sharif University of Technology

2008

Abstract

Adaptive Noise Cancellation (ANC) is a well-known technique for background noise reduction in automobile and vehicular environments. The noise fields in automobile and other vehicle interior obey the diffuse noise field model closely. On the other hand, the ANC does not provide sufficient noise reduction in the diffuse noise fields. In this paper, a new multistage post-filter is designed for ANC as a solution to diffuse noise conditions. The designed post-filter is a single channel Linear Prediction (LP) based speech enhancement system. The LP is performed by an adaptive lattice filter and attempts to extract speech components by using intermediate ANC signals. The post-filter has no...

Automatic noise recognition based on neural network using LPC and MFCC feature parameters

, Article 2012 Federated Conference on Computer Science and Information Systems, FedCSIS 2012, 9 September 2012 through 12 September 2012 ; 2012 , Pages 69-73 ; 9781467307086 (ISBN) Haghmaram, R ; Aroudi, A ; Ghezel, M. H ; Veisi, H ; Sharif University of Technology

2012

Abstract

This paper studies the automatic noise recognition problem based on RBF and MLP neural networks classifiers using linear predictive and Mel-frequency cepstral coefficients (LPC and MFCC). We first briefly review the architecture of each network as automatic noise recognition (ANR) approach, then, compare them to each other and investigate factors and criteria that influence final recognition performance. The proposed networks are evaluated on 15 stationary and non-stationary types of noises with frame length of 20 ms in term of correct classification rate. The results demonstrate that the MLP network using LPCs is a precise ANR with accuracy rate of 99.9%, while the RBF network with MFCCs...

Study of equivalent mechanical properties and energy absorption of composite honeycomb structures

, Article International Journal of Applied Mechanics ; Volume 15, Issue 6 , 2023 ; 17588251 (ISSN) Farrokhabadi, A ; Gharehbaghi, H ; Malekinejad, H ; Sebghatollahi, M ; Noroozi, Z ; Veisi, H ; Sharif University of Technology

World Scientific 2023

Abstract

In this study, an analytical model based on classical laminate theory (CLT) is proposed to predict the equivalent mechanical characteristics of three-dimensional (3D) printed fiber-reinforced polylactic acid (PLA) honeycomb structures. Higher rigidity and strength in comparison with the structures made of pure isotropic materials are presented by employing fiber-reinforced PLA. Tensile tests and finite elements studies are conducted to verify the developed analytical relationships. A good agreement is found between the experimental, numerical, and analytical results. Consequently, the mechanical characteristics of the aforementioned structures can be properly predicted using the presented...

Progressive bearing failure modeling of composites with double-bolted joints at mesoscale level

, Article Archive of Applied Mechanics ; Vol. 84, issue. 5 , May , 2014 , p. 657-669 ; 09391533 Veisi, H ; Hosseini Kordkheili, S. A ; Toozandehjani, H ; Sharif University of Technology

2014

Abstract

Both numerical and experimental researches are carried out to study the strength of the composite double-bolted joints and the bearing damage propagation. A mesoscale level progressive damage model along with analytical formulation is used to predict the bearing failure of carbon-epoxy composite plates. This damage model is introduced as a user material subroutine in the commercial software ABAQUS, and the maximum failure load is calculated. In order to validate the numerical results, experimental tests are also conducted in which comparison between the results shows an excellent agreement. Furthermore, the effects of the bolt distances on the maximum failure load are studied. The results...

Thermally developing electroosmotic flow of power-law fluids in a parallel plate microchannel

, Article International Journal of Thermal Sciences ; Volume 61 , 2012 , Pages 106-117 ; 12900729 (ISSN) Sadeghi, A ; Saidi, M.H ; Veisi, H ; Fattahi, M ; Sharif University of Technology

2012

Abstract

The present investigation considers the thermally developing electroosmotic flow of power-law fluids through a parallel plate microchannel. Both the viscous dissipation and Joule heating effects are taken into account and a step change in wall temperature is considered to represent physically conceivable thermal entrance conditions. Expressions for the dimensionless temperature and Nusselt number in the form of infinite series are presented. In general, the resultant eigenvalue problem is solved numerically; nevertheless, an analytical solution is presented for the regions close to the entrance. A parametric study reveals that increasing amounts of the Peclet number result in higher wall...

Persian language modeling using recurrent neural networks

, Article 9th International Symposium on Telecommunication, IST 2018, 17 December 2018 through 19 December 2018 ; 2019 , Pages 207-210 ; 9781538682746 (ISBN) Hosseini Saravani, H ; Bahrani, M ; Veisi, H ; Besharati, S ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2019

Abstract

In this paper, recurrent neural networks are applied to language modeling of Persian, using word embedding as word representation. To this aim, unidirectional and bidirectional Long Short-Term Memory (LSTM) networks are used, and the perplexity of Persian language models on a 100-million-word data set is evaluated. The effect of various parameters, including number of hidden layers and size of LSTM units, on the performance of the networks in reducing the perplexity of the models are investigated. Among different LSTM language models, the best perplexity, which is equal to 59.05, is achieved from a 2-layer bidirectional LSTM model. Comparing this value with the perplexity of the classical...

A complexity-based approach in image compression using neural networks

, Article World Academy of Science, Engineering and Technology ; Volume 35 , 2009 , Pages 684-694 ; 2010376X (ISSN) Veisi, H ; Jamzad, M ; Sharif University of Technology

2009

Abstract

In this paper we present an adaptive method for image compression that is based on complexity level of the image. The basic compressor/de-compressor structure of this method is a multilayer perceptron artificial neural network. In adaptive approach different Back-Propagation artificial neural networks are used as compressor and de-compressor and this is done by dividing the image into blocks, computing the complexity of each block and then selecting one network for each block according to its complexity value. Three complexity measure methods, called Entropy, Activity and Pattern-based are used to determine the level of complexity in image blocks and their ability in complexity estimation...