Loading...
Search for: gradient-descent
0.011 seconds

    ICA by Mutual Information minimization: An approach for avoiding local minima

    , Article 13th European Signal Processing Conference, EUSIPCO 2005, Antalya, 4 September 2005 through 8 September 2005 ; 2005 , Pages 253-256 ; 1604238216 (ISBN); 9781604238211 (ISBN) Babaie Zadeh, M ; Bahmani, B ; Jutten, C ; Sharif University of Technology
    2005
    Abstract
    Using Mutual Information (MI) minimization is very common in Blind Source Separation (BSS). However, it is known that gradient descent approaches may trap in local minima of MI in constrained models. In this paper, it is proposed that this problem may be solved using a 'poor' estimation of the derivative of MI  

    A novel approach to quantized matrix completion using huber loss measure

    , Article IEEE Signal Processing Letters ; Volume 26, Issue 2 , 2019 , Pages 337-341 ; 10709908 (ISSN) Esmaeili, A ; Marvasti, F ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    In this paper, we introduce a novel and robust approach to quantized matrix completion. First, we propose a rank minimization problem with constraints induced by quantization bounds. Next, we form an unconstrained optimization problem by regularizing the rank function with Huber loss. Huber loss is leveraged to control the violation from quantization bounds due to two properties: first, it is differentiable; and second, it is less sensitive to outliers than the quadratic loss. A smooth rank approximation is utilized to endorse lower rank on the genuine data matrix. Thus, an unconstrained optimization problem with differentiable objective function is obtained allowing us to advantage from... 

    Speech activity detection using deep neural networks

    , Article 2017 25th Iranian Conference on Electrical Engineering, ICEE 2017, 2 May 2017 through 4 May 2017 ; 2017 , Pages 1564-1568 ; 9781509059638 (ISBN) Shahsavari, S ; Sameti, H ; Hadian, H ; Sharif University of Technology
    Abstract
    In this paper, we introduce a new dataset for SAD and evaluate certain common methods such as GMM, DNN, and RNN on it. We have collected our dataset in a semi-supervised approach, using subtitled movies, with a labeling accuracy of 95%. This semi-automatic method can help us collect huge amounts of labeled audio data with very high diversity in language, speaker, and channel. We model the problem of SAD as a classification task to two classes of speech and non-speech. When using GMM for this problem, we use two separate mixtures to model speech and non-speech. In the case of neural networks, we use a softmax layer at the end of the network, with two neurons which represent speech and... 

    A new framework to train autoencoders through non-smooth regularization

    , Article IEEE Transactions on Signal Processing ; Volume 67, Issue 7 , 2019 , Pages 1860-1874 ; 1053587X (ISSN) Amini, S ; Ghaemmaghami, S ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    Deep structures consisting of many layers of nonlinearities have a high potential of expressing complex relations if properly initialized. Autoencoders play a complementary role in training a deep structure by initializing each layer in a greedy unsupervised manner. Due to the high capacity presented by autoencoders, these structures need to be regularized. While mathematical regularizers (based on weight decay, sparsity, etc.) and structural ones (by way of, e.g., denoising and dropout) have been well studied in the literature, quite a few papers have addressed the problem of training autoencoder with non-smooth regularization. In this paper, we address the problem of training autoencoder... 

    Fuzzy adaptive sliding mode control of chaos

    , Article 2006 ASME International Mechanical Engineering Congress and Exposition, IMECE2006, Chicago, IL, 5 November 2006 through 10 November 2006 ; 2006 ; 10716947 (ISSN); 0791837904 (ISBN); 9780791837900 (ISBN) Arjmand, M. T ; Layeghi, N ; Salarieh, H ; Alasty, A ; Sharif University of Technology
    American Society of Mechanical Engineers (ASME)  2006
    Abstract
    A simple and systematic approach is developed for modeling and adaptive control of an unknown (or uncertain) chaotic system of the form x(n) = f(X) + g(X)u, using only input-output data obtained from the underlying dynamic system. Two different fuzzy identification methods, i.e. least-squares and gradient descent, are used for identifying the unknown functions f (X) and g (X). Based on the fuzzy modeling, an adaptive controller is devised, which works through sliding mode method. The presented procedure is illustrated by using the chaotic system-modified Duffing's equation as an example, on which simulation results demonstrate the effectiveness of the proposed adaptive algorithm. Copyright ©... 

    A complementary method for preventing hidden neurons' saturation in feed forward neural networks training

    , Article Iranian Journal of Electrical and Computer Engineering ; Volume 9, Issue 2 , SUMMER-FALL , 2010 , Pages 127-133 ; 16820053 (ISSN) Moallem, P ; Ayoughi, S. A ; Sharif University of Technology
    2010
    Abstract
    In feed forward neural networks, hidden layer neurons' saturation conditions, which are the cause of flat spots on the error surface, is one of the main disadvantages of any conventional gradient descent learning algorithm. In this paper, we propose a novel complementary scheme for the learning based on a suitable combination of anti saturated hidden neurons learning process and accelerating methods like the momentum term and the parallel tangent technique. In our proposed method, a normalized saturation criterion (NSC) of hidden neurons, which is introduced in this paper, is monitored during learning process. When the NSC is higher than a specified threshold, it means that the algorithm... 

    A hybrid method of modified cat swarm optimization and gradient descent algorithm for training anfis

    , Article International Journal of Computational Intelligence and Applications ; Volume 12, Issue 2 , June , 2013 ; 14690268 (ISSN) Orouskhani, M ; Mansouri, M ; Orouskhani, Y ; Teshnehlab, M ; Sharif University of Technology
    2013
    Abstract
    This paper introduces a novel approach for tuning the parameters of the adaptive network-based fuzzy inference system (ANFIS). In the commonly used training methods, the antecedent and consequent parameters of ANFIS are trained by gradient-based algorithms and recursive least square method, respectively. In this study, a new swarm-based meta-heuristic optimization algorithm, so-called "Cat Swarm Optimization", is used in order to train the antecedent part parameters and gradient descent algorithm is applied for training the consequent part parameters. Experimental results for prediction of Mackey-Glass model and identification of two nonlinear dynamic systems reveal that the performance of... 

    Real-time Bus Holding Control Strategy to Reduce Passenger Waiting Time

    , M.Sc. Thesis Sharif University of Technology Asgharzdeh, Mohammad Amin (Author) ; Shafahi, Yusof (Supervisor)
    Abstract
    Travel time is the most significant element in choosing among different modes of transportation. In order to increase the public transportation’s utility we need to utilize some methods to reduce public transportation travel time and passenger delay. Bus system, as one of the major types of public transportation, is mainly influenced by the traffic flow more than other types of public transportation. In order to increase the bus system’s reliability we need to utilize a controlling system so that we could increase its movement discipline and reduce line’s disruption. In this research we proposed an optimization model based on bus holding strategy and the real-time data from the route. This... 

    Data-driven Methods for Cooperative Control of Wheeled Mobile Robots

    , M.Sc. Thesis Sharif University of Technology Qahremani, Sina (Author) ; Sadati, Nasser (Supervisor)
    Abstract
    Employing wheeled mobile robots is growing in industry, transportation, space and defense industry and many other social fields as well. These robots are used to execute distinct forms of operations and tasks such as exploring the surface of the earth and other planets, serving in public places, backing natural disasters and warehousing, and so forth. In some cases, the assigned mission may not be capable of being performed as intended by a single robot. In this case, several robots will work together to execute a particular mission. Several research topics that are under investigation currently include the interacting procedure of robots as a multi-agent system in order to perform the... 

    Intelligent control of chaos using linear feedback controller and neural network identifier

    , Article Communications in Nonlinear Science and Numerical Simulation ; Volume 17, Issue 12 , 2012 , Pages 4731-4739 ; 10075704 (ISSN) Sadeghpour, M ; Khodabakhsh, M ; Salarieh, H ; Sharif University of Technology
    2012
    Abstract
    A method for controlling chaos when the mathematical model of the system is unknown is presented in this paper. The controller is designed by the pole placement algorithm which provides a linear feedback control method. For calculating the feedback gain, a neural network is used for identification of the system from which the Jacobian of the system in its fixed point can be approximated. The weights of the neural network are adjusted online by the gradient descent algorithm in which the difference between the system output and the network output is considered as the error to be decreased. The method is applied on both discrete-time and continuous-time systems. For continuous-time systems,... 

    Efficient iterative Semi-Supervised Classification on manifold

    , Article Proceedings - IEEE International Conference on Data Mining, ICDM ; 2011 , Pages 228-235 ; 15504786 (ISSN); 9780769544090 (ISBN) Farajtabar, M ; Rabiee, H. R ; Shaban, A ; Soltani Farani, A ; National Science Foundation (NSF) - Where Discoveries Begin; University of Technology Sydney; Google; Alberta Ingenuity Centre for Machine Learning; IBM Research ; Sharif University of Technology
    Abstract
    Semi-Supervised Learning (SSL) has become a topic of recent research that effectively addresses the problem of limited labeled data. Many SSL methods have been developed based on the manifold assumption, among them, the Local and Global Consistency (LGC) is a popular method. The problem with most of these algorithms, and in particular with LGC, is the fact that their naive implementations do not scale well to the size of data. Time and memory limitations are the major problems faced in large-scale problems. In this paper, we provide theoretical bounds on gradient descent, and to overcome the aforementioned problems, a new approximate Newton's method is proposed. Moreover, convergence... 

    Towards improving robustness of deep neural networks to adversarial perturbations

    , Article IEEE Transactions on Multimedia ; Volume 22, Issue 7 , 2020 , Pages 1889-1903 Amini, S ; Ghaemmaghami, S ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2020
    Abstract
    Deep neural networks have presented superlative performance in many machine learning based perception and recognition tasks, where they have even outperformed human precision in some applications. However, it has been found that human perception system is much more robust to adversarial perturbation, as compared to these artificial networks. It has been shown that a deep architecture with a lower Lipschitz constant can generalize better and tolerate higher level of adversarial perturbation. Smooth regularization has been proposed to control the Lipschitz constant of a deep architecture and in this work, we show how a deep convolutional neural network (CNN), based on non-smooth regularization... 

    Spatio-temporal VLAD encoding of visual events using temporal ordering of the mid-level deep semantics

    , Article IEEE Transactions on Multimedia ; Volume 22, Issue 7 , 2020 , Pages 1769-1784 Soltanian, M ; Amini, S ; Ghaemmaghami, S ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2020
    Abstract
    Classification of video events based on frame-level descriptors is a common approach to video recognition. In the meanwhile, proper encoding of the frame-level descriptors is vital to the whole event classification procedure. While there are some pretty efficient video descriptor encoding methods, temporal ordering of the descriptors is often ignored in these encoding algorithms. In this paper, we show that by taking into account the temporal inter-frame dependencies and tracking the chronological order of video sub-events, accuracy of event recognition is further improved. First, the frame-level descriptors are extracted using convolutional neural networks (CNNs) pre-trained on ImageNet,... 

    An optimization based approach embedded in a fuzzy connectivity algorithm for airway tree segmentation

    , Article Proceedings of the 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS'08 - "Personalized Healthcare through Technology", 20 August 2008 through 25 August 2008, Vancouver, BC ; 2008 , Pages 4011-4014 ; 9781424418152 (ISBN) Yousefi Rizi, F ; Ahmadian, A. R ; Fatemizadeh, E ; Alirezaie, J ; Sharif University of Technology
    2008
    Abstract
    The main problem with airway segmentation methods which significantly influences their accuracy is leakage into the extra-luminal regions due to thinness of the airway wall during the process of segmentation. This phenomenon potentially makes large regions of lungparenchyma to be wrongly identified as airways. A solution to this problem in the previous methods was based on leak detection followed by reducing leakage during the segmentation process. This has been dealt with adjusting the segmentation parameters and performing the re-segmentation process on the pre-segmented area. This makes the algorithm very exhaustive and more dependent on the user interaction. The method presented here is...