Loading...
Search for: short-term-memory
0.011 seconds
Total 41 records

    Automated Lip-Reading robotic system based on convolutional neural network and long short-term memory

    , Article 13th International Conference on Social Robotics, ICSR 2021, 10 November 2021 through 13 November 2021 ; Volume 13086 LNAI , 2021 , Pages 73-84 ; 03029743 (ISSN) ; 9783030905248 (ISBN) Gholipour, A ; Taheri, A ; Mohammadzade, H ; Sharif University of Technology
    Springer Science and Business Media Deutschland GmbH  2021
    Abstract
    In Iranian Sign Language (ISL), alongside the movement of fingers/arms, the dynamic movement of lips is also essential to perform/recognize a sign completely and correctly. In a follow up of our previous studies in empowering the RASA social robot to interact with individuals with hearing problems via sign language, we have proposed two automated lip-reading systems based on DNN architectures, a CNN-LSTM and a 3D-CNN, on the robotic system to recognize OuluVS2 database words. In the first network, CNN was used to extract static features, and LSTM was used to model temporal dynamics. In the second one, a 3D-CNN network was used to extract appropriate visual and temporal features from the... 

    Continuous emotion recognition during music listening using EEG signals: A fuzzy parallel cascades model

    , Article Applied Soft Computing ; Volume 101 , 2021 ; 15684946 (ISSN) Hasanzadeh, F ; Annabestani, M ; Moghimi, S ; Sharif University of Technology
    Elsevier Ltd  2021
    Abstract
    A controversial issue in artificial intelligence is human emotion recognition. This paper presents a fuzzy parallel cascades (FPC) model for predicting the continuous subjective emotional appraisal of music by time-varying spectral content of electroencephalogram (EEG) signals. The EEG, along with an emotional appraisal of 15 subjects, was recorded during listening to seven musical excerpts. The emotional appraisement was recorded along the valence and arousal emotional axes as a continuous signal. The FPC model was composed of parallel cascades with each cascade containing a fuzzy logic-based system. The FPC model performance was evaluated using linear regression (LR), support vector... 

    Language Modeling Using Recurrent Neural Networks

    , M.Sc. Thesis Sharif University of Technology Rahimi, Adel (Author) ; Sameti, Hossein (Supervisor)
    Abstract
    This thesis examines the differences and the similarities between the two famous RNN blocks the Long Short Term Memory and the Gated Recurrent Unit. It measure different aspects such as computational complexity, Word Error Rate, and subjective human evaluation in the task of text generation.In the computational complexity experiment results show that the LSTM takes more time to compute, in comparison to the GRU. Moving on into the next experiment the GRU slightly outperforms the LSTM in terms of WER but the perplexity for the language models tested was the same. This shows that slight differences in the perplexity does not drastically change the WER. Having said, the results suggest that the... 

    LSTM-Based ecg classification for continuous monitoring on personal wearable devices

    , Article IEEE Journal of Biomedical and Health Informatics ; Volume 24, Issue 2 , 2020 , Pages 515-523 Saadatnejad, S ; Oveisi, M ; Hashemi, M ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2020
    Abstract
    Objective: A novel electrocardiogram (ECG) classification algorithm is proposed for continuous cardiac monitoring on wearable devices with limited processing capacity. Methods: The proposed solution employs a novel architecture consisting of wavelet transform and multiple long short-term memory (LSTM) recurrent neural networks (see Fig. 1). Results: Experimental evaluations show superior ECG classification performance compared to previous works. Measurements on different hardware platforms show the proposed algorithm meets timing requirements for continuous and real-time execution on wearable devices. Conclusion: In contrast to many compute-intensive deep-learning based approaches, the... 

    Providing a Method Based on Signal Transformations and Machine Learning Tools for Forecasting in Stock Market

    , M.Sc. Thesis Sharif University of Technology Parhizkari, Amir (Author) ; Khedmati, Majid (Supervisor)
    Abstract
    Obtaining high profit is the ultimate goal of an investor in the financial market. The key to achieve high profits in stock trading is to find the right time to trade with minimum business risk. However, it is difficult, often, to make decision about the best time to buy or sell some stocks due to the extremely dynamic and volatile behavior of the stock market. In order to resolve these problems, two steps have been followed in this research:1) Create a model to predict the final price of the stock with small error rate, and 2) Suggest the best stocks for trading to the trader. In order to achieve the goals of the first step, the stock price data of Hcltech, Maruti, Axisbank is selected and... 

    Recurrent Neural Network Language Modeling For Persian

    , M.Sc. Thesis Sharif University of Technology Hosseini Saravani, Habib (Author) ; Bahrani, Mohammad (Supervisor) ; Veisi, Hadi (Supervisor)
    Abstract
    Neural Networks have been applied to Language Modeling to solve a major problem that N-gram language models could not overcome: discreteness of the words. Generally, neural networks were successful in solving this problem and improved Language Modeling by reducing the perplexity of the models. Neural networks can find grammatical and semantic connections among the words using word embedding which maps each word to a low dimensional feature vector of real numbers. In this research, different kinds of neural network applied to Language Modeling has been reviewed. Also, it has been tried to reduce the perplexity of Persian language models on a 100-million scale data set using a single-layer... 

    Design and Implementation of a Tool Ttip Force Estimation Algorithm for Surgical Robotic Systems Using Proximal Sensor using Neural Network

    , M.Sc. Thesis Sharif University of Technology Mansoury, Bahman (Author) ; Farahmand, Farzam (Supervisor)
    Abstract
    One of the most important problems in the field of robotic surgery is the measurement of the laparoscopic instrument Tip force. Many efforts have been made with the use of distal sensors to issues such as high cost and need to be disinfected, but the use of proximal sensors for estimating the Tip force of the tool is very rare, which can improve distal sensor problems.In this project, first, using data obtained from simulating a set-up for the Sina robot, two static MLP, and dynamic NARX neural networks are trained to evaluate the use of machine learning algorithms to determine whether the Tip force neural network can be used. Did the robot estimate Sina's surgeon? Then, by comparing this... 

    Real Time Trend Forecasting of Noisy Signal Using Deep Recurrent LSTM Network

    , M.Sc. Thesis Sharif University of Technology Aghaee, Arman (Author) ; Vosoughi Vahdat, Bijan (Supervisor)
    Abstract
    Artificial neural networks are mathematical models inspired by the nervous system and brain. The types and applications of these networks are very widespread nowadays, and it seems that they can be used to track the signals well and estimate the data of the next. In this research, we try to present a model that can predict the future of the trend of noisy signals that have unpredictable behavior, or in other words, chaotic signals. Such research is also widely used in the medical sciences, including the diagnosis of epileptic seizures or heart attacks. In this research, a study with high volatility financial data has been done as an example on this issue and the proposed model tries to be... 

    Real-Time Hand Pose Estimation Using Camera Vision System

    , M.Sc. Thesis Sharif University of Technology Kiani, Mahmoud (Author) ; Hashemi, Matin (Supervisor) ; Namvar, Mehrzad (Supervisor)
    Abstract
    Hand pose estimation is something that has applications in many fields, including augmented and virtual reality systems, as well as mixed reality. Hand gesture recognition and classification applications including sign language recognition and non-handheld senarios (such as storefront contactless Survey systems) that have found special cases in the Qovid-19 pandemic period Shows the highest hand pose estimation importance. In our work, we target 2D and 3D estimation at the same time and also use RGB camera as a sensor to record input data. It becomes more economical to achieve Compared with using RGBD or depth sensors . There is only one RGB image in our input. Also there is no contract to... 

    Designing an Automatic Lip-reading System for Persian Words Using Deep Neural Networks and Implementing it on Rasa Social Robot

    , M.Sc. Thesis Sharif University of Technology Gholipour, Amir (Author) ; Taheri, Alireza (Supervisor) ; Mohammadzadeh, Hoda (Supervisor)
    Abstract
    In Iranian Sign Language (ISL), alongside the movement of fingers, the movement of the lips is also essential for to perform words completely and correctly. The purpose of current study is to provide an automated lip-reading system using deep neural networks and implement it on Rasa social robot; So that the robot can recognize a limited number of specified Persian words. To do this, we propose an automated lip-reading system based on convolutional neural networks and long short-term memories. Convolutional neural networks in extracting features from images and long short-term memories in modeling temporal dynamics have achieved good results. We have also recorded a database in Persian... 

    A Novel Resource Allocation Algorithm in Edge Computing with Deep Reinforcement Learning

    , M.Sc. Thesis Sharif University of Technology Rahmati, Iman (Author) ; Movaghar, Ali (Supervisor)
    Abstract
    With the explosion of mobile smart devices, many computation intensive applications have emerged, such as interactive gaming and augmented reality. Mobile edge computing (EC) is put forward, as an extension of cloud computing, to meet the low-latency require- ments of the applications. In mobile edge computing systems, an edge node may have a high load when a large number of mobile devices offload their tasks to it. those offloaded tasks may experience large processing delay or even be dropped when their deadlines expire. Due to the uncertain load dynamics at the edge nodes, it is challenging for each device to determine its offloading decision (i.e., whether to offload or not, and which... 

    Mutation Prediction of Infectious Viruses Based on Different Machine Learning Approaches

    , M.Sc. Thesis Sharif University of Technology Ehteshami, Khashayar (Author) ; Ghafourian Ghahramani, Amir Ali (Supervisor) ; Kavousi, Kaveh (Supervisor)
    Abstract
    Predicting the evolution of viruses is vital in controlling, preventing, and treating diseases. Mutations that evade the host immune system can propagate and persist through generations, making it crucial to anticipate and combat them effectively. The 1918 H1N1 pandemic serves as an example of the devastating impact of pandemics caused by viral mutations. By predicting mutations in advance, we can identify potential future pandemics and take effective preventative measures to mitigate their impact. Proteins play a vital role in the functioning of viruses. They are involved in various processes, such as replication, transcription, and host cell invasion. Any changes in the protein sequence... 

    An AI Based Cryptocurrency Trading System

    , M.Sc. Thesis Sharif University of Technology Yasrebi, Amir Abbas (Author) ; Khayyat, Amir Ali Akbar (Supervisor)
    Abstract
    Cryptocurrencies are not only regarded as a trustworthy method of financial transaction validated by a decentralized cryptographic system as opposed to a centralized authority, but also as one of the most popular and lucrative forms of trade and investing. Predicting the price of a cryptocurrency is a challenging topic in time-series research. Its intricacy is due to the volatility and large swings of cryptocurrencies' price. The emergence of brand-new cryptocurrencies, which might present a profitable trading opportunity but lack sufficient historical data for technical analysis, prompted us to develop a trading strategy that could be applied universally. The forecast of the next timestep's... 

    Energy Management of a Residential Prosumer Employing Machine Learning Techniques

    , M.Sc. Thesis Sharif University of Technology Derakhshan Mahboob, Fatemeh (Author) ; Moeini Eghtaei, Moein (Supervisor)
    Abstract
    This thesis proposes an energy management model for a residential prosumer building which is a multi-carrier energy hub in a grid-connected mode. Power may be bought and sold between the building energy system and the power grid. Solar panels and solar water heaters are integrated into the building's energy system to create electricity and heat, which may be used in a variety of ways: used immediately to meet the home's demands, stored in batteries and heat banks for later use, or sold back to the power grid to earn income. The energy dispatch is managed such that the overall energy consumption cost is minimized, considering the variation in electricity price (grid tariff), renewable energy... 

    Speech-Driven Facial Reenactment

    , M.Sc. Thesis Sharif University of Technology Jalalifar, Ali (Author) ; Karbalaei Aghajan, Hamid (Supervisor)
    Abstract
    Creating talking heads from audio input is interesting from both scientific and practical viewpoints, e.g. constructing virtual computer generated characters, aiding hearing-impaired people, live dubbing of videos with translated audio, etc. Due to its wide variety of applications, audio to video has been the focus of intensive research in recent years. Mapping audio to facial images with accurate lip-sync is an extremely difficult task because it is a mapping form 2-Dimensional to 3-Dimensional space and also because humans are expert at detecting any out-of-sync lip movements with respect to an audio.Approaches to automatically generating natural looking speech animation usually involve... 

    Face Verification Resistant to Spoofing based on Lib Movements

    , M.Sc. Thesis Sharif University of Technology Khanehgir, Saeed (Author) ; Ghaemmaghami, Shahrokh (Supervisor) ; Eghlidos, Taraneh (Co-Supervisor)
    Abstract
    Identity verification is a key part of identity reidentification process. Nowadays, Identity reidentification using face-based algorithms are popular in learning and vision area due to their generality and accessibility of this body organ. Using a fake image, occlusions on face and appearance changes like makeup can cause distortion in face verification systems which can be a drop in function of such systems. Most of these face verification models like DeepFace, FaceNet, ArcFace and SphereFace use convolution networks as their major architecture. These models, in addition to their large storage consuming and high computational complexity, due to using face as their major feature, are not... 

    The effects of a short-term memory task on postural control of stroke patients

    , Article Topics in Stroke Rehabilitation ; Volume 22, Issue 5 , 2015 , Pages 335-341 ; 10749357 (ISSN) Mehdizadeh, H ; Taghizadeh, G ; Ghomashchi, H ; Parnianpour, M ; Khalaf, K ; Salehi, R ; Esteki, A ; Ebrahimi, I ; Sangelaji, B ; Sharif University of Technology
    Taylor and Francis Ltd  2015
    Abstract
    Background: Many studies have been conducted on the changes in the balance capabilities of stroke patients. However, results regarding the effects of dual-task activities on postural control in these patients have been variable. Objective: To evaluate the effects of a short-term memory task on the sway characteristics of stroke patients. Method: Center of pressure (COP) fluctuations were measured in three levels of postural difficulty (rigid surface with closed and open eyes and foamsurface with closed eyes), aswell as two levels of cognitive difficulty (easy and difficult). COP parameters included mean velocity, standard deviation of velocity in both medial-lateral (M.L) and... 

    Persian language modeling using recurrent neural networks

    , Article 9th International Symposium on Telecommunication, IST 2018, 17 December 2018 through 19 December 2018 ; 2019 , Pages 207-210 ; 9781538682746 (ISBN) Hosseini Saravani, H ; Bahrani, M ; Veisi, H ; Besharati, S ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    In this paper, recurrent neural networks are applied to language modeling of Persian, using word embedding as word representation. To this aim, unidirectional and bidirectional Long Short-Term Memory (LSTM) networks are used, and the perplexity of Persian language models on a 100-million-word data set is evaluated. The effect of various parameters, including number of hidden layers and size of LSTM units, on the performance of the networks in reducing the perplexity of the models are investigated. Among different LSTM language models, the best perplexity, which is equal to 59.05, is achieved from a 2-layer bidirectional LSTM model. Comparing this value with the perplexity of the classical... 

    A generalizable model for seizure prediction based on deep learning using CNN-LSTM architecture

    , Article 2018 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2018, 26 November 2018 through 29 November 2018 ; 2019 , Pages 469-473 ; 9781728112954 (ISBN) Shahbazi, M ; Aghajan, H ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    This work proposes a novel deep learning-based model for prediction of epileptic seizures using multichannel EEG signals. Multichannel images are first constructed by applying short-time Fourier transform (STFT) to Electroencephalography (EEG) signals. After a preprocessing step, a CNN-LSTM neural network is trained on the STFTs in order to capture the spectral, spatial and temporal features within and between the EEG segments and classify them as preictal or interictal stage. The proposed method achieves a sensitivity of 98.21%, a false prediction rate (FPR) of 0.13/h and a mean prediction time of 44.74 minutes on the CHB-MIT dataset. As the main contribution of this work, by using a... 

    Unsupervised anomaly detection with LSTM autoencoders using statistical data-filtering

    , Article Applied Soft Computing ; Volume 108 , 2021 ; 15684946 (ISSN) Maleki, S ; Maleki, S ; Jennings, N. R ; Sharif University of Technology
    Elsevier Ltd  2021
    Abstract
    To address one of the most challenging industry problems, we develop an enhanced training algorithm for anomaly detection in unlabelled sequential data such as time-series. We propose the outputs of a well-designed system are drawn from an unknown probability distribution, U, in normal conditions. We introduce a probability criterion based on the classical central limit theorem that allows evaluation of the likelihood that a data-point is drawn from U. This enables the labelling of the data on the fly. Non-anomalous data is passed to train a deep Long Short-Term Memory (LSTM) autoencoder that distinguishes anomalies when the reconstruction error exceeds a threshold. To illustrate our...