Loading...
Search for: learning-algorithm
0.017 seconds
Total 227 records

    Subcutaneous insulin administration by deep reinforcement learning for blood glucose level control of type-2 diabetic patients

    , Article Computers in Biology and Medicine ; Volume 148 , 2022 ; 00104825 (ISSN) Raheb, M. A ; Niazmand, V. R ; Eqra, N ; Vatankhah, R ; Sharif University of Technology
    Elsevier Ltd  2022
    Abstract
    Background: Type-2 diabetes mellitus is characterized by insulin resistance and impaired insulin secretion in the human body. Many endeavors have been made in terms of controlling and reducing blood glucose via the medium of automated controlling tools to increase precision and efficiency and reduce human error. Recently, reinforcement learning algorithms are proved to be powerful in the field of intelligent control, which was the motivation for the current study. Methods: For the first time, a reinforcement algorithm called normalized advantage function (NAF) algorithm has been applied as a model-free reinforcement learning method to regulate the blood glucose level of type-2 diabetic... 

    Fast multidimensional dictionary learning algorithms and their application in 3D inverse synthetic aperture radar image restoration and noise reduction

    , Article IET Radar, Sonar and Navigation ; Volume 16, Issue 9 , 2022 , Pages 1484-1502 ; 17518784 (ISSN) Mehrpooya, A ; Nazari, M ; Abbasi, Z ; Karbasi, S. M ; Nayebi, M. M ; Bastani, M. H ; Sharif University of Technology
    John Wiley and Sons Inc  2022
    Abstract
    By generalising dictionary learning (DL) algorithms to multidimensional (MD) mode and using them in applications where signals are inherently multidimensional, such as in three-dimensional (3D) inverse synthetic aperture radar (ISAR) imaging, it is possible to achieve much higher speed and less computational complexity. In this study, the formulation of the multidimensional dictionary learning (MDDL) problem is expressed and two algorithms are proposed to solve it. The first one is based on the method of optimum directions (MOD) algorithm for 1D dictionary learning (1DDL), which uses alternating minimisation and gradient projection approach. As the MDDL problem is non-convex, the second... 

    Angle-incremental range estimation for FDA-MIMO radar via hybrid sparse learning

    , Article Digital Signal Processing: A Review Journal ; Volume 130 , 2022 ; 10512004 (ISSN) Karbasi, S. M ; Sharif University of Technology
    Elsevier Inc  2022
    Abstract
    In this paper, a target parameter estimation problem is addressed for the recently emerging frequency diverse array multiple-input multiple-output (FDA-MIMO) radar system, utilizing sparse learning. The scene is modeled as a two dimensional (2D) angle-incremental range grid. To solve the resulting sparse problem, the recently proposed user-parameter free algorithms including block sparse learning via iterative minimization (BSLIM), iterative adaptive approach (IAA), sparse iterative covariance-based estimation (SPICE), likelihood-based estimation of sparse parameters (LIKES), and orthogonal matching pursuit (OMP) are applied which achieve excellent parameter estimation performance. However,... 

    Active learning of causal structures with deep reinforcement learning

    , Article Neural Networks ; Volume 154 , 2022 , Pages 22-30 ; 08936080 (ISSN) Amirinezhad, A ; Salehkaleybar, S ; Hashemi, M ; Sharif University of Technology
    Elsevier Ltd  2022
    Abstract
    We study the problem of experiment design to learn causal structures from interventional data. We consider an active learning setting in which the experimenter decides to intervene on one of the variables in the system in each step and uses the results of the intervention to recover further causal relationships among the variables. The goal is to fully identify the causal structures with minimum number of interventions. We present the first deep reinforcement learning based solution for the problem of experiment design. In the proposed method, we embed input graphs to vectors using a graph neural network and feed them to another neural network which outputs a variable for performing... 

    Quick generation of SSD performance models using machine learning

    , Article IEEE Transactions on Emerging Topics in Computing ; Volume 10, Issue 4 , 2022 , Pages 1821-1836 ; 21686750 (ISSN) Tarihi, M ; Azadvar, S ; Tavakkol, A ; Asadi, H ; Sarbazi Azad, H ; Sharif University of Technology
    IEEE Computer Society  2022
    Abstract
    Increasing usage of Solid-State Drives (SSDs) has greatly boosted the performance of storage backends. SSDs perform many internal processes such as out-of-place writes, wear-leveling, and garbage collection. These operations are complex and not well documented which make it difficult to create accurate SSD simulators. Our survey indicates that aside from complex configuration, available SSD simulators do not support both sync and discard requests. Past performance models also ignore the long term effect of I/O requests on SSD performance, which has been demonstrated to be significant. In this article, we utilize a methodology based on machine learning that extracts history-aware features at... 

    A constrained multi-item EOQ inventory model for reusable items: Reinforcement learning-based differential evolution and particle swarm optimization

    , Article Expert Systems with Applications ; Volume 207 , 2022 ; 09574174 (ISSN) Fallahi, A ; Amani Bani, E ; Akhavan Niaki, S. T ; Sharif University of Technology
    Elsevier Ltd  2022
    Abstract
    The growing environmental concerns, governmental regulations, and significant cost savings are the primary motivations for companies to consider the reuse and recovery of products in their inventory system. The previous research ignored several realistic features of reusable items inventory systems, such as the presence of multiple products and operational constraints. For the first time, this paper presents a new multiproduct economic order quantity inventory model for an inventory system of reusable products. The goal of the model is to determine the optimal replenishment quantity and reuse quantity of each item so that the system's total cost is minimized. Several operational constraints... 

    Comparison of classic classifiers, metaheuristic algorithms and convolutional neural networks in hyperspectral classification of nitrogen treatment in tomato leaves

    , Article Remote Sensing ; Volume 14, Issue 24 , 2022 ; 20724292 (ISSN) Benmouna, B ; Pourdarbani, R ; Sabzi, S ; Fernandez Beltran, R ; García-Mateos, G ; Molina Martínez, J. M ; Sharif University of Technology
    MDPI  2022
    Abstract
    Tomato is an agricultural product of great economic importance because it is one of the most consumed vegetables in the world. The most crucial chemical element for the growth and development of tomato is nitrogen (N). However, incorrect nitrogen usage can alter the quality of tomato fruit, rendering it undesirable to customers. Therefore, the goal of the current study is to investigate the early detection of excess nitrogen application in the leaves of the Royal tomato variety using a non-destructive hyperspectral imaging system. Hyperspectral information in the leaf images at different wavelengths of 400–1100 nm was studied; they were taken from different treatments with normal nitrogen... 

    Graph centrality algorithms for hardware trojan detection at gate-level netlists

    , Article International Journal of Engineering, Transactions A: Basics ; Volume 35, Issue 7 , 2022 , Pages 1375-1387 ; 17281431 (ISSN) Hashemi, M ; Momeni, A ; Pashrashid, A ; Mohammadi, S ; Sharif University of Technology
    Materials and Energy Research Center  2022
    Abstract
    The rapid growth in the supply chain of electronic devices has led companies to purchase Intellectual Property or Integrated Circuits from unreliable sources. This dispersion in the design to fabrication stages of IP/IC has led to new attacks called hardware Trojans. Hardware Trojans can bargain information, reduce performance, or cause failure. Various methods have been introduced to detect or prevent hardware Trojans. Machine learning methods are one of these. Selecting the type and number of input variables in the learning algorithm has an important role in the performance of the learning model. Some previous hardware Trojan detection studies have used structural gate-level features to... 

    Joint topology learning and graph signal recovery using variational bayes in Non-gaussian noise

    , Article IEEE Transactions on Circuits and Systems II: Express Briefs ; Volume 69, Issue 3 , 2022 , Pages 1887-1891 ; 15497747 (ISSN) Torkamani, R ; Zayyani, H ; Marvasti, F ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2022
    Abstract
    This brief proposes a joint graph signal recovery and topology learning algorithm using a Variational Bayes (VB) framework in the case of non-Gaussian measurement noise. It is assumed that the graph signal is Gaussian Markov Random Field (GMRF) and the graph weights are considered statistical with the Gaussian prior. Moreover, the non-Gaussian noise is modeled using two distributions: Mixture of Gaussian (MoG), and Laplace. All the unknowns of the problem which are graph signal, Laplacian matrix, and the (Hyper)parameters are estimated by a VB framework. All the posteriors are calculated in closed forms and the iterative VB algorithm is devised to solve the problem. The efficiency of the... 

    Predicting the objective and priority of issue reports in software repositories

    , Article Empirical Software Engineering ; Volume 27, Issue 2 , 2022 ; 13823256 (ISSN) Izadi, M ; Akbari, K ; Heydarnoori, A ; Sharif University of Technology
    Springer  2022
    Abstract
    Software repositories such as GitHub host a large number of software entities. Developers collaboratively discuss, implement, use, and share these entities. Proper documentation plays an important role in successful software management and maintenance. Users exploit Issue Tracking Systems, a facility of software repositories, to keep track of issue reports, to manage the workload and processes, and finally, to document the highlight of their team’s effort. An issue report is a rich source of collaboratively-curated software knowledge, and can contain a reported problem, a request for new features, or merely a question about the software product. As the number of these issues increases, it... 

    Model-free LQR design by Q-function learning

    , Article Automatica ; Volume 137 , 2022 ; 00051098 (ISSN) Farjadnasab, M ; Babazadeh, M ; Sharif University of Technology
    Elsevier Ltd  2022
    Abstract
    Reinforcement learning methods such as Q-learning have shown promising results in the model-free design of linear quadratic regulator (LQR) controllers for linear time-invariant (LTI) systems. However, challenges such as sample-efficiency, sensitivity to hyper-parameters, and compatibility with classical control paradigms limit the integration of such algorithms in critical control applications. This paper aims to take some steps towards bridging the well-known classical control requirements and learning algorithms by using optimization frameworks and properties of conic constraints. Accordingly, a new off-policy model-free approach is proposed for learning the Q-function and designing the... 

    An integrated human stress detection sensor using supervised algorithms

    , Article IEEE Sensors Journal ; Volume 22, Issue 8 , 2022 , Pages 8216-8223 ; 1530437X (ISSN) Mohammadi, A ; Fakharzadeh, M ; Baraeinejad, B ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2022
    Abstract
    This paper adopts a holistic approach to stress detection issues in software and hardware phases and aims to develop and evaluate a specific low-power and low-cost sensor using physiological signals. First, a stress detection model is presented using a public data set, where four types of signals, temperature, respiration, electrocardiogram (ECG), and electrodermal activity (EDA), are processed to extract 65 features. Using Kruskal-Wallis analysis, it is shown that 43 out of 65 features demonstrate a significant difference between stress and relaxed states. K nearest neighbor (KNN) algorithm is implemented to distinguish these states, which yields a classification accuracy of 96.0 ± 2.4%. It... 

    Demand forecasting based machine learning algorithms on customer information: an applied approach

    , Article International Journal of Information Technology (Singapore) ; Volume 14, Issue 4 , 2022 , Pages 1937-1947 ; 25112104 (ISSN) Zohdi, M ; Rafiee, M ; Kayvanfar, V ; Salamiraad, A ; Sharif University of Technology
    Springer Science and Business Media B.V  2022
    Abstract
    Demand forecasting has always been a concern for business owners as one of the main activities in supply chain management. Unlike the past, that forecasting was done with the help of a limited amount of information, today, using advanced technologies and data analytics, forecasting is performed with machine learning algorithms and data-driven methods. Patterns and trends of demand, customer information, preferences, suggestions, and post-consumption feedbacks are some types of data that are used in various demand forecasting efforts. Traditional statistical methods and techniques are biased in demand prediction and are not accurate; so, machine learning algorithms as more popular techniques... 

    Performance evaluation of slag-based concrete at elevated temperatures by a novel machine learning approach

    , Article Construction and Building Materials ; Volume 358 , 2022 ; 09500618 (ISSN) Toufigh, V ; Palizi, S ; Sharif University of Technology
    Elsevier Ltd  2022
    Abstract
    Ground granulated blast furnace slag is a sustainable material and supplementary for cement in the concrete industry. Different behavioral aspects must be assessed to achieve reliable sustainable materials, including post-fire mechanical properties. One robust tool is the machine learning approach to train prediction models. This study proposes a novel machine learning algorithm, hybrid support vector regression and dolphin echolocation algorithm (SVR-DE), to predict the post-fire compressive strength ratio of slag-based concrete. In this regard, SVR hyper-parameters were tuned by the DE optimization algorithm. Four kernel functions were implemented in SVR formulation: linear, sigmoid,... 

    Intelligent flight-data-recorders; a step toward a new generation of learning aircraft

    , Article 8th International Conference on Control, Decision and Information Technologies, CoDIT 2022, 17 May 2022 through 20 May 2022 ; 2022 , Pages 1545-1549 ; 9781665496070 (ISBN) Malaek, S. M ; Alipour, E ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2022
    Abstract
    To understand how aerial accidents occur, the installation of flight data recorders (FDR) and cockpit voice recorders (CVR) have become mandatory by law. However, such devices play a passive role and are used once an accident has occurred. Recent advances in machine learning techniques and their application in solving engineering problems are the keys to creating a more active role for both FDR as well as CVR. Here, we investigate a new approach to bringing intelligence to an FDR (called I-FDR). Through a continuous data-mining process, an I-FDR could bring better situational awareness to the flight crew. An I-FDR, similar to the FDR, records all pertinent flying parameters. In addition,... 

    Computation offloading strategy for autonomous vehicles

    , Article 27th International Computer Conference, Computer Society of Iran, CSICC 2022, 23 February 2022 through 24 February 2022 ; 2022 ; 9781665480277 (ISBN) Farimani, M. K ; Karimian Aliabadi, S ; Entezari Maleki, R ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2022
    Abstract
    Vehicular edge computing is a progressing technology which provides processing resources to the internet of vehicles using the edge servers deployed at roadside units. Vehicles take advantage by offloading their computationintensive tasks to this infrastructure. However, concerning time-sensitive applications and the high mobility of vehicles, cost-efficient task offloading is still a challenge. This paper establishes a computation offloading strategy based on deep Q-learning algorithm for vehicular edge computing networks. To jointly minimize the system cost including offloading failure rate and the total energy consumption of the offloading process, the vehicle tasks offloading problem is... 

    Adaptive actuator failure compensation on the basis of contraction metrics

    , Article IEEE Control Systems Letters ; Volume 6 , 2022 , Pages 1376-1381 ; 24751456 (ISSN) Boveiri, M ; Tavazoei, M. S ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2022
    Abstract
    This letter develops an adaptive actuator failure compensation method for nonlinear systems with unmatched parametric uncertainty based on contraction metrics. The proposed method, which is constructed by benefiting from the recent achievements on contraction metrics based adaptive control techniques, ensures the closed-loop stability and asymptotic tracking of the desired trajectory in the presence of actuator failures. In particular, a sufficient convex condition is derived for constructing a valid metric, by which a quadratic program-based controller is obtained to determine the inputs of the actuators. The introduced method is more general than the common adaptive actuator failure... 

    A multi-agent deep reinforcement learning framework for algorithmic trading in financial markets

    , Article Expert Systems with Applications ; Volume 208 , 2022 ; 09574174 (ISSN) Shavandi, A ; Khedmati, M ; Sharif University of Technology
    Elsevier Ltd  2022
    Abstract
    Algorithmic trading based on machine learning is a developing and promising field of research. Financial markets have a complex, uncertain, and dynamic nature, making them challenging for trading. Some financial theories, such as the fractal market hypothesis, believe that the markets behave based on the collective psychology of investors who trade with different investment horizons and interpretations of information. Accordingly, a multi-agent deep reinforcement learning framework is proposed in this paper to trade on the collective intelligence of multiple agents, each of which is an expert trader on a specific timeframe. The proposed framework works in a hierarchical structure in which... 

    WLFS: Weighted label fusion learning framework for glioma tumor segmentation in brain MRI

    , Article Biomedical Signal Processing and Control ; Volume 68 , 2021 ; 17468094 (ISSN) Barzegar, Z ; Jamzad, M ; Sharif University of Technology
    Elsevier Ltd  2021
    Abstract
    Glioma is a common type of tumor that develops in the brain. Due to many differences in the shape and appearance, accurate segmentation of glioma for identifying all parts of the tumor and its surrounding tissues in cancer detection is a challenging task in cancer detection. In recent researches, the combination of atlas-based segmentation and machine learning methods have presented superior performance over other automatic brain MRI segmentation algorithms. To overcome the side effects of limited existing information on atlas-based segmentation, and the long training and the time consuming phase of learning methods, we proposed a semi-supervised learning framework by introducing a... 

    Mapping the spatiotemporal variability of salinity in the hypersaline Lake Urmia using Sentinel-2 and Landsat-8 imagery

    , Article Journal of Hydrology ; Volume 595 , 2021 ; 00221694 (ISSN) Bayati, M ; Danesh Yazdi, M ; Sharif University of Technology
    Elsevier B.V  2021
    Abstract
    The spatiotemporal dynamic of salinity concentration (SC) in saline lakes is strongly dependent on the rate of water flow into the lake, water circulation, wind speed, evaporation rate, and the phenomenon of salt precipitation and dissolution. Although in-situ observations most reliably quantify water quality metrics, the spatiotemporal distribution of such data are typically limited and cannot be readily extrapolated for either long-term projections or extensive areas. Alternatively, remotely sensed imagery has facilitated less expensive and a stronger ability to estimate water quality over a wide range of spatiotemporal resolutions. This study introduces an adaptive learning model that...