Loading...
Search for: reinforcement--psychology
0.003 seconds

    Individual differences in nucleus accumbens dopamine receptors predict development of addiction-like behavior: A computational approach

    , Article Neural Computation ; Volume 22, Issue 9 , 2010 , Pages 2334-2368 ; 08997667 (ISSN) Piray, P ; Keramati, M. M ; Dezfouli, A ; Lucas, C ; Mokri, A ; Sharif University of Technology
    2010
    Abstract
    Clinical and experimental observations show individual differences in the development of addiction. Increasing evidence supports the hypothesis that dopamine receptor availability in the nucleus accumbens (NAc) predisposes drug reinforcement. Here, modeling striatal-midbrain dopaminergic circuit, we propose a reinforcement learning model for addiction based on the actor-critic model of striatum. Modeling dopamine receptors in the NAc as modulators of learning rate for appetitive-but not aversive-stimuli in the critic-but not the actor-we define vulnerability to addiction as a relatively lower learning rate for the appetitive stimuli, compared to aversive stimuli, in the critic. We... 

    Speed/accuracy trade-off between the habitual and the goal-directed processes

    , Article PLoS Computational Biology ; Volume 7, Issue 5 , 2011 ; 1553734X (ISSN) Keramati, M ; Dezfouli, A ; Piray, P ; Sharif University of Technology
    2011
    Abstract
    Instrumental responses are hypothesized to be of two kinds: habitual and goal-directed, mediated by the sensorimotor and the associative cortico-basal ganglia circuits, respectively. The existence of the two heterogeneous associative learning mechanisms can be hypothesized to arise from the comparative advantages that they have at different stages of learning. In this paper, we assume that the goal-directed system is behaviourally flexible, but slow in choice selection. The habitual system, in contrast, is fast in responding, but inflexible in adapting its behavioural strategy to new conditions. Based on these assumptions and using the computational theory of reinforcement learning, we... 

    Neuromuscular control of the point to point and oscillatory movements of a sagittal arm with the actor-critic reinforcement learning method

    , Article Computer Methods in Biomechanics and Biomedical Engineering ; Volume 8, Issue 2 , 2005 , Pages 103-113 ; 10255842 (ISSN) Golkhou, V ; Parnianpour, M ; Lucas, C ; Sharif University of Technology
    2005
    Abstract
    In this study, we have used a single link system with a pair of muscles that are excited with alpha and gamma signals to achieve both point to point and oscillatory movements with variable amplitude and frequency. The system is highly nonlinear in all its physical and physiological attributes. The major physiological characteristics of this system are simultaneous activation of a pair of nonlinear musclelike- actuators for control purposes, existence of nonlinear spindle-like sensors and Golgi tendon organlike sensor, actions of gravity and external loading. Transmission delays are included in the afferent and efferent neural paths to account for a more accurate representation of the reflex... 

    Active learning of causal structures with deep reinforcement learning

    , Article Neural Networks ; Volume 154 , 2022 , Pages 22-30 ; 08936080 (ISSN) Amirinezhad, A ; Salehkaleybar, S ; Hashemi, M ; Sharif University of Technology
    Elsevier Ltd  2022
    Abstract
    We study the problem of experiment design to learn causal structures from interventional data. We consider an active learning setting in which the experimenter decides to intervene on one of the variables in the system in each step and uses the results of the intervention to recover further causal relationships among the variables. The goal is to fully identify the causal structures with minimum number of interventions. We present the first deep reinforcement learning based solution for the problem of experiment design. In the proposed method, we embed input graphs to vectors using a graph neural network and feed them to another neural network which outputs a variable for performing...