JN Watch the video to learn how APS reaches out to developing nations.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
 QUICK SEARCH:   [advanced]


     


J Neurophysiol (October 10, 2007). doi:10.1152/jn.00364.2007
This Article
Right arrow Full Text (PDF)
Right arrow Supplemental Figures
Right arrow All Versions of this Article:
98/6/3648    most recent
00364.2007v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Farries, M. A.
Right arrow Articles by Fairhall, A. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Farries, M. A.
Right arrow Articles by Fairhall, A. L.
Submitted on April 2, 2007
Accepted on October 9, 2007

Reinforcement Learning with Modulated Spike Timing-Dependent Synaptic Plasticity

Michael Alan Farries1* and Adrienne L. Fairhall2

1 Biology, University of Texas San Antonio, San Antonio, Texas, United States
2 Physiology and Biophysics, University of Washington, Seattle, Washington, United States

* To whom correspondence should be addressed. E-mail: michael.farries{at}utsa.edu.

Spike timing-dependent synaptic plasticity (STDP) has emerged as the preferred framework linking patterns of pre- and postsynaptic activity to changes in synaptic strength. Although synaptic plasticity is widely believed to be a major component of learning, it is unclear how STDP itself could serve as a mechanism for general purpose learning. On the other hand, algorithms for reinforcement learning work on a wide variety of problems, but lack an experimentally established neural implementation. Here, we combine these paradigms in a novel model in which a modified version of STDP achieves reinforcement learning. We build this model in stages, identifying a minimal set of conditions needed to make it work. Using a performance-modulated modification of STDP in a two-layer feedforward network, we can train output neurons to generate arbitrarily selected spike trains or population responses. Furthermore, a given network can learn distinct responses to several different input patterns. We also describe in detail how this model might be implemented biologically. Thus, our model offers a novel and biologically plausible implementation of reinforcement learning that is capable of training a neural population to produce a very wide range of possible mappings between synaptic input and spiking output.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
Visit Other APS Journals Online
Copyright © 2007 by the The American Physiological Society.