Peter Dayan: Difference between revisions

Peter Dayan
FRS
Peter Dayan FRS
	Peter Dayan at the Royal Society admissions day in London, July 2018
Born	Peter Samuel Dayan; 1965 (age 58–59)
Alma mater	University of Cambridge (BA); University of Edinburgh (PhD)
Known for	Q-learning
Awards	Rumelhart Prize (2012); The Brain Prize (2017)
	Scientific career
Fields	Computational neuroscience; Reinforcement learning
Institutions	Max Planck Institute for Biological Cybernetics; University College London; Massachusetts Institute of Technology; Uber; University of Toronto; Salk Institute
Thesis	Reinforcing connectionism : learning the statistical way (1991)
Doctoral advisor	David Willshaw
Website	www.kyb.tuebingen.mpg.de/person/95844/251691

Browse history interactively

← Previous edit Next edit →

Content deleted Content added

VisualWikitext

Inline

Revision as of 09:18, 11 December 2019

Peter Samuel Dayan FRS is director at the Max Planck Institute for Biological Cybernetics in Tübingen, Germany. He is co-author of Theoretical Neuroscience,^[4] an influential textbook on computational neuroscience. He is known for applying bayesian methods from machine learning and artificial intelligence to understand neural function and is particularly recognized for relating neurotransmitter levels to prediction errors and Bayesian uncertainties.^[5] He has pioneered the field of reinforcement learning (RL) where he helped develop the Q-learning algorithm, and made contributions to unsupervised learning, including the wake-sleep algorithm for neural networks and the Helmholtz machine.^[6]^[7]^[8]

Education

Dayan studied mathematics at the University of Cambridge and then continued for a PhD in artificial intelligence at the University of Edinburgh School of Informatics on statistical learning ^[9] supervised by David Willshaw and David Wallace, focusing on associative memory and reinforcement learning.^[9]

Career and research

After his PhD, Dayan held postdoctoral research positions with Terry Sejnowski at the Salk Institute and Geoffrey Hinton at the University of Toronto. He then took up an assistant professor position at the Massachusetts Institute of Technology (MIT), and moved to the Gatsby Charitable Foundation computational neuroscience unit at University College London (UCL) in 1998, becoming professor and director in 2002.^[10] In September 2018, the Max Planck Society announced his appointment as a director at the Max Planck Institute for Biological Cybernetics.^[11]

Awards and honours

Dayan was elected a Fellow of the Royal Society (FRS) in 2018.^[12] He was awarded the Rumelhart Prize in 2012 and The Brain Prize in 2017.^[12]

References

^ Ghahramani, Zoubin (2017). "Welcoming Peter Dayan to Uber AI Labs". uber.com. Archived from the original on 15 March 2018.
^ Shead, Sam (2018). "Elon Musk Signed A 350-Year-Old Book With DeepMind's Demis Hassabis". forbes.com. Retrieved 9 February 2019.
^ Kumaran, Dharshan; Banino, Andrea; Blundell, Charles; Hassabis, Demis; Dayan, Peter (2016). "Computations Underlying Social Hierarchy Learning: Distinct Neural Mechanisms for Updating and Representing Self-Relevant Information". Neuron. 92 (5): 1135–1147. doi:10.1016/j.neuron.2016.10.052. ISSN 0896-6273. PMC 5158095. PMID 27930904.
^ Dayan, Peter; Abbott, Laurence (2014). Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems. Cambridge: MIT Press. ISBN 9780262541855. OCLC 952504127.
^ Schultz, W.; Dayan, P.; Montague, P. R. (1997). "A Neural Substrate of Prediction and Reward" (PDF). Science. 275 (5306): 1593–1599. doi:10.1126/science.275.5306.1593. ISSN 0036-8075. PMID 9054347.
^ Watkins, Christopher J. C. H.; Dayan, Peter (1992). "Q-learning". Machine Learning. 8 (3–4): 279–292. doi:10.1007/BF00992698. ISSN 0885-6125.
^ Dayan, Peter (1992). "The convergence of TD (λ) for general λ". Machine Learning. 8 (3/4): 341–362. doi:10.1023/A:1022632907294. ISSN 0885-6125.
^ Peter, Dayan; Hinton, Geoffrey E.; Neal, Radford M.; Zemel, Richard S. (1995). "The helmholtz machine". Neural Computation. 7 (5): 889–904. doi:10.1162/neco.1995.7.5.889. PMID 7584891.
^ ^a ^b Dayan, Peter Samuel (1991). Reinforcing connectionism: learning the statistical way (PhD thesis). hdl:1842/14754. EThOS uk.bl.ethos.649240.
^ "Peter Dayan". gatsby.ucl.ac.uk. Archived from the original on 25 March 2019.
^ Anon (2018). "Peter Dayan and Li Zhaoping appointed to the Max Planck Institute for Biological Cybernetics". mpg.de. Archived from the original on 3 April 2019. Retrieved 2 October 2018.
^ ^a ^b Anon (2018). "Professor Peter Dayan FRS". royalsociety.org. London: Royal Society. Retrieved 22 May 2018. One or more of the preceding sentences incorporates text from the royalsociety.org website where:
“All text published under the heading 'Biography' on Fellow profile pages is available under Creative Commons Attribution 4.0 International License.” --Royal Society Terms, conditions and policies at the Wayback Machine (archived 2016-11-11)

This article incorporates text available under the CC BY 4.0 license.

This biography of an academic is a stub. You can help Wikipedia by expanding it.

This article about a neuroscientist is a stub. You can help Wikipedia by expanding it.

[uber-1] Ghahramani, Zoubin (2017). "Welcoming Peter Dayan to Uber AI Labs". uber.com. Archived from the original on 15 March 2018.

[2] Shead, Sam (2018). "Elon Musk Signed A 350-Year-Old Book With DeepMind's Demis Hassabis". forbes.com. Retrieved 9 February 2019.

[KumaranBanino2016-3] Kumaran, Dharshan; Banino, Andrea; Blundell, Charles; Hassabis, Demis; Dayan, Peter (2016). "Computations Underlying Social Hierarchy Learning: Distinct Neural Mechanisms for Updating and Representing Self-Relevant Information". Neuron. 92 (5): 1135–1147. doi:10.1016/j.neuron.2016.10.052. ISSN 0896-6273. PMC 5158095. PMID 27930904.

[4] Dayan, Peter; Abbott, Laurence (2014). Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems. Cambridge: MIT Press. ISBN 9780262541855. OCLC 952504127.

[SchultzDayan1997-5] Schultz, W.; Dayan, P.; Montague, P. R. (1997). "A Neural Substrate of Prediction and Reward" (PDF). Science. 275 (5306): 1593–1599. doi:10.1126/science.275.5306.1593. ISSN 0036-8075. PMID 9054347.

[q-6] Watkins, Christopher J. C. H.; Dayan, Peter (1992). "Q-learning". Machine Learning. 8 (3–4): 279–292. doi:10.1007/BF00992698. ISSN 0885-6125.

[Dayan1992-7] Dayan, Peter (1992). "The convergence of TD (λ) for general λ". Machine Learning. 8 (3/4): 341–362. doi:10.1023/A:1022632907294. ISSN 0885-6125.

[8] Peter, Dayan; Hinton, Geoffrey E.; Neal, Radford M.; Zemel, Richard S. (1995). "The helmholtz machine". Neural Computation. 7 (5): 889–904. doi:10.1162/neco.1995.7.5.889. PMID 7584891.

[phd-9] Dayan, Peter Samuel (1991). Reinforcing connectionism: learning the statistical way (PhD thesis). hdl:1842/14754. EThOS uk.bl.ethos.649240.

[10] "Peter Dayan". gatsby.ucl.ac.uk. Archived from the original on 25 March 2019.

[11] Anon (2018). "Peter Dayan and Li Zhaoping appointed to the Max Planck Institute for Biological Cybernetics". mpg.de. Archived from the original on 3 April 2019. Retrieved 2 October 2018.

[frs-12] Anon (2018). "Professor Peter Dayan FRS". royalsociety.org. London: Royal Society. Retrieved 22 May 2018. One or more of the preceding sentences incorporates text from the royalsociety.org website where:
“All text published under the heading 'Biography' on Fellow profile pages is available under Creative Commons Attribution 4.0 International License.” --Royal Society Terms, conditions and policies at the Wayback Machine (archived 2016-11-11)

[1]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

@@ Line 48: / Line 48: @@
 | footnotes         =
 }}
-'''Peter Samuel Dayan''' {{post-nominals|country=GBR|FRS}} is director at the [[Max Planck Institute for Biological Cybernetics]] in [[Tübingen]], Germany. He is co-author of ''Theoretical Neuroscience'',<ref>{{Cite book|url=https://www.worldcat.org/title/theoretical-neuroscience-computational-and-mathematical-modeling-of-neural-systems/oclc/952504127|title=Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems|last=Dayan|first=Peter|last2=Abbott|first2=Laurence|date=2014|publisher=MIT Press|isbn=9780262541855|location=Cambridge|language=English}}</ref> an influential textbook on [[computational neuroscience]]. He is known for applying [[bayesian method]]s from [[machine learning]] and [[artificial intelligence]] to understand neural function and is particularly recognized for relating [[neurotransmitter]] levels to prediction errors and Bayesian uncertainties.<ref name="SchultzDayan1997">{{cite journal|last1=Schultz|first1=W.|last2=Dayan|first2=P.|last3=Montague|first3=P. R.|title=A Neural Substrate of Prediction and Reward|journal=Science|volume=275|issue=5306|year=1997|pages=1593–1599|issn=0036-8075|doi=10.1126/science.275.5306.1593|pmid=9054347|url=http://www.cs.utexas.edu/~dana/Reward.pdf}} {{closed access}}</ref> He has pioneered the field of [[reinforcement learning]] (RL) where he helped develop the [[Q-learning]] algorithm, and made contributions to [[unsupervised learning]], including the [[wake-sleep algorithm]] for [[neural network]]s and the [[Helmholtz machine]].<ref name=q>{{cite journal|last1=Watkins|first1=Christopher J. C. H.|last2=Dayan|first2=Peter|title=Q-learning|journal=Machine Learning|volume=8|issue=3-4|year=1992|pages=279–292|issn=0885-6125|doi=10.1007/BF00992698}}</ref><ref name="Dayan1992">{{cite journal|last1=Dayan|first1=Peter|journal=Machine Learning|title=The convergence of TD (λ) for general λ|volume=8|issue=3/4|year=1992|pages=341–362|issn=08856125|doi=10.1023/A:1022632907294}}</ref><ref>{{Cite journal|title = The helmholtz machine.|journal = Neural computation|date = 1995|pages = 889-904|volume = 7|first = Dayan|last = Peter|authorlink1=Peter Dayan|first2 = Geoffrey E.|last2 = Hinton|authorlink2=Geoffrey Hinton|first3 = Radford M.|last3 = Neal|authorlink3=Radford M. Neal|first4 = Richard S.|last4 = Zemel|authorlink4=Richard Zemel}} {{closed access}}</ref>
+'''Peter Samuel Dayan''' {{post-nominals|country=GBR|FRS}} is director at the [[Max Planck Institute for Biological Cybernetics]] in [[Tübingen]], Germany. He is co-author of ''Theoretical Neuroscience'',<ref>{{Cite book|title=Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems|last=Dayan|first=Peter|last2=Abbott|first2=Laurence|date=2014|publisher=MIT Press|isbn=9780262541855|location=Cambridge|language=English|oclc = 952504127}}</ref> an influential textbook on [[computational neuroscience]]. He is known for applying [[bayesian method]]s from [[machine learning]] and [[artificial intelligence]] to understand neural function and is particularly recognized for relating [[neurotransmitter]] levels to prediction errors and Bayesian uncertainties.<ref name="SchultzDayan1997">{{cite journal|last1=Schultz|first1=W.|last2=Dayan|first2=P.|last3=Montague|first3=P. R.|title=A Neural Substrate of Prediction and Reward|journal=Science|volume=275|issue=5306|year=1997|pages=1593–1599|issn=0036-8075|doi=10.1126/science.275.5306.1593|pmid=9054347|url=http://www.cs.utexas.edu/~dana/Reward.pdf}} {{closed access}}</ref> He has pioneered the field of [[reinforcement learning]] (RL) where he helped develop the [[Q-learning]] algorithm, and made contributions to [[unsupervised learning]], including the [[wake-sleep algorithm]] for [[neural network]]s and the [[Helmholtz machine]].<ref name=q>{{cite journal|last1=Watkins|first1=Christopher J. C. H.|last2=Dayan|first2=Peter|title=Q-learning|journal=Machine Learning|volume=8|issue=3–4|year=1992|pages=279–292|issn=0885-6125|doi=10.1007/BF00992698}}</ref><ref name="Dayan1992">{{cite journal|last1=Dayan|first1=Peter|journal=Machine Learning|title=The convergence of TD (λ) for general λ|volume=8|issue=3/4|year=1992|pages=341–362|issn=08856125|doi=10.1023/A:1022632907294}}</ref><ref>{{Cite journal|title = The helmholtz machine.|journal = Neural Computation|date = 1995|pages = 889–904|volume = 7|issue = 5|first = Dayan|last = Peter|authorlink1=Peter Dayan|first2 = Geoffrey E.|last2 = Hinton|authorlink2=Geoffrey Hinton|first3 = Radford M.|last3 = Neal|authorlink3=Radford M. Neal|first4 = Richard S.|last4 = Zemel|authorlink4=Richard Zemel|doi = 10.1162/neco.1995.7.5.889|pmid = 7584891}} {{closed access}}</ref>
 ==Education==
-Dayan studied mathematics at the [[University of Cambridge]] and then continued for a [[PhD]] in [[artificial intelligence]] at the [[University of Edinburgh School of Informatics]] on statistical learning <ref name=phd>{{Cite thesis|last=Dayan|first=Peter Samuel|degree=PhD|date=1991|title=Reinforcing connectionism: learning the statistical way|url=https://www.era.lib.ed.ac.uk/handle/1842/14754|website=lib.ed.ac.uk|hdl=1842/14754|language=en|id={{EThOS|uk.bl.ethos.649240}}}} {{free access}}</ref> supervised by [[d:Q54978760|David Willshaw]] and [[David Wallace (physicist)|David Wallace]], focusing on [[Association (psychology)|associative]] [[memory]] and [[reinforcement learning]].<ref name=phd/>
+Dayan studied mathematics at the [[University of Cambridge]] and then continued for a [[PhD]] in [[artificial intelligence]] at the [[University of Edinburgh School of Informatics]] on statistical learning <ref name=phd>{{Cite thesis|last=Dayan|first=Peter Samuel|degree=PhD|date=1991|title=Reinforcing connectionism: learning the statistical way|hdl=1842/14754|language=en|id={{EThOS|uk.bl.ethos.649240}}}} {{free access}}</ref> supervised by [[d:Q54978760|David Willshaw]] and [[David Wallace (physicist)|David Wallace]], focusing on [[Association (psychology)|associative]] [[memory]] and [[reinforcement learning]].<ref name=phd/>
 ==Career and research==

v t e Fellows of the Royal Society elected in 2018
Fellows	Jim Al-Khalili Polly Arnold Jillian Banfield Margaret Brimble Neil Brockdorff Frank Caruso Vincenzo Cerundolo Kevin Costello Robert Crabtree Philip Dawid Peter Dayan Richard Dixon Gregory Edgecombe Wenfei Fan Roger Goody Robin Grimes Gregory Hannon Demis Hassabis Judy Hirst Graeme Jameson Harren Jhoti Sophien Kamoun Andrew King Dimitri Kullmann Dominic Kwiatkowski Richard Marais Cathie Martin Elon Musk Peter O'Hearn Vassilis Pachnis Tracy Palmer Colin Prentice Lalita Ramakrishnan Nancy Reid Graham Richards David Richardson Sheila Rowan Ingrid Scheffer Michelle Simmons John Smol Timothy Softley John Speakman Graeme Stephens Angela Strank Charles Swanton Peter Visscher Guy Wilkinson Geordie Williamson Daniel Wise Nikolay Zheludev
Honorary	David Willetts (Baron Willetts of Havant)
Foreign	Carolyn Bertozzi Martin Chalfie Sebsebe Demissew Jeffrey Friedman Fabiola Gianotti Albrecht Hofmann Butler Lampson Tullio Pozzan Joachim Sauer Adi Shamir

Authority control databases
International	ISNI VIAF WorldCat
National	Germany United States France BnF data Netherlands Poland Israel Catalonia
Academics	ORCID Mathematics Genealogy Project Scopus zbMATH DBLP MathSciNet
Other	IdRef