Welcome to P K Kelkar Library, Online Public Access Catalogue (OPAC)

006.31
Szepesvári, Csaba.
       Algorithms for reinforcement learning [electronic resource] / / Csaba Szepesvári. .- San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) :: Morgan & Claypool,, c2010. .- 1 electronic text (xii, 89 p. : ill.) :. digital file. ** Synthesis lectures on artificial intelligence and machine learning, * # 9 1939-4616 ; )
Q325.6 / .S942 2010 - Synthesis digital library of engineering and computer science. Synthesis lectures on artificial intelligence and machine learning, # 9. .
Part of: Synthesis digital library of engineering and computer science. Series from website.
Includes bibliographical references (p. 73-88).
Abstract freely available; full-text restricted to subscribers or individual document purchasers.
Compendex INSPEC Google scholar Google book search


Mode of access: World Wide Web.
System requirements: Adobe Acrobat Reader.
ISBN: 9781608454938 (electronic bk.)
10.2200/S00268ED1V01Y201005AIM009 doi
Subject Headings:
Reinforcement learning;--Mathematical models.
Reinforcement learning Markov Decision Processes Temporal difference learning Stochastic approximation Two-timescale stochastic approximation Monte-Carlo methods Simulation optimization Function approximation Stochastic gradient methods Least-squares methods Overfitting Bias-variance tradeoff Online learning Active learning Planning Simulation PAC-learning Q-learning Actor-critic methods Policy gradient Natural gradient
Copy Details:
Acc. No.: EBKE265, Full Call No.: , Item type: E books , Location: ,
------------------------- --------------------- ------ --------- ------- ------- --------- --------

Powered by Koha