Mathematical foundations of reinforcement learning (Record no. 200032)

MARC details
000 -LEADER
fixed length control field 02440nam a22002657a 4500
003 - CONTROL NUMBER IDENTIFIER
control field OSt
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20260603121952.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 260603b |||||||| |||| 00| 0 eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
ISBN 9789819739462
041 ## - LANGUAGE CODE
Language code of text/sound track or separate title eng
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 006.31 ZHA-M
100 ## - MAIN ENTRY--AUTHOR NAME
Personal name Zhao, Shiyu.
245 ## - TITLE STATEMENT
Title Mathematical foundations of reinforcement learning
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Place of publication China:
Name of publisher Jsinghua University Press,
Year of publication 2025
300 ## - PHYSICAL DESCRIPTION
Number of Pages xvi, 275p.
505 ## - FORMATTED CONTENTS NOTE
Formatted contents note Front Matter<br/>Pages i-xvi<br/><br/>Basic Concepts<br/>Shiyu Zhao<br/>Pages 1-13<br/>State Values and Bellman Equation<br/>Shiyu Zhao<br/>Pages 15-34<br/>Optimal State Values and Bellman Optimality Equation<br/>Shiyu Zhao<br/>Pages 35-55<br/>Value Iteration and Policy Iteration<br/>Shiyu Zhao<br/>Pages 57-76<br/>Monte Carlo Methods<br/>Shiyu Zhao<br/>Pages 77-99<br/>Stochastic Approximation<br/>Shiyu Zhao<br/>Pages 101-124<br/>Temporal-Difference Methods<br/>Shiyu Zhao<br/>Pages 125-150<br/>Value Function Methods<br/>Shiyu Zhao<br/>Pages 151-189<br/>Policy Gradient Methods<br/>Shiyu Zhao<br/>Pages 191-214<br/>Actor-Critic Methods<br/>Shiyu Zhao<br/>Pages 215-236<br/>Back Matter<br/>Pages 237-275
520 ## - SUMMARY, ETC.
Summary, etc This book provides a mathematical yet accessible introduction to the fundamental concepts, core challenges, and classic reinforcement learning algorithms. It aims to help readers understand the theoretical foundations of algorithms, providing insights into their design and functionality. Numerous illustrative examples are included throughout. The mathematical content is carefully structured to ensure readability and approachability.<br/><br/>The book is divided into two parts. The first part is on the mathematical foundations of reinforcement learning, covering topics such as the Bellman equation, Bellman optimality equation, and stochastic approximation. The second part explicates reinforcement learning algorithms, including value iteration and policy iteration, Monte Carlo methods, temporal-difference methods, value function methods, policy gradient methods, and actor-critic methods.<br/><br/>With its comprehensive scope, the book will appeal to undergraduate and graduate students, post-doctoral researchers, lecturers, industrial researchers, and anyone interested in reinforcement learning.
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical Term Artificial intelligence
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical Term Reinforcement learning
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical Term Machine learning.
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical Term Machine learning
General subdivision algorithm
856 ## - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier https://doi.org/10.1007/978-981-97-3944-8
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Koha item type Books and Monographs
Holdings
Full call number Accession Number Koha item type Lost status Damaged status Permanent Location Current Location Shelving location Date acquired Source of acquisition
006.31 ZHA-M 102891 Books and Monographs     Central Library, NIT Jalandhar Central Library, NIT Jalandhar General Stacks 20.05.2026 Delhi, Narendra Publishing House
006.31 ZHA-M 102892 Books and Monographs     Central Library, NIT Jalandhar Central Library, NIT Jalandhar General Stacks 20.05.2026 Delhi, Narendra Publishing House
Dr. Sanjeev, Librarian
Managed by: Dr. D. P. Tripathi, Deputy Librarian, Central Library
For any query / question, please mail at circulation.liby@nitj.ac.in 

Powered by Koha