Normal view MARC view ISBD view

Mathematical foundations of reinforcement learning (Record no. 200032)

MARC details
000 -LEADER
fixed length control field	02440nam a22002657a 4500
003 - CONTROL NUMBER IDENTIFIER
control field	OSt
005 - DATE AND TIME OF LATEST TRANSACTION
control field	20260603121952.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field	260603b \|\|\|\|\|\|\|\| \|\|\|\| 00\| 0 eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
ISBN	9789819739462
041 ## - LANGUAGE CODE
Language code of text/sound track or separate title	eng
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number	006.31 ZHA-M
100 ## - MAIN ENTRY--AUTHOR NAME
Personal name	Zhao, Shiyu.
245 ## - TITLE STATEMENT
Title	Mathematical foundations of reinforcement learning
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Place of publication	China:
Name of publisher	Jsinghua University Press,
Year of publication	2025
300 ## - PHYSICAL DESCRIPTION
Number of Pages	xvi, 275p.
505 ## - FORMATTED CONTENTS NOTE
Formatted contents note	Front Matter<br/>Pages i-xvi<br/><br/>Basic Concepts<br/>Shiyu Zhao<br/>Pages 1-13<br/>State Values and Bellman Equation<br/>Shiyu Zhao<br/>Pages 15-34<br/>Optimal State Values and Bellman Optimality Equation<br/>Shiyu Zhao<br/>Pages 35-55<br/>Value Iteration and Policy Iteration<br/>Shiyu Zhao<br/>Pages 57-76<br/>Monte Carlo Methods<br/>Shiyu Zhao<br/>Pages 77-99<br/>Stochastic Approximation<br/>Shiyu Zhao<br/>Pages 101-124<br/>Temporal-Difference Methods<br/>Shiyu Zhao<br/>Pages 125-150<br/>Value Function Methods<br/>Shiyu Zhao<br/>Pages 151-189<br/>Policy Gradient Methods<br/>Shiyu Zhao<br/>Pages 191-214<br/>Actor-Critic Methods<br/>Shiyu Zhao<br/>Pages 215-236<br/>Back Matter<br/>Pages 237-275
520 ## - SUMMARY, ETC.
Summary, etc	This book provides a mathematical yet accessible introduction to the fundamental concepts, core challenges, and classic reinforcement learning algorithms. It aims to help readers understand the theoretical foundations of algorithms, providing insights into their design and functionality. Numerous illustrative examples are included throughout. The mathematical content is carefully structured to ensure readability and approachability.<br/><br/>The book is divided into two parts. The first part is on the mathematical foundations of reinforcement learning, covering topics such as the Bellman equation, Bellman optimality equation, and stochastic approximation. The second part explicates reinforcement learning algorithms, including value iteration and policy iteration, Monte Carlo methods, temporal-difference methods, value function methods, policy gradient methods, and actor-critic methods.<br/><br/>With its comprehensive scope, the book will appeal to undergraduate and graduate students, post-doctoral researchers, lecturers, industrial researchers, and anyone interested in reinforcement learning.
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical Term	Artificial intelligence
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical Term	Reinforcement learning
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical Term	Machine learning.
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical Term	Machine learning
General subdivision	algorithm
856 ## - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier	https://doi.org/10.1007/978-981-97-3944-8
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Koha item type	Books and Monographs

Holdings
Full call number	Accession Number	Koha item type	Lost status	Damaged status	Permanent Location	Current Location	Shelving location	Date acquired	Source of acquisition
006.31 ZHA-M	102891	Books and Monographs			Central Library, NIT Jalandhar	Central Library, NIT Jalandhar	General Stacks	20.05.2026	Delhi, Narendra Publishing House
006.31 ZHA-M	102892	Books and Monographs			Central Library, NIT Jalandhar	Central Library, NIT Jalandhar	General Stacks	20.05.2026	Delhi, Narendra Publishing House

Print
Suggest for purchase
Send to device
More searches

Search for this title in:
Other Libraries (WorldCat) Other Databases (Google Scholar) Online Stores (Bookfinder.com) Open Library (openlibrary.org)