Mathematical foundations of reinforcement learning (Record no. 200032)
[ view plain ]
| 000 -LEADER | |
|---|---|
| fixed length control field | 02440nam a22002657a 4500 |
| 003 - CONTROL NUMBER IDENTIFIER | |
| control field | OSt |
| 005 - DATE AND TIME OF LATEST TRANSACTION | |
| control field | 20260603121952.0 |
| 008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
| fixed length control field | 260603b |||||||| |||| 00| 0 eng d |
| 020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
| ISBN | 9789819739462 |
| 041 ## - LANGUAGE CODE | |
| Language code of text/sound track or separate title | eng |
| 082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER | |
| Classification number | 006.31 ZHA-M |
| 100 ## - MAIN ENTRY--AUTHOR NAME | |
| Personal name | Zhao, Shiyu. |
| 245 ## - TITLE STATEMENT | |
| Title | Mathematical foundations of reinforcement learning |
| 260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT) | |
| Place of publication | China: |
| Name of publisher | Jsinghua University Press, |
| Year of publication | 2025 |
| 300 ## - PHYSICAL DESCRIPTION | |
| Number of Pages | xvi, 275p. |
| 505 ## - FORMATTED CONTENTS NOTE | |
| Formatted contents note | Front Matter<br/>Pages i-xvi<br/><br/>Basic Concepts<br/>Shiyu Zhao<br/>Pages 1-13<br/>State Values and Bellman Equation<br/>Shiyu Zhao<br/>Pages 15-34<br/>Optimal State Values and Bellman Optimality Equation<br/>Shiyu Zhao<br/>Pages 35-55<br/>Value Iteration and Policy Iteration<br/>Shiyu Zhao<br/>Pages 57-76<br/>Monte Carlo Methods<br/>Shiyu Zhao<br/>Pages 77-99<br/>Stochastic Approximation<br/>Shiyu Zhao<br/>Pages 101-124<br/>Temporal-Difference Methods<br/>Shiyu Zhao<br/>Pages 125-150<br/>Value Function Methods<br/>Shiyu Zhao<br/>Pages 151-189<br/>Policy Gradient Methods<br/>Shiyu Zhao<br/>Pages 191-214<br/>Actor-Critic Methods<br/>Shiyu Zhao<br/>Pages 215-236<br/>Back Matter<br/>Pages 237-275 |
| 520 ## - SUMMARY, ETC. | |
| Summary, etc | This book provides a mathematical yet accessible introduction to the fundamental concepts, core challenges, and classic reinforcement learning algorithms. It aims to help readers understand the theoretical foundations of algorithms, providing insights into their design and functionality. Numerous illustrative examples are included throughout. The mathematical content is carefully structured to ensure readability and approachability.<br/><br/>The book is divided into two parts. The first part is on the mathematical foundations of reinforcement learning, covering topics such as the Bellman equation, Bellman optimality equation, and stochastic approximation. The second part explicates reinforcement learning algorithms, including value iteration and policy iteration, Monte Carlo methods, temporal-difference methods, value function methods, policy gradient methods, and actor-critic methods.<br/><br/>With its comprehensive scope, the book will appeal to undergraduate and graduate students, post-doctoral researchers, lecturers, industrial researchers, and anyone interested in reinforcement learning. |
| 650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
| Topical Term | Artificial intelligence |
| 650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
| Topical Term | Reinforcement learning |
| 650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
| Topical Term | Machine learning. |
| 650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
| Topical Term | Machine learning |
| General subdivision | algorithm |
| 856 ## - ELECTRONIC LOCATION AND ACCESS | |
| Uniform Resource Identifier | https://doi.org/10.1007/978-981-97-3944-8 |
| 942 ## - ADDED ENTRY ELEMENTS (KOHA) | |
| Koha item type | Books and Monographs |
| Full call number | Accession Number | Koha item type | Lost status | Damaged status | Permanent Location | Current Location | Shelving location | Date acquired | Source of acquisition |
|---|---|---|---|---|---|---|---|---|---|
| 006.31 ZHA-M | 102891 | Books and Monographs | Central Library, NIT Jalandhar | Central Library, NIT Jalandhar | General Stacks | 20.05.2026 | Delhi, Narendra Publishing House | ||
| 006.31 ZHA-M | 102892 | Books and Monographs | Central Library, NIT Jalandhar | Central Library, NIT Jalandhar | General Stacks | 20.05.2026 | Delhi, Narendra Publishing House |
