新書推薦:

《
给孩子的考古
》
售價:NT$
296.0

《
文明的重建:战后德国五十年(译林思想史)从大屠杀刽子手到爱好和平的民主主义者,揭秘战后德国五十年奇迹般的复兴之路!
》
售價:NT$
505.0

《
推荐系统核心技术与实践
》
售價:NT$
505.0

《
乌合之众:群体心理研究
》
售價:NT$
347.0

《
流浪的君子:孔子的最后二十年 王健文
》
售價:NT$
254.0

《
咨询的奥秘2:咨询师的百宝箱(珍藏版)
》
售價:NT$
356.0

《
中国近代思想与学术的系谱(增订版)
》
售價:NT$
500.0

《
张元济的生平与事业:从清代改革家到二十世纪出版家
》
售價:NT$
398.0
|
內容簡介: |
博塞克斯编著的《抽象动态规划国际知名大学原版教材信息技术学科与电气工程学科系列》采用一种简洁的方式介绍动态规划的理论与方法。首先把动态规划的核心问题表述为一类抽象影射的不动点问题,然后将决定不动点问题求解难度的主要因素概括为上述抽象投影射的两个性质,接着顺序讨论了各种典型情况下的相应不动点问题的主要性质和求解方法。
|
目錄:
|
1.Introduction
1.1.Structure of Dynamic Programming Problems
1.2.Abstract Dynamic Programming Models
1.2.1.Problem Formulation
1.2.2.Monotonicity and Contraction Assumptions
1.2.3.Some Examples
1.2.4.Approximation-Related Mappings
1.3.Organization of the Book
1.4.Notes, Sources, and Exercises
2.Contractive Models
2.1.Fixed Point Equation and Optimality Conditions
2.2.Limited Lookahead Policies
2.3.Value Iteration
2.3.1.Approximate Value Iteration
2.4.Policy Iteration
2.4.1.Approximate Policy Iteration
2.5.Optimistic Policy Iteration
2.5.1.Convergence of Optimistic Policy Iteration
2.5.2.Approximate Optimistic Policy Iteration
2.6.Asynchronous Algorithms
2.6.1.Asynchronous Value Iteration
2.6.2.Asynchronous Policy Iteration
2.6.3.Policy Iteration with a Uniform Fixed Point
2.7.Notes, Sources, and Exercises
3.Semicontractive Models
3.1.Semicontractive Models and Regular Policies
3.1.1.Fixed Points, Optimality Conditions, and
Algorithmic Results
3.1.2.Illustrative Example: Deterministic Shortest
Path Problems
3.2.Irregular Policies and a Perturbation Approach
3.2.1.The Case Where Irregular Policies Have Infinite
Cost
3.2.2.The Case Where Irregular Policies Have Finite
Cost - Perturbations
3.3.Algorithms
3.3.1.Asynchronous Value Iteration
3.3.2.Asynchronous Policy Iteration
3.3.3.Policy Iteration with Perturbations
3.4.Notes, Sources, and Exercises
4.Noncontractive Models
4.1.Noncontractive Models
4.2.Finite Horizon Problems
4.3.Infinite Horizon Problems
4.3.1.Fixed Point Properties and Optimality Conditions
4.3.2.Value Iteration
4.3.3.Policy Iteration
4.4.Semicontractive-Monotone Increasing Models
4.4.1.Value and Policy Iteration Algorithms
4.4.2.Some Applications
4.4.3.Linear-Quadratic Problems
4.5.Affine Monotonic Models
4.5.1.Increasing Affine Monotonic Models
4.5.2.Nonincreasing Affine Monotonic Models
4.5.3.Exponential Cost Stochastic Shortest Path
Problems
4.6.An Overview of Semicontractive Models and Results
4.7.Notes, Sources, and Exercises
5.Models with Restricted Policies
5.1.A Framework for Restricted Policies
5.1.1.General Assumptions
5.2.Finite Horizon Problems
5.3.Contractive Models
5.4.Borel Space Models
5.5.Notes, Sources, and Exercises
Appendix A: Notation and Mathematical Conventions
Appendix B: Contraction Mappings
Appendix C: Measure Theoretic Issues
Appendix D: Solutions of Exercises
References
Index
|
|