A Menu of Designs for Reinforcement Learning Over Time 论文

1991The MIT Press eBooks引用 610

Evolutionary Algorithms and Applications

作者

摘要

This chapter contains sections titled: Introduction and Overview, A Simple Two-Component Adaptive Critic Design, HDP and Dynamic Programming, Alternative Ways to Figure 3.2 in Adapting the Action Network, Alternatives to HDP in Adapting the Critic Network, Some Topics for Further Research, Equations and Code For Implementation, References

作者查看全部 (1)

Paul J. Werbos

A Menu of Designs for Reinforcement Learning Over Time 论文

摘要

作者查看全部 (1)

相关技术查看全部 (1)

相关事件

相关文章