A Menu of Designs for Reinforcement Learning Over Time 论文

1991The MIT Press eBooks引用 610
Evolutionary Algorithms and Applications

摘要

This chapter contains sections titled: Introduction and Overview, A Simple Two-Component Adaptive Critic Design, HDP and Dynamic Programming, Alternative Ways to Figure 3.2 in Adapting the Action Network, Alternatives to HDP in Adapting the Critic Network, Some Topics for Further Research, Equations and Code For Implementation, References

相关事件

暂无数据

相关文章

暂无数据