Learning complex goals with iterated amplification 事件

REGULATION2018-10-22影响: MEDIUM

Learning complex goals with iterated amplification We’re proposing an AI safety technique called iterated amplification that lets us specify complicated behaviors and goals that are beyond human scale, by demonstrating how to decompose a task into simpler sub-tasks, rather than by providing labeled data or a reward function. Although this idea is in its very early stages and we have only completed experiments on simple toy algorithmic domains, we’ve decided to present it in its preliminary state

Learning complex goals with iterated amplification · 相关人物

暂无数据