Learning complex goals with iterated amplification 事件
REGULATION2018-10-22影响: MEDIUM
Learning complex goals with iterated amplification We’re proposing an AI safety technique called iterated amplification that lets us specify complicated behaviors and goals that are beyond human scale, by demonstrating how to decompose a task into simpler sub-tasks, rather than by providing labeled data or a reward function. Although this idea is in its very early stages and we have only completed experiments on simple toy algorithmic domains, we’ve decided to present it in its preliminary state
Learning complex goals with iterated amplification · 相关人物
暂无数据