Overview of the Transformer-based Models for NLP Tasks 论文

2020Annals of Computer Science and Information Systems引用 374顶会

Topic ModelingNatural Language Processing TechniquesMachine Learning in Healthcare

人工智能 Natural Language Processing Techniques Topic Modeling Machine Learning in Healthcare

作者

摘要

proposed a new neural network architecture named Transformer. That modern architecture quickly revolutionized the natural language processing world. Models like GPT and BERT relying on this Transformer architecture have fully outperformed the previous state-of-theart networks. It surpassed the earlier approaches by such a wide margin that all the recent cutting edge models seem to rely on these Transformer-based architectures. In this paper, we provide an overview and explanations of the latest models. We cover the auto-regressive models such as GPT, GPT-2 and XLNET, as well as the auto-encoder architecture such as BERT and a lot of post-BERT models like RoBERTa, ALBERT, ERNIE 1.0/2.0.

作者查看全部 (4)

Omar Abou Khaled

Elena Mugellini

Jacky Casas

Anthony Gillioz

Overview of the Transformer-based Models for NLP Tasks 论文

摘要

作者查看全部 (4)

相关技术查看全部 (3)

相关事件

相关文章