The Unreasonable Effectiveness of Data 论文
2009IEEE Intelligent Systems引用 1780
Semantic Web and OntologiesData Quality and ManagementTopic Modeling
相关技术:Topic Modeling
摘要
Problems that involve interacting with humans, such as natural language understanding, have not proven to be solvable by concise, neat formulas like F = ma. Instead, the best approach appears to be to embrace the complexity of the domain and address it by harnessing the power of data: if other humans engage in the tasks and generate large amounts of unlabeled, noisy data, new algorithms can be used to build high-quality models from the data.