Distilling Answer-Set Programming Rules from LLMs for Neurosymbolic Visual Question Answering 文章

ArXiv CS.AI2026-06-03NEWSen作者: Thomas Eiter, Nelson Higuera Ruiz, Johannes Oetsch

摘要

arXiv:2606.03269v1 Announce Type: new Abstract: Visual Question Answering (VQA) is the task of answering questions about images, requiring the integration of multimodal input and reasoning. Modular approaches that incorporate logic-based representations into the reasoning component offer clear advantages over end-to-end trained systems, particularly in terms of interpretability. However, adapting or extending these representations when task requirements change can place a significant burden on developers. To address this challenge, we present an approach for distilling rules from Large Language Models (LLMs). Our method prompts an LLM to extend an initial VQA reasoning theory, expressed as an answer-set program, to meet new requirements of the task. Examples from VQA datasets guide the LLM, validate the results, and help correct erroneous rules by leveraging feedback from the ASP solver. We demonstrate that our approach is effective across diverse VQA datasets.

Distilling Answer-Set Programming Rules from LLMs for Neurosymbolic Visual Question Answering 文章

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (5)