Reading or Guessing? Visual Grounding Failures of Vision-Language Models for OCR in Ancient Greek Editions 事件

Name: Reading or Guessing? Visual Grounding Failures of Vision-Language Models for OCR in Ancient Greek Editions
Start: 2026-05-28

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Reading or Guessing? Visual Grounding Failures of Vision-Language Models for OCR in Ancient Greek Editions arXiv:2605.27750v1 Announce Type: cross Abstract: Recent work has shown that Vision-Language Models (VLMs) used for optical character recognition (OCR) can generate plausible but visually unsupported text, suggesting reliance on language priors. Comparing open-weight VLMs with traditional OCR baselines on low-resource Ancient Greek critical editions, we show that VLM errors often remain fl

人工智能

关系图谱

Reading or Guessing? Visual Grounding Failures of Vision-Language Models for OCR in Ancient Greek Editions 事件

相关公司查看全部 (10)

相关人物查看全部 (1)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)