TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AI 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AI arXiv:2603.22867v1 Announce Type: cross Abstract: Multimodal stacks that mix ViTs, CNNs, GNNs, and transformer NLP strain embedded platforms because their compute/memory patterns diverge and hard real-time targets leave little slack. TRINE is a single-bitstream FPGA accelerator and compiler that executes end-to-end multimodal inference without reconfiguration. Layers are unified as DDMM/SDDMM/SpMM and mapped to a mod

TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AI · 相关人物

暂无数据