Multi-Modal Learning meets Genetic Programming: Analyzing Alignment in Latent Space Optimization 文章

ArXiv CS.AI2026-06-02NEWSen作者: Benjamin L\'eger, Kazem Meidani, Christian Gagn\'e

摘要

arXiv:2604.08324v3 Announce Type: replace-cross Abstract: Symbolic regression (SR) aims to discover mathematical expressions from data, a task traditionally tackled using Genetic Programming (GP) through combinatorial search over symbolic structures. Latent Space Optimization (LSO) methods use neural encoders to map symbolic expressions into continuous spaces, transforming the combinatorial search into continuous optimization. SNIP (Meidani et al., 2024), a contrastive pre-training model inspired by CLIP, advances LSO by introducing a multi-modal approach: aligning symbolic and numeric encoders in a shared latent space to learn the phenotype-genotype mapping, enabling optimization in the numeric space to implicitly guide symbolic search. However, this relies on fine-grained cross-modal alignment, whereas literature on similar models like CLIP reveals that such an alignment is typically coarse-grained.