Token Optimization Strategies for LLM-Based Oracle-to-PostgreSQL Migration 文章

ArXiv CS.AI2026-05-28NEWSen作者: Oleg Grynets, Dmytro Babarytskyi, Vasyl Lyashkevych

摘要

arXiv:2605.28557v1 Announce Type: cross Abstract: LLMs are increasingly used for software modernization, code translation, and database migration. However, LLM-based Oracle2PostgreSQL migration remains constrained by high token consumption, long-context degradation, dialect-specific semantic differences, and the risk of semantic drift during query transformation. Direct inclusion of large Oracle SQL/PL-SQL artefacts, schema definitions, procedural logic, and migration instructions into the model context increases cost and may reduce generation quality. This paper shows token optimization as a constrained transformation problem in LLM-based Oracle2PostgreSQL migration. The study formalizes and evaluates twelve token optimization strategies: baseline representation, context pruning, minification, DSL-based semantic compression, metadata augmentation, context refactoring, schema distillation, adaptive routing, AST-based minification, identifier masking, output constraint enforcement,…

摘要可能不完整,可查看原文