This paper addresses the challenges of schema evolution in document-oriented databases by introducing SchemaMigrate, a system that combines refutational normalization-based schema inclusion checking with LLM-guided code generation to produce executable Python migration scripts. The system detects breaking changes and generates migration scripts for complex structural transformations.
Key findings
SchemaMigrate achieves 83.3% accuracy on specification conformance tests and 100% accuracy on real-world scenarios.
The system successfully generates migration scripts with 82% average confidence.
Compared to existing baselines, SchemaMigrate uniquely combines accurate breaking change detection with practical migration generation.
Limitations & open questions
The paper does not discuss the limitations of the current implementation of SchemaMigrate.
Further research is needed to improve the system's handling of more complex schema evolution scenarios.