💻 Code Intelligence¶
📷 CVPR2026 · 2 paper notes
- GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning
-
GeoTikzBridge constructs the largest 2.5M image–TikZ code dataset and the first auxiliary-line instruction dataset, trains a code generation model capable of accurately reconstructing geometric figures, and serves as a plug-and-play module to enhance the geometric reasoning capabilities of arbitrary MLLMs/LLMs.
- MM-ReCoder: Advancing Chart-to-Code Generation with Reinforcement Learning and Self-Correction
-
This paper proposes MM-ReCoder, the first multimodal LLM with genuine self-correction capability for chart-to-code generation. Through a two-stage multi-turn GRPO reinforcement learning framework—first optimizing correction ability via shared-first-turn training, then optimizing coding ability via full-trajectory training—MM-ReCoder achieves 86.5% low-level score on ChartMimic with only 7B parameters, rivaling Qwen3-VL-235B.