🧠 VLM Reasoning¶
🎞️ ECCV2024 · 1 paper notes
📌 Same area in other venues: 📷 CVPR2026 (150) · 🔬 ICLR2026 (112) · 💬 ACL2026 (32) · 🧪 ICML2026 (31) · 🤖 AAAI2026 (10) · 🧠 NeurIPS2025 (30)
- NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
-
NavGPT-2 closes the performance gap between LM-based agents and VLN-specific models while retaining the LLM's interpretable navigational reasoning capabilities by feeding the hidden layer representations of a frozen LLM into a topological map navigation policy network as vision-language features, showcasing excellent data efficiency.