Skip to content

✏️ Knowledge Editing

💬 ACL2026 · 10 paper notes

📌 Same area in other venues: 📷 CVPR2026 (2) · 🔬 ICLR2026 (15) · 🧪 ICML2026 (8) · 🤖 AAAI2026 (4) · 🧠 NeurIPS2025 (6)

🔥 Top topics: LLM ×3

Aligning Language Models with Real-time Knowledge Editing

Introduces CRAFT (a continuously updated Chinese financial knowledge editing dataset) and KEDAS (a knowledge editing alignment paradigm based on diverse edit augmentation and adaptive inference) to resolve the difficulty of balancing success rate, locality, and portability in real-time knowledge editing scenarios.

Can Factual Opinions Be Edited (Manipulated) in Large Language Models?

This paper points out that existing knowledge editing techniques can be used to manipulate the "documented stances of public figures" (factual opinions). To address this, the authors construct the FOE benchmark with evidence and find that current methods result in "surface-level opinion changes with contradictory evidence." They propose a two-stage Self-Generated Evidence-Aligned method, enabling edited models to provide self-consistent evidence for manipulated opinions without relying on explicit instructions.

CLaRE-ty Amid Chaos: Quantifying Representational Entanglement to Predict Ripple Effects in LLM Editing

CLARE proposes a lightweight representational method that quantifies the degree of entanglement between facts through forward activations of a single intermediate layer. It is used to predict ripple effects in model editing, achieving an average improvement of 62.2% in Spearman correlation compared to gradient-based methods, while being 2.74x faster with 2.85x less memory consumption.

EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing

Ours proposes EvoEdit, which achieves large-scale sequential knowledge editing by dynamically evolving a null-space projector. It efficiently injects new knowledge while maintaining existing knowledge, preserving SOTA performance at the 10K editing scale and running 3.5x faster than AlphaEdit.

FABLE: Fine-grained Fact Anchoring for Unstructured Model Editing

This paper identifies that existing unstructured model editing methods, while capable of holistic recall of edited text, fail to provide access to fine-grained facts. It proposes the FABLE framework, which uses a two-stage hierarchical strategy to anchor fine-grained facts in shallow layers and integrate holistic narratives in deep layers, and constructs the UnFine diagnostic benchmark for systematic evaluation.

HiEdit: Lifelong Model Editing with Hierarchical Reinforcement Learning

HiEdit utilizes hierarchical reinforcement learning to decompose "lifelong model editing" into two subtasks: high-level layer selection and low-level gradient update calculation. This allows the hypernetwork to adaptively modify only half of the layers based on specific knowledge, improving the strong baseline RLEdit by an average of 8.48%.

One Mask to Rule Them All: On Hidden Facts after Editing and How to Find Them

This paper discovers that ROME / MEMIT does not truly overwrite old knowledge but suppresses it through a shared overattention mechanism; a sparse binary mask can reverse most edits and reduce the success rate of new edits from 98% to 38%.

Representation Interventions Enable Lifelong Knowledge Memory Control in LLMs

This paper proposes RILKE, which transforms lifelong knowledge editing from "modifying model weights" to "applying low-rank interventions in the hidden representation space." Through robust training, query-adaptive routing, and shared subspace modules, RILKE maintains near-perfect editing success rates and strong generalization after 1,000 unstructured knowledge edits while significantly reducing storage overhead.

Spectral Characterization and Mitigation of Sequential Knowledge Editing Collapse

The paper explains why sequential knowledge editing causes LLM general ability collapse from the perspective of SVD spectral structure and proposes REVIVE. By filtering update components that interfere with the dominant singular subspace within the singular vector basis of the original weights, REVIVE enables editors like MEMIT, RECT, and AlphaEdit to maintain both editing success rates and general capabilities under 10,000 to 20,000 continuous edits.

The Model Agreed, But Didn't Learn: Diagnosing Surface Compliance in Large Language Models

The SA-MCQ diagnostic framework is proposed to reveal the "surface compliance" phenomenon in knowledge editing—where editors achieve high scores on standard benchmarks but fail to truly overwrite internal beliefs. Models revert to original parametric memory in discriminative self-assessment, and sequential editing accumulates representational residue, leading to cognitive instability.