Skip to content

GraphNarrator: Generating Textual Explanations for Graph Neural Networks

Conference: ACL 2025
arXiv: 2410.15268
Code: None
Area: Graph Learning
Keywords: GNN Explainability, Natural Language Explanation, Text-Attributed Graphs, Expert Iteration, Knowledge Distillation

TL;DR

GraphNarrator is the first method to generate natural language explanations for Graph Neural Networks (GNNs). By utilizing saliency graph verbalization for pseudo-label generation, information-theoretic metric-driven expert iteration for self-improvement, and knowledge distillation for training an end-to-end explainer, it achieves faithful, concise, and human-friendly explanations of GNN decisions.


Background & Motivation

Background: Graph representation learning is widely applied in recommendation systems, social networks, and other fields, but the internal decision-making process of GNNs remains opaque. Existing GNN explanation methods primarily provide node- or edge-level importance scores. In the context of Text-Attributed Graphs (TAGs), these token-importance-based explanations are redundant and lack synthesis, making them difficult for humans to comprehend.

Limitations of Prior Work: (1) Saliency graph or feature importance methods (e.g., GNNExplainer) only provide importance scores without explaining semantic information; (2) In multi-node subgraphs, token-by-token importance annotations are highly fragmented and redundant; (3) External language models lack awareness of the model's internal decision-making processes, leading to unreliable zero-shot generated explanations.

Key Insight: Natural language explanations are abstractive, concise, and readable, making them a more human-comprehensible modality of explanation. However, due to the lack of ground-truth explanation data, the key challenge of "training a high-quality explanation generator without labels" must be addressed.


Method

Overall Architecture

GraphNarrator employs a three-stage pipeline: (1) saliency explanation generation and verbalization → (2) expert-iteration-driven pseudo-label generator training → (3) knowledge distillation for training an end-to-end explainer.

Key Designs

1. Saliency Paragraph Verbalization (Saliency Paragraph): First, saliency methods (e.g., LRP, Input×Grad) are used to obtain node and token importance scores. Then, the ego-graph is decomposed into a tree structure via BFS, which is then organized into a hierarchical document paragraph using pre-order traversal. The text of each node serves as a section, child nodes serve as subsections, and cross-edges are represented by citation hook-ups. Importance scores are appended following each token, in the form of token(score).

2. Information-Theoretic Explanation Quality Metrics: Three training objectives are proposed: - Input Faithfulness \(f_S\): Uses PMI to measure the mutual information between the explanation and important input regions, which is approximated via masked token prediction. - Output Faithfulness \(f_F\): Uses PMI to measure the mutual information between the explanation and the model's prediction. - Brevity \(f_B\): The ratio of explanation length to input length, encouraging concise expressions.

3. Expert Iteration Self-Improvement: Closed-loop optimization based on Expert Iteration: measuring explanation quality → filtering high-quality explanations → fine-tuning the pseudo-label generator → generating a new round of candidate explanations.

Loss & Training

Input faithfulness is estimated by sampling masked token predictions across different thresholds \(\tau\):

\[f_S \approx \int_0^1 P(\tau) \cdot \log \frac{P_{MLM}(\mathcal{R}_\tau | \mathcal{G}_{M_\tau}, E)}{P_{MLM}(\mathcal{R}_\tau | \mathcal{G}_{M_\tau})} d\tau\]

The final objective balances \(f_S\) (maximization), \(f_F\) (maximization), and \(f_B\) (minimization) through multi-objective optimization.


Experiments

Main Results

Dataset Method Simul.↑ PMI-10%↑ PMI-20%↑ PMI-30%↑ Brevity↓
DBLP LLaMA3.1 8B 0.63 0.139 0.109 0.077 0.394
DBLP GPT-4o 0.82 0.142 0.101 0.085 0.385
DBLP GraphNarrator 0.95 0.155 0.108 0.085 0.354
Cora GPT-4o 0.95 0.414 0.284 0.225 0.357
Cora GraphNarrator 0.97 0.418 0.290 0.227 0.315
Book-History GPT-4o 0.89 0.456 0.313 0.240 0.768
Book-History GraphNarrator 0.96 0.533 0.374 0.291 0.506

Ablation Study

The effectiveness of expert iteration is verified by comparing different iteration rounds (see the original paper for details), showing a steady improvement in the quality of pseudo-labels in each round. The balanced configuration of the three training objectives (\(f_S\), \(f_F\), \(f_B\)) with top-50% filtering achieves the best performance.

Ablation Dimension Observation
W/o Expert Iteration Both explanation faithfulness and brevity deteriorate
Only \(f_S\) preserved Explanations are wordy with poor brevity
Only \(f_B\) preserved Explanations are too short with poor faithfulness

Key Findings

  • GraphNarrator consistently outperforms GPT-4o zero-shot explanations across all datasets, particularly showing a significant advantage in Simulatability (DBLP: 0.95 vs 0.82).
  • On the Book-History dataset, PMI-10% increases by 16.9% and brevity improves by 34.1%.
  • The small-scale model based on LLaMA3.1 8B surpasses GPT-4o after fine-tuning, validating the effectiveness of Expert Iteration.

Highlights & Insights

  • Proposes the first method to generate natural language explanations for GNNs, pioneering a new paradigm for graph model explainability.
  • Innovatively converts saliency graph explanations into structured documents (Saliency Paragraphs), allowing LLMs to comprehend graph structural information.
  • Proposes an information-theoretic unsupervised metric for explanation quality, enabling the evaluation and optimization of explanations without human annotations.
  • Achieves efficient end-to-end explanation generation through Expert Iteration and knowledge distillation.

Limitations & Future Work

  • Relies on saliency methods as the initial signal, whose inherent limitations may propagate to the final explanations.
  • Evaluated only on node classification tasks, without being extended to other task types like link prediction or graph classification.
  • Pseudo-label generation relies on GPT-4o-mini, which limits applicability in resource-constrained scenarios.
  • The human evaluation scale is relatively small (50 samples × 3 annotators), and statistical significance remains to be further validated.
  • GNN Explainability: GNNExplainer (Ying et al., 2019), PGExplainer (Luo et al., 2020), etc., provide structural explanations.
  • Natural Language Explanation: e-SNLI (Camburu et al., 2018), Chain-of-Thought (Wei et al., 2022), etc., target textual models.
  • SMV Methods: Feldhus et al. (2022) verbalize saliency maps for text classification model explanation.
  • Expert Iteration: The iterative self-improvement framework of Dong et al. (2023).

Rating

Dimension Score
Novelty ⭐⭐⭐⭐⭐
Technical Depth ⭐⭐⭐⭐
Experimental Thoroughness ⭐⭐⭐⭐
Writing Quality ⭐⭐⭐⭐
Value ⭐⭐⭐⭐
Overall Rating 8/10