Skip to content

T-GINEE: A Tensor-Based Multilayer Graph Representation Learning

Conference: ICML 2026
arXiv: 2605.28300
Code: To be confirmed
Area: Graph Learning / Representation Learning
Keywords: Multilayer graphs, Tensor decomposition, Generalized Estimating Equations (GEE), Cross-layer dependencies

TL;DR

T-GINEE combines CP tensor decomposition and Generalized Estimating Equations (GEE) to explicitly model cross-layer dependencies in multilayer networks. It offers theoretical guarantees and superior scalability, breaking the OOM (Out-of-Memory) limitations of other tensor methods on million-node graphs such as DBLP and Stack Overflow.

Background & Motivation

Background: Multilayer networks are ubiquitous in the real world (e.g., social networks with friend/colleague/family relations; biological networks with protein co-expression/physical interactions). These systems require learning low-dimensional vector representations to capture complex cross-layer relationships.

Limitations of Prior Work: Traditional graph embedding methods (GNNs, Matrix Factorization) mostly process layers independently (ignoring inter-layer correlations) or use simple aggregation strategies (causing information loss) in multilayer settings. They lack a rigorous theoretical foundation to characterize how embeddings should encode subtle mutual influences between layers.

Key Challenge: Current multilayer graph learning lacks a systematic theoretical framework to explicitly model cross-layer residual dependencies.

Goal: Design a statistical framework for multilayer graph learning that possesses both theoretical guarantees and computational efficiency.

Key Insight: The GEE framework from biostatistics can model correlated data in a principled way, while CP tensor decomposition can elegantly represent multilayer graph structures. Combining these two allows for the explicit capture of inter-layer covariance.

Core Idea: Parameterize the multilayer graph parameter tensor via CP tensor decomposition and jointly model intra-layer and cross-layer dependencies through a tensor-based GEE framework.

Method

Overall Architecture

(1) Decompose the multilayer adjacency tensor \(\mathcal{A}_{n \times n \times M}\) into a CP form consisting of node embeddings \(\alpha_{n \times R}\) and layer-specific embeddings \(\beta_{M \times R}\); (2) Obtain the marginal probabilities \(\mathcal{P}\) by applying a link function \(g^{-1}\) to the parameter tensor \(\Theta\); (3) Solve for parameters under a tensor-based GEE framework weighted by a working covariance matrix.

Key Designs

  1. Symmetric CP Tensor Decomposition:

    • Function: Decomposes the parameter tensor as \(\Theta = \sum_{r=1}^R \alpha^{(r)} \circ \alpha^{(r)} \circ \beta^{(r)}\).
    • Mechanism: The symmetry ensures the representation of undirected graphs. Shared latent factors \(\alpha\) capture the common roles of nodes across different relationship types, while layer-specific factors \(\beta\) model the heterogeneity of each layer. The total parameter count is reduced from \(O(n^2 M)\) to \(O((n + M) R)\).
    • Design Motivation: Real-world entities typically interact through a few latent features across multiple relations; CP decomposition captures this sparsity while maintaining interpretability.
  2. Tensor-Based Generalized Estimating Equations (T-GEE) Framework:

    • Function: Estimates the parameter vector \(\gamma\) by minimizing the weighted sum of squared residuals.
    • Mechanism: GEE only specifies the first two moments (mean and covariance) of the response without assuming a full distribution. It weights residuals using a working covariance matrix \(\Sigma^w_{i, j} = \Gamma^{1/2}_{i, j} W \Gamma^{1/2}_{i, j}\) to explicitly model cross-layer correlations.
    • Design Motivation: Standard unweighted least squares ignore inter-layer correlations, leading to inefficient estimation. The GEE weighting strategy assigns more reasonable weights to correlated layers via co-variation data, improving parameter estimation efficiency.
  3. Working Covariance + Flexible Link Functions:

    • Function: Estimates a shared correlation matrix \(W\) from a residual pool. Flexible link functions \(g^{-1}\) such as logit, probit, or sparsity-aware logit \(g^{-1}(x) = \frac{s}{1 + e^{-x}}\) are supported.
    • Mechanism: Working covariance estimation allows for the misspecification of the correlation structure. Sparsity-aware link functions explicitly handle sparse networks.
    • Design Motivation: Practical networks are often sparse, where standard logit gradients may explode in sparse regions. The sparsity-aware design fits sparse observations naturally by introducing a coefficient \(s \in (0, 1)\).

Key Experimental Results

Main Results

Method CP Tucker NMF SVD LSE MASE NNTUCK SPECK HOSVD T-GINEE
Synthetic AUC 0.449 0.529 0.722 0.813 0.223 0.382 0.611 0.760 0.850 0.940

T-GINEE significantly outperforms the runner-up HOSVD (0.850). The substantial gap between the basic CP decomposition and T-GINEE (0.449 → 0.940) quantifies the benefits of the statistical regularization framework.

Main Results (Real-world Data)

Method AUCS Krackhardt WAT Yeast DBLP (Million Nodes) Stack Overflow (Million Nodes)
HOSVD 0.897 0.783 0.820 0.902 OOM OOM
SVD 0.877 0.932 0.719 0.879 0.6093 0.9682
T-GINEE 0.920 0.948 0.838 0.921 0.6478 0.9831

CP, Tucker, and HOSVD all failed on DBLP and Stack Overflow due to OOM errors, whereas T-GINEE successfully handled these million-node graphs.

Ablation Study & Generalization

  • Heterogeneous Synthesis: Under low overdetermination ratios, the learned covariance matrix \(W\) correctly recovers the inter-layer similarity order.
  • New Layer Generalization: In zero-shot transfer experiments on DBLP-5K—training on the first four layers and predicting edges in the fifth—T-GINEE achieved an AUC of 0.7733, surpassing MGCN and MR-GCN even though the latter were retrained on the fifth layer.

Highlights & Insights

  • Principled Cross-layer Modeling: This work is the first to seamlessly combine the GEE framework (a standard tool in biostatistics) with tensor decomposition, providing a rigorous statistical foundation for multilayer graph learning.
  • Double Theoretical Guarantees: Theorem 3.1 proves \(\sqrt{N}\) consistency, and Theorem 3.2 establishes asymptotic normality.
  • Breakthrough Scalability: The sparse tensor implementation and mini-batch sampling reduce complexity from \(O(n^2 M)\) to \(O(R |E|)\), maintaining competitiveness on million-node graphs.
  • Inter-layer Knowledge Transfer: The learned \(\alpha\) and \(\beta\) decompositions naturally support zero-shot cross-layer generalization.

Limitations & Future Work

  • The theoretical analysis (Theorem 3.2) requires overdetermination conditions, which limits the applicability of asymptotic normality in extremely sparse scenarios.
  • There is currently no data-driven method for the optimal selection of rank \(R\) and the sparsity coefficient \(s\).
  • Layer Heterogeneity: The assumption of fully shared node embeddings \(\alpha\) across layers may be too strong for completely heterogeneous relationship types.
  • Future Work: Introduce hierarchical or soft-sharing mechanisms; develop provable mini-batch theory; design adaptive selection for rank and sparsity coefficients.
  • vs. Traditional Tensor Decomposition (CP/Tucker): Traditional methods assume edge independence without covariance modeling; T-GINEE explicitly learns a working covariance.
  • vs. Multilayer GNNs (MGCN/MR-GCN): GNNs rely on local neighborhood aggregation via graph convolution, while T-GINEE adopts a global tensor perspective and statistical inference.
  • Insight: The combination of the GEE paradigm and tensor decomposition can be generalized to other multivariate data scenarios.

Rating

  • Novelty: ⭐⭐⭐⭐⭐ The integration of GEE and tensor decomposition introduces a new theoretical paradigm for multilayer graphs.
  • Experimental Thoroughness: ⭐⭐⭐⭐⭐ Comprehensive evaluation across synthetic data, small benchmarks, and million-node graphs compared against strong GNN baselines.
  • Writing Quality: ⭐⭐⭐⭐⭐ Clear motivation, rigorous methodological presentation, and well-defined experimental conclusions.
  • Value: ⭐⭐⭐⭐⭐ Fills a theoretical gap in multilayer graph learning, advancing both statistical frameworks and scalable designs.