Skip to content

I-CAM-UV: Integrating Causal Graphs over Non-Identical Variable Sets Using Causal Additive Models with Unobserved Variables

Conference: AAAI 2026 arXiv: 2603.03207 Code: Not released Area: Causal Inference Keywords: causal discovery, causal additive model, unobserved variables, non-identical variable sets, graph integration, combinatorial search

TL;DR

This paper proposes I-CAM-UV, a method that enumerates consistent DAGs satisfying structural constraints derived from multiple CAM-UV outputs over non-identical variable sets, recovering causal relations lost due to unobserved variables, and introduces an optimal-first search algorithm exploiting cost monotonicity for efficient combinatorial search.

Background & Motivation

  • Practical demand for causal discovery: Identifying causal relations from observational data is a fundamental scientific task; randomized controlled trials are often infeasible due to cost or ethical constraints, making observational causal discovery increasingly important.
  • Prevalence of multi-dataset scenarios: In practice, multiple datasets often share the same research targets but observe different variable subsets (e.g., different laboratories measuring different indicators), making their integration a key challenge.
  • Unobserved confounders: Even with a moderate number of variables (around 10), unobserved confounders remain common, yet most existing methods assume causal sufficiency.
  • Limitations of naive overlap approaches: Estimating causal graphs independently per dataset and overlapping them leads to missed true causal relations due to confounding by unobserved variables; moreover, some variable pairs are never jointly observed across any dataset.
  • Shortcomings of existing methods: ION/IOD/COmbINE output PAGs (containing uncertainty), and CD-MiNi supports only linear non-Gaussian models, neither handling nonlinear causal relations.
  • New opportunity via CAM-UV: CAM-UV identifies unobserved causal paths (UCPs) and unobserved backdoor paths (UBPs) arising from unobserved variables, providing structural consistency constraints that enable multi-dataset causal graph integration.

Method

Overall Architecture

Given \(m\) datasets \(V_1, \dots, V_m \subset V\) over non-identical variable sets, CAM-UV is first applied to each dataset to obtain a mixed graph \(G_k = (V_k, A_k, N_k)\) containing directed and undirected edges. I-CAM-UV then enumerates all DAGs satisfying consistency constraints. The core pipeline is: (1) overlay all directed edges to obtain \(\hat{G}\); (2) collect the set of undetermined edges \(E = E_{\text{imp}} \cup E_{\text{uno}}\) (edges unidentified due to UCP/UBP and variable pairs never jointly observed); (3) assign directions or exclude each edge in \(E\) to enumerate all consistent DAGs.

Key Design 1: Consistency Definition Based on UCP/UBP

  • Function: Defines structural consistency between a candidate DAG and CAM-UV outputs.
  • Mechanism: For each dataset \(V_k\), variable pairs in the identified set \(I_k\) should have neither UCP nor UBP between them in the candidate DAG, while variable pairs connected by undirected edges \(N_k\) must have either a UCP or UBP. The candidate DAG must satisfy consistency with all \(m\) datasets simultaneously.
  • Design Motivation: Undirected edges in CAM-UV precisely reflect the existence of unobserved paths, and exploiting this structural information constrains the candidate solution space. Theorem 1 proves that under ideal conditions (full variable coverage, error-free CAM-UV), the true causal graph is always consistent.

Key Design 2: Inconsistency Cost Relaxation

  • Function: Introduces an inconsistency cost \(C(\tilde{G})\) to relax the problem from exact consistency enumeration to approximate consistency enumeration.
  • Mechanism: The total number of variable pairs violating consistency across all datasets is used as the cost; all DAGs with inconsistency cost \(\leq C^* + b\) are enumerated, where \(C^*\) is the minimum cost and \(b\) is a user-specified parameter.
  • Design Motivation: In practice, CAM-UV may produce estimation errors (e.g., spurious undirected edges or missing directed edges), making strict consistency infeasible. The relaxation allows recovery of the true DAG even under estimation errors.

Key Design 3: Optimal-First Search Based on Monotonicity

  • Function: Designs an efficient combinatorial search algorithm to enumerate all DAGs satisfying the cost constraint.
  • Mechanism: A cost lower bound function \(\tilde{C}(t, \tilde{G})\) is defined and shown to be monotone during search (Theorem 2), enabling a priority queue (heap) to search in ascending cost order and terminate early once the dequeued state's cost exceeds \(C^* + b\).
  • Design Motivation: Exhaustively evaluating all \(3^{|E|}\) connection patterns is infeasible. Monotonicity enables substantial pruning of the search space. UCP/UBP existence can be determined via BFS in \(O(|A|)\) time, ensuring efficient evaluation of each search state.

Key Design 4: Polynomial-Time UCP/UBP Detection

  • Function: Designs fast sub-algorithms for detecting UCPs and UBPs.
  • Mechanism: For UCP, BFS is launched from \(v_i\) to check reachability to unobserved parents of \(v_j\); for UBP, ancestor sets \(S_i, S_j\) of the unobserved parents of \(v_i\) and \(v_j\) are computed, and \(S_i \cap S_j \neq \emptyset\) is checked.
  • Design Motivation: UCP/UBP existence must be evaluated repeatedly for each search state; \(O(|A|)\) detection ensures the overall time efficiency of the search.

Loss & Training

This paper presents a non-parametric method without neural network training. The overall pipeline is: (1) independently run the CAM-UV estimation algorithm (including regression analysis and independence testing) on each dataset; (2) collect the overlaid graph and undetermined edge set; (3) run combinatorial search to enumerate consistent DAGs. The core optimization objective is to minimize the inconsistency cost \(C(\tilde{G})\).

Key Experimental Results

Experimental Setup

100 synthetic datasets were generated using the Erdős–Rényi random graph model (10 variables, edge probability 0.3, nonlinear functions), constructing \(m \in \{2, 3\}\) sub-datasets with non-identical variable sets, each containing \(|U| \in \{3, 4\}\) unobserved variables and 1000 samples.

Table 1: F1 Score Comparison Across Methods and Settings

Method \((m{=}2, \|U\|{=}3)\) \((m{=}2, \|U\|{=}4)\) \((m{=}3, \|U\|{=}3)\) \((m{=}3, \|U\|{=}4)\)
I-CAM-UV Highest recall Highest recall Highest recall Highest recall
CAM-UV-OVL F1 close to I-CAM-UV F1 close to I-CAM-UV Below I-CAM-UV Below I-CAM-UV
PC-OVL Low Low Low Low
Imputation Low Low Low Low
CD-MiNi Low Low Low Low

I-CAM-UV significantly outperforms all competing methods in recall across all settings (including observed and unobserved variable pairs), though its precision is slightly lower than CAM-UV-OVL, with overall F1 scores being comparable between the two.

Table 2: Statistics on Number of Enumerated DAGs and Computation Time for I-CAM-UV

Metric \((m{=}2, \|U\|{=}3)\) \((m{=}2, \|U\|{=}4)\) \((m{=}3, \|U\|{=}3)\) \((m{=}3, \|U\|{=}4)\)
Median DAG count Small Moderate Moderate Large
Median enumeration time <1s Several seconds Several seconds Tens of seconds
Maximum enumeration time Outliers up to hundreds of seconds Outliers larger

The number of enumerated DAGs varies widely (from 1 to thousands), but the precision distribution shows a unimodal clustering pattern, indicating that most enumerated DAGs are of comparable quality. The optimal-first search yields acceptable computation time on most instances.

Highlights & Insights

  • Novel problem formulation: This is the first work to leverage UCP/UBP structural information from CAM-UV for causal graph integration over non-identical variable sets, enabling identification of causal relations—especially causal directions between variable pairs never jointly observed—that are entirely undetectable by naive overlay approaches.
  • Solid theoretical guarantees: The paper proves consistency of the true DAG under ideal conditions (Theorem 1), monotonicity of the cost function (Theorem 2), and correctness of the algorithm.
  • Elegant algorithm design: Optimal-first search combined with monotonicity-based pruning and polynomial-time UCP/UBP detection effectively reduces an exponential search space.
  • Strong practical relevance: Non-identical variable sets across multiple datasets is a common scenario in real-world scientific research.

Limitations & Future Work

  • The worst-case time complexity remains exponential \(O(3^{|E|})\), rendering the algorithm potentially infeasible for large variable counts or large \(|E|\).
  • Accuracy is highly dependent on the quality of the underlying CAM-UV estimates, whose theoretical completeness has not been fully established.
  • The method may output a large number of DAGs, making manual inspection impractical; effective compact representations or ranking mechanisms are lacking.
  • Validation is limited to synthetic data; experiments on real-world datasets are absent.
  • Comparison with a broader range of nonlinear causal discovery methods is not conducted.
  • Multi-dataset causal discovery: ION (Tillman et al. 2009), IOD (Tillman & Spirtes 2011), and COmbINE (Triantafillou et al. 2010) are constraint-based methods outputting PAGs rather than complete DAGs; CD-MiNi (Huang et al. 2020) is based on LiNGAM and handles only linear settings.
  • Causal additive models: CAM (Bühlmann et al. 2014) assumes nonlinear additive functional relationships; CAM-UV (Maeda & Shimizu 2021) extends CAM to allow unobserved variables; Pham et al. (2025) improve CAM-UV estimation accuracy.
  • Functional model methods: LiNGAM (Shimizu et al. 2006) addresses linear non-Gaussian settings; ANM (Hoyer et al. 2009) handles general additive noise models.

Rating

  • Novelty: ⭐⭐⭐⭐ — Leveraging UCP/UBP information from CAM-UV for multi-variable-set integration is an entirely new direction.
  • Experimental Thoroughness: ⭐⭐⭐ — Synthetic experiments are well-designed but lack real-world data validation.
  • Writing Quality: ⭐⭐⭐⭐ — Theoretical derivations are clear, examples are abundant, and algorithm descriptions are rigorous.
  • Value: ⭐⭐⭐⭐ — Addresses a practically meaningful open problem, though scalability remains to be verified.