Causal Bayesian networks via probability trees

October 31, 2020 — February 26, 2025

algebra
causal
graphical models
how do science
machine learning
measure
networks
probability
statistics
Figure 1

Many words here I don’t yet know so let’s fill in the LLM summary for now, which starts from (Ortega 2011, 2015).

TODO: relate to Shafer (1996).

1 Connection to Pearlian structural causal models

The relationship to Structural Causal Models (SCMs) is

  • Graphical representation: Like Judea Pearl’s DAG-based SCMs, probability trees encode causal dependencies through directed structures, though trees explicitly model sequential decision points rather than general networks.
  • Interventional logic: Both frameworks support counterfactual reasoning, but probability trees implement interventions through branch modifications (e.g., fixing node values) rather than graph surgery via Pearl’s do-operator.

The tree structure imposes natural identifiability through:

  1. Temporal ordering of nodes
  2. Restricted parent sets at each branching point This avoids needing external tools like do-calculus for many identification problems.

There is a deepmind demonstration notebook (Genewein et al. 2020).

2 Connection to mechanised causal graphs

Another obvious formalism in this domain is mechanised causal graphs.

Once again, I got an LLM to attempt to summarise the relationship between these two formalisms:

Both methodologies: 1. Extend traditional causal models by adding hierarchical elements to Pearlian SCMs 2. Enable policy-aware reasoning about decision-making agents 3. Support intervention analysis through modified graph operations

2.1 Key Distinctions

Aspect Probability Tree Models Mechanised Causal Graphs
Core innovation Temporal decomposition of causal pathways Mechanism/node distinction in SCMs
Intervention scope Branch-specific probability modifications Mechanism-level policy interventions
Variable types Single-class variables with temporal order Dual-class (object + mechanism) variables
Agent modeling Implicit through decision nodes Explicit policy variables (𝐷,𝑋)

3 References

Genewein, McGrath, Déletang, et al. 2020. Algorithms for Causal Reasoning in Probability Trees.”
Görgen. 2017. An algebraic characterisation of staged trees : their geometry and causal implications.”
Herlau. 2022. Probability Trees and the Value of a Single Intervention.”
Leonelli, and Varando. 2022. Structural Learning of Simple Staged Trees.”
———. 2023. Context-Specific Causal Discovery for Categorical Data Using Staged Trees.” In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics.
Ortega. 2011. Bayesian Causal Induction.” arXiv:1111.0708 [Cs, Stat].
———. 2015. Subjectivity, Bayesianism, and Causality.” Pattern Recognition Letters, Philosophical Aspects of Pattern Recognition,.
Park, Buchholz, Schölkopf, et al. 2023. A Measure-Theoretic Axiomatisation of Causality.” In Proceedings of the 37th International Conference on Neural Information Processing Systems. NIPS ’23.
Shafer. 1996. The Art of Causal Conjecture.
Varando, Carli, and Leonelli. 2024. Staged Trees and Asymmetry-Labeled DAGs.” Metrika.