Causal Bayesian networks via probability trees

October 31, 2020 — February 26, 2025

algebra

causal

graphical models

how do science

machine learning

measure

networks

probability

statistics

Suspiciously similar content

Many words here I don’t yet know so let’s fill in the LLM summary for now, which starts from (Ortega 2011, 2015).

TODO: relate to Shafer (1996).

1 Connection to Pearlian structural causal models

The relationship to Structural Causal Models (SCMs) is

Graphical representation: Like Judea Pearl’s DAG-based SCMs, probability trees encode causal dependencies through directed structures, though trees explicitly model sequential decision points rather than general networks.
Interventional logic: Both frameworks support counterfactual reasoning, but probability trees implement interventions through branch modifications (e.g., fixing node values) rather than graph surgery via Pearl’s do-operator.

The tree structure imposes natural identifiability through:

Temporal ordering of nodes
Restricted parent sets at each branching point This avoids needing external tools like do-calculus for many identification problems.

There is a deepmind demonstration notebook (Genewein et al. 2020).

2 Connection to mechanised causal graphs

Another obvious formalism in this domain is mechanised causal graphs.

Once again, I got an LLM to attempt to summarise the relationship between these two formalisms:

Both methodologies: 1. Extend traditional causal models by adding hierarchical elements to Pearlian SCMs 2. Enable policy-aware reasoning about decision-making agents 3. Support intervention analysis through modified graph operations

2.1 Key Distinctions

Aspect	Probability Tree Models	Mechanised Causal Graphs
Core innovation	Temporal decomposition of causal pathways	Mechanism/node distinction in SCMs
Intervention scope	Branch-specific probability modifications	Mechanism-level policy interventions
Variable types	Single-class variables with temporal order	Dual-class (object + mechanism) variables
Agent modeling	Implicit through decision nodes	Explicit policy variables (𝐷,𝑋)

3 References

Genewein, McGrath, Déletang, et al. 2020. “Algorithms for Causal Reasoning in Probability Trees.”

Görgen. 2017. “An algebraic characterisation of staged trees : their geometry and causal implications.”

Herlau. 2022. “Probability Trees and the Value of a Single Intervention.”

Leonelli, and Varando. 2022. “Structural Learning of Simple Staged Trees.”

———. 2023. “Context-Specific Causal Discovery for Categorical Data Using Staged Trees.” In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics.

Ortega. 2011. “Bayesian Causal Induction.” arXiv:1111.0708 [Cs, Stat].

———. 2015. “Subjectivity, Bayesianism, and Causality.” Pattern Recognition Letters, Philosophical Aspects of Pattern Recognition,.

Park, Buchholz, Schölkopf, et al. 2023. “A Measure-Theoretic Axiomatisation of Causality.” In Proceedings of the 37th International Conference on Neural Information Processing Systems. NIPS ’23.

Shafer. 1996. The Art of Causal Conjecture.

Varando, Carli, and Leonelli. 2024. “Staged Trees and Asymmetry-Labeled DAGs.” Metrika.