Multi agent causality
Game theory and decision theory for lots of interacting agents
March 9, 2025 — March 9, 2025
Suspiciously similar content
Notes on decision theory and causality where agents make decisions, in the context of iterated games in multi-agent systems, with applications to AI safety.
Extending causal DAGs to include agents and decisions.
0.1 Multi-agent graphs
There seems to be a long series of works attempting this (Heckerman and Shachter 1994; Dawid 2002; Koller and Milch 2003). I am working from Hammond et al. (2023) and MacDermott, Everitt, and Belardinelli (2023), which introduce the One Ring that unifies them all in the form of something called a Mechanised Multi-Agent Influence Diagram, a.k.a. a MMAID.
cf Liu et al. (2024).
1 Commitment races
Commitment Races are important in international relations. They also seem popular in AI safety theory, although I’m not sure why since I don’t understand how AIs can credibly commit to things; setting up credible signals that they will commit seems difficult and probably exceptional for very opaque systems.