Draft:Functional Decision Theory Source: en.wikipedia.org/wiki/Draft:Functional_Decision_Theory

Functional Decision Theory (FDT) is a normative decision theory developed by Eliezer Yudkowsky and Nate Soares at the Machine Intelligence Research Institute (MIRI) in 2017.^[1] FDT proposes that rational agents should treat their decisions as the output of a fixed mathematical function, asking "Which output of this very function would yield the best outcome?".

FDT was developed as an alternative to causal decision theory (CDT) and evidential decision theory (EDT), aiming to address perceived shortcomings in both theories when applied to certain decision problems and game-theoretic scenarios. It builds upon and formalizes concepts from Yudkowsky's earlier Timeless Decision Theory (TDT).

While FDT hasn't gained much traction in academic philosophy, it is widely adopted within the rationality, effective altruism and AI safety communities.^[2]^[3]

Approach

The core principle of Functional Decision Theory is that rational agents should conceptualize their decision-making process as implementing a mathematical function. Rather than asking "What should I do?" an FDT agent asks "What output from the function that I implement would lead to the best outcomes?"^[4]

This approach differs fundamentally from traditional decision theories in how it conceptualizes the relationship between an agent's decision and outcomes:

Causal Decision Theory: recommends actions based on their direct causal consequences.^[5] CDT agents ask "What will happen if I take this action?" focusing on the causal chain that flows from their decision.
Evidential Decision Theory: recommends doing what you most want to learn that you will do.^[6] EDT agents consider their choice as evidence about the world state and choose the action they would be most pleased to discover they had chosen.
Functional Decision Theory: Recommends actions by treating the agent's decision as determining the output of all computationally similar processes.^[1] FDT agents ask "What would happen if the mathematical function I implement returned this output?" considering logical rather than just causal connections.

Philosophical foundations

FDT is grounded in three main philosophical arguments:^[1]^[7]

Precommitment. FDT proponents argue that rational agents should be willing to precommit to certain strategies when they know doing so will lead to better outcomes. FDT naturally incorporates this willingness to precommit without requiring separate justification.
Information value. Traditional decision theories sometimes recommend gathering information that has no value for improving outcomes.^[8] FDT avoids this by focusing on functional relationships rather than causal or evidential ones.
Utility. FDT is designed to maximize expected utility across a broader range of scenarios than competing theories, particularly in cases involving prediction, simulation, or strategic interaction with similar agents.

Timeless Decision Theory

FDT builds upon and supersedes Yudkowsky's earlier Timeless Decision Theory (TDT), introduced in 2010.^[9] FDT is described as a replacement for TDT, providing a more formal and precise framework for the same underlying intuitions.^[1]

Philosophical thought experiments

FDT and EDT both outperform CDT

Newcomb's problem

In Newcomb's problem, an agent faces two boxes: one transparent containing $1,000, and one opaque containing either $1,000,000 or nothing. A reliable predictor, who has made similar predictions in the past and has been correct 99% of the time, claims to have placed $1,000,000 in the opaque box if she predicted that the agent would leave the transparent box behind. The predictor has already made her prediction and left. The agent can take either just the opaque box or both boxes.^[10]

CDT recommends taking both boxes because the opaque box's contents are already causally determined by the predictor's past action. Since the agent's current choice cannot causally influence what was already placed in the box, CDT reasons that taking both boxes always yields $1,000 more than taking only the opaque box, regardless of what's inside it.^[11]
EDT recommends taking only the opaque box because the agent's choice serves as evidence about what the predictor likely placed in the box. Since the predictor is highly accurate, choosing to one-box is strong evidence that the predictor predicted this choice and therefore placed $1,000,000 in the opaque box. EDT asks: "What choice would I be happiest to learn that I made?" and concludes that learning you one-boxed (and thus likely received $1,000,000) is preferable to learning you two-boxed (and thus likely received only $1,000).
FDT Recommends taking only the opaque box,^[12] reasoning that the agent's decision-making process and the predictor's prediction process are functionally linked through computational similarity. FDT treats the question not as "What should I choose given that the box contents are fixed?" but rather "What output from my decision function would lead to the best overall outcome?" Since the predictor bases their prediction on modeling the agent's decision function, choosing to one-box functionally determines that the predictor placed $1,000,000 in the box, while choosing to two-box functionally determines that the predictor placed nothing.^[1]

The key distinction is that while EDT relies on evidential correlation between choice and outcome, FDT posits a functional connection: the same computational process that determines the agent's choice also determines (through the predictor's modeling) what was placed in the box.

References

^ ^a ^b ^c ^d ^e Yudkowsky, Eliezer; Soares, Nate (2017). "Functional Decision Theory: A New Theory of Instrumental Rationality". arXiv. 1710.05060. arXiv:1710.05060.
^ "The Delirious, Violent, Impossible True Story of the Zizians". Wired. ISSN 1059-1028. Retrieved 2025-06-25.
^ "Functional Decision Theory - LessWrong". www.lesswrong.com. 2025-03-21. Retrieved 2025-06-26.
^ "New paper: "Cheating Death in Damascus" - Machine Intelligence Research Institute". intelligence.org. 2017-03-18. Retrieved 2025-06-26.
^ Joyce, James M. (1999). The Foundations of Causal Decision Theory. Cambridge University Press. ISBN 978-0521641647.
^ Jeffrey, Richard C. (1983). "The Logic of Decision". University of Chicago Press. ISBN 978-0226395821.
^ Conitzer, Vincent (2019-07-17). "Designing Preferences, Beliefs, and Identities for Artificial Intelligence". Proceedings of the AAAI Conference on Artificial Intelligence. 33 (1): 9755–9759. doi:10.1609/aaai.v33i01.33019755. ISSN 2374-3468.
^ Good, I.J. (1967). "On the Principle of Total Evidence". The British Journal for the Philosophy of Science. 17 (4): 319–321. doi:10.1093/bjps/17.4.319.
^ Yudkowsky, Eliezer (2010). "Timeless Decision Theory" (PDF). The Machine Intelligence Research Institute (previously known as the Singularity Institute).
^ Nozick, Robert (1969). "Newcomb's Problem and Two Principles of Choice". Essays in Honor of Carl G. Hempel: 114–146. doi:10.1007/978-94-017-1466-2_7. ISBN 978-90-481-8332-6.
^ Lewis, David (March 1981). "CAUSAL DECISION THEORY" (PDF). Australasian Journal of Philosophy. 59 (1): 5–30. doi:10.1080/00048408112340011.
^ Weirich, Paul (25 October 2008). "Causal Decision Theory". Stanford Encyclopedia of Philosophy. Retrieved 26 June 2025.

[yudkowsky2017-1] Yudkowsky, Eliezer; Soares, Nate (2017). "Functional Decision Theory: A New Theory of Instrumental Rationality". arXiv. 1710.05060. arXiv:1710.05060.

[2] "The Delirious, Violent, Impossible True Story of the Zizians". Wired. ISSN 1059-1028. Retrieved 2025-06-25.

[3] "Functional Decision Theory - LessWrong". www.lesswrong.com. 2025-03-21. Retrieved 2025-06-26.

[4] "New paper: "Cheating Death in Damascus" - Machine Intelligence Research Institute". intelligence.org. 2017-03-18. Retrieved 2025-06-26.

[joyce1999-5] Joyce, James M. (1999). The Foundations of Causal Decision Theory. Cambridge University Press. ISBN 978-0521641647.

[jeffrey1983-6] Jeffrey, Richard C. (1983). "The Logic of Decision". University of Chicago Press. ISBN 978-0226395821.

[7] Conitzer, Vincent (2019-07-17). "Designing Preferences, Beliefs, and Identities for Artificial Intelligence". Proceedings of the AAAI Conference on Artificial Intelligence. 33 (1): 9755–9759. doi:10.1609/aaai.v33i01.33019755. ISSN 2374-3468.

[8] Good, I.J. (1967). "On the Principle of Total Evidence". The British Journal for the Philosophy of Science. 17 (4): 319–321. doi:10.1093/bjps/17.4.319.

[9] Yudkowsky, Eliezer (2010). "Timeless Decision Theory" (PDF). The Machine Intelligence Research Institute (previously known as the Singularity Institute).

[10] Nozick, Robert (1969). "Newcomb's Problem and Two Principles of Choice". Essays in Honor of Carl G. Hempel: 114–146. doi:10.1007/978-94-017-1466-2_7. ISBN 978-90-481-8332-6.

[11] Lewis, David (March 1981). "CAUSAL DECISION THEORY" (PDF). Australasian Journal of Philosophy. 59 (1): 5–30. doi:10.1080/00048408112340011.

[12] Weirich, Paul (25 October 2008). "Causal Decision Theory". Stanford Encyclopedia of Philosophy. Retrieved 26 June 2025.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

v t e Decision theory
Core concepts	Ambiguity aversion Bounded rationality Choice architecture Expected utility Expected value Hyperbolic discounting Leximin Loss aversion Multi-attribute utility Path dependence Principle of indifference Prospect theory Rational choice theory Risk aversion Risk-seeking Satisficing Strategic dominance Subjective expected utility Sure-thing Utility theorem
Decision models	Anscombe-Aumann framework Causal decision Decision field theory Emotional choice Evidential decision Fuzzy-trace theory Intertemporal choice Naturalistic decision Normative model Quantum cognition Recognition-primed decision Rubicon model Savage's subjective expected utility model
Decision analysis tools	Analytic hierarchy process Analytic network process Cost–benefit analysis Decision conferencing Decision curve analysis Decision rule Decision support system Decision table Decision tree Decision matrix Expected value of perfect information Gittins index Influence diagram Minimax MCDA Scoring rule Value of information
Paradoxes and biases	Allais paradox Certainty effect Cognitive bias Decoy effect Disposition effect Ellsberg paradox Endowment effect Framing effect Heuristics Newcomb's paradox Pseudocertainty effect Reference dependence Regret St. Petersburg paradox Status quo bias Sunk cost
Uncertainty and risk	Deep uncertainty Exploration–exploitation Info-gap Pignistic probability Robust decision-making
Related fields	Behavioral economics Game theory Operations research Social choice theory Utility theory
Key people	David Blackwell Bruno de Finetti Morris H. DeGroot Peter C. Fishburn Gerd Gigerenzer Itzhak Gilboa Daniel Kahneman R. Duncan Luce Oskar Morgenstern Howard Raiffa Leonard J. Savage David Schmeidler Herbert Simon Amos Tversky John von Neumann