In acausal trade, agents cooperate by each predicting what the other wants and doing it, even though they might have absolutely no way of every communicating or affecting each other, nor any direct evidence that the other exists.
- 1 Background: Super-rationality and the one-shot Prisoner's Dilemma
- 2 Description
- 3 Prediction mechanisms
- 4 Decision Theories
- 5 An example of acausal trade with simple resource requirements
- 6 Acausal trade with complex resource requirements
- 7 Ordinary trade
- 8 Relevance to Friendly AI
- 9 See also
- 10 References
Background: Super-rationality and the one-shot Prisoner's Dilemma
The concept of acausal trade emerged out of the much-debated question of how to achieve cooperation on a Prisoner's Dilemma, where, by design, the two players are not allowed to communicate. On the one hand, an player in the one-shot Prisoner's Dilemma considering the causal consequences of a decision finds that defection always produces a better result. On the other hand, if the other player symmetrically reasons this way, the result is an equilibrium of Defect/Defect, which is bad for both agents. If they can somehow converge on mutual cooperation, they will each do better, on their own individual utility measure, than the Defect/Defect equilibrium. The question is what decision theory allows this beneficial equilibrium in the one-shot Prisoner's Dilemma, rather than falling into the negative-sum defection equilibrium.
Douglas Hofstadter (see references) coined the term "super-rationality" to express this state of convergence. He illustrated it with a game in which twenty players, who do not know each other's identities, each get an offer. If exactly one player asks for the prize of a billion dollars, they get it, but if none or multiple players ask, no one gets it. The "correct" decision--the decision which maximizes utility for each player, if players symmetrically make the same decision--is to randomize a one-in-20 chance of asking for the prize. Players cannot communicate, but each can reason that the others are reasoning similarly. Hofstadter's insight was an important starting point for further investigation of acausal game theory.
Gary Drescher (see references) developed the concept further, introducing the term "acausal subjunctive morality" for an ethical system of behavior based on this mechanism.
Acausal trade goes one step beyond super-rationality. The agents do not need to be identical, nor do they need to have the same utility function for themselves. Moreover, the agents do not need to be told in advance what the other agents are like, or even if they exist. In acausal trade, an agent may have to surmise the potential existence of the other agent, and calculate probability estimates about what the other agent would want it to do.
We have two (or more) agents, possibly separated so that no interaction is possible. This can be simply because each is not aware of the location of the other; or is prevented from communicating with the other, or doing anything that will practically have an effect on the other. The agent may be in the other's future, and so unable to affect it.
In order to more clearly illustrate a situation in which interaction is absolutely impossible, other less prosaic scenarios are sometimes described: For example, the agents may be outside each other's light cones, or they may be in separate worlds in a multiverse. The Everett multiworld interpretation of quantum mechanics is sometimes used for this example, but is not necessary: we can talk of different counterfactual "impossible possible worlds," or simply talk about probability distributions that an agent has for different possibilities.
In acausal trade, each agent can do something the other wants, and values that thing less than the other does. This is the usual condition for trade. However, acausal trade can happen even if the two are completely separated and have no opportunity to affect each other causally. One example: An FAI whose utility function takes into account the well-being of all humans, including not only those in its present and its future but also those in its past. (Note that the seemingly exotic utility functions brought into the discussion, including this one, are actually not so outlandish: Many humans value the wellbeing of other humans, and are distressed at harm done to people in the past, even though they can do nothing to change this.)
In acausal trade, the agents cannot count on the usual enforcement mechanisms to ensure cooperation--there is no expectation of future interactions or an outside enforcer. The agents cooperate because each knows that the other can somehow predict its behavior very well, like Omega in Newcomb's problem. Each knows that if it defects (respectively: cooperates), the other will know this, and defect (respectively: cooperate), and so the best choice is to cooperate, since, as usual in trade, Cooperate/Cooperate is better for both sides than Defect/Defect.
Acausal trade can also be described in terms of precommitment: Both agents precommit to cooperate, and each has reason to think that the other is also precommitting. gain a probabalistic belief).
Acausal trade can occur with a variety of mechanisms for allowing the agents to conclude that the other exists, and to predict each other's behavior.
1. For the belief that the trading partner exists, we may have the simple case in which the agents are told this as part of the setup for the game. But more interesitng is the case in which the agent concludes that the other is likely to exist. There is no need for it to be certain of this: As with all beliefs, a certain subjective probability about the other's existence suffices.
A superintelligence might conclude that other superintelligences would exist in other worlds because superintelligence (powerful optimization ability) [Basic_AI_drives|is an attractor] for superintelligences. The algorithms will take into account that acausal trade is one of the tricks a good optimizer may well use to optimize its utility function, thus implying the likely existence of the superintelligent acausal trading partner.
To take a more prosaic example, a person might conclude that other people are in similar situations because there are mant humans with similar mental architectures, all of whom want similar things for themselves and many of whom face the same challenges and resource constraints. 2. Diffrent paths have been desribed for the agents to preduct each other's behavior:
- They might known each other's mental architectures (source code).
- In particular, they might know that they have identical or similar mental architecture, so that the each one knows that its own mental processes approximately simulate the other's. See Gary Drescher's "acausal subjunctive cooperation."
- They might be able to simulate each other, or to predict the other's behavior analytically. The simulation may be approximate, and the predictions probabilistic, to avoid tractability problems of simulating something of the same complexity of oneself. (Simulation in this sense does not require full knowlege of the other agent's source code and the ability to run the other's source code in precise simulation: Even we humans simulate each other's thoughts, imprecisely, to guess what the other would do.)
- More broadly, acausal trade may be possible for two agents that know nothing at all about each other, except that the other is an extremely powerful superintelligence. Seen mathematically, this is just an optimization problem seeking the best possible algorithm for an agent's utility function. An optimum may be reached with algorithms in which each intelligence sacrifices the opportunity to generate some utility for itself, and instead generates utility for the other, while the other symmetrically does the same. In this case, any other would be suboptimal, both sides would know that, and so neither would deviate from the optimum. (If, counterfactually, one agent were to find it best to do so, the other would conclude that this was the case and so defect, which would reduce the first one's utility.)
The reasoning behind acausal trade resembles that behind recently developed decision theories, such as Updateless decision theory and Timeless decision theory. Like UDT and TDT, acausal trade takes into account reflexive considerations about the algorithms of the agent and of other players, unlike better-known forms of Decision theory, such as Causal decision theory (as well as Evidential Decision Theory). In CDT, the agent's algorithm (implementation) is treated as uncaused by the rest of the universe, so that it only influences the universe through its decision and subsequent action. In contrast, in UDT and TDT, the agents' own algorithms are treated as causal nodes, influenced by other factors, such as the logical requirement of optimality in a utility-function maximizer which "causes" them to make certain choices. In these theories and in acausal trade, the agent cannot escape the fact that its decision to defect or cooperate constitutes strong Bayesian evidence as to what the other agent will do, and so it is better off cooperating in the acausal trade.
Objections to acausal trade resemble the objection to UDT: Why shouldn't the agent choose to defect or to not go through on its side of the acausal trade? Whatever the considerations for cooperation, it can "at the last moment" choose to cheat and get a better result. However, this takes into account only the direct effect of the the decision. It's important to take into account that the agent's decision itself is not severed from the aspects of reality other than the direct effect of the decision. The decision is the result of other factors, namely the algorithm, and the algorithm itself is influenced by the utility function. However the algorithm was chosen, designed, or evolved, clearly it is one that does better at optimizing this utility function than some other arbitrary function. The decision thus allows reverse causal inference about the algorithm and the utility function.
An example of acausal trade with simple resource requirements
At its most abstract, the agents in this model are simply optimization algorithms, and agents can run each other as subalgorithms if that is useful in improving optimization power. Let T be a utility function for which time is most valuable as a resource; while for utility function S, space is most valuable.
In choosing the best algorithms for T and S, we must deal with subjective uncertainty as to the environments in which they will operate. For our toy example, there is some probability that the algorithm will be run in an environment where time is in abundance and some probability that it will be run in a space-rich universe.
(This uncertainty is sometimes expressed in terms of multiverse theory, but only requires ordinary subjective uncertainty as is typical in decision theory, i.e., that the algorithms should be able to optimize in either kind of environment, weighting each environment by its probability.)
If the algorithm for T is instantiated in a space-rich environment, it will only be able to gain a small amount of utility for itself, but S would be able to gain a lot of utility; and vice versa.
The question is what algorithm for T, and respectively for S, provides the most optimization power, the highest expected value given the probability distributions on the environment.
The algorithm for T can prove about S's algorithm that it will "do a favor" for the utility function T, i.e., that S's algorithm will run T's algorithm as a subalgorithm, if S's algorithm finds itself instantiated in a time-rich environment. Symmetric statements can be made reversing S and T.
The algorithms also estimate that an agent seeking the other utility function, and ready to acausally trade, is a strong possibility and likely to exist in other possibly universes.
Acausal trade with complex resource requirements
In the toy example above, resource requirements are very simple: Time and space. In general, given that agents can have complex and arbitrary goals, as humans do humans, an agent will not be able to conclude that a specific trading partner exists and will acausally cooperate.
However, an agent can analyze the distribution of probabilities for the existence of other agents, and weight its actions accordingingly, so that it has a higher probability of doing the "favor" for a more-likely acausal trading partner. (Alternatively, this can be described as variants of an agent across a multiverse doing different "favors" for different acausal trading partners.) The goods given and received come into rough balance, so long as the probabalistic sampling reflects an accurate distribution, as will happen in the limiting case of increasing super-intelligence.
Ordinary trade is usually analyzed causally: The two parties can talk, exchange information, threaten retribution, promise payment, and bind themselves legally. When the trade is carried out as promised, or else one or more parties defect, they can give rewards or carry out threats.
Nonetheless, from the perspective of Updateless decision theory and related acausal theories, even ordinary trade can be analyzed acausally. Using UDT, an agent's optimal choice is to analyze a timeless outside view of the full range of possibile algorithm, taking into account the full causal network, including all threats and promises, and choose the algorithm that gives the best results. The principles of acausal trade can be applied even if interaction is possible. If threats and promises, punishments and rewards, are involved, they are simply part of the utility function that is being optimized.
And whether or not the agent actually uses UDT, we can analyze the reasoning behind trade using a timeless, acausal perspective: Optimal agents are simply choosing the best algorithm, and the best algorithm often involves trade, with threats and promises simply entering into the utility calculation.
Relevance to Friendly AI
Acausal trade may be a way to get the cooperation of a future AI. If we know that
- the AI would want us to behave a certain way,
- and we can prove that it will do good things for us, once it arises, if we do what it wants now,
- and that it can prove the symmetrical statement -- i.e., that we do what it wants, if we've proven this behavior about it --
then we can trade with it, even though it does not yet exist.
This approach rests on being able to prove certain facts about human behavior; or to put it another way, for humans to be able to commit to behavior.
- "AI deterrence"
- "The AI in a box boxes you"
- A story that shows acausal trade in action.
- "Hail Mary, Value Porosity, and Utility Diversification," Nick Bostrom, the first paper from academia to rely on the concept of acausal trade.
- Towards an idealized decision theory, by Nate Soares and Benja Fallenstein discusses acausal interaction scenarios that shed light on new directions in decision theory.
- Hofstadter's Superrationality essays, published in Metamagical Themas (LW discussion)
- Alexander Kruel, The Acausal Trade Argument.
- Jaan Tallinn, Why Now? A Quest in Metaphysics.
- Gary Drescher, Good and Real, MIT Press, 1996.