Metamagical Themas, page 99
At first, you're disappointed, but then a flash crosses your mind: "Maybe I can at least get a first-hand report about Zilenko Buznani's playing from Jane. Since she and I hear everything through. virtually the same ears, it would be almost as good as my going if she would go." This is comforting for a moment, until it occurs to you that something is wrong here. For the same reasons as you do, Jane will insist on hearing Concert A. After all, she loves music in the same way as you do-that's precisely why you wish she would tell you about Concert B! The more you feel Jane's taste is the same as yours, the more you wish she would go to the other concert, so that you could know what it was like to have gone to it. But the more her taste is the same is yours, the less she will want to go to it!
The two of you are tied together by a bond of common taste. And if it turns out that you are different enough in taste to disagree about which concert is better, then that will tend to make you lose interest in what she might report, since you no longer can trust her opinion as that of someone who hears music "through your ears". In other words, hoping she'll choose Concert B is pointless, since it undermines your reasons for caring which concert she chooses!
The analogy is clear, I hope. Choosing D undermines your reasons for doing so. To the extent that all of you really are rational thinkers, you really will think in the same tracks. And my letter was supposed to establish beyond doubt the notion that you are all "in synch"; that is, to ensure that you can depend on the others' thoughts to be rational, which is all you need.
Well, not quite. You need to depend not just on their being rational, but on their depending on everyone else to be rational, and on their depending on everyone to depend on everyone to be rational-and so on. A group of reasoners in this relationship to each other I call superrational. Superrational thinkers, by recursive definition, include in their calculations the fact that they are in a group of superrational thinkers. In this way, they resemble elementary particles that are renormalized.
A renormalized electron's style of interacting with, say, a renormalized photon takes into account that the photon's quantum-mechanical structure includes "virtual electrons" and that the electron's quantum-mechanical structure includes "virtual photons"; moreover it takes into account that all these virtual particles (themselves renormalized) also interact with one another. An infinite cascade of possibilities ensues but is taken into account in one fell swoop by nature. Similarly, superrationality, or renormalized reasoning, involves seeing all the consequences of the fact that other renormalized reasoners are involved in the same situation-and doing so in a finite swoop rather than succumbing to an infinite regress of reasoning about reasoning about reasoning ...
* * *
`C' is the answer I was hoping to receive from everyone. I was not so optimistic as to believe that literally everyone would arrive at this conclusion, but I expected a majority would-thus my dismay when the early returns strongly favored defecting. As more phone calls came in, I did receive some C's, but for the wrong reasons. Dan Dennett cooperated, saying, "I would rather be the person who bought the Brooklyn Bridge than the person who sold it. Similarly, I'd feel better spending $3 gained by cooperating than $10 gained by defecting."
Charles Brenner, who I'd figured to be a sure-fire D, took me by surprise and C'd. When I asked him why, he candidly replied, "Because I don't want to go on record in an international journal as a defector." Very well. Know, World, that Charles Brenner is a cooperator!
Many people flirted with the idea that everybody would think "about the same", but did not take it seriously enough. Scott Buresh confided to me: "It was not an easy choice. I found myself in an oscillation mode: back and forth. I made an assumption: that everybody went through the same mental processes I went through. Now I personally found myself wanting to cooperate roughly one third of the time. Based on that figure and the assumption that I was typical, I figured about one third of the people would cooperate. So I computed how much I stood to make in a field where six or seven people cooperate. It came out that if I were a D, I'd get about three times as much as if I were a C. So I'd have to defect. Water seeks out its own level, and I sank to the lower righthand corner of the matrix." At this point, I told Scott that so far, a substantial majority had defected. He reacted swiftly: "Those rats-how can they all defect? It makes me so mad! I'm really disappointed in your friends, Doug."
So was I, when the final results were in: Fourteen people had defected and six had cooperated-exactly what the networks would have predicted! Defectors thus received $43 while cooperators got $15. I wonder what Dorothy's saying to Peter about now? I bet she's chuckling and saying, "I told you I'd do better this way, didn't l?" Ah, me ... What can you do with people like that?
A striking aspect of Scott Buresh's answer is that, in effect, he treated his own brain as a simulation of other people's brains and ran the simulation enough to get a sense of what a "typical person" would do. This is very much in the spirit of my letter. Having assessed what the statistics are likely to be, Scott then did a cool-headed calculation to maximize his profit, based on the assumption of six or seven cooperators. Of course, it came out in favor of defecting. In fact, it would have, no matter what the number of cooperators was! Any such calculation will always come out in favor of defecting. As long as you feel your decision is independent of others' decisions, you should defect. What Scott failed to tike into account was that cool-headed calculating people should take into account that cool-headed calculating people should take into account that cool-headed calculating people should take into account that ...
This sounds awfully hard to take into account in a finite way, but actually it's the easiest thing in the world. All it means is that all these heavy-duty rational thinkers are going to see that they are in a symmetric situation, so that whatever reason dictates to one, it will dictate to all. From that point on, the process is very simple. Which is better for an individual if it is a universal choice: C or D? That's all.
* * *
Actually, it's not quite all, for I've swept one possibility under the rug: maybe throwing a die could be better than making a deterministic choice. Like Chris Morgan, you might think the best thing to do is to choose C with probability p and D with probability 1-p. Chris arbitrarily let p be 1/2, but it could be any number between 0 and 1, where the two extremes represent Ding and C'ing respectively. What value of p would be chosen by superrational players? It is easy to figure out in a two-person Prisoner's Dilemma, where you assume that both players use the same value of p. The expected earnings for each, as a function of p, come out to be I+ 3p -p2, which grows monotonically as p increases from 0 to 1. Therefore, the optimum value ofp is 1, meaning certain cooperation. In the case of more players, the computations get more complex but the answer doesn't change: the expectation is always maximal when p equals 1. Thus this approach confirms the earlier one, which didn't entertain probabilistic strategies. - Rolling a die to determine what you'll do didn't add anything new to the standard Prisoner's Dilemma, but what about the modified-matrix version I gave in the P. S. to my letter? I'll let you figure that one out for yourself. And what about the Platonia Dilemma? There, two things are very clear: (1) if you decide not to send a telegram, your chances of winning are zero; (2) if everyone sends a telegram, your chances of winning are zero. If you believe that what you choose will be the same as what everyone else chooses because you are all superrational, then neither of these alternatives is very appealing. With dice, however, a new option presents itself to roll a die with probability p of coming up "good" and then to send in your name if and only if "good" comes up.
Now imagine twenty people all doing this, and figure out what value of p maximizes the likelihood of exactly one person getting the go-ahead. It turns out that it is p = 1/20, or more generally, p=1/N where N is the number of participants. In the limit where N approaches infinity, the chance that exactly one person will get the go-ahead is 1/e, which is just-under 37 percent. With twenty superrational players all throwing icosahedral dice, the chance that you will come up the big winner is very close to 1/(20e), which is a little below two percent. That's not at all bad! Certainly it's a lot better than zero percent.
The objection many people raise is: "What if my roll comes up bad? Then why shouldn't I send in my name anyway? After all, if I fail to, I'll have no chance whatsoever of winning. I'm no better off than if I had never rolled my die and had just voluntarily withdrawn!" This objection seems overwhelming at first, but actually it is fallacious, being based on a misrepresentation of the meaning of "making a decision". A genuine decision to abide by the throw of a die means that you really must abide by the throw of the die; if under certain circumstances you ignore the die and do something else, then you never made the decision you claimed to have made. Your decision is revealed by your actions, not by your words before acting!
If you like the idea of rolling a die but fear that your will power may not be up to resisting the temptation to defect, imagine a third "Policansky button": this one says `R' for "Roll", and if you press it, it rolls a die (perhaps simulated) and then instantly and irrevocably either sends your name or does not, depending on which way the die came up. This way you are never allowed to go back on your decision after the die is cast. Pushing that button is making a genuine decision to abide by the roll of a die. It would be easier on any ordinary human to be thus shielded from the temptation, but any superrational player would have no trouble holding back after a bad roll.
* * *
This talk of holding back in the face of strong temptation brings me to the climax of this column: the announcement of a Luring Lottery open to all readers and nonreaders of Scientific American. The prize of this lottery is $1,000,000/N, where N is the number of entries submitted. Just think: If you are the only entrant (and if you submit only one entry), a cool million is yours! Perhaps, though, you doubt this will come about. It does seem a trifle iffy. If you'd like to increase your chances of winning, you are encouraged to send in multiple entries-no limit! Just send in one postcard per entry. If you send in 100 entries, you'll have 100 times the chance of some poor slob who sends in just one. Come to think of it, why should you have to send in multiple entries separately? Just send one postcard with your name and address and a positive integer (telling how many entries you're making) to:
Luring Lottery
c/o Scientific American
415 Madison Avenue
New York, N.Y. 10017
You will be given the same chance of winning as if you had sent in that number of postcards with `1' written on them. Illegible, incoherent, ill-specified, or incomprehensible entries will be disqualified. Only entries received by midnight June 30, 1983 will be considered. Good luck to you (but certainly not to any-other reader of this column)!
Post Scriptum.
The emotions churned up by the Prisoner's Dilemma are among the strongest I have ever encountered, and for good reason. Not only is it a wonderful intellectual puzzle, akin to some of the most famous paradoxes of all time, but also it captures in a powerful and pithy way the essence of a myriad deep and disturbing situations that we are familiar with from life. Some are choices we make every day; others are the kind of agonizing choices that we all occasionally muse about but hope the world will never make us face.
My friend Bob Wolf, a mathematician whose specialty is logic, adamantly advocated choosing D in the case of the letters I sent out. To defend his choice, he began by saying that it was clearly "a paradox with no rational solution", and thus there was no way to know what people would do. Then he said, "Therefore, I will choose D. I do better that way than any other way." I protested strenuously: "How dare you say `therefore' when you've just gotten through describing this situation as a paradox and claiming there is no rational answer? How dare you say logic is forcing an answer down your throat, when the premise of your `logic' is that there is no logical answer?" I never got what I considered a satisfactory answer from Bob, although neither of us could budge the other. However, I did finally get some insight into Bob's vision when he, pushed hard by my probing, invented a situation with a new twist to it, which I call "Wolf's Dilemma".
Imagine that twenty people are selected from your high school graduation class, you among them. You don't know which others have been selected, and you are told they are scattered all over the country. All you know is that they are all connected to a central computer. Each of you is in a little cubicle, seated on a chair and facing one button on an otherwise blank wall. You are given ten minutes to decide whether or not to push your button. At the end of that time, a light will go on for ten seconds, and while it is on, you may either push or refrain from pushing. All the responses will then go to the central computer, and one minute later, they will result in consequences. Fortunately, the consequences can only be good. If you pushed your button, you will get $100, no strings attached, emerging from a small slot below the button. If nobody pushed their button, then everybody will get $1,000. But if there was even a single button-pusher, the refrainers will get nothing at all.
Bob asked me what I would do. Unhesitatingly, I said, "Of course I would not push the button. It's obvious!" To my amazement, though, Bob said he'd push the button with no qualms. I said, "What if you knew your co-players were all logicians?" He said that would make no difference to him. Whereas I gave credit to everybody for being able to see that it was to everyone's advantage to refrain, Bob did not. Or at least he expected that there is enough "flakiness" in people that he would prefer not to rely on the rationality of nineteen other people. But of course in assuming the flakiness of others, he would be his own best example-ruining everyone else's chances of getting $1,000.
What bothered me about Wolf's Dilemma was what I have come to call reverberant doubt. Suppose you are wondering what to do. At first it's obvious that everybody should avoid pushing their button. But you do realize that among twenty people, there might be one who is slightly hesitant and who might waver a bit. This fact is enough to worry you a tiny bit, and thus to make you waver, ever so slightly. But suddenly you realize that if you are wavering, even just a tiny bit, then most likely everyone is wavering a tiny bit. And that's considerably worse than what you'd thought at first-namely, that just one person might be wavering. Uh-oh! Now that you can imagine that everybody is at least contemplating pushing their button, the situation seems a lot more serious. In fact, now it seems quite probable that at least one person will push their button. But if that's the case, then pushing your own button seems the only sensible thing to do. As you catch yourself thinking this thought, you realize it must be the same as everyone else's thought. At this point, it becomes plausible that the majority of participants -possibly even all-will push their button! This clinches it for you, and so you decide to push yours.
Isn't this an amazing and disturbing slide from certain restraint to certain pushing? It is a cascade, a stampede, in which the tiniest flicker of a doubt has become amplified into the gravest avalanche of doubt. That's what I mean by "reverberant doubt". And one of the annoying things about it is that the brighter you are, the more quickly and clearly you see what there is to fear. A bunch of amiable slowpokes might well be more likely to unanimously refrain and get the big payoff than a bunch of razor-sharp logicians who all think perversely recursively reverberantly. It's that "smartness" to see that initial flicker of a doubt that triggers the whole avalanche and sends rationality a-tumblin' into-the abyss. So, dear reader . . . if you push that button in front of you, do you thereby lose $900 or do you thereby gain $100?
* * *
Wolf's Dilemma is not the same as the Prisoner's Dilemma. In the Prisoner's Dilemma, pressure towards defection springs from hope for asymmetry (i.e., hope that the other player might be dumber than you and thus make the opposite choice) whereas in Wolf's Dilemma, pressure towards button-pushing springs from fear of asymmetry (i.e., fear that the other player might be dumber than you and thus make the opposite choice). This difference shows up clearly in the games' payoff matrices for the two-person case (compare Figure 30-lb with Figure 29-1c). In the Prisoner's Dilemma, the temptation T is greater than the reward R (5 > 3), whereas in Wolf's Dilemma, R is greater than T (1,000 > 100).
Bob Wolf's choice in his own dilemma revealed to me something about his basic assessment of people and their reliability (or lack thereof). Since his adamant decision to be a button-pusher even in this case stunned me, I decided to explore that cynicism a bit more, and came up with this modified Wolf's Dilemma.
Imagine, as before, that twenty people have been selected from your high school graduation class, and are escorted to small cubicles with one button on the wall. This time, however, each of you is strapped into a chair, and a device containing a revolver is attached to your head. Like it or not, you are now going to play Russian roulette, the odds of your death to be determined by your choice. For anybody who pushes their button, the odds of survival will be set at 90 percent-only one chance in ten of dying. Not too bad, but given that there are twenty of you, it means that almost certainly one or two of you will die, possibly more. And what happens to the refrainers? It all depends on how many of them there are. Let's say there are N refrainers. For each one of them, their chance of being shot will be one in N2. For instance, if five people don't push, each of them will have only a 1/25 chance of dying. If ten people refrain, they will each get a 99 percent chance of survival. The bad cases are, of course, when nearly everybody pushes their button ("playing it safe", so to speak), leaving the refrainers in a tiny minority of three, two, or even one. If you're the sole refrainer, it's curtains for you-one chance in one of your death. Bye-bye! For two refrainers, it's one chance in four for each one. That means there's nearly a 50 percent chance that at least one of the two will perish.

