Other Minds and the Turing Test

The next question we want to discuss is how we know whether various creatures other than ourselves think, feel, and enjoy other mental states:

The rest of these notes will focus on the question whether we can ever have good reasons to attribute mentality to machines/AIs.

The Turing Test

In the 1950 article Computing Machinery and Intelligence, the philosopher Alan Turing puts forward a test for determining whether machines are genuinely thinking. The test works like this. A human judge carries on remote conversations with a machine and another human, and has to guess which is the machine and which the human. If the machine is often able to fool the human judge into thinking it’s human, then that machine passes the test, and Turing claims we should regard it as genuinely thinking, and having genuine intelligence. This has come to be known as the Turing Test for machine intelligence.

Note that Turing is only claiming that passing his test is enough for being intelligent, or reasonably being counted as intelligent. Turing’s test may be very hard; it may set the bar too high. Perhaps chimps are intelligent, though they can’t pass the test. Perhaps someday there really will be intelligent machines, that also aren’t intelligent enough to pass Turing’s Test. Turing acknowledges this; he doesn’t want to say that being able to pass his test is a necessary condition for being intelligent. He’s only saying that the machines which are able to pass his test are intelligent, or should reasonably be so counted.

Naive judges and Simple Chatbots

Turing doesn’t say very much about who’s supposed to be judging these tests. But that’s important, because it’s very easy to fool computer neophytes into thinking that some program is really intelligent, even if the program is in fact totally stupid. One early computer program called ELIZA pretended to be a psychotherapist holding “conversations” with its patients. ELIZA was a very simple program. (You can nowadays get a version of ELIZA even for bargain-level smartphones.) Nobody who understands the program is at all tempted to call it intelligent. What the program does is this. It searches the user’s input for keywords like the word “father.” If it finds a keyword, it issues back some canned response, like “Do you often think about your father?” Here is more background about this program.

As I said, no one who really understands ELIZA wants to claim that this program is intelligent. If we’re ever going to construct a real artificial intelligence, it will take a much more sophisticated approach than was used to make ELIZA.

When Turing Tests are set up at public computing exhibitions, and the judges are just people taken off the street, people who aren’t very familiar with computer programs of this sort, then chatbot programs using programs using the same underlying structure as ELIZA turn out to be able to fool those judges about 1/2 the time. Here is an article about a chatbot that has successfully used such a strategy. Here is another more recent example.

Hence, if you want to say that passing the Turing Test really is a good test for intelligence, than it’s going to make a difference who’s judging the Turing Test. We’d have to use better judges than just ordinary people off the street.

Failure Just Around the Corner?

As I mentioned in class, when I was young I loved Spider-Man comics. Before he gained his superpowers, Peter Parker was a stereotypical weak clumsy nerdy kid. The school jock Flash Thompson bullied him around. After Peter became Spider-Man, he was in fact no longer so weak or clumsy. But he went to great efforts to keep up that appearance, so that no one would figure out his secret identity. Thus when Flash bullied him, he went through the motions, pretended to be hurt. But what his schoolmates were seeing was misleading. The illusion was fragile; it was liable to fall apart any moment, and in the comics it several times did.

A second case to consider. Suppose you end up somehow turning your mother’s delicate crystal vase into something that looks the same but is really indestructible. For some reason you want to keep this a secret. You go through a complex dance trying to give everybody else the impression that the vase is still fragile. But really it’s not. Its real disposition is to be indestructible. But with effort you might get people not to see that. You might manage to make it seem fragile, at least for a while.

One thought that comes up with the Turing Test is that even if machines/AIs turn out to do really well at the test, maybe their successes would be unstable in these same kinds of ways. Maybe failure would be just around the corner, just as soon as someone thought up the right question to ask.

I mention this thought just to acknowledge it and be able to refer back to it. I don’t have anything useful to say to address it. Peter/Spider-Man’s schoolmates might have good reasons to think he’s still weak and clumsy, even though he’s not. Your mother might have good reasons to think her vase is still fragile, even though it’s not. We might have good reason to think a machine/AI’s performance on the Turing Test will continue to impress, even though it’s going to break down after the next question. We can’t rule these possibilities out.

Let’s suppose for the sake of argument, though, that this isn’t what’s going to happen. Let’s suppose some machine/AI has so far acted quite flexibly and apparently intelligently, and that its ability to do so is robustly reliable. It’s no more likely to break down after the next question than adult humans are. What should we think in that case? Should we agree with Turing that this would be a good reason to count the machine as having real intelligence, thoughts, preferences, and so on?

What Would Reliably Passing the Turing Test Establish?

So some machine/AI turns out to pass Turing’s Test, even when the test is administered by sophisticated, trained and knowledgeable judges. And we suppose it can do this in a way that’s robustly reliable. There’s no trick question we just haven’t figured out yet that’s going to make it break down or go into an infinite loop.

Some theorists think that even if that happens, we still wouldn’t have good reasons to attribute mentality to the machine/AI. I’ll call this the anti-machine camp. The opposing, pro-machine camp thinks we would.

We’ll talk more about the anti-machine view below. For the moment, let’s sort out different ways of holding the pro-machine view.

One way to get a machine/AI with the right kind of programming might be to build it to run the same kind of “program” as human brains run. Our hardware would be different, but the machine might for all that be processing information in the same abstract ways our brains do.

In the Lycan reading from last week, we heard about Henrietta, who has her neurons replaced one-by-one with synthetic digital substitutes. Eventually her brain has no more organic parts left. If the substitutes do the same causal work that the neurons they’re replacing did, then from the outside, we won’t see any difference in Henrietta. She’ll keep walking and talking, processing information and making plans, the same as she always did. Lycan argues that Henrietta herself wouldn’t notice any difference either. When she has just one neuron replaced, none of its neighboring neurons “notice” any difference. And over the process of gradually replacing all her neurons, there doesn’t seem to be any point at which she’d lose her ability to think or feel. Her new brains would keep working the same way they always have.

So why should the difference in what they’re physically made of matter? Shouldn’t any hardware that runs the same “program” as her original brain have the same mental life as the original?

This perspective is taken up in many places in fiction and film — such as the jewel computer in Egan’s story from last week. In more limited ways, the characters in The Matrix…

and Doctorow’s story from last week get to acquire certain abilities or memories, or have certain experiences, by loading new “programs” into their brains. All of this speaks to the intuitive force of the idea that our mental lives are driven by what “programs” our brains are running.

Anti-Machine Arguments

The anti-machine theorists think that machines/AIs will never have real thoughts or mental states of their own. They can at best simulate thought and intelligence. All that passing the Turing Test would show is that a machine is a good simulation of a real thinker.

This is the position of the opposing attorney in the Leiber dialogue. He admits that a machine might be “creative” in some sense, such as when it discovers new solutions to math problems, but he argues that the machine never really understands what its doing. Whereas when humans work on problems, they genuinely have insights, and realize what’s going on. Humans genuinely experience their thoughts, the meanings of their sentences, and what’s happening in their environment.

Near the end of the dialogue, the machine/AI they’re arguing about comes on stage itself, and responds to this attorney that it seems to it (the AI) that it also has inner experiences. It asks the attorney, what makes him so sure that other adult humans really have genine thoughts and other mental states. Presumably the most important reasons for thinking so is how they act and behave. And doesn’t the machine/AI also behave in the same flexible and apparently intelligent ways?

If the attorney thinks he has better reasons for thinking that other humans have real mentality, what are those reasons?

A difficult passage

When re-reading Turing’s article, I noticed this passage:

It is not possible to produce a set of rules purporting to describe what a man should do in every conceivable set of circumstances… To attempt to provide rules of conduct to cover every eventuality, even those arising from traffic lights, appears to be impossible. With all this I agree.

From this it is argued that we cannot be machines. I shall try to reproduce the argument, but I fear I shall hardly do it justice. It seems to run something like this, “If each man had a definite set of rules of conduct by which he regulated his life he would be no better than a machine. But there are no such rules, so men cannot be machines.” The undistributed middle is glaring.

What the heck does that last sentence mean? I can’t expect you to know. I hope when you come across passages like this you will at least be able to work out from context what the author must in general be getting at. I hope it was clear that Turing doesn’t approve of the argument he’s reporting here, and that the passages that come next in his article—where he distinguishes between “rules of conduct” and “laws of behavior”—are meant to be part of a reply to the argument. Some of you may have been industrious enough to google the term “undistributed middle” to try to figure out more specifically what Turing was saying. (If so, great. That disposition will serve you well.)

What you will find is that this is a term from an older logical system. We don’t use the expression so much anymore—in fact I myself had to look up specifically which fallacy this is. An example of the fallacy of undistributed middle would be the argument “All newts are gross. Harry is gross. So Harry is a newt.” I hope that even without the benefit of any formal training in logic, you’ll be able to see that this is not a good form of argument. (Of course there can be instances of this form whose premises and conclusion are all true, but that doesn’t make this a good form of argument.)

Now I have to scratch my head and speculate a bit to figure out why Turing thought the argument he was discussing displayed this form. I don’t think it’s fair for him to say that the presence of this fallacy in the argument he reports is “glaring.” Here’s my best guess at what Turing is thinking.

We begin with the claim:

If you had a definite set of rules of conduct by which you regulated your life — rules that prescribed and guided all your choices — you would be a machine.

As we discussed in connection with Leibniz’s Law, claims of the form “If D, then M” are always equivalent to “contrapositive” claims of the form “If not-M, then not-D.” (Compare: if Fido is a dog, then Fido is mortal. Equivalent to: if Fido is immortal, then Fido is not a dog.) So 1 is equivalent to:

If you are not a machine (or as Turing puts it, if you are “better than” a machine), then you don’t have a definite set of rules that prescribe and guide all your conduct.

Now Turing is imagining that his opponents continue the argument like this:

I don’t have a definite set of rules that prescribe and guide all my conduct.
Therefore, I am not (or: I am “better than”) a machine.

The argument from 2 and 3 to 4 does display the fallacy of undistributed middle that we described above. Turing’s text doesn’t make this as clear as it might have, though, since it has the beginning premise in form 1 rather than the (equivalent) form 2.

But what fundamentally is Turing thinking his opponents get wrong?

He’s imagining that even if some machines may have definite rules that explicitly script their conduct in every situation they encounter, others may not. The point of the passages that come next in his article are to distinguish between the idea of having such complete and explicit “rules of conduct” and there being low-level “laws of behavior” that settle in advance how the machine (or the human being) will respond to any given stimulus. Turing would agree that there are low-level laws of behavior strictly governing what the machine does, but there may be such laws for us too. He’d agree that humans don’t have complete and explicit rules of conduct telling us what to do in every situation, but he’d say machines won’t necessarily have that either. Machines and we might both have to figure out what to do, rather than follow some high-level recipe already explicitly written out in advance.

I think I understand the distinction Turing is making, but I’m not entirely sure that I do. How about you? Can you make sense of the idea that there may be some low-level laws of behavior (say your genes, and everything that’s happened to you up until this point in your life) that govern how you will act, even though you don’t have rules you consult that explicitly guide every choice you make? What more would you say to better explain this distinction? Can you make sense of the idea that some machine might also lack such high-level complete and explicit rules of conduct?

There’s a lot here for us to wrestle with. Hopefully though this will help you better track how the words Turing actually wrote here are supposed to fit into his larger argument.

Links

Turing is an interesting character who made huge contributions to several areas of thought, beyond what we’re looking at in class. If you read about his life, you’ll see he also had a hard time for being homosexual, and may have committed suicide as a result. Or his death may have been a tragic accident; from what I’ve read it seems to be unclear. In any event, much of our contemporary life has been profoundly shaped by his contributions.

Among Turing’s other important contributions were: the notion of an (abstract) Turing Machine and other ideas in the foundations of logic and computer science; breaking the German ENIGMA code during World War II; and being part of the development of early computers. If (like me) you find the last topic interesting and want to read more, here are some links: