X-Git-Url: http://lambda.jimpryor.net/git/gitweb.cgi?a=blobdiff_plain;f=topics%2F_week4_fixed_point_combinator.mdwn;h=d006a5fcac7c7c162c05767ce3897f0ef3206b85;hb=fd807027783d237968080f5c1b5e0d6817574892;hp=17b7b1c368d8eb52c2cdf51715a27d31b3c010e0;hpb=ee6a2e3f2edbc040298165943d874423d866bbc6;p=lambda.git diff --git a/topics/_week4_fixed_point_combinator.mdwn b/topics/_week4_fixed_point_combinator.mdwn index 17b7b1c3..d006a5fc 100644 --- a/topics/_week4_fixed_point_combinator.mdwn +++ b/topics/_week4_fixed_point_combinator.mdwn @@ -19,15 +19,15 @@ How could we compute the length of a list? Without worrying yet about what lambd In OCaml, you'd define that like this: - let rec get_length = fun lst -> - if lst == [] then 0 else 1 + get_length (tail lst) - in ... (* here you go on to use the function "get_length" *) + let rec length = fun lst -> + if lst == [] then 0 else 1 + length (tail lst) + in ... (* here you go on to use the function "length" *) In Scheme you'd define it like this: - (letrec [(get_length - (lambda (lst) (if (null? lst) 0 [+ 1 (get_length (cdr lst))] )) )] - ... ; here you go on to use the function "get_length" + (letrec [(length + (lambda (lst) (if (null? lst) 0 [+ 1 (length (cdr lst))] )) )] + ... ; here you go on to use the function "length" ) Some comments on this: @@ -36,67 +36,67 @@ Some comments on this: 2. `cdr` is function that gets the tail of a Scheme list. (By definition, it's the function for getting the second member of an ordered pair. It just turns out to return the tail of a list because of the particular way Scheme implements lists.) -3. I use `get_length` instead of the convention we've been following so far of hyphenated names, as in `make-list`, because we're discussing OCaml code here, too, and OCaml doesn't permit the hyphenated variable names. OCaml requires variables to always start with a lower-case letter (or `_`), and then continue with only letters, numbers, `_` or `'`. Most other programming languages are similar. Scheme is very relaxed, and permits you to use `-`, `?`, `/`, and all sorts of other crazy characters in your variable names. +3. I use `length` instead of the convention we've been following so far of hyphenated names, as in `make-list`, because we're discussing OCaml code here, too, and OCaml doesn't permit the hyphenated variable names. OCaml requires variables to always start with a lower-case letter (or `_`), and then continue with only letters, numbers, `_` or `'`. Most other programming languages are similar. Scheme is very relaxed, and permits you to use `-`, `?`, `/`, and all sorts of other crazy characters in your variable names. 4. I alternate between `[ ]`s and `( )`s in the Scheme code just to make it more readable. These have no syntactic difference. The main question for us to dwell on here is: What are the `let rec` in the OCaml code and the `letrec` in the Scheme code? -Answer: These work like the `let` expressions we've already seen, except that they let you use the variable `get_length` *inside* the body of the function being bound to it---with the understanding that it will there refer to the same function that you're then in the process of binding to `get_length`. So our recursively-defined function works the way we'd expect it to. In OCaml: +Answer: These work like the `let` expressions we've already seen, except that they let you use the variable `length` *inside* the body of the function being bound to it---with the understanding that it will there refer to the same function that you're then in the process of binding to `length`. So our recursively-defined function works the way we'd expect it to. In OCaml: - let rec get_length = fun lst -> - if lst == [] then 0 else 1 + get_length (tail lst) - in get_length [20; 30] + let rec length = fun lst -> + if lst == [] then 0 else 1 + length (tail lst) + in length [20; 30] (* this evaluates to 2 *) In Scheme: - (letrec [(get_length - (lambda (lst) (if (null? lst) 0 [+ 1 (get_length (cdr lst))] )) )] - (get_length (list 20 30))) + (letrec [(length + (lambda (lst) (if (null? lst) 0 [+ 1 (length (cdr lst))] )) )] + (length (list 20 30))) ; this evaluates to 2 - + If you instead use an ordinary `let` (or `let*`), here's what would happen, in OCaml: - let get_length = fun lst -> - if lst == [] then 0 else 1 + get_length (tail lst) - in get_length [20; 30] + let length = fun lst -> + if lst == [] then 0 else 1 + length (tail lst) + in length [20; 30] (* fails with error "Unbound value length" *) Here's Scheme: - (let* [(get_length - (lambda (lst) (if (null? lst) 0 [+ 1 (get_length (cdr lst))] )) )] - (get_length (list 20 30))) - ; fails with error "reference to undefined identifier: get_length" + (let* [(length + (lambda (lst) (if (null? lst) 0 [+ 1 (length (cdr lst))] )) )] + (length (list 20 30))) + ; fails with error "reference to undefined identifier: length" Why? Because we said that constructions of this form: - let get_length = A + let length = A in B really were just another way of saying: - (\get_length. B) A + (\length. B) A -and so the occurrences of `get_length` in A *aren't bound by the `\get_length` that wraps B*. Those occurrences are free. +and so the occurrences of `length` in A *aren't bound by the `\length` that wraps B*. Those occurrences are free. -We can verify this by wrapping the whole expression in a more outer binding of `get_length` to some other function, say the constant function from any list to the integer 99: +We can verify this by wrapping the whole expression in a more outer binding of `length` to some other function, say the constant function from any list to the integer 99: - let get_length = fun lst -> 99 - in let get_length = fun lst -> - if lst == [] then 0 else 1 + get_length (tail lst) - in get_length [20; 30] + let length = fun lst -> 99 + in let length = fun lst -> + if lst == [] then 0 else 1 + length (tail lst) + in length [20; 30] (* evaluates to 1 + 99 *) -Here the use of `get_length` in `1 + get_length (tail lst)` can clearly be seen to be bound by the outermost `let`. +Here the use of `length` in `1 + length (tail lst)` can clearly be seen to be bound by the outermost `let`. -And indeed, if you tried to define `get_length` in the lambda calculus, how would you do it? +And indeed, if you tried to define `length` in the lambda calculus, how would you do it? - \lst. (isempty lst) zero (add one (get_length (extract-tail lst))) + \lst. (isempty lst) zero (add one (length (extract-tail lst))) -We've defined all of `isempty`, `zero`, `add`, `one`, and `extract-tail` in earlier discussion. But what about `get_length`? That's not yet defined! In fact, that's the very formula we're trying here to specify. +We've defined all of `isempty`, `zero`, `add`, `one`, and `extract-tail` in earlier discussion. But what about `length`? That's not yet defined! In fact, that's the very formula we're trying here to specify. What we really want to do is something like this: @@ -116,14 +116,14 @@ So how could we do it? And how do OCaml and Scheme manage to do it, with their ` 2. If you tried this in Scheme: - (define get_length - (lambda (lst) (if (null? lst) 0 [+ 1 (get_length (cdr lst))] )) ) - - (get_length (list 20 30)) + (define length + (lambda (lst) (if (null? lst) 0 [+ 1 (length (cdr lst))] )) ) + + (length (list 20 30)) You'd find that it works! This is because `define` in Scheme is really shorthand for `letrec`, not for plain `let` or `let*`. So we should regard this as cheating, too. -3. In fact, it *is* possible to define the `get_length` function in the lambda calculus despite these obstacles. This depends on using the "version 3" implementation of lists, and exploiting its internal structure: that it takes a function and a base value and returns the result of folding that function over the list, with that base value. So we could use this as a definition of `get_length`: +3. In fact, it *is* possible to define the `length` function in the lambda calculus despite these obstacles. This depends on using the "version 3" implementation of lists, and exploiting its internal structure: that it takes a function and a base value and returns the result of folding that function over the list, with that base value. So we could use this as a definition of `length`: \lst. lst (\x sofar. successor sofar) zero @@ -134,9 +134,9 @@ lists and numbers. The reason we can do this is that our "version 3," fold-based implementation of lists, and Church's implementations of numbers, have a internal structure that *mirrors* the common recursive operations we'd use lists and numbers for. In a sense, the recursive -structure of the `get_length` operation is built into the data +structure of the `length` operation is built into the data structure we are using to represent the list. The non-recursive -version of get_length exploits this embedding of the recursion into +version of length exploits this embedding of the recursion into the data type. This is one of the themes of the course: using data structures to @@ -160,7 +160,7 @@ an implementation may not terminate doesn't mean that such a function isn't well-defined. The point of interest here is that its definition requires recursion in the function definition.) -Neither do the resources we've so far developed suffice to define the +Neither do the resources we've so far developed suffice to define the [[!wikipedia Ackermann function]]: A(m,n) = @@ -184,10 +184,10 @@ But functions like the Ackermann function require us to develop a more general t ###Fixed points### In general, we call a **fixed point** of a function f any value *x* -such that f x is equivalent to *x*. For example, +such that f x is equivalent to *x*. For example, consider the squaring function `sqare` that maps natural numbers to their squares. `square 2 = 4`, so `2` is not a fixed point. But `square 1 = 1`, so `1` is a -fixed point of the squaring function. +fixed point of the squaring function. There are many beautiful theorems guaranteeing the existence of a fixed point for various classes of interesting functions. For @@ -210,7 +210,7 @@ In the lambda calculus, we say a fixed point of an expression `f` is any formula You should be able to immediately provide a fixed point of the identity combinator I. In fact, you should be able to provide a whole -bunch of distinct fixed points. +bunch of distinct fixed points. With a little thought, you should be able to provide a fixed point of the false combinator, KI. Here's how to find it: recall that KI @@ -233,7 +233,7 @@ might be true. We'll return to this point below. ###How fixed points help definie recursive functions### -Recall our initial, abortive attempt above to define the `get_length` function in the lambda calculus. We said "What we really want to do is something like this: +Recall our initial, abortive attempt above to define the `length` function in the lambda calculus. We said "What we really want to do is something like this: \list. if empty list then zero else add one (... (tail lst)) @@ -247,7 +247,7 @@ length function. Call that function `length`. Then we have At this point, we have a definition of the length function, though it's not complete, since we don't know what value to use for the symbol `length`. Technically, it has the status of an unbound -variable. +variable. Imagine now binding the mysterious variable: @@ -261,13 +261,13 @@ a function that accurately computes the length of a list---as long as the argument we supply is already the length function we are trying to define. (Dehydrated water: to reconstitute, just add water!) -But this is just another way of saying that we are looking for a fixed point. +But this is just another way of saying that we are looking for a fixed point. Assume that `h` has a fixed point, call it `LEN`. To say that `LEN` is a fixed point means that h LEN <~~> LEN -But this means that +But this means that (\list . if empty list then zero else add one (LEN (tail list))) <~~> LEN @@ -275,7 +275,7 @@ So at this point, we are going to search for fixed point. The strategy we will present will turn out to be a general way of finding a fixed point for any lambda term. -##Deriving Y, a fixed point combinator## +##Deriving Y, a fixed point combinator## How shall we begin? Well, we need to find an argument to supply to `h`. The argument has to be a function that computes the length of a @@ -283,11 +283,11 @@ list. The function `h` is *almost* a function that computes the length of a list. Let's try applying `h` to itself. It won't quite work, but examining the way in which it fails will lead to a solution. - h h <~~> \list . if empty list then zero else 1 + h (tail list) + h h <~~> \list . if empty list then zero else 1 + h (tail list) There's a problem. The diagnosis is that in the subexpression `h (tail list)`, we've applied `h` to a list, but `h` expects as its -first argument the length function. +first argument the length function. So let's adjust h, calling the adjusted function H: @@ -306,12 +306,12 @@ argument. `H` itself fits the bill: H H <~~> (\h \list . if empty list then zero else 1 + ((h h) (tail list))) H <~~> \list . if empty list then zero else 1 + ((H H) (tail list)) == \list . if empty list then zero else 1 + ((\list . if empty list then zero else 1 + ((H H) (tail list))) (tail list)) - <~~> \list . if empty list then zero + <~~> \list . if empty list then zero else 1 + (if empty (tail list) then zero else 1 + ((H H) (tail (tail list)))) - + We're in business! -How does the recursion work? +How does the recursion work? We've defined `H` in such a way that `H H` turns out to be the length function. In order to evaluate `H H`, we substitute `H` into the body of the lambda term. Inside the lambda term, once the substitution has @@ -353,7 +353,7 @@ That is, Yh is a fixed point for h. Works! -##What is a fixed point for the successor function?## +##What is a fixed point for the successor function?## Well, you might think, only some of the formulas that we might give to the `successor` as arguments would really represent numbers. If we said something like: @@ -457,9 +457,9 @@ returns itself (a copy of `sink`); if the argument is boolean false sink true true false ~~> I sink true true true false ~~> I -So we make `sink = Y (\f b. b f I)`: +So we make `sink = Y (\f b. b f I)`: - 1. sink false + 1. sink false 2. Y (\fb.bfI) false 3. (\f. (\h. f (h h)) (\h. f (h h))) (\fb.bfI) false 4. (\h. [\fb.bfI] (h h)) (\h. [\fb.bfI] (h h)) false @@ -476,7 +476,7 @@ argument, we can throw it away unreduced. Now we try the next most complex example: - 1. sink true false + 1. sink true false 2. Y (\fb.bfI) true false 3. (\f. (\h. f (h h)) (\h. f (h h))) (\fb.bfI) true false 4. (\h. [\fb.bfI] (h h)) (\h. [\fb.bfI] (h h)) true false @@ -490,7 +490,7 @@ is again I. You should be able to see that `sink` will consume as many `true`s as we throw at it, then turn into the identity function after it -encounters the first `false`. +encounters the first `false`. The key to the recursion is that, thanks to Y, the definition of `sink` contains within it the ability to fully regenerate itself as @@ -542,7 +542,7 @@ true, then (1) is not true. Without pretending to give a serious analysis of the paradox, let's assume that sentences can have for their meaning boolean functions like the ones we have been working with here. Then the sentence *John -is John* might denote the function `\x y. x`, our `true`. +is John* might denote the function `\x y. x`, our `true`. Then (1) denotes a function from whatever the referent of *this sentence* is to a boolean. So (1) denotes `\f. f true false`, where @@ -585,7 +585,7 @@ context, (2) might denote Y C (\f. (\h. f (h h)) (\h. f (h h))) I - (\h. C (h h)) (\h. C (h h))) + (\h. C (h h)) (\h. C (h h))) C ((\h. C (h h)) (\h. C (h h))) C (C ((\h. C (h h))(\h. C (h h)))) C (C (C ((\h. C (h h))(\h. C (h h))))) @@ -627,7 +627,7 @@ just this juncture? One obstacle to thinking this through is the fact that a sentence normally has only two truth values. We might consider instead a noun -phrase such as +phrase such as (3) the entity that this noun phrase refers to @@ -639,7 +639,7 @@ for any object. The chameleon nature of (3), by the way (a description that is equally good at describing any object), makes it particularly well suited as a -gloss on pronouns such as *it*. In the system of +gloss on pronouns such as *it*. In the system of [Jacobson 1999](http://www.springerlink.com/content/j706674r4w217jj5/), pronouns denote (you guessed it!) identity functions...