translating_between_OCaml_Scheme_and_Haskell.mdwn

   1 The functional programming literature tends to use one of four languages: Scheme, OCaml, Standard ML (SML), or Haskell. With experience, you'll grow comfortable switching between these. At the beginning, though, it can be confusing.
   2
   3 The easiest translations are between OCaml and SML. These languages are both derived from a common ancestor, ML. For the most part, the differences between them are only superficial. [Here's a translatio nmanual](http://www.mpi-sws.org/~rossberg/sml-vs-ocaml.html).
   4
   5 In some respects these languages are closer to Scheme than to Haskell: Scheme, OCaml and SML all default to call-by-value evaluation order, and all three have native syntax for mutation and other imperative idioms (though that's not central to their design). Haskell is different in both respects: the default evaluation order is call-by-name (strictly speaking, it's "call-by-need", which is a more efficient cousin), and the only way to have mutation or the like is through the use of monads.
   6
   7 On both sides, however, the non-default evaluation order can also be had by using special syntax. And in other respects, OCaml and SML are more like Haskell than they are like Scheme. For example, OCaml and SML and Haskell all permit you to declare types and those types are *statically checked*: that is, your program won't even start to be interpreted unless all the types are consistent. In Scheme, on the other hand, type-checking only happens when your program is running, and the language is generally much laxer about what it accepts as well typed. (There's no problem having a list of mixed numbers and booleans, for example... and you don't need to wrap them in any sum type to do so.)
   8
   9 Additionally, the syntax of OCaml and SML is superficially much closer to Haskell's than to Scheme's.
  10
  11 ##Comments, Whitespace, and Brackets##
  12
  13                 -- this is a single line comment in Haskell
  14
  15                 {- this
  16                    is a multiline
  17                    comment in Haskell -}
  18
  19                 (* this is a single or multiline
  20                    comment in OCaml *)
  21
  22                 ; this is a single line comment in Scheme
  23
  24                 #| this is a
  25                    multiline comment
  26                    in Scheme |#
  27
  28                 #;(this is
  29                     (another way to
  30                       (comment out (a block) (of Scheme code))))
  31
  32 *       Haskell is sensitive to linespace and indentation: it matters how your code is lined up. OCaml and Scheme don't care about this, though they recommend following some conventions for readability.
  33
  34 *       In Haskell, a block of code can be bracketed with `{` and `}`, with different expressions separated by `;`. But usually one would use line-breaks and proper indentation instead. In OCaml, separating expressions with `;` has a different meaning, having to do with how side-effects are sequenced. Instead, one can bracket a block of code with `(` and `)` or with `begin` and `end`. In Scheme, of course, every parentheses is significant.
  35
  36
  37 ##Scheme and OCaml##
  38
  39 *       You can [try Scheme in your web browser](http://tryscheme.sourceforge.net/). This is useful if you don't have Racket or another Scheme implementation installed---but don't expect it to have all the bells and whistles of a mature implementation!
  40
  41 *       **Type Variants and Pattern Matching** If you want to reproduce this kind of OCaml code:
  42
  43                 type lambda_expression = Var of char | Lam of char * lambda_expression | App of lambda_expression * lambda_expression;;
  44
  45                 let rec free_vars (expr : lambda_expression) : char list =
  46                   match expr with
  47                     | Var label -> [label]
  48                     | Lam (label, body) -> remove label (free_vars body)
  49                     | App (left, right) -> merge (free_vars left) (free_vars right);;
  50
  51         in Scheme, you have two choices. First, the quick hack:
  52
  53                 ; we use the symbols 'var and 'lam as tags, and assume
  54                 ; that an expression will always be a pair of one of these forms:
  55                 ;       (cons 'var symbol)
  56                 ;       (cons (cons 'lam symbol) expression)
  57                 ;       (cons expression expression)
  58
  59                 (define (free-vars expr)
  60                   (cond
  61                     [(eq? (car expr) 'var) (list (cdr expr))]
  62                     [(and? (pair? (car expr)) (eq? (car (car expr)) 'lam))
  63                       (remove (cdr (car expr)) (free-vars (cdr expr)))]
  64                     [else (merge (free-vars (car expr)) (free-vars (cdr expr)))]))
  65
  66         Second, you can create real datatypes and pattern-match on them. There are several tools for doing this. I'll describe the `define-datatype` and `cases` forms developed for the book *Essentials of Programming Languages* (EoPL) by Friedman and Wand.
  67
  68         (Alternatives include the `struct` form in Racket, see <http://docs.racket-lang.org/guide/define-struct.html>. Also `define-record-type` from srfi-9 and srfi-57; see also <http://docs.racket-lang.org/r6rs-lib-std/r6rs-lib-Z-H-7.html>.)
  69
  70         Here is how the tools from EoPL work. You must begin your file either with `#lang eopl` or with the first two lines below:
  71
  72                 #lang racket
  73                 (require eopl/eopl)
  74
  75                 (define-datatype lambda-expression lambda-expression?
  76                   (var (label symbol?))
  77                   (lam (label symbol?) (body lambda-expression?))
  78                   (app (left lambda-expression?) (right lambda-expression?)))
  79
  80                 (define (free-vars expr)
  81                   (cases lambda-expression expr
  82                     (var (label) (list label))
  83                     (lam (label body) (remove label (free-vars body)))
  84                     (app (left right) (remove-duplicates (append (free-vars left) (free-vars right))))))
  85
  86
  87 *       Scheme has excellent support for working with implicit or "first-class" **continuations**, using either `call/cc` or any of various delimited continuation operators. See <http://docs.racket-lang.org/reference/cont.html?q=shift&q=do#%28part._.Classical_.Control_.Operators%29>.
  88
  89         In Scheme you can use these forms by default (they're equivalent):
  90
  91                 (call/cc (lambda (k) ...))
  92                 (let/cc k ...)
  93
  94         If your program declares `(require racket/control)`, you can also use:
  95
  96                 (begin ... (reset ... (shift k ...) ...) ...)
  97
  98                 (begin ... (prompt ... (control k ...) ...) ...)
  99
 100                 (begin ... (prompt ... (abort value) ...) ...)
 101
 102         These last three forms are also available in OCaml, but to use them you'll need to compile and install Oleg Kiselyov's "delimcc" or "caml-shift" library (these names refer to the same library), which you can find [here](http://okmij.org/ftp/continuations/implementations.html#caml-shift). You'll already need to have OCaml installed. It also helps if you already have the findlib package installed, too, [as we discuss here](http://lambda.jimpryor.net/how_to_get_the_programming_languages_running_on_your_computer/). If you're not familiar with how to compile software on your computer, this might be beyond your reach for the time being.
 103
 104         But assuming you do manage to compile and install Oleg's library, here's how you'd use it in an OCaml session:
 105
 106                 #require "delimcc";; (* loading Oleg's library this way requires the findlib package *)
 107                 open Delimcc;; (* this lets you say e.g. new_prompt instead of Delimcc.new_prompt *)
 108                 let p = new_prompt ();;
 109                 let prompt thunk = push_prompt p thunk;;
 110                 let foo =
 111                   ...
 112                   prompt (fun () ->
 113                     ...
 114                     shift p (fun k -> ...)
 115                     ...
 116                     (* or *)
 117                     control p (fun k -> ...)
 118                     ...
 119                     (* or *)
 120                     abort p value
 121                     ...
 122                   )
 123                   ...
 124
 125         There is also a library for using *undelimited* continuations in OCaml, but it's shakier than Oleg's delimited continuation library.
 126
 127 We won't say any more about translating to and from Scheme.
 128
 129
 130 ##Haskell and OCaml##
 131
 132 We will however try to give some general advice about how to translate between OCaml and Haskell.
 133
 134 *       Again, it may sometimes be useful to [try Haskell in your web browser](http://tryhaskell.org/)
 135 *       There are many Haskell tutorials and textbooks available. This is probably the most actively developed: [Haskell Wikibook](http://en.wikibooks.org/wiki/Haskell)
 136 *       [Yet Another Haskell Tutorial](http://www.cs.utah.edu/~hal/docs/daume02yaht.pdf) (much of this excellent book has supposedly been integrated into the Haskell Wikibook)
 137 *       All About Monads has supposedly also been integrated into the Haskell Wikibook
 138 *       (A not-so-)[Gentle Introduction to Haskell](http://web.archive.org/web/http://www.haskell.org/tutorial/) (archived)
 139 *       [Learn You a Haskell for Great Good](http://learnyouahaskell.com/)
 140
 141
 142 #Type expressions#
 143
 144 *       In Haskell, you say a value has a certain type with: `value :: type`. You express the operation of prepending a new `int` to a list of `int`s with `1 : other_numbers`. In OCaml it's the reverse: you say `value : type` and `1 :: other_numbers`.
 145
 146 *       In Haskell, type names and constructors both begin with capital letters, and type variables always appear after their constructors, in Curried form. And the primary term for declaring a new type is `data` (short for "abstract datatype").
 147 So we have:
 148
 149                 data Either a b = Left a | Right b;
 150                 data FooType a b = Foo_constructor1 a b | Foo_constructor2 a b;
 151
 152         In printed media, Haskell type variables are often written using Greek letters, like this:
 153
 154         <pre><code>type Either &alpha; &beta; = Left &alpha; | Right &beta;
 155         </code></pre>
 156
 157         Some terminology: in this type declaration, `Either` is known as a *type-constructor*, since it takes some types <code>&alpha;</code> and <code>&beta;</code> as arguments and yields a new type. We call <code>Left &alpha;</code> one of the *variants* for the type <code>Either &alpha; &beta;</code>. `Left` and `Right` are known as *value constructors* or *data constructors* or just *constructors*. You can use `Left` in any context where you need a function, for example:
 158
 159                 map Left [1, 2]
 160
 161         In OCaml, value constructors are still capitalized, but type names are lowercase. Type variables take the form `'a` instead of `a`, and if there are multiple type variables, they're not Curried but instead have to be grouped in a tuple. The syntax for whether they appear first or second is also somewhat different. So we have instead:
 162
 163                 type ('a,'b) either = Left of 'a | Right of 'b;;
 164                 type ('a,'b) foo_type = Foo_constructor1 of 'a * 'b | Foo_constructor2 of 'a * 'b;;
 165
 166         In OCaml, constructors aren't full-fledged functions, so you need to do this instead:
 167
 168                 List.map (fun x -> Left x) [1; 2]
 169
 170         Apart from these differences, there are many similarities between Haskell's and OCaml's use of constructors. For example, in both languages you can do:
 171
 172                 let Left x = Left 1 in x + 1
 173
 174 *       In addition to the `data` keyword, Haskell also sometimes uses `type` and `newtype` to declare types. `type` is used just to introduce synonyms. If you say:
 175
 176                 type Weight = Integer
 177                 type Person = (Name, Address)    -- supposing types Name and Address to be declared elsewhere
 178
 179         then you can use a value of type `Integer` wherever a `Weight` is expected, and vice versa. `newtype` and `data` on the other hand, create genuinely new types. `newtype` is basically just an efficient version of `data` that you can use in special circumstances. `newtype` must always take one type argument and have one value constructor. For example:
 180
 181                 newtype PersonalData a = PD a
 182
 183         You could also say:
 184
 185                 data PersonalData a = PD a
 186
 187         And `data` also allows multiple type arguments, and multiple variants and value constructors.
 188
 189         OCaml just uses the one keyword `type` for all of these purposes:
 190
 191                 type weight = int;;
 192                 type person = name * address;;
 193                 type 'a personal_data = PD of 'a;;
 194
 195 *       The type constructors discussed above took simple types as arguments. In Haskell, types are also allowed to take *type constructors* as arguments:
 196
 197                 data BarType t = Bint (t Integer) | Bstring (t string)
 198
 199         One does this for example when defining monad transformers---the type constructor `ReaderT` takes some base monad's type constructor as an argument.
 200
 201         The way to do this this in OCaml is less straightforward. [See here](/code/tree_monadize.ml) for an example.
 202
 203 *       Haskell has a notion of *type-classes*. They look like this:
 204
 205                 class Eq a where
 206                   (==)    :: a -> a -> Bool
 207
 208         This declares the type-class `Eq`; in order to belong to this class, a type `a` will have to supply its own implementation of the function ==, with the type a -> a -> Bool. Here is how the `Integer` class signs up to join the type-class:
 209
 210                 instance Eq Integer where
 211                   x == y  =  ...
 212
 213         Type expressions can be conditional on some of their parameters belonging to certain type-classes. For example:
 214
 215                 elem      :: (Eq a) => a -> [a] -> Bool
 216
 217         says that the function `elem` is only defined over types `a` that belong to the type-class `Eq`. For such types `a`, `elem` has the type `a -> [a] -> Bool`.
 218
 219         Similarly:
 220
 221                 instance (Eq a) => Eq (Tree a) where
 222                   Leaf a         == Leaf b          =  a == b
 223                   (Branch l1 r1) == (Branch l2 r2)  =  (l1==l2) && (r1==r2)
 224                   _              == _               =  False
 225
 226         says that if `a` belongs to the typeclass `Eq`, then so too does `Tree a`, and in such cases here is the implementation of `==` for `Tree a`...
 227
 228 *       OCaml doesn't have type-classes. You can do soemthing similar with OCaml modules that take are parameterized on other modules. Again, [see here](/code/tree_monadize.ml) for an example.
 229
 230
 231 *       Some specific differences in how certain types are expressed. This block in Haskell:
 232
 233                 Prelude> type Maybe a = Nothing | Just a
 234                 Prelude> let x = [] :: [Int]
 235                 Prelude> :t x
 236                 x :: [Int]
 237                 Prelude> let x = () :: ()
 238                 Prelude> let x = (1, True) :: (Int, Bool)
 239
 240 corresponds to this block in OCaml:
 241
 242                 # type 'a option = None | Some of 'a;;
 243                 type 'a option = None | Some of 'a
 244                 # let (x : int list) = [];;
 245                 val x : int list = []
 246                 # let (x : unit) = ();;
 247                 val x : unit = ()
 248                 # let (x : int * bool) = (1, true);;
 249                 val x : int * bool = (1, true)
 250
 251 *       Haskell has a plethora of numerical types, including the two types `Int` (integers limited to a machine-dependent range) and `Integer` (unbounded integers). The same arithmetic operators (`+` and so on) work for all of these. OCaml also has several different numerical types (though not as many). In OCaml, by default, one has to use a different numerical operator for each type:
 252
 253                 # 1 + 2;;
 254                 - : int = 3
 255                 # 1.0 + 2.0;;
 256                 Error: This expression has type float but an expression was expected of type int
 257                 # 1.0 +. 2.0;;
 258                 - : float = 3.
 259
 260         However the comparison operators are polymorphic. You can equally say:
 261
 262                 # 1 = 2;;
 263                 - : bool = false
 264                 # 1.0 = 2.0;;
 265                 - : bool = false
 266                 # 2 > 1;;
 267                 - : bool = true
 268                 # 2.0 > 1.0;;
 269                 - : bool = true
 270
 271         But you must still apply these operators to expressions of the same type:
 272
 273                 # 2.0 > 1;;
 274                 Error: This expression has type int but an expression was expected of type float
 275
 276 * We'll discuss differences between Haskell's and OCaml's record types below.
 277
 278
 279 #Lists, Tuples, Unit, Booleans#
 280
 281 *       As noted above, Haskell describes the type of a list of `Int`s as `[Int]`. OCaml describes it as `int list`. Haskell describes the type of a pair of `Int`s as `(Int, Int)`. OCaml describes it as `int * int`. Finally, Haskell uses `()` to express both the unit type and a value of that type. In OCaml, one uses `()` for the value and `unit` for the type.
 282
 283 *       Haskell describes the boolean type as `Bool` and its variants are `True` and `False`. OCaml describes the type as `bool` and its variants are `true` and `false`. This is an inconsistency in OCaml: other value constructors must always be capitalized.
 284
 285 *       As noted above, in Haskell one builds up a list by saying `1 : [2, 3]`. In OCaml one says `1 :: [2; 3]`. In Haskell, one can test whether a list is empty with either:
 286
 287                 lst == []
 288                 null lst
 289
 290         In OCaml, there is no predefined `null` or `isempty` function. One can still test whether a list is empty using the comparison `lst = []`.
 291
 292 *       In Haskell, the expression [1..5] is the same as [1,2,3,4,5], and the expression [0..] is a infinite lazily-evaluated stream of the natural numbers. In OCaml, there is no [1..5] shortcut, lists must be finite, and they are eagerly evaluated. It is possible to create lazy streams in OCaml, even infinite ones, but you have to use other techniques than the native list type.
 293
 294 *       Haskell has *list comprehensions*:
 295
 296                 [ x * x | x <- [1..10], odd x]
 297
 298         In OCaml, one has to write this out longhand:
 299
 300                 List.map (fun x -> x * x) (List.filter odd [1..10]);;
 301
 302 *       In Haskell, the expressions "abc" and ['a','b','c'] are equivalent. (Strings are just lists of chars. In OCaml, these expressions have two different types.
 303
 304         Haskell uses the operator `++` for appending both strings and lists (since Haskell strings are just one kind of list). OCaml uses different operators:
 305
 306                 "string1" ^ "string2"
 307                 ['s';'t'] @ ['r';'i';'n';'g']
 308                 (* or equivalently *)
 309                 List.append ['s';'t'] ['r';'i';'n';'g']
 310
 311
 312 #Let and Where#
 313
 314 *       Haskell permits both:
 315
 316                 foo x =
 317                   let result1 = x * x
 318                       result2 = x + 1
 319                   in result1 + result2
 320
 321         and:
 322
 323                 foo x = result1 + result2
 324                   where result1 = x * x
 325                         result2 = x + 1
 326
 327         OCaml permits only:
 328
 329                 let foo x =
 330                   let result1 = x * x
 331                   in let result2 = x + 1
 332                   in result1 + result2;;
 333
 334 #Patterns#
 335
 336 *       In OCaml:
 337
 338                 # let (x, y) as both = (1, 2)
 339                   in (both, x, y);;
 340                 - : (int * int) * int * int = ((1, 2), 1, 2)
 341
 342
 343         The same in Haskell:
 344
 345                 let both@(x,y) = (1, 2)
 346                   in (both, x, y)
 347
 348 *       In OCaml:
 349
 350                 match list_expression with
 351                   | y::_ when odd y -> result1
 352                   | y::_ when y > 5 -> result2
 353                   | y::_ as whole -> (whole, y)
 354                   | [] -> result4
 355
 356         The same in Haskell:
 357
 358                 case list_expression of
 359                   (y:_) | odd y -> result1
 360                         | y > 5 -> result2
 361                   whole@(y:_) -> (whole, y)
 362                   [] -> result4
 363
 364
 365 #Records#
 366
 367 Haskell and OCaml both have `records`, which are essentially just tuples with a pretty interface. The syntax for declaring and using these is a little bit different in the two languages.
 368
 369 *       In Haskell one says:
 370
 371                 -- declare a record type
 372                 data Color = C { red, green, blue :: Int }
 373                 -- create a value of that type
 374                 let c = C { red = 0, green = 127, blue = 255 }
 375
 376         In OCaml one says instead:
 377
 378                 type color = { red : int; green : int; blue : int};;
 379                 let c = { red = 0; green = 127; blue = 255 }
 380
 381         Notice that OCaml doesn't use any value constructor `C`. The record syntax `{ red = ...; green = ...; blue = ... }` is by itself the constructor. The record labels `red`, `green`, and `blue` cannot be re-used for any other record type.
 382
 383 *       In Haskell, one may have multiple constructors for a single record type, and one may re-use record labels within that type, so long as the labels go with fields of the same type:
 384
 385                 data FooType = Constructor1 {f :: Int, g :: Float} | Constructor2 {f :: Int, h :: Bool}
 386
 387 *       In Haskell, one can extract the field of a record like this:
 388
 389                 let c = C { red = 0, green = 127, blue = 255 }
 390                 in red c    -- evaluates to 0
 391
 392         In OCaml:
 393
 394                 let c = { red = 0; green = 127; blue = 255 }
 395                 in c.red    (* evaluates to 0 *)
 396
 397 *       In both languages, there is a special syntax for creating a copy of an existing record, with some specified fields altered. In Haskell:
 398
 399                 let c2 = c { green = 50, blue = 50 }
 400                 -- evaluates to C { red = red c, green = 50, blue = 50 }
 401
 402         In OCaml:
 403
 404                 let c2 = { c with green = 50; blue = 50 }
 405                 (* evaluates to { red = c.red; green = 50; blue = 50 }
 406
 407 *       One pattern matches on records in similar ways. In Haskell:
 408
 409                 let C { red = r, green = g } = c
 410                 in r
 411
 412         In OCaml:
 413
 414                 let { red = r; green = g } = c
 415                 in r
 416
 417         In Haskell:
 418
 419                 makegray c@(C { red = r} ) = c { green = r, blue = r }
 420
 421         is equivalent to:
 422
 423                 makegray c = let C { red = r } = c
 424                              in { red = r, green = r, blue = r }
 425
 426         In OCaml it's:
 427
 428                 # let makegray ({red = r} as c) = { c with green=r; blue=r };;
 429                 val makegray : color -> color = <fun>
 430                 # makegray { red = 0; green = 127; blue = 255 };;
 431                 - : color = {red = 0; green = 0; blue = 0}
 432
 433
 434 #Functions#
 435
 436 *       In Haskell functions are assumed to be recursive, and their types and applications to values matching different patterns are each declared on different lines. So we have:
 437
 438                 factorial    :: int -> int
 439                 factorial 0  =  1
 440                 factorial n  =  n * factorial (n-1)
 441
 442         In OCaml you must explicitly say when a function is recursive; and this would be written instead as:
 443
 444                 let rec factorial (n : int) : int =
 445                   match n with
 446                     | 0 -> 1
 447                     | x -> x * factorial (x-1)
 448
 449         or:
 450
 451                 let rec factorial : int -> int =
 452                   fun n -> match n with
 453                     | 0 -> 1
 454                     | x -> x * factorial (x-1)
 455
 456         or (though we recommend not using this last form):
 457
 458                 let rec factorial : int -> int =
 459                   function
 460                     | 0 -> 1
 461                     | x -> x * factorial (x-1)
 462
 463 *       Another example, in Haskell:
 464
 465                 length         :: [a] -> Integer
 466                 length []      =  0
 467                 length (x:xs)  =  1 + length xs
 468
 469         In OCaml:
 470
 471                 let rec length : 'a list -> int =
 472                   fun lst -> match lst with
 473                     | [] -> 0
 474                     | x::xs -> 1 + length xs
 475
 476 *       Another example, in Haskell:
 477
 478                 sign x | x >  0      = 1
 479                        | x == 0      = 0
 480                        | otherwise   = -1
 481
 482         In OCaml:
 483
 484                 let sign x = match x with
 485                   | x' when x' > 0 -> 1
 486                   | x' when x' = 0 -> 0
 487                   | _ -> -1
 488
 489 *       In Haskell the equality comparison operator is `==`, and the non-equality operator is `/=`. In OCaml, `==` expresses "physical identity", which has no analogue in Haskell because Haskell has no mutable types. See our discussion of "Four grades of mutable involvement" in the [[Week9]] notes. In OCaml the operator corresponding to Haskell's `==` is just `=`, and the corresponding non-equality operator is `<>`.
 490
 491 *       In both Haskell and OCaml, one can use many infix operators as prefix functions by parenthesizing them. So for instance:
 492
 493                 (+) 1 2
 494
 495         will work in both languages. One notable exception is that in OCaml you can't do this with the list constructor `::`:
 496
 497         # (::) 1 [1;2];;
 498         Error: Syntax error
 499         # (fun x xs -> x :: xs) 1 [1; 2];;
 500         - : int list = [1; 1; 2]
 501
 502 *       Haskell also permits two further shortcuts here that OCaml has no analogue for. In Haskell, in addition to writing:
 503
 504                 (>) 2 1
 505
 506         you can also write either of:
 507
 508                 (1 >) 2
 509                 (> 2) 1
 510
 511         In OCaml one has to write these out longhand:
 512
 513                 (fun y -> 1 > y) 2;;
 514                 (fun x -> x > 2) 1;;
 515
 516         Also, in Haskell, there's a special syntax for using what are ordinarily prefix functions into infix operators:
 517
 518                 Prelude> elem 1 [1, 2]
 519                 True
 520                 Prelude> 1 `elem` [1, 2]
 521                 True
 522
 523         In OCaml one can't do that. There's only:
 524
 525                 # List.mem 1 [1; 2];;
 526                 - : bool = true
 527
 528 *       In Haskell one writes anonymous functions like this:
 529
 530                 \x -> x + 1
 531
 532         In OCaml it's:
 533
 534                 fun x -> x + 1
 535
 536 *       Haskell uses the period `.` as a composition operator:
 537
 538                 g . f
 539                 -- same as
 540                 \x -> g (f x)
 541
 542         In OCaml one has to write it out longhand:
 543
 544                 fun x -> g (f x)
 545
 546 *       In Haskell, expressions like this:
 547
 548                 g $ f x y
 549
 550         are equivalent to:
 551
 552                 g (f x y)
 553
 554         (Think of the period in our notation for the untyped lambda calculus.)
 555
 556 *       The names of standard functions, and the order in which they take their arguments, may differ. In Haskell:
 557
 558                 Prelude> :t foldr
 559                 foldr :: (a -> b -> b) -> b -> [a] -> b
 560
 561         In OCaml:
 562
 563                 # List.fold_right;;
 564                 - : ('a -> 'b -> 'b) -> 'a list -> 'b -> 'b = <fun>
 565
 566 *       Some functions are predefined in Haskell but not in OCaml. Here are OCaml definitions for some common ones:
 567
 568         let id x = x;;
 569         let const x _ = x;;
 570         let flip f x y = f y x;;
 571         let curry (f : ('a, 'b) -> 'c) = fun x y -> f (x, y);;
 572         let uncurry (f : 'a -> 'b -> 'c) = fun (x, y) -> f x y;;
 573         let null lst = lst = [];;
 574
 575         `fst` and `snd` (defined only on pairs) are provided in both languages. Haskell has `head` and `tail` for lists; these will raise an exception if applied to []. In OCaml the corresponding functions are `List.hd` and `List.tl`. Many other Haskell list functions like `length` are available in OCaml as `List.length`, but OCaml's standard libraries are leaner that Haskell's.
 576
 577 *       The `until` function in Haskell is used like this:
 578
 579                 until (\l -> length l == 4) (1 : ) []
 580                 -- evaluates to [1,1,1,1]
 581
 582                 until (\x -> x == 10) succ 0
 583                 -- evaluates to 10
 584
 585         This can be defined in OCaml as:
 586
 587     let rec until test f z =
 588         if test z then z else until test f (f z)
 589
 590
 591 #Lazy or Eager#
 592
 593 *       As we've mentioned several times, Haskell's evaluation is by default *lazy* or "call-by-need" (that's an efficient version of "call-by-name" that avoids computing the same results again and again). In some places Haskell will force evaluation to be *eager* or "strict". This is done in several different ways; the symbols `!` and `seq` are signs that it's being used.
 594
 595 *       Like Scheme and most other languages, OCaml is by default eager. Laziness can be achieved either by using thunks:
 596
 597                 # let eval_later1 () = 2 / 2;;
 598                 val eval_later1 : unit -> int = <fun>
 599                 # let eval_later2 () = 2 / 0;;
 600                 val eval_later2 : unit -> int = <fun>
 601                 # eval_later1 ();;
 602                 - : int = 1
 603                 # eval_later2 ();;
 604                 Exception: Division_by_zero.
 605
 606         or by using the special forms `lazy` and `Lazy.force`:
 607
 608                 # let eval_later3 = lazy (2 / 2);;
 609                 val eval_later3 : int lazy_t = <lazy>
 610                 # Lazy.force eval_later3;;
 611                 - : int = 1
 612                 # eval_later3;;
 613                 - : int lazy_t = lazy 1
 614
 615         Notice in the last line the value is reported as being `lazy 1` instead of `<lazy>`. Since the value has once been forced, it won't ever need to be recomputed. The thunks are less efficient in this respect. Even though OCaml will now remember that `eval_later3` should be forced to, `eval_later3` is still type distinct from a plain `int`.
 616
 617
 618 #Monads#
 619
 620 Haskell has more built-in support for monads, but one can define the monads one needs in OCaml.
 621
 622 *       In our seminar, we've been calling one monadic operation `unit`, in Haskell the same operation is called `return`. We've been calling another monadic operation `bind`, used in prefix form, like this:
 623
 624                 bind u f
 625
 626         In Haskell, one uses the infix operator `>>=` to express bind instead:
 627
 628                 u >>= f
 629
 630         If you like this Haskell convention, you can define (>>=) in OCaml like this:
 631
 632                 let (>>=) = bind;;
 633
 634 *       Haskell also uses the operator `>>`, where `u >> v` means the same as `u >>= \_ -> v`.
 635
 636 *       In Haskell, one can generally just use plain `return` and `>>=` and the compiler will infer what monad you must be talking about from the surrounding type constraints. In OCaml, you generally need to be specific about which monad you're using. So in these notes, when mutiple monads are on the table, we've defined operations as `reader_unit` and `reader_bind`.
 637
 638 *       Haskell has a special syntax for working conveniently with monads. It looks like this. Assume `u` `v` and `w` are values of some monadic type `M a`. Then `x` `y` and `z` will be variables of type `a`:
 639
 640                 do
 641                   x <- u
 642                   y <- v
 643                   w
 644                   let z = foo x y
 645                   return z
 646
 647         This is equivalent in meaning to the following:
 648
 649                 u >>= \ x ->
 650                 v >>= \ y ->
 651                 w >>= \ _ ->
 652                 let z = foo x y
 653                 in unit z
 654
 655         which can be translated straightforwardly into OCaml.
 656
 657 *       If you like the Haskell do-notation, there's [a library](http://www.cas.mcmaster.ca/~carette/pa_monad/) you can compile and install to let you use something similar in OCaml.
 658
 659 *       In order to do any printing, Haskell has to use a special `IO` monad. So programs will look like this:
 660
 661                 main :: IO ()
 662                 main = do
 663                   let s = "hello world"
 664                   putStrLn s
 665
 666                 main :: IO String
 667                 main = do
 668                   let s = "hello world"
 669                   putStrLn s
 670                   return s
 671
 672                 main :: IO String
 673                 main = let s = "hello world"
 674                        in putStrLn s >> return s
 675
 676         OCaml permits you to mix side-effects with regular code, so you can just print, without needing to bring in any monad:
 677
 678                 let main =
 679                   let s = "hello world"
 680                   in let () = print_endline s
 681                   in s;;
 682
 683         or:
 684
 685                 let main =
 686                   let s = "hello world"
 687                   in print_endline s ; s;;
 688