Lambda Calculus is a universal model of computation (can be used to simulate any Turing Machine) based on function abstraction and application. It has a very simple semantic that can be used to study properties of computation.
The first thing to take note of is that EVERYTHING is a function (an algorithm, the input and the output are all functions).
Let's start with a very simple example in Scala.
def apply(f: Int => Int, x: Int): Int = f(x)
The first thing we need to adjust for the function to look more like Lambda Calculus is that since everything is a function, everything will be untyped.
def apply(f, x) = f(x)
An abstraction Lambda Calculus does is that it treats all functions as anonymous (it doesn't give them explicit names).
(f, x) => f(x)
Another abstraction is that all functions are curried (only take 1 input and return 1 output).
f => (x => f(x))
Now we can re-write the function using lambda calculus syntax (instead of Scala), and we will get a valid lambda expression.
$ \lambda f.\lambda x.(f \ x) $
Given a set of variables VARS, an expression under lambda calculus can be:
variable | $ x $ | $ x \in VARS $ |
function | $ \lambda x.e $ | $ x \in VARS $, $ e $ is a $ \lambda $-expression |
application | $ (e_1 \ e_2) $ | $ e_1, e_2 $ are $ \lambda $-expressions |
To evaluate $\lambda$-expressions, there are two types of reduction operations:
If a expression cannot be reduced further using $ \beta $-reductions, we say the expression is in $ \beta $-normal form.
Take the following Scala snippet as an example:
def add(x: Int) = x + y
We can say that the second occurence of $ x $ is bounded by the $ x $ that appears as a function parameter. When we call the function, the occurence of $ x $ is replaced by the argument that was provided to $ add $. In contrast, $ y $ is a free variable.
This code might look weird, where does $ y $ come from? What does it do? Why would we use a variable that we don't instantiate (i.e. is not bound to anything)? Well, the snippet actually comes from a broader context:
def add_all(x: List[Int], y: Int) = { def add(x: Int) = x + y x.map(add) }
In this new snippet we can see that all variables are bounded, the free variable from before is bounded by the outer function, but only the free variable, notice that $ x $ is still bounded by the inner function, and the $ x $ parameter of $ add\_all $ (that is a List[Int]
) is 'invisible' inside $ add $.
The importance of free variables is that only free occurences of a sub-expression can be bounded by the outer expression.
Translating to lambda calculus, when reducing $ (\lambda x.e_1 \ e_2) $ to $ e_1[x \ / \ e_2] $, only free occurences of $ x $ in $ e_1 $ will be replaced by $ e_2 $.
More generally, we say that:
Exercise
7.1.1. For every variable occurence, mention if it's a free or a bounded occurence:
Using what we learned from free and bounded variables, we can define a algorithm for $\beta$-reduction, given a expression $ e_1[x \ / \ e_2] $:
$ e_1 $ | $ e_1[x \ / \ e_2] $ | condition | |
---|---|---|---|
$ x $ | $ e_2 $ | ||
$ y $ | $ y $ | $ x \neq y $ | |
$ E_1 \ E_2 $ | $ E_1[x \ / \ e_2] \ E_2[x \ / \ e_2] $ | ||
$ \lambda x.e $ | $ \lambda x.e $ | ||
$ \lambda y.e $ | $ \lambda y.e[x \ / \ e_2] $ | $ x \neq y $, $ y $ does not appear free in $ e_2 $ | |
$ \lambda y.e $ | $ \{\lambda z.e[y \ / \ z]\}[x \ / \ e_2] $ | $ x \neq y $, $ y $ appears free in $ e_2 $ | ( $ z $ is a new variable that is not free in $ e $ or $ e_2 $ ) |
Q: If we have multiple redexes in a expression, which one do we evaluate?
A: We can evaluate any of them, and it is guaranteed by Church-Rosser theorem that if the expression is reducible, we will eventually get the same $ \beta $-normal form.
To not just randomly choose redexes, there exist reduction strategies, from which we will use the Normal Order and Applicative Order:
Exercise
7.2.1. Evaluate in both Normal Order and Applicative Order the following expressions:
The Church-Turing thesis asserts that any computable function can be computed using lambda calculus (or Turing Machines or equivalent models).
How can this be? Everything in Lambda Calculus is a function, there are no numbers to compute stuff with. Well, while there are not the numbers we are used to, we can define higher-order functions that are analogs for concepts we are familiar with and use them instead.
The representations we are going to present further are also called Church encodings, because they were first used by Alonzo Church, the inventor of Lambda Calculus.
We can encode boolean values TRUE and FALSE in lambda calculus as functions that take 2 values, x and y, and return the first (for TRUE) or second (for FALSE) value.
$ TRUE = \lambda x.\lambda y.x$
$ FALSE = \lambda x.\lambda y.y$
Some common operation on booleans (that were discussed during the lecture) are:
$ AND = \lambda x.\lambda y.((x \ y) \ x) $
$ OR = \lambda x.\lambda y.((x \ x) \ y) $
$ NOT = \lambda x.((x \ FALSE) \ TRUE) $
$ IF = \lambda c.\lambda t.\lambda e.((c t) e) $
Exercises
7.3.1. Define the $ XOR $ operations over booleans.
7.3.2. Define the $ NAND $ operations over booleans.
7.3.3. Define the $ NOR $ operations over booleans.
We can also encode data structures . We will only look at one of the simpler ones, the pair.
A pair encapsulates two variables together, that we can later access using $ FIRST $ and $ SECOND $ .
$ PAIR = \lambda a.\lambda b.\lambda z.((z \ a )\ b) $
$ FIRST = \lambda p.(p \ TRUE) $
$ SECOND = \lambda p.(p \ FALSE) $
Church numerals represent natural numbers as higher-order functions. Under this representation, the number n is a function that maps f to its n-fold composition.
$ N0 = \lambda f.\lambda x. x $
$ N1 = \lambda f.\lambda x. (f \ x) $
$ N2 = \lambda f.\lambda x. (f \ (f \ x)) $
…
You can also define operation on church numerals, some (that were discussed during the lecture) are:
$ SUCC = \lambda n.\lambda f.\lambda x.(f \ ((n \ f) \ x)) $
$ ISZERO = \lambda n.((n \ \lambda x.FALSE) \ TRUE) $
$ ADD = \lambda n.\lambda m.\lambda f.\lambda x.((n \ f) \ ((m \ f) \ x)) $
Exercises
7.4.1. Define multiplication under church numerals: $ MULT = \lambda n.\lambda m. \ \ldots $ (Hint: you can do it without the Y-Combinator)
7.4.2. Define exponentiation under church numerals: $ EXP = \lambda n.\lambda m. \ \ldots $
7.4.3. (*) Define the predecessor operator, that takes a number and returns the number prior to it. What's the predecessor of 0? Evaluate $ (PRED \ N0) $.
7.4.4. Define substraction under church numerals: $ SUB = \lambda n.\lambda m. \ \ldots $ (Hint: use $ PRED $). What happens if you try to substract a bigger number from a smaller one? Evaluate $ (SUB \ N1 \ N2 )$.
7.4.5. Define $ LEQ $ (less or equal). $ LEQ \ n \ m $ should return TRUE if $ n \leq m $ and FALSE if $ n > m $.
7.4.6. Define $ EQ $ (equality). $ EQ \ n \ m $ should return TRUE if $ n = m $ and FALSE otherwise.
In lambda calculus, recursion is achieved using the fixed-point combinator (or Y combinator, “Why” bird or Sage bird). A fixed-point combinator is a higher-order function that returns some fixed point of it's argument function (x is a fixed pointed for a function f if $ f(x) = x $). That means: $ f \ (fix \ f) = fix \ f $ . And by repeated application: $ fix \ f = f \ (f \ (\ldots f \ (fix \ f)\ldots)) $
Exercises
7.5.1. (*) Using the Y-Combinator, define a function that computes the factorial of a number n.
7.5.2. (*) Using the Y-Combinator, define a function $ FIB $ that computes the n-th fibonacci number.