382 lines
16 KiB
Markdown
382 lines
16 KiB
Markdown
---
|
|
title: "Generalizing Folds in Haskell"
|
|
date: 2022-04-22T12:19:22-07:00
|
|
draft: true
|
|
tags: ["Haskell"]
|
|
---
|
|
|
|
### Recursive Functions
|
|
Let's start off with a little bit of a warmup, and take a look at a simple recursive function:
|
|
`length`. Here's a straightforward definition:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 4 6 >}}
|
|
|
|
Haskell is nice because it allows for clean definitions of recursive functions; `length` can
|
|
just reference itself in its definition, and everything works out in the end. In the underlying
|
|
lambda calculus, though, a function definition doesn't come with a name -- you only get anonymous
|
|
functions via the lambda abstraction. There's no way for such functions to just refer to themselves by
|
|
their name in their body. But the lambda calculus is Turing complete, so something is making recursive
|
|
definitions possible.
|
|
|
|
The trick is to rewrite your recursive function in such a way that instead of calling itself by its name
|
|
(which, with anonymous functions, is hard to come by), it receives a reference to itself as an argument.
|
|
As a concrete example:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 8 10 >}}
|
|
|
|
This new function can easily me anonymous; if we enable the `LambdaCase` extension,
|
|
we can write it using only lambda functions as:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 12 14 >}}
|
|
|
|
This function is not equivalent to `length`, however. It expects "itself", or a function
|
|
which has type `[a] -> Int`, to be passed in as the first argument. Once fed this `rec`
|
|
argument, though, `lengthF` returns a length function. Let's try feed it something, then!
|
|
|
|
```Haskell
|
|
lengthF _something
|
|
```
|
|
|
|
But if `lengthF` produces a length function when given this _something_, why can't we feed
|
|
this newly-produced length function back to it?
|
|
|
|
```Haskell
|
|
lengthF (lengthF _something)
|
|
```
|
|
|
|
And again:
|
|
|
|
```Haskell
|
|
lengthF (lengthF (lengthF _something))
|
|
```
|
|
|
|
If we kept going with this process infinitely, we'd eventually have what we need:
|
|
|
|
{{< latex >}}
|
|
\text{length} = \text{lengthF}(\text{lengthF}(\text{lengthF}(...)))
|
|
{{< /latex >}}
|
|
|
|
But hey, the stuff inside the first set of parentheses is still an infinite sequence of applications
|
|
of the function $\\text{lengthF}$, and we have just defined this to be $\\text{length}$. Thus,
|
|
we can rewrite the above equation as:
|
|
|
|
{{< latex >}}
|
|
\text{length} = \text{lengthF}(\text{length})
|
|
{{< /latex >}}
|
|
|
|
What we have just discovered is that the actual function that we want, `length`, is a [fixed point](https://mathworld.wolfram.com/FixedPoint.html)
|
|
of the non-recursive function `lengthF`. Fortunately, Haskell comes with a function that can find
|
|
such a fixed point. It's defined like this:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 16 16 >}}
|
|
|
|
This definition is as declarative as can be; `fix` returns the \\(x\\) such that \\(x = f(x)\\). With
|
|
this, we finally write:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 18 18 >}}
|
|
|
|
Loading up the file in GHCi, and running the above function, we get exactly the expected results.
|
|
|
|
```
|
|
ghci> Main.length' [1,2,3]
|
|
3
|
|
```
|
|
|
|
You may be dissatisfied with the way we handled `fix` here; we went through and pretended that we didn't
|
|
have recursive function definitions, but then used a recursive `let`-expression in the body `fix`!
|
|
This is a valid criticism, so I'd like to briefly talk about how `fix` is used in the context of the
|
|
lambda calculus.
|
|
|
|
In the untyped typed lambda calculus, we can just define a term that behaves like `fix` does. The
|
|
most common definition is the \\(Y\\) combinator, defined as follows:
|
|
|
|
{{< latex >}}
|
|
Y = \lambda f. (\lambda x. f (x x)) (\lambda x. f (x x ))
|
|
{{< /latex >}}
|
|
|
|
When applied to a function, this combinator goes through the following evaluation steps:
|
|
|
|
{{< latex >}}
|
|
Y f = f (Y f) = f (f (Y f)) =\ ...
|
|
{{< /latex >}}
|
|
|
|
This is the exact sort of infinite series of function applications that we saw above with \\(\\text{lengthF}\\).
|
|
|
|
### Recursive Data Types
|
|
We have now seen how we can rewrite a recursive function as a fixed point of some non-recursive function.
|
|
Another cool thing we can do, though, is to transform recursive __data types__ in a similar manner!
|
|
Let's start with something pretty simple.
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 20 20 >}}
|
|
|
|
Just like we did with functions, we can extract the recursive occurrences of `MyList` into a parameter.
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 21 21 >}}
|
|
|
|
Just like `lengthF`, `MyListF` isn't really a list. We can't write a function `sum :: MyListF -> Int`.
|
|
`MyListF` requires _something_ as an argument, and once given that, produces a type of integer lists.
|
|
Once again, let's try feeding it:
|
|
|
|
```
|
|
MyListF a
|
|
```
|
|
|
|
From the definition, we can clearly see that `a` is where the "rest of the list" is in the original
|
|
`MyList`. So, let's try fill `a` with a list that we can get out of `MyListF`:
|
|
|
|
```
|
|
MyListF (MyListF a)
|
|
```
|
|
|
|
And again:
|
|
|
|
```
|
|
MyListF (MyListF (MyListF a))
|
|
```
|
|
|
|
Much like we used a `fix` function to turn our `lengthF` into `length`, we need a data type,
|
|
which we'll call `Fix` (and which has been [implemented before](https://hackage.haskell.org/package/data-fix-0.3.2/docs/Data-Fix.html)). Here's the definition:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 23 23 >}}
|
|
|
|
Looking past the constructors and accessors, we might write the above in pseudo-Haskell as follows:
|
|
|
|
```Haskell
|
|
newtype Fix f = f (Fix f)
|
|
```
|
|
|
|
This is just like the lambda calculus \\(Y\\) combinator above! Unfortunately, we _do_ have to
|
|
deal with the cruft induced by the constructors here. Thus, to write down the list `[1,2,3]`
|
|
using `MyListF`, we'd have to produce the following:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 25 26 >}}
|
|
|
|
This is actually done in practice when using some approaches to help address the
|
|
[expression problem](https://en.wikipedia.org/wiki/Expression_problem); however,
|
|
it's quite unpleasant to write code in this way, so we'll set it aside.
|
|
|
|
Let's go back to our infinite chain of type applications. We've a similar pattern before,
|
|
with \\(\\text{length}\\) and \\(\\text{lengthF}\\). Just like we did then, it seems like
|
|
we might be able to write something like the following:
|
|
|
|
{{< latex >}}
|
|
\begin{aligned}
|
|
& \text{MyList} = \text{MyListF}(\text{MyListF}(\text{MyListF}(...))) \\
|
|
\Leftrightarrow\ & \text{MyList} = \text{MyListF}(\text{MyList})
|
|
\end{aligned}
|
|
{{< /latex >}}
|
|
|
|
In something like Haskell, though, the above is not quite true. `MyListF` is a
|
|
non-recursive data type, with a different set of constructors to `MyList`; they aren't
|
|
_really_ equal. Instead of equality, though, we use the next-best thing: isomorphism.
|
|
|
|
{{< latex >}}
|
|
\text{MyList} \cong \text{MyListF}(\text{MyList})
|
|
{{< /latex >}}
|
|
|
|
Two types are isomorphic when there exist a
|
|
{{< sidenote "right" "fix-isomorphic-note" "pair of functions, \(f\) and \(g\)," >}}
|
|
Let's a look at the types of <code>Fix</code> and <code>unFix</code>, by the way.
|
|
Suppose that we <em>did</em> define <code>MyList</code> to be <code>Fix MyListF</code>.
|
|
Let's specialize the <code>f</code> type parameter of <code>Fix</code> to <code>MyListF</code>
|
|
for a moment, and check:<br><br>
|
|
In one direction, <code>Fix :: MyListF MyList -> MyList</code><br>
|
|
And in the other, <code>unFix :: MyList -> MyListF MyList</code><br><br>
|
|
The two mutual inverses \(f\) and \(g\) fall out of the definition of the <code>Fix</code>
|
|
data type! If we didn't have to deal with the constructor cruft, this would be more
|
|
ergonomic than writing our own <code>myIn</code> and <code>myOut</code> functions.
|
|
{{< /sidenote >}}
|
|
that take you from one type to the other (and vice versa), such that applying \\(f\\) after \\(g\\),
|
|
or \\(g\\) after \\(f\\), gets you right back where you started. That is, \\(f\\) and \\(g\\)
|
|
need to be each other's inverses. For our specific case, let's call the two functions `myOut`
|
|
and `myIn` (I'm matching the naming in [this paper](https://maartenfokkinga.github.io/utwente/mmf91m.pdf)). They are not hard to define:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 28 34 >}}
|
|
|
|
By the way, when a data type is a fixed point of some other, non-recursive type constructor,
|
|
this second type constructor is called a __base functor__. We can verify that `MyListF` is a functor
|
|
by providing an instance (which is rather straightforward):
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 36 38 >}}
|
|
|
|
### Recursive Functions with Base Functors
|
|
One neat thing you can do with a base functor is define recursive functions on the actual data type!
|
|
|
|
Let's go back to the very basics. When we write recursive functions, we try to think of it as solving
|
|
a problem, assuming that we are given solutions to the sub-problems that make it up. In the more
|
|
specific case of recursive functions on data types, we think of it as performing a given operation,
|
|
assuming that we know how to perform this operation on the smaller pieces of the data structure.
|
|
Some quick examples:
|
|
|
|
1. When writing a `sum` function on a list, we assume we know how to find the sum of
|
|
the list's tail (`sum xs`), and add to it the current element (`x+`). Of course,
|
|
if we're looking at a part of a data structure that's not recursive, we don't need to
|
|
perform any work on its constituent pieces.
|
|
```Haskell
|
|
sum [] = 0
|
|
sum (x:xs) = x + sum xs
|
|
```
|
|
2. When writing a function to invert a binary tree, we assume that we can invert the left and right
|
|
children of a non-leaf node. We might write:
|
|
```Haskell
|
|
invert Leaf = Leaf
|
|
invert (Node l r) = Node (invert r) (invert l)
|
|
```
|
|
|
|
What does this have to do with base functors? Well, recall how we arrived at `MyListF` from
|
|
`MyList`: we replaced every occurrence of `MyList` in the definition with a type parameter `a`.
|
|
Let me reiterate: wherever we had a sub-list in our definition, we replaced it with `a`.
|
|
The `a` in `MyListF` marks the locations where we _would_ have to use recursion if we were to
|
|
define a function on `MyList`.
|
|
|
|
What if instead of a stand-in for the list type (as it was until now), we use `a` to represent
|
|
the result of the recursive call on that sub-list? To finish computing the sum of the list, then,
|
|
the following would suffice:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 40 42 >}}
|
|
|
|
Actually, this is enough to define the whole `sum` function. First things first, let's use `myOut`
|
|
to unpack one level of the `Mylist` type:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 28 28 >}}
|
|
|
|
We know that `MyListF` is a functor; we can thus use `fmap sum` to compute the sum of the
|
|
remaining list:
|
|
|
|
```Haskell
|
|
fmap mySum :: MyListF MyList -> MyListF Int
|
|
```
|
|
|
|
Finally, we can use our `mySumF` to handle the last addition:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 40 40 >}}
|
|
|
|
Let's put all of these together:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 44 45 >}}
|
|
|
|
Notice, though, that the exact same approach would work for _any_ function
|
|
with type:
|
|
|
|
```Haskell
|
|
MyListF a -> a
|
|
```
|
|
|
|
We can thus write a generalized version of `mySum` that, instead of using `mySumF`,
|
|
uses some arbitrary function `f` with the aforementioned type:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 47 48 >}}
|
|
|
|
Let's use `myCata` to write a few other functions:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 50 60 >}}
|
|
|
|
#### It's just a `foldr`!
|
|
When you write a function with the type `MyListF a -> a`, you are actually
|
|
providing two things: a "base case" element of type `a`, for when you match `MyNilF`,
|
|
and a "combining function" with type `Int -> a -> a`, for when you match `MyConsF`.
|
|
We can thus define:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 64 66 >}}
|
|
|
|
We could also go in the opposite direction, by writing:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 68 69 >}}
|
|
|
|
Hey, what was it that we said about types with two functions between them, which
|
|
are inverses of each other? That's right, `MyListF a -> a` and `(a, Int -> a -> a)`
|
|
are isomorphic. The function `myCata`, and the "traditional" definition of `foldr`
|
|
are equivalent!
|
|
|
|
#### Base Functors for All!
|
|
We've been playing with `MyList` for a while now, but it's kind of getting boring:
|
|
it's just a list of integers! Furthermore, we're not _really_ getting anything
|
|
out of this new "generalization" procedure -- `foldr` is part of the standard library,
|
|
and we've just reinvented the wheel.
|
|
|
|
But you see, we haven't quite. This is because, while we've only been working with `MyListF`,
|
|
the base functor for `MyList`, our approach works for _any recursive data type_, provided
|
|
an `out` function. Let's define a type class, `Cata`, which pairs a data type `a` with
|
|
its base functor `f`, and specifies how to "unpack" `a`:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 71 72 >}}
|
|
|
|
We can now provide a more generic version of our `myCata`, one that works for all types
|
|
with a base functor:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 74 75 >}}
|
|
|
|
Clearly, `MyList` and `MyListF` are one instance of this type class:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 77 78 >}}
|
|
|
|
We can also write a base functor for Haskell's built-in list type, `[a]`:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 80 84 >}}
|
|
|
|
We can use our `cata` function for regular lists to define a generic `sum`:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 86 89 >}}
|
|
|
|
It works perfectly:
|
|
|
|
```
|
|
ghci> Main.sum [1,2,3]
|
|
6
|
|
ghci> Main.sum [1,2,3.0]
|
|
6.0
|
|
ghci> Main.sum [1,2,3.0,-1]
|
|
5.0
|
|
```
|
|
|
|
What about binary trees, which served as our second example of a recursive data structure?
|
|
We can do that, too:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 91 96 >}}
|
|
|
|
Given this, here's an implementation of that `invert` function we mentioned earlier:
|
|
|
|
{{< codelines "Haskell" "catamorphisms/Cata.hs" 98 101 >}}
|
|
|
|
#### What About `Foldable`?
|
|
If you've been around the Haskell ecosystem, you may know the `Foldable` type class.
|
|
Isn't this exactly what we've been working towards here? No, not at all. Take
|
|
a look at how the documentation describes [`Data.Foldable`](https://hackage.haskell.org/package/base-4.16.1.0/docs/Data-Foldable.html):
|
|
|
|
> The Foldable class represents data structures that can be reduced to a summary value one element at a time.
|
|
|
|
One at a time, huh? Take a look at the signature of `foldMap`, which is sufficient for
|
|
an instance of `Foldable`:
|
|
|
|
```Haskell
|
|
foldMap :: Monoid m => (a -> m) -> t a -> m
|
|
```
|
|
|
|
A `Monoid` is just a type with an associative binary operation that has an identity element.
|
|
Then, `foldMap` simply visits the data structure in order, and applies this binary operation
|
|
pairwise to each monoid produced via `f`. Alas,
|
|
this function is not enough to be able to implement something like inverting a binary tree;
|
|
there are different configurations of binary tree that, when visited in-order, result in the
|
|
same sequence of elements. For example:
|
|
|
|
```
|
|
ghci> fold (Node "Hello" Leaf (Node ", " Leaf (Node "World!" Leaf Leaf)))
|
|
"Hello, World!"
|
|
ghci> fold (Node "Hello" (Node ", " Leaf Leaf) (Node "World!" Leaf Leaf))
|
|
"Hello, World!"
|
|
```
|
|
|
|
As far as `fold` (which is just `foldMap id`) is concerned, the two trees are equivalent. They
|
|
are very much not equivalent for the purposes of inversion! Thus, whereas `Foldable` helps
|
|
us work with list-like data types, the `Cata` type class lets us express _any_ function on
|
|
a recursive data type similarly to how we'd do it with `foldr` and lists.
|
|
|
|
#### Catamorphisms
|
|
Why is the type class called `Cata`, and the function `cata`? Well, a function that
|
|
performs a computation by recursively visiting the data structure is called a catamorphism.
|
|
Indeed, `foldr f b`, for function `f` an "base value" `b` is an example of a list catamorophism.
|
|
It's a fancy word, and there are some fancier descriptions of what it is, especially when
|
|
you step into category theory (check out the [Wikipedia entry](https://en.wikipedia.org/wiki/Catamorphism)
|
|
if you want to know what I mean). However, for our purposes, a catamorphism is just a generalization
|
|
of `foldr` from lists to any data type!
|