Add draft of the first portion of day 8 Coq writeup.
This commit is contained in:
		
							parent
							
								
									250746e686
								
							
						
					
					
						commit
						60eb50737d
					
				
							
								
								
									
										276
									
								
								content/blog/01_aoc_coq.md
									
									
									
									
									
										Normal file
									
								
							
							
						
						
									
										276
									
								
								content/blog/01_aoc_coq.md
									
									
									
									
									
										Normal file
									
								
							@ -0,0 +1,276 @@
 | 
			
		||||
---
 | 
			
		||||
title: "Advent of Code in Coq - Day 8"
 | 
			
		||||
date: 2020-12-31T17:55:25-08:00
 | 
			
		||||
tags: ["Advent of Code", "Coq"]
 | 
			
		||||
draft: true
 | 
			
		||||
---
 | 
			
		||||
 | 
			
		||||
Huh? We're on day 8? What happened to days 2 through 7?
 | 
			
		||||
 | 
			
		||||
Well, for the most part, I didn't think they were that interesting from the Coq point of view.
 | 
			
		||||
Day 7 got close, but not close enough to inspire me to create a formalization. Day 8, on the other
 | 
			
		||||
hand, is
 | 
			
		||||
{{< sidenote "right" "pl-note" "quite interesting," >}}
 | 
			
		||||
Especially to someone like me who's interested in programming languages!
 | 
			
		||||
{{< /sidenote >}} and took quite some time to formalize.
 | 
			
		||||
 | 
			
		||||
As before, here's an (abridged) description of the problem:
 | 
			
		||||
 | 
			
		||||
> Given a tiny assembly-like language, determine the state of its accumulator
 | 
			
		||||
> when the same instruction is executed twice.
 | 
			
		||||
 | 
			
		||||
Before we start on the Coq formalization, let's talk about an idea from
 | 
			
		||||
Programming Language Theory (PLT), _big step operational semantics_.
 | 
			
		||||
 | 
			
		||||
### Big Step Operational Semantics
 | 
			
		||||
What we have in Advent of Code's Day 8 is, undeniably, a small programming language.
 | 
			
		||||
We are tasked with executing this language, or, in PLT lingo, defining its _semantics_.
 | 
			
		||||
There are many ways of doing this - at university, I've been taught of [denotational](https://en.wikipedia.org/wiki/Denotational_semantics), [axiomatic](https://en.wikipedia.org/wiki/Axiomatic_semantics), 
 | 
			
		||||
and [operational](https://en.wikipedia.org/wiki/Operational_semantics) semantics.
 | 
			
		||||
I believe that Coq's mechanism of inductive definitions lends itself very well
 | 
			
		||||
to operational semantics, so we'll take that route. But even "operational semantics"
 | 
			
		||||
doesn't refer to a concrete technique - we have a choice between small-step (structural) and
 | 
			
		||||
big-step (natural) operational semantics. The former describe the minimal "steps" a program
 | 
			
		||||
takes as it's being evaluated, while the latter define the final results of evaluating a program.
 | 
			
		||||
I decided to go with big-step operational semantics, since they're more intutive (natural!).
 | 
			
		||||
 | 
			
		||||
So, how does one go about "[defining] the final results of evaluating a program?" Most commonly,
 | 
			
		||||
we go about using _inference rules_. Let's talk about those next.
 | 
			
		||||
 | 
			
		||||
#### Inference Rules
 | 
			
		||||
Inference rules are a very general notion. The describe how we can determine (infer) a conclusion
 | 
			
		||||
from a set of assumption. It helps to look at an example. Here's a silly little inference rule:
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\frac
 | 
			
		||||
{\text{I'm allergic to cats} \quad \text{My friend has a cat}}
 | 
			
		||||
{\text{I will not visit my friend very much}}
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
It reads, "if I'm allergic to cats, and if my friend has a cat, then I will not visit my friend very much".
 | 
			
		||||
Here, "I'm allergic to cats" and "my friend has a cat" are _premises_, and "I will not visit my friend very much" is
 | 
			
		||||
a _conclusion_. An inference rule states that if all its premises are true, then its conclusion must be true.
 | 
			
		||||
Here's another inference rule, this time with some mathematical notation instead of words:
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\frac
 | 
			
		||||
{n < m}
 | 
			
		||||
{n + 1 < m + 1}
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
This one reads, "if \\(n\\) is less than \\(m\\), then \\(n+1\\) is less than \\(m+1\\)". We can use inference
 | 
			
		||||
rules to define various constructs. As an example, let's define what it means for a natural number to be even.
 | 
			
		||||
It takes two rules:
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\frac
 | 
			
		||||
{}
 | 
			
		||||
{0 \; \text{is even}}
 | 
			
		||||
\quad
 | 
			
		||||
\frac
 | 
			
		||||
{n \; \text{is even}}
 | 
			
		||||
{n+2 \; \text{is even}}
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
First of all, zero is even. We take this as fact - there are no premises for the first rule, so they
 | 
			
		||||
are all trivially true. Next, if a number is even, then adding 2 to that number results in another
 | 
			
		||||
even number. Using the two of these rules together, we can correctly determine whether any number
 | 
			
		||||
is or isn't even. We start knowing that 0 is even. Adding 2 we learn that 2 is even, and adding 2
 | 
			
		||||
again we see that 4 is even, as well. We can continue this to determine that 6, 8, 10, and so on
 | 
			
		||||
are even too. Never in this process will we visit the numbers 1 or 3 or 5, and that's good - they're not even!
 | 
			
		||||
 | 
			
		||||
Let's now extend this notion to programming languages, starting with a simple arithmetic language.
 | 
			
		||||
This language is made up of natural numbers and the \\(\square\\) operation, which represents the addition
 | 
			
		||||
of two numbers. Again, we need two rules:
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\frac
 | 
			
		||||
{n \in \mathbb{N}}
 | 
			
		||||
{n \; \text{evaluates to} \; n}
 | 
			
		||||
\quad
 | 
			
		||||
\frac
 | 
			
		||||
{e_1 \; \text{evaluates to} \; n_1 \quad e_2 \; \text{evaluates to} \; n_2}
 | 
			
		||||
{e_1 \square e_2 \; \text{evaluates to} \; n_1 + n_2}
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
First, let me explain myself. I used \\(\square\\) to demonstrate two important points. First, languages can be made of
 | 
			
		||||
any kind of characters we want; it's the rules that we define that give these languages meaning.
 | 
			
		||||
Second, while \\(\square\\) is the addition operation _in our language_, \\(+\\) is the _mathematical addition operator_.
 | 
			
		||||
They are not the same - we use the latter to define how the former works.
 | 
			
		||||
 | 
			
		||||
Finally, writing "evaluates to" gets quite tedious, especially for complex languages. Instead,
 | 
			
		||||
PLT people use notation to make their semantics more concise. The symbol \\(\Downarrow\\) is commonly
 | 
			
		||||
used to mean "evaluates to"; thus, \\(e \Downarrow v\\) reads "the expression \\(e\\) evaluates to the value \\(v\\).
 | 
			
		||||
Using this notation, our rules start to look like the following:
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\frac
 | 
			
		||||
{n \in \mathbb{N}}
 | 
			
		||||
{n \Downarrow n}
 | 
			
		||||
\quad
 | 
			
		||||
\frac
 | 
			
		||||
{e_1 \Downarrow n_1 \quad e_2 \Downarrow n_2}
 | 
			
		||||
{e_1 \square e_2 \Downarrow n_1 + n_2}
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
If nothing else, these are way more compact! Though these may look intimidating at first, it helps to
 | 
			
		||||
simply read each symbol as its English meaning.
 | 
			
		||||
 | 
			
		||||
#### Encoding Inference Rules in Coq
 | 
			
		||||
Now that we've seen what inference rules are, we can take a look at how they can be represented in Coq.
 | 
			
		||||
We can use Coq's `Inductive` mechanism to define the rules. Let's start with our "is even" property.
 | 
			
		||||
 | 
			
		||||
```Coq
 | 
			
		||||
Inductive is_even : nat -> Prop :=
 | 
			
		||||
    | zero_even : is_even 0
 | 
			
		||||
    | plustwo_even : is_even n -> is_even (n+2).
 | 
			
		||||
```
 | 
			
		||||
 | 
			
		||||
The first line declares the property `is_even`, which, given a natural number, returns proposition.
 | 
			
		||||
This means that `is_even` is not a proposition itself, but `is_even 0`, `is_even 1`, and `is_even 2`
 | 
			
		||||
are all propositions.
 | 
			
		||||
 | 
			
		||||
The following two lines each encode one of our aforementioned inference rules. The first rule, `zero_even`,
 | 
			
		||||
is of type `is_even 0`. The `zero_even` rule doesn't require any arguments, and we can use it to create
 | 
			
		||||
a proof that 0 is even. On the other hand, the `plustwo_even` rule _does_ require an argument, `is_even n`.
 | 
			
		||||
To construct a proof that a number `n+2` is even using `plustwo_even`, we need to provide a proof
 | 
			
		||||
that `n` itself is even. From this definition we can see a general principle: we encode each inference
 | 
			
		||||
rule as constructor of an inductive Coq type. Each rule encoded in this manner takes as arguments
 | 
			
		||||
the proofs of its premises, and returns a proof of its conclusion.
 | 
			
		||||
 | 
			
		||||
For another example, let's encode our simple addition language. First, we have to define the language
 | 
			
		||||
itself:
 | 
			
		||||
 | 
			
		||||
```Coq
 | 
			
		||||
Inductive tinylang : Type :=
 | 
			
		||||
    | number (n : nat) : tinylang
 | 
			
		||||
    | box (e1 e2 : tinylang) : tinylang.
 | 
			
		||||
```
 | 
			
		||||
 | 
			
		||||
This defines the two elements of our example language: `number n` corresponds to \\(n\\), and `box e1 e2` corresponds
 | 
			
		||||
to \\(e_1 \square e_2\\). Finally, we define the inference rules:
 | 
			
		||||
 | 
			
		||||
```Coq {linenos=true}
 | 
			
		||||
Inductive tinylang_sem : tinylang -> nat -> Prop :=
 | 
			
		||||
    | number_sem : forall (n : nat), tinylang_sem (number n) n
 | 
			
		||||
    | box_sem : forall (e1 e2 : tinylang) (n1 n2 : nat),
 | 
			
		||||
        tinylang_sem e1 n1 -> tinylang_sem e2 n2 ->
 | 
			
		||||
        tinylang_sem (box e1 e2) (n1 + n2).
 | 
			
		||||
```
 | 
			
		||||
 | 
			
		||||
When we wrote our rules earlier, by using arbitrary variables like \\(e_1\\) and \\(n_1\\), we implicitly meant
 | 
			
		||||
that our rules work for _any_ number or expression. When writing Coq we have to make this assumption explicit
 | 
			
		||||
by using `forall`. For instance, the rule on line 2 reads, "for any number `n`, the expression `n` evaluates to `n`".
 | 
			
		||||
 | 
			
		||||
#### Semantics of Our Language
 | 
			
		||||
 | 
			
		||||
We've now written some example big-step operational semantics, both "on paper" and in Coq. Now, it's time to take a look at
 | 
			
		||||
the specific semantics of the language from Day 8! Our language consists of a few parts.
 | 
			
		||||
 | 
			
		||||
First, there are three opcodes: \\(\texttt{jmp}\\), \\(\\texttt{nop}\\), and \\(\\texttt{add}\\). Opcodes, combined
 | 
			
		||||
with an integer, make up an instruction. For example, the instruction \\(\\texttt{add} \\; 3\\) will increase the
 | 
			
		||||
content of the accumulator by three. Finally, a program consists of a sequence of instructions; They're separated
 | 
			
		||||
by newlines in the puzzle input, but we'll instead separate them by semicolons. For example, here's a complete program.
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\texttt{add} \; 0; \; \texttt{nop} \; 2; \; \texttt{jmp} \; -2
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
Now, let's try evaluating this program. Starting at the beginning and with 0 in the accumulator,
 | 
			
		||||
it will add 0 to the accumulator (keeping it the same),
 | 
			
		||||
do nothing, and finally jump back to the beginning. At this point, it will try to run the addition instruction again,
 | 
			
		||||
which is not allowed; thus, the program will terminate.
 | 
			
		||||
 | 
			
		||||
Did you catch that? The semantics of this language will require more information than just our program itself (which we'll denote \\(p\\)).
 | 
			
		||||
* First, to evaluate the program we will need a program counter, \\(\\textit{c}\\). This program counter
 | 
			
		||||
will tell us the position of the instruction to be executed next. It can also point past the last instruction,
 | 
			
		||||
which means our program terminated successfully. 
 | 
			
		||||
* Next, we'll need the accumulator \\(a\\). Addition instructions can change the accumulator, and we will be interested
 | 
			
		||||
in the number that ends up in the accumulator when our program finishes executing.
 | 
			
		||||
* Finally, and more subtly, we'll need to keep track of the states we visited. For instance,
 | 
			
		||||
in the course of evaluating our program above, we encounter the \\((c, a)\\) pair of \\((0, 0)\\) twice: once
 | 
			
		||||
at the beginning, and once at the end. However, whereas at the beginning we have not yet encountered the addition
 | 
			
		||||
instruction, at the end we have, so the evaluation behaves differently. To make the proofs work better in Coq,
 | 
			
		||||
we'll use a set \\(v\\) of
 | 
			
		||||
{{< sidenote "right" "allowed-note" "allowed (valid) program counters (as opposed to visited program counters)." >}}
 | 
			
		||||
Whereas the set of "visited" program counters keeps growing as our evaluation continues,
 | 
			
		||||
the set of "allowed" program counters keeps shrinking. Because the "allowed" set never stops shrinking,
 | 
			
		||||
assuming we're starting with a finite set, our execution will eventually terminate.
 | 
			
		||||
{{< /sidenote >}}
 | 
			
		||||
 | 
			
		||||
Now we have all the elements of our evaluation. Let's define some notation. A program starts at some state,
 | 
			
		||||
and terminates in another, possibly different state. In the course of a regular evaluation, the program
 | 
			
		||||
never changes; only the state does. So I propose this (rather unorthodox) notation:
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
(c, a, v) \Rightarrow_p (c', a', v')
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
This reads, "after starting at program counter \\(c\\), accumulator \\(a\\), and set of valid addresses \\(v\\),
 | 
			
		||||
the program \\(p\\) terminates with program counter \\(c'\\), accumulator \\(a'\\), and set of valid addresses \\(v'\\)".
 | 
			
		||||
Before creating the inference rules for this evaluation relation, let's define the effect of evaluating a single
 | 
			
		||||
instruction, using notation \\((c, a) \rightarrow_i (c', a')\\). An addition instruction changes the accumulator,
 | 
			
		||||
and increases the program counter by 1.
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\frac{}
 | 
			
		||||
{(c, a) \rightarrow_{\texttt{add} \; n} (c+1, a+n)}
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
A no-op instruction does even less. All it does is increment the program counter.
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\frac{}
 | 
			
		||||
{(c, a) \rightarrow_{\texttt{nop} \; n} (c+1, a)}
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
Finally, a jump instruction leaves the accumulator intact, but adds a number to the program counter itself!
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\frac{}
 | 
			
		||||
{(c, a) \rightarrow_{\texttt{jmp} \; n} (c+n, a)}
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
None of these rules have any premises, and they really are quite simple. Now, let's define the rules
 | 
			
		||||
for evaluating a program. First of all, a program starting in a state that is not considered "valid"
 | 
			
		||||
is done evaluating, and is in a "failed" state.
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\frac{c \not \in v \quad c \not= \text{length}(p)}
 | 
			
		||||
{(c, a, v) \Rightarrow_{p} (c, a, v)}
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
We use \\(\\text{length}(p)\\) to represent the number of instructions in \\(p\\). Note the second premise:
 | 
			
		||||
even if our program counter \\(c\\) is not included in the valid set, if it's "past the end of the program",
 | 
			
		||||
the program terminates in an "ok" state. Here's a rule for terminating in the "ok" state:
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\frac{c = \text{length}(p)}
 | 
			
		||||
{(c, a, v) \Rightarrow_{p} (c, a, v)}
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
When our program counter reaches the end of the program, we are also done evaluating it. Even though
 | 
			
		||||
both rules {{< sidenote "right" "redundant-note" "lead to the same conclusion," >}}
 | 
			
		||||
In fact, if the end of the program is never included in the valid set, the second rule is completely redundant.
 | 
			
		||||
{{< /sidenote >}}
 | 
			
		||||
it helps to distinguish the two possible outcomes. Finally, if neither of the termination conditions are met,
 | 
			
		||||
our program can take a step, and continue evaluating from there.
 | 
			
		||||
 | 
			
		||||
{{< latex >}}
 | 
			
		||||
\frac{c \in v \quad p[c] = i \quad (c, a) \rightarrow_i (c', a') \quad (c', a', v - \{c\}) \Rightarrow_p (c'', a'', v'')}
 | 
			
		||||
{(c, a, v) \Rightarrow_{p} (c'', a'', v'')}
 | 
			
		||||
{{< /latex >}}
 | 
			
		||||
 | 
			
		||||
This is quite a rule. A lot of things need to work out for a program to evauate from a state that isn't
 | 
			
		||||
currently the final state:
 | 
			
		||||
 | 
			
		||||
* The current program counter \\(c\\) must be valid. That is, it must be an element of \\(v\\).
 | 
			
		||||
* This program counter must correspond to an instruction \\(i\\) in \\(p\\), which we write as \\(p[c] = i\\).
 | 
			
		||||
* This instruction must be executed, changing our program counter from \\(c\\) to \\(c'\\) and our
 | 
			
		||||
accumulator from \\(a\\) to \\(a'\\). The set of valid instructions will no longer include \\(c\\),
 | 
			
		||||
and will become \\(v - \\{c\\}\\).
 | 
			
		||||
* Our program must then finish executing, starting at state
 | 
			
		||||
\\((c', a', v - \\{c\\})\\), and ending in some (unknown) state \\((c'', a'', v'')\\).
 | 
			
		||||
 | 
			
		||||
If all of these conditions are met, our program, starting at \\((c, a, v)\\), will terminate in the state \\((c'', a'', v'')\\). This third rule completes our semantics; a program being executed will keep running instructions using the third rule, until it finally
 | 
			
		||||
hits an invalid program counter (terminating with the first rule) or gets to the end of the program (terminating with the second rule).
 | 
			
		||||
		Loading…
	
		Reference in New Issue
	
	Block a user