Move away from using Moniker for variable binding #181

brendanzab · 2018-11-19T02:12:40Z

Our autobinding library, Moniker is currently not pulling its weight in terms of making it easy to understand the internals of Pikelet and will make supporting a high performance compiler heading into the future. In the words of @kleimkuhler:

I know the times I’ve explored enough into the codebase to get an idea of how you have implemented something, I usually end up at a point where a binding occurs and I lose track a little.

Also from conversations on Gitter, @boomshroom and others have struggled with grasping Moniker, so it's not an isolated issue! @jonsterling has also noted on Twitter that he is not really a fan of using ABTs (Abstract binding trees) in implementations:

What I found is that unfortunately, it is difficult to see the right way to code a specific thing unless I have hand-coded the syntax. Some kind of ABT thing that abstracted over binding-sensitive traversals would be nice, but I concluded that the main point of ABTs, which was to abstract over binding itself (providing some interface with names, and automatically freshening etc., providing substitution) is actually harmful for implementations.

It's not only this - moniker is also standing in the way of using visitors in the compiler, and will cause us performance problems down the line as Pikelet codebases get larger. It also could make salsa/adapton style incrementalism harder.

Requirements for name binding

The problems we will have to tackle when moving to a new variable binding scheme is:

sound semantics
- capture-avoiding substitution (λx.e [e/x] should not result in λx.x)
- alpha equivalence (λx.x should be equivalent to λy.y)
- reduce the chance of messing up variable binding when adding new features
support existing and upcoming features
- produce nice, stable, pretty names when pretty printing
- recursive bindings (Recursive definitions #46)
- support reflection (like in Idris, Agda, F*, Lean)
performance optimizations
- allow for stable names across incremental compilations (see adapton and salsa) (Move to an incremental/query driven architecture #103)
- reduce unnecessary tree traversals and variable shifts
- allow for vistor-based fusion (Refactor the compiler to use visitors #75) resulting in reduced allocations and as close-to single-pass compilation as we can get

Possible solutions

We have a number of options open to us:

Nominal binding

Use nominal binding, like in David Christiansen's NBE tutorial - could be compatible with visitor-based futsion which might amortize some of the performance penalties. @pythonesque has expressed some concern over going down this route though, especially when it comes to recursive bindings.

Graph libraries

Use petgraph, like in Program Synthesis is Possible in Rust. Not sure how well this would support alpha equivalence though. This feels similar to the Scope Graph stuff from A Theory of Name Resolution. I'm not sure if this has ever been applied to dependently typed languages though, and it is off-the beaten track in terms of the main-line of research.

Locally Nameless

Continue to use a locally nameless approach as with moniker, but bring it into Pikelet. This might result in lots of traversals though, Lean has workarounds though. Also not compatible with visitors.

Explicit substitutions with a fully representation

Apparently locally nameless has not been proven stable under substitution(?) for delayed substitutions, so we'd have to go with a fully nameless representation here, like in autosubst. This is well understood, but might be harder to get to work with visitors. The advantage would be that it would be quite close to what we would be doing in a theorem prover if we ever wanted to do a soundness proof of Pikelet.

Use "semantic type checking"

This is what @jonsterling has advocated to me on Twitter:

I base my stuff on an algorithm that I think comes from Thierry Coquand, called "semantic type checking". The main idea is as follows:

Have a syntax based on De Bruijn indices. You don't even need to implement any operations on the syntax. This syntax will serve as the "unchecked" inputs to your judgments. (Like the M in G !- M : A.)

Have a semantic domain based on De Bruijn levels. Interpret binders as closures; environments are sequences of values.

In your bidirectional type checker, you have judgments like G !- M <= A and G !- M => A. in both cases, G and A are coming from the semantic domain, whereas M is syntax. In the mode-shift rule, you will check either definitional equivalence or subtyping (depending on the language), and this will be done structurally in the semantic domain -- this part has the structure of quotation from NbE, but let me observe that you actually don't ever need to quote anything. The algorithm has the same structure though.

Now, let me point out the epic Power Move that we executed so far.

What was important was the yoga of having indices in the syntax and levels in the semantics. It means that the parts of your judgment that have wellformedness presuppositions, which we already agreed to draw from the semantic domain, can be implicitly weakened, so there is never any need anywhere in the algorithm to unleash a De Bruijn shift, or any kind of operation on syntax. The only thing you ever do to syntax is check the head constructor, and evaluate it into the domain afterward.

I will have to think about this more in order to get my head around it!

Here is an example type checker that uses the technique: https://github.com/jozefg/nbe-for-mltt/

Improve Moniker's performance and documentation

I feel like it still would be handy to have a nice way of doing binding, but perhaps this would require more experimentation using other techniques first. I'm not sure though.

The text was updated successfully, but these errors were encountered:

brendanzab · 2018-11-19T08:24:13Z

Updated the description with a link to this example: https://github.com/jozefg/nbe-for-mltt/

glaebhoerl · 2018-11-19T08:37:04Z

I haven't tried to understand them very much, but some recent work in Haskell on nominal stuff, in case it might come in helpful:

https://github.com/ekmett/name

https://hackage.haskell.org/package/nominal

brendanzab · 2018-11-19T09:50:00Z

Thanks for linking those @glaebhoerl, much appreciated! 👍

brendanzab · 2018-12-01T11:29:39Z

I have a Rust port of Nbe using semantic type checking up at brendanzab/rust-nbe-for-mltt. I think this is the approach I am going to go with, but it will be a bit tricky to make!

brendanzab · 2018-12-03T00:34:57Z

@AndrasKovacs has put together a nice approach in smalltt. I don't really understand it yet though (the implementation is a little tricky to understand), so I might persist with the nbe-for-mltt approach for now, which seems more approachable. I think we'll be able to adapt smalltt easier by that stage though, if we wanted to.

ratmice · 2020-02-09T15:38:38Z

While i haven't tried it yet, another thing worth looking into

Similar to the graph approach is using hypergraphs, Ueda has a number of papers on name binding and lambda terms as hypergraphs Name Binding is Easy with Hypergraphs

Not aware of any rust libraries for dealing with hypergraphs being available yet though.

brendanzab · 2020-02-10T00:01:26Z

Oooh thanks! I've heard of hypergraphs before! Both from @sashaboyd and @robrix https://twitter.com/rob_rix/status/1224461727678509057. One of the issues I'm running into with the way I'm doing NbE in #197 is that it's hard to check the equivalence of dependent record types using this representation:

pub enum Value {
    ⋮
    /// Record type extension.
    RecordTypeExtend(String, Arc<Value>, Closure),
    /// Empty record types.
    RecordTypeEmpty,
    ⋮

brendanzab mentioned this issue Dec 1, 2018

Crate restructure in preparation for NbE #187

Merged

mikeday mentioned this issue Dec 6, 2018

2019: Open Areas of Investigation yeslogic/fathom#150

Closed

6 tasks

brendanzab mentioned this issue Feb 26, 2019

Look into using library for handling variables granule-project/granule#98

Open

brendanzab pinned this issue Mar 25, 2019

brendanzab mentioned this issue Apr 27, 2019

Normalisation by evaluation ollef/sixten#145

Open

This was referenced Jan 19, 2020

Merge next branch into master #196

Closed

Rebuild Pikelet using normalization by evaluation #197

Merged

brendanzab linked a pull request Jun 8, 2020 that will close this issue

Rebuild Pikelet using normalization by evaluation #197

Merged

brendanzab closed this as completed in #197 Dec 2, 2020

brendanzab unpinned this issue Dec 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move away from using Moniker for variable binding #181

Move away from using Moniker for variable binding #181

brendanzab commented Nov 19, 2018 •

edited

Loading

brendanzab commented Nov 19, 2018

glaebhoerl commented Nov 19, 2018

brendanzab commented Nov 19, 2018

brendanzab commented Dec 1, 2018

brendanzab commented Dec 3, 2018

ratmice commented Feb 9, 2020

brendanzab commented Feb 10, 2020 •

edited

Loading

Move away from using Moniker for variable binding #181

Move away from using Moniker for variable binding #181

Comments

brendanzab commented Nov 19, 2018 • edited Loading

Requirements for name binding

Possible solutions

Nominal binding

Graph libraries

Locally Nameless

Explicit substitutions with a fully representation

Use "semantic type checking"

Improve Moniker's performance and documentation

brendanzab commented Nov 19, 2018

glaebhoerl commented Nov 19, 2018

brendanzab commented Nov 19, 2018

brendanzab commented Dec 1, 2018

brendanzab commented Dec 3, 2018

ratmice commented Feb 9, 2020

brendanzab commented Feb 10, 2020 • edited Loading

brendanzab commented Nov 19, 2018 •

edited

Loading

brendanzab commented Feb 10, 2020 •

edited

Loading