jameysharp opened issue #5705:
Feature
Our mid-end optimization rules define a special constructor called
subsume
inprelude_opt.isle
. Its effect is to cause the e-graph optimization pass to ignore other results from the same call tosimplify
. Let's consider making it a feature of the ISLE language instead, applicable to any multi-term.Background
For a rule-driven multi-constructor term, ISLE currently generates a Rust function which accumulates results into a
Vec
. It returns the whole vector once all pattern matches have completed.The e-graph specific implementation of
subsume
records all values it's called on in the optimization context. After all pattern matches have finished,optimize_pure_enode
checks each value from the accumulated vector to see if it is one wheresubsume
was used. The first one it finds causes the rest of the vector to be skipped, and returns a singleton e-class. However any earlier values were already added to the e-graph; they're effectively hidden from elaboration but still taking memory.This is used for any rule where we know ahead of time that if the rule matches, it will produce a result which is at least as good as any other rule we might pick. It's primarily used for rules which rewrite an expression into a constant.
Benefit
If ISLE were aware of the semantics of
subsume
, it could apply two optimizations to the generated code:
The first rule which matches and is flagged as
subsume
could return early with a singleton vector. Then the caller doesn't have to process any results that it's going to ignore, and thesimplify
function doesn't have to evaluate later pattern matches whose results will be ignored anyway.Rules flagged
subsume
could be checked before other rules, if that allows us to check fewer patterns. This isn't trivial to get right but could be a nice compile-time win.Making
subsume
a part of the language is also useful for static analysis purposes, like #5704, or checking if such rules fully shadow other rules.This would allow
subsume
-like behavior in multi-terms other thansimplify
. I don't have a use-case in mind for that but it might be useful for something. Currently I think any multi-term could callsubsume
and maybe do something useful with it, but it would be difficult to use correctly anywhere except at the top-level of the right-hand side of rules insimplify
.Implementation
I'm imagining an optional keyword after
(rule
but before the left-hand side.Alternatives
We could define rule priorities in multi-terms to mean that all rules of the same priority have their results returned together, and only the highest priority which matches any rule is returned. That might be a useful generalization of "subsumes", or it might just make the rules harder to reason about.
cfallin commented on issue #5705:
This is really interesting to me because it diverges significantly from my mental model / layering -- worth digging into why, I think!
subsume
was at least originally meant to be an egraph-specific intrinsic, and I think this can still be seen today in behavior that islec could not replicate purely by itself ifsubsume
were just a way to alter multi-term behavior: the subsuming value replaces even the original input to the rewrite in the value-map as we build the egraph. In other words, if wesubsume
an expression with a constant due to cprop, it's not just that we return only thaticonst
Value
and nothing else; it's that the toplevel loop that would ordinarily build an eclass out of the input (rewritten) value and its various rewrites, instead takes just the one rewrite.Seen another way, this would mean that the conceptual
Iterator<Item = Value>
return from thesimplify
multi-term would instead need to turn into something likeIterator<Item = MaybeSubsumingValue>
whereenum MaybeSubsumingValue { Normal(Value), Subsuming(Value) }
. We could do something like that but...What I do like instead is that islec has special knowledge of a property of the
subsume
constructor, such that it tries to evaluate these rules first, as you suggest. I think it should be onsubsume
, and not on rules that use it (else we're liable to forget it); and should perhaps be something like(decl shortcut subsume (Value) Value)
whereshortcut
means "the RHS evaluating this ctor can return its value alone". We still need the external ctor call to actually make the above behavior happen though.All of that said, while writing the last paragraph, the sum-type approach is actually starting to grow on me. Perhaps what we want is a
subsume
, marked withshortcut
, that builds a(MaybeSubsumingValue.Subsuming x)
fromx
; and an implicit converter otherwise that builds a(MaybeSubsumingValue.Normal x)
fromx
. Thoughts?
jameysharp commented on issue #5705:
We discussed
subsume
yesterday and concluded that we should try treating it as an egraph-specific dynamic property of the result ofsimplify
, rather than something we annotate on individual rules. There could still be an advantage to having some way to makesimplify
return early, but until we have a clearer idea of exactly which shortcuts we want I don't think we should put too much thought into that aspect. So I'm retracting this idea, at least for now.
jameysharp closed issue #5705:
Feature
Our mid-end optimization rules define a special constructor called
subsume
inprelude_opt.isle
. Its effect is to cause the e-graph optimization pass to ignore other results from the same call tosimplify
. Let's consider making it a feature of the ISLE language instead, applicable to any multi-term.Background
For a rule-driven multi-constructor term, ISLE currently generates a Rust function which accumulates results into a
Vec
. It returns the whole vector once all pattern matches have completed.The e-graph specific implementation of
subsume
records all values it's called on in the optimization context. After all pattern matches have finished,optimize_pure_enode
checks each value from the accumulated vector to see if it is one wheresubsume
was used. The first one it finds causes the rest of the vector to be skipped, and returns a singleton e-class. However any earlier values were already added to the e-graph; they're effectively hidden from elaboration but still taking memory.This is used for any rule where we know ahead of time that if the rule matches, it will produce a result which is at least as good as any other rule we might pick. It's primarily used for rules which rewrite an expression into a constant.
Benefit
If ISLE were aware of the semantics of
subsume
, it could apply two optimizations to the generated code:
The first rule which matches and is flagged as
subsume
could return early with a singleton vector. Then the caller doesn't have to process any results that it's going to ignore, and thesimplify
function doesn't have to evaluate later pattern matches whose results will be ignored anyway.Rules flagged
subsume
could be checked before other rules, if that allows us to check fewer patterns. This isn't trivial to get right but could be a nice compile-time win.Making
subsume
a part of the language is also useful for static analysis purposes, like #5704, or checking if such rules fully shadow other rules.This would allow
subsume
-like behavior in multi-terms other thansimplify
. I don't have a use-case in mind for that but it might be useful for something. Currently I think any multi-term could callsubsume
and maybe do something useful with it, but it would be difficult to use correctly anywhere except at the top-level of the right-hand side of rules insimplify
.Implementation
I'm imagining an optional keyword after
(rule
but before the left-hand side.Alternatives
We could define rule priorities in multi-terms to mean that all rules of the same priority have their results returned together, and only the highest priority which matches any rule is returned. That might be a useful generalization of "subsumes", or it might just make the rules harder to reason about.
jameysharp commented on issue #5705:
We're tracking the new plan for
subsume
in #6105.
Last updated: Jan 24 2025 at 00:11 UTC