Stream: git-wasmtime

Topic: wasmtime / PR #5726 egraphs: fix accidental remat of call.


view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 22:12):

cfallin requested elliottt for a review on PR #5726.

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 22:12):

cfallin opened PR #5726 from fix-5716 to main:

In the provided test case in #5716, the result of a call was then added to 0. We have a rewrite rule that sets the remat-bit on any add of a value and a constant, because these frequently appear (e.g. from address offset calculations) and this can frequently reduce register pressure (one long-lived base vs. many long-lived base+offset values). Separately, we have an algebraic rule that x+0 rewrites to x.

The result of this was that we had an eclass with the remat bit set on the add, but the add was also union'd into the call. We pick the latter during extraction, because it's cheaper not to do the add at all; but we still get the remat bit, and try to remat a call (!), which blows up later.

This PR fixes the logic to look up the "best value" for a value (i.e., whatever extraction determined), and look up the remat bit on that node, not the canonical node.

(Why did the canonical node become the iadd and not the call? Because the latter had a lower value-number, as an accident of IR construction; we don't impose any requirements on the input CLIF's value-number ordering, and I don't think this breaks any of the important acyclic properties, even though there is technically a dependence from a lower-numbered to a higher-numbered node. In essence one can think of them as having "virtual numbers" in any true topologically-sorted order, and the only place the actual integer indices matter should be in choosing the "canonical ID", which is just used for dedup'ing, modulo this bug.)

Fixes #5716.

<!--

Please ensure that the following steps are all taken care of before submitting
the PR.

Please ensure all communication adheres to the code of conduct.
-->

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 22:12):

cfallin requested fitzgen for a review on PR #5726.

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 22:12):

cfallin requested jameysharp for a review on PR #5726.

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 22:13):

cfallin updated PR #5726 from fix-5716 to main.

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 22:13):

cfallin edited PR #5726 from fix-5716 to main:

In the provided test case in #5716, the result of a call was then added to 0. We have a rewrite rule that sets the remat-bit on any add of a value and a constant, because these frequently appear (e.g. from address offset calculations) and this can frequently reduce register pressure (one long-lived base vs. many long-lived base+offset values). Separately, we have an algebraic rule that x+0 rewrites to x.

The result of this was that we had an eclass with the remat bit set on the add, but the add was also union'd into the call. We pick the latter during extraction, because it's cheaper not to do the add at all; but we still get the remat bit, and try to remat a call (!), which blows up later.

This PR fixes the logic to look up the "best value" for a value (i.e., whatever extraction determined), and look up the remat bit on that node, not the canonical node.

(Why did the canonical node become the iadd and not the call? Because the former had a lower value-number, as an accident of IR construction; we don't impose any requirements on the input CLIF's value-number ordering, and I don't think this breaks any of the important acyclic properties, even though there is technically a dependence from a lower-numbered to a higher-numbered node. In essence one can think of them as having "virtual numbers" in any true topologically-sorted order, and the only place the actual integer indices matter should be in choosing the "canonical ID", which is just used for dedup'ing, modulo this bug.)

Fixes #5716.

<!--

Please ensure that the following steps are all taken care of before submitting
the PR.

Please ensure all communication adheres to the code of conduct.
-->

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 22:40):

jameysharp submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 22:40):

jameysharp created PR review comment:

Should this lookup also switch to best_value instead of canonical_value? What about the later uses of canonical_value? Perhaps canonical_value should just be dead as soon as we've looked up best_value, maybe enforced by shadowing it?

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 22:49):

cfallin updated PR #5726 from fix-5716 to main.

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 22:50):

cfallin submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 22:50):

cfallin created PR review comment:

This actually needs to be the canonical-value, to avoid unnecessary codegen!

The reason is that we record the elaborated values in the map with canonical-ID as key, and we want to dedup as much as possible here. Otherwise if we refer to a value with different IDs that point to different parts of the eclass, we'll have unnecessary misses and recomputations.

However it's good that you asked because I went over the whole function and saw that below, another canonical_value should be best_value, when we look up remat in the other case!

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 23:00):

jameysharp submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 23:03):

cfallin updated PR #5726 from fix-5716 to main.

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 23:05):

cfallin has enabled auto merge for PR #5726.

view this post on Zulip Wasmtime GitHub notifications bot (Feb 06 2023 at 23:36):

cfallin merged PR #5726.


Last updated: Jan 24 2025 at 00:11 UTC