Stream: git-wasmtime

Topic: wasmtime / PR #5490 Update Intel x86 CPU presets to match...


view this post on Zulip Wasmtime GitHub notifications bot (Dec 26 2022 at 19:04):

MozarellaMan opened PR #5490 from update_x86_target_intel_cpu_presets_to_match_llvm to main:

<!--

Please ensure that the following steps are all taken care of before submitting
the PR.

Please ensure all communication adheres to the code of conduct.
-->
This is for #5444

I decided to just start with the Intel CPUs for now just to get feedback on whether this is an alright way to sync up + update cranelift's to match LLVM's presets. I used this (thanks @bjorn3!) as a reference for the features and presets used by LLVM.

Changes

Notes

view this post on Zulip Wasmtime GitHub notifications bot (Dec 27 2022 at 13:06):

MozarellaMan updated PR #5490 from update_x86_target_intel_cpu_presets_to_match_llvm to main.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 04:19):

abrown created PR review comment:

Is this duplicated from up above?

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 04:19):

abrown submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 04:19):

abrown submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 21:05):

jameysharp submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 21:05):

jameysharp submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 21:05):

jameysharp created PR review comment:

Looks like LLVM has an alias for silvermont called slm which I guess we should mirror here.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 21:05):

jameysharp created PR review comment:

I think there's a mismatch between how we specify features and how LLVM does it.

Apparently LLVM's FeatureSSE41 implies FeatureSSSE3 which implies FeatureSSE3, for example. Our boolean settings don't imply other settings. Instead, if you look at the nehalem preset for example, it's specified as preset!(has_sse3 && has_ssse3 && has_sse41 && ...).

So these definitions for core2 and penryn are both missing has_sse3, and penryn is missing has_ssse3. Later presets are similarly missing implied features.

I'm not sure whether it's better to follow the existing pattern and duplicate these implied feature sets everywhere, or to introduce new presets for features which are related in this way:

let sse3 = settings.add_preset("sse3", "SSE3 and earlier.", preset!(has_sse3));
let ssse3 = settings.add_preset("ssse3", "SSSE3 and earlier.", preset!(sse3 && has_ssse3));
let sse41 = settings.add_preset("sse41", "SSE4.1 and earlier.", preset!(ssse3 && has_sse41));
let sse42 = settings.add_preset("sse42", "SSE4.2 and earlier.", preset!(sse41 && has_sse42));
...
settings.add_preset("nocona", "Nocona microarchitecture.", preset!(sse3));
settings.add_preset("core2", "Core 2 microarchitecture.", preset!(ssse3));
settings.add_preset("penryn", "Penryn microarchitecture.", preset!(sse41));
let nehalem = settings.add_preset("nehalem", "Nehalem microarchitecture.", preset!(sse42 && has_popcnt));

Either way the flattened list of features is included in comments in the generated Rust in target/*/build/cranelift-codegen-*/out/settings-x86.rs, which is nice for double-checking the results.

(@fitzgen, perhaps you have opinions on this?)

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 21:05):

jameysharp created PR review comment:

Let's match how LLVM defines Haswell. Apparently we were missing AVX/AVX2 and FMA.

        preset!(ivy_bridge && has_avx2 && has_bmi1 && has_bmi2 && has_fma && has_lzcnt),

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 21:05):

jameysharp created PR review comment:

Since LLVM claims Haswell included FMA, we can drop it from the preset for Broadwell.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 21:05):

jameysharp created PR review comment:

I appreciate that you're keeping the preset with the icelake name that we used previously. But let's do something more like this to emphasize that it's a compatibility alias, not from LLVM:

let icelake_client = settings.add_preset("icelake-client", ..., preset!(cannonlake && has_avx512bitalg));
// LLVM doesn't use the name "icelake" but Cranelift did in the past; alias it
settings.add_preset("icelake", ..., preset!(icelake_client));

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 21:05):

jameysharp created PR review comment:

Yeah, this copy of the alderlake definition should be deleted. But LLVM is confusing here because it defines Alder Lake between Sapphire Rapids and Raptor Lake, but uses it as a base for earlier microarchitectures. Maybe leave a comment here noting it was defined above, and a comment at the definition above noting that LLVM defines it later.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 04 2023 at 21:05):

jameysharp created PR review comment:

If I've read the LLVM tables correctly, Nocona is the first Intel 64-bit microarchitecture, so these are the 32-bit targets I'd like to remove.

    // Netburst

view this post on Zulip Wasmtime GitHub notifications bot (Jan 11 2023 at 22:10):

MozarellaMan updated PR #5490 from update_x86_target_intel_cpu_presets_to_match_llvm to main.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 11 2023 at 22:10):

MozarellaMan submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 11 2023 at 22:10):

MozarellaMan created PR review comment:

Personally, I would prefer the features implying the others - it makes sense now that you pointed it out. The added description in the setting preset makes that more obivous.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 11 2023 at 22:20):

MozarellaMan updated PR #5490 from update_x86_target_intel_cpu_presets_to_match_llvm to main.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 11 2023 at 22:25):

MozarellaMan updated PR #5490 from update_x86_target_intel_cpu_presets_to_match_llvm to main.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 11 2023 at 22:27):

MozarellaMan updated PR #5490 from update_x86_target_intel_cpu_presets_to_match_llvm to main.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 11 2023 at 22:40):

MozarellaMan updated PR #5490 from update_x86_target_intel_cpu_presets_to_match_llvm to main.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 11 2023 at 23:34):

jameysharp submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 11 2023 at 23:34):

jameysharp submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 11 2023 at 23:34):

jameysharp created PR review comment:

If I'm reading the LLVM tables correctly, it exposes this as _both_ "slm" and "silvermont", but defines them identically. So we should do the same. I think it makes sense to use the longer name "silvermont" as the let-bound preset, and define the "slm" preset in terms of "silvermont".

view this post on Zulip Wasmtime GitHub notifications bot (Jan 13 2023 at 19:39):

MozarellaMan updated PR #5490 from update_x86_target_intel_cpu_presets_to_match_llvm to main.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 13 2023 at 20:29):

jameysharp submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 13 2023 at 20:29):

jameysharp created PR review comment:

I think the duplicate definition of alderlake is the last issue remaining here. Once that's fixed I think this PR is ready!

view this post on Zulip Wasmtime GitHub notifications bot (Jan 13 2023 at 20:35):

MozarellaMan updated PR #5490 from update_x86_target_intel_cpu_presets_to_match_llvm to main.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 13 2023 at 20:42):

jameysharp submitted PR review.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 13 2023 at 20:43):

jameysharp has enabled auto merge for PR #5490.

view this post on Zulip Wasmtime GitHub notifications bot (Jan 13 2023 at 21:02):

jameysharp merged PR #5490.


Last updated: Nov 22 2024 at 17:03 UTC