Stream: git-wasmtime

Topic: wasmtime / PR #8633 Change the function alignment of x86 ...


view this post on Zulip Wasmtime GitHub notifications bot (May 16 2024 at 07:04):

hungryzzz opened PR #8633 from hungryzzz:fix-8573 to bytecodealliance:main:

<!--
Please make sure you include the following information:

Our development process is documented in the Wasmtime book:
https://docs.wasmtime.dev/contributing-development-process.html

Please ensure all communication follows the code of conduct:
https://github.com/bytecodealliance/wasmtime/blob/main/CODE_OF_CONDUCT.md
-->

The current default 16-bytes function alignment for x86-64 would cause suboptimal execution performance under some cases which are reported in https://github.com/bytecodealliance/wasmtime/issues/8573.

Based on the discussion "the CPU frontend grabs an aligned 32B or 64B chunk at a time" in https://github.com/bytecodealliance/wasmtime/issues/8573, this PR changes the default alignment from 16-bytes to 32-bytes for better performance.

Also rerun the cases reported in https://github.com/bytecodealliance/wasmtime/issues/8573 and the execution time will back to normal.

# After changes  case  wasmtime compile good.wasm -o good.cwasm
➜  case  wasmtime compile bad.wasm -o bad.cwasm
➜  case  time wasmtime run --allow-precompiled good.cwasm
~/wasmtime/target/release/wasmtime run  good.cwasm  4.67s user 0.00s system 100% cpu 4.674 total
➜  case  time wasmtime run --allow-precompiled bad.cwasm
~/wasmtime/target/release/wasmtime run  bad.cwasm  4.68 user 0.01s system 100% cpu 9.365 total


# Before changes  case  wasmtime compile good.wasm -o good.cwasm
➜  case  wasmtime compile bad.wasm -o bad.cwasm
➜  case  time wasmtime run --allow-precompiled good.cwasm
~/wasmtime/target/release/wasmtime run  good.cwasm  4.67s user 0.00s system 100% cpu 4.674 total
➜  case  time wasmtime run --allow-precompiled bad.cwasm
~/wasmtime/target/release/wasmtime run  bad.cwasm  9.36s user 0.01s system 100% cpu 9.365 total

view this post on Zulip Wasmtime GitHub notifications bot (May 16 2024 at 07:04):

hungryzzz requested elliottt for a review on PR #8633.

view this post on Zulip Wasmtime GitHub notifications bot (May 16 2024 at 07:04):

hungryzzz requested wasmtime-compiler-reviewers for a review on PR #8633.

view this post on Zulip Wasmtime GitHub notifications bot (May 16 2024 at 07:06):

hungryzzz edited PR #8633:

<!--
Please make sure you include the following information:

Our development process is documented in the Wasmtime book:
https://docs.wasmtime.dev/contributing-development-process.html

Please ensure all communication follows the code of conduct:
https://github.com/bytecodealliance/wasmtime/blob/main/CODE_OF_CONDUCT.md
-->

The current default 16-bytes function alignment for x86-64 would cause suboptimal execution performance under some cases which are reported in https://github.com/bytecodealliance/wasmtime/issues/8573.

Based on the discussion "the CPU frontend grabs an aligned 32B or 64B chunk at a time" in https://github.com/bytecodealliance/wasmtime/issues/8573, this PR changes the default alignment from 16-bytes to 32-bytes for better performance.

Also rerun the cases reported in https://github.com/bytecodealliance/wasmtime/issues/8573 and the execution time will back to normal.

# After changes  case  wasmtime compile good.wasm -o good.cwasm
➜  case  wasmtime compile bad.wasm -o bad.cwasm
➜  case  time wasmtime run --allow-precompiled good.cwasm
~/wasmtime/target/release/wasmtime run  good.cwasm  4.68s user 0.00s system 100% cpu 4.680 total
➜  case  time wasmtime run --allow-precompiled bad.cwasm
~/wasmtime/target/release/wasmtime run  bad.cwasm  4.67s user 0.01s system 99% cpu 4.681 total


# Before changes  case  wasmtime compile good.wasm -o good.cwasm
➜  case  wasmtime compile bad.wasm -o bad.cwasm
➜  case  time wasmtime run --allow-precompiled good.cwasm
~/wasmtime/target/release/wasmtime run  good.cwasm  4.67s user 0.00s system 100% cpu 4.674 total
➜  case  time wasmtime run --allow-precompiled bad.cwasm
~/wasmtime/target/release/wasmtime run  bad.cwasm  9.36s user 0.01s system 100% cpu 9.365 total

view this post on Zulip Wasmtime GitHub notifications bot (May 16 2024 at 14:27):

alexcrichton commented on PR #8633:

Thanks for this! It looks like this is causing changes in the disassembly of some existing tests. You can update the disassembly with WASMTIME_TEST_BLESS=1 cargo test --test disas locally and commit those changes to get pushed up here

view this post on Zulip Wasmtime GitHub notifications bot (May 17 2024 at 15:42):

hungryzzz requested wasmtime-core-reviewers for a review on PR #8633.

view this post on Zulip Wasmtime GitHub notifications bot (May 17 2024 at 15:42):

hungryzzz updated PR #8633.

view this post on Zulip Wasmtime GitHub notifications bot (May 17 2024 at 15:59):

alexcrichton submitted PR review:

Thanks!

view this post on Zulip Wasmtime GitHub notifications bot (May 17 2024 at 16:13):

alexcrichton merged PR #8633.


Last updated: Jan 24 2025 at 00:11 UTC