peterhuene opened PR #2811 from improve-store-registration
to main
:
This PR refactors the store frame information to eliminate the copying of
data out fromCompiledModule
.It also moves the population of a
BTreeMap
out of the frame information and
intoCompiledModule
where it is only ever calculated once rather than at
every new module instantiation into aStore
. The map is also lazy-initialized
so the cost of populating the map is incurred only when a trap occurs.This should help improve instantiation time of modules with a large number of
functions and functions with lots of instructions.
peterhuene requested alexcrichton for a review on PR #2811.
peterhuene submitted PR Review.
peterhuene created PR Review Comment:
I just realized this doesn't need to be
Arc
any more; will fix.
peterhuene edited PR Review Comment.
peterhuene updated PR #2811 from improve-store-registration
to main
.
alexcrichton submitted PR Review.
alexcrichton submitted PR Review.
alexcrichton created PR Review Comment:
The
is_wasm_pc
function is somewhat special in that it can primarily get invoked from a signal handler on Unix and Windows. Technically we're doing signal-unsafe things here by grabbing a lock as soon as we enter this function, but it's presumed fine in that we're pretty unlikely to segfault in this function itself. This reminds me, though, could you add some words to this function's documentation indicating that it's called from signal handlers and as a result the contents should be carefully written?I was originally worried slightly that this call to
func
would involve assembling theBTreeMap
which could involve allocations which could be tricky in a signal handler native-stack wise, but in thinking about it I'm not too worried about that.Thinking a bit more about this, though, I'm not actually sure if we need the
BTreeMap
any more. Sincefunc_map()
is only ever used for function lookup, we can probably just do a binary search on an array with the pc? I presume that the list of functions is sorted in address order in some array in the compiled artifacts anyway. If we use that I think we can actually remove the need for that whole map entirely?
alexcrichton created PR Review Comment:
I think this can avoid returning an
Option
? Most of the usage of indexing we have through things likeDefinedFuncIndex
we have panic if invalid
peterhuene submitted PR Review.
peterhuene created PR Review Comment:
I agree, we probably don't need a
BTreeMap
here. I'll improve this further.
peterhuene submitted PR Review.
peterhuene created PR Review Comment:
I think we can guarantee that
finished_functions
are in sort-order (they're based on the ELF symbols which we are writing in-order) so we should be able to just utilize that for the search (based on the start vs. the end).
fitzgen submitted PR Review.
fitzgen submitted PR Review.
fitzgen created PR Review Comment:
We could do a
with_capacity
here.
peterhuene submitted PR Review.
peterhuene created PR Review Comment:
Good eye! Thankfully I'm about to push up a change that removes this
BTreeMap
and just does a binary search on the finished function ranges directly.
peterhuene updated PR #2811 from improve-store-registration
to main
.
peterhuene requested fitzgen for a review on PR #2811.
peterhuene requested alexcrichton and fitzgen for a review on PR #2811.
alexcrichton submitted PR Review.
fitzgen submitted PR Review.
peterhuene edited PR #2811 from improve-store-registration
to main
:
This PR refactors the store frame information to eliminate the copying of
data out fromCompiledModule
.It also removes the population of a
BTreeMap
in favor of doing a binary
search on the finished functions in the associatedCompiledModule
.This should help improve instantiation time of modules with a large number of
functions and functions with lots of instructions.
peterhuene merged PR #2811.
Last updated: Jan 24 2025 at 00:11 UTC