Java with The Foreign Function and Memory (FFM) API · general

I am finally coming back to experiment with getting Wasmtime running in Java with the now stabilized FFM API and jextract. I'm currently running into an issue with getting the the hello.c C api demo working (in Java). It looks like I'm running into a poisoned RwLock in the TypeRegistry, clearing the poisoned value gets passed the issue, but then just runs into another panic after in RegisteredType::register_singleton_rec_group. This is all happening during the call to wasmtime_func_new matching the logic in hello.c. Anyone have any tips on how track down the issue and why this lock is getting poisoned? (this is in the current release-36.0.0 branch)

Pat Hickey (Sep 04 2025 at 22:57):

If a lock is poisioned it means the thread it was running on died somehow (unwound via an exception)

Pat Hickey (Sep 04 2025 at 22:59):

You can use the wasmtime c api from multiple threads safely, because internally wasmtime is thread safe, but if you unwind across wasmtime in one of your threads its quite possible to break everything for all threads

Benjamin Fry (Sep 04 2025 at 23:00):

I'm not currently spawning any additional threads. Is there a background process in Wasmtime?

Pat Hickey (Sep 04 2025 at 23:01):

Pat Hickey (Sep 04 2025 at 23:02):

and if you're using wasmtime-wasi in there, it will start its own multi-threaded tokio to perform io

Benjamin Fry (Sep 04 2025 at 23:02):

Benjamin Fry (Sep 04 2025 at 23:04):

I'll see if I cut out the junit test runner if that removes some of the potential issues.

Pat Hickey (Sep 04 2025 at 23:04):

sorry, I really don't have any other ideas. I don't know the jvm or FFM well enough to guess what might be getting you there.

Chris Fallin (Sep 04 2025 at 23:07):

If you're able to set an environment variable WASMTIME_LOG=trace, the trace-log output from Wasmtime might give some hints as to what's going wrong

Chris Fallin (Sep 04 2025 at 23:07):

(feel free to put that in a gist and link here -- I can't guarantee I'll see anything but someone might)

Benjamin Fry (Sep 04 2025 at 23:08):

ah, cool, I was wondering about that. I hacked up the code a bunch in wasmtime already to try and trace all of this in more detail.

Alex Crichton (Sep 04 2025 at 23:09):

wasmtime/crates/c-api/src/engine.rs at a631d20afa7a0154e63c2b8aa34a979864518991 · bytecodealliance/wasmtime

A lightweight WebAssembly runtime that is fast, secure, and standards-compliant - bytecodealliance/wasmtime

Alex Crichton (Sep 04 2025 at 23:09):

Alex Crichton (Sep 04 2025 at 23:10):

if you're running into a poisoned lock that means that something panicked earlier and that's the interesting backtrace in theory

Alex Crichton (Sep 04 2025 at 23:10):

Benjamin Fry (Sep 04 2025 at 23:14):

Poisoned RwLock in Wasmtime

Poisoned RwLock in Wasmtime. GitHub Gist: instantly share code, notes, and snippets.

Benjamin Fry (Sep 04 2025 at 23:14):

Alex Crichton (Sep 04 2025 at 23:14):

Alex Crichton (Sep 04 2025 at 23:15):

is this all you see, nothing else? There in theory should be some other backtrace b/c poisoning a lock should require a panic somewhere

Benjamin Fry (Sep 04 2025 at 23:24):

So far, the engine, store, and context all are created without errors. I can compile the wat -> wasm without error. so things are "working" up to that point. I had a theory that the func_ty I was creating for the C callback was bad, but that appears to be fine as well.

Benjamin Fry (Sep 04 2025 at 23:24):

Alex Crichton (Sep 04 2025 at 23:25):

wasmtime/examples/hello.c at a631d20afa7a0154e63c2b8aa34a979864518991 · bytecodealliance/wasmtime

A lightweight WebAssembly runtime that is fast, secure, and standards-compliant - bytecodealliance/wasmtime

Benjamin Fry (Sep 04 2025 at 23:27):

Benjamin Fry (Sep 04 2025 at 23:30):

Alex Crichton (Sep 04 2025 at 23:30):

Jacob Lifshay (Sep 04 2025 at 23:30):

you could try running wasmtime inside gdb and adding catch points for rust panics, that should show wherever the panic is happening

Benjamin Fry (Sep 04 2025 at 23:31):

running in gdb will be hard... I'm going to try and remove some of the maven and junit overhead to make it just a raw java execution. that will take me a minute...

Alex Crichton (Sep 04 2025 at 23:32):

how certain are you that the bindings are right? b/c this could also be random corruption of memory or something like that

Alex Crichton (Sep 04 2025 at 23:32):

Benjamin Fry (Sep 04 2025 at 23:33):

Oh, it totally could be corruption. I'm definitely new to this Java FFM API. I've done a lot of double checking, but definitely could be something there. I did try clearing the bit, but things are clearly in a bad state at that point.

Alex Crichton (Sep 04 2025 at 23:34):

a smoking gun for wasmtime would be a reproduction with just the C API (e.g. a C file repro)

Alex Crichton (Sep 04 2025 at 23:34):

Jacob Lifshay (Sep 04 2025 at 23:34):

GDB should break on panic · Issue #21102 · rust-lang/rust

Expected behavior: when I use gdb, gdb should catch the panic and I should be able to use bt to analyze the stack. when I use RUST_BACKTRACE=1 I should see source files and line numbers in the back...

Benjamin Fry (Sep 04 2025 at 23:35):

Yeah, I assume at this point that I'm screwing something up with Java, but I've not found that yet.

Alex Crichton (Sep 04 2025 at 23:36):

Benjamin Fry (Sep 04 2025 at 23:37):

I can post a gist of the demo code that I have in Java, if you want more I could push it...

Alex Crichton (Sep 04 2025 at 23:37):

Benjamin Fry (Sep 04 2025 at 23:39):

WasmtimeJavaTest.java

WasmtimeJavaTest.java. GitHub Gist: instantly share code, notes, and snippets.

Benjamin Fry (Sep 04 2025 at 23:41):

I have my old JNI stuff lying around this codebase, I'd prefer not to push all of that until I get it cleaned up.

Alex Crichton (Sep 04 2025 at 23:44):

unsure if this would affect things but helloCallbackDesc doesn't look quite right

Alex Crichton (Sep 04 2025 at 23:44):

Benjamin Fry (Sep 04 2025 at 23:47):

wasmtime/examples/hello.c at 1047b51183f5906ded5d82ec375f77e586485b5f · bytecodealliance/wasmtime

A lightweight WebAssembly runtime that is fast, secure, and standards-compliant - bytecodealliance/wasmtime

Benjamin Fry (Sep 04 2025 at 23:47):

Alex Crichton (Sep 04 2025 at 23:48):

no I think I'm just confused, it looks like the first parameter is actually the return type, then it's all the param types

Alex Crichton (Sep 04 2025 at 23:48):

I thought it was just the param types but then the return type wouldn't otherwise be specified anywhere

Alex Crichton (Sep 04 2025 at 23:49):

Alex Crichton (Sep 04 2025 at 23:50):

IIRC panicking/poisoning goes through TLS infrastructure in the rust standard library and maybe something about that is super broken in this context

Alex Crichton (Sep 04 2025 at 23:50):

so, e.g., when the lock is originally unlocked it mistakenly thinks the thread is panicking because the implementation of TLS is broken

Alex Crichton (Sep 04 2025 at 23:51):

to confirm/deny this since it looks like you have a custom build of Wasmtime already you might be able to print this function's result in various places throughout wasmtime

Alex Crichton (Sep 04 2025 at 23:51):

that should always return false but if it prints true then something is gone wrong

Benjamin Fry (Sep 04 2025 at 23:51):

Alex Crichton (Sep 04 2025 at 23:52):

how is wasmtime linked? I presume it's not statically linked so is java dlopen'ing the libwasmtime.so somewhere?

Benjamin Fry (Sep 04 2025 at 23:54):

Benjamin Fry (Sep 04 2025 at 23:57):

here's the hs_err log file from java that has a bunch of state captured, if you're interested... (which is a SEGFAULT that I thought was partially due to the panic and the poisoned lock as I dug deeper). https://gist.github.com/bluejekyll/760a232f39c651647552e095f2451e24

Wasmtime C API with Java Heap

Wasmtime C API with Java Heap. GitHub Gist: instantly share code, notes, and snippets.

Benjamin Fry (Sep 04 2025 at 23:59):

Alex Crichton (Sep 05 2025 at 00:02):

notably RegisteredType::new is on the stack which does rwlock things which hits tls

Alex Crichton (Sep 05 2025 at 00:06):

C++ 11 thread_local and "foreign" threads

I would like to use C++ 11 thread_local, but our application embeds a JVM, and sometimes C++ methods are called from Java-created thread via JNI. This is essentially the same problem as if an exter...

Benjamin Fry (Sep 05 2025 at 00:07):

Alex Crichton (Sep 05 2025 at 00:09):

Benjamin Fry (Sep 05 2025 at 00:11):

Yeah, I'll keep digging. I think some of these hints have been good so far, and at least give me some things to experiment with.

David Lloyd (Sep 05 2025 at 13:00):

I'd be interested in hearing more about how/if the JVM is trashing the TLS context if you find out anything specific

Benjamin Fry (Sep 05 2025 at 22:48):

I found something about some potential issues with signals for traps, disabling signals_based_traps (which btw, is not exposed to the c-api) seems to have "helped". I'm now getting to a more consistent failure at a slightly different location. But this is progress.

Jacob Lifshay (Sep 05 2025 at 22:58):

that would make sense since I'd expect both the JVM and wasmtime use SIGSEGV or similar for catching illegal memory accesses (null references for Java)

Chris Fallin (Sep 05 2025 at 23:01):

Wasmtime does have logic to forward on to an already-registered signal handler (see here); so this isn't a slam-dunk obvious conflict, at least, though there could still be weird interactions of course.

wasmtime/crates/wasmtime/src/runtime/vm/sys/unix/signals.rs at 9f47be2ed6b4ea99fd86f2592277a26d65eff5da · bytecodealliance/wasmtime

A lightweight WebAssembly runtime that is fast, secure, and standards-compliant - bytecodealliance/wasmtime

Alex Crichton (Sep 05 2025 at 23:15):

ok I know I'm a broken record but wasmtime's signal handler accesses TLS, and if we assume that the JVM sort of randomly gets signals for GC and whatnot and/or for other threads, and if we assume that accessing TLS in Rust is an issue, then that would explain a why a nondeterministic error with signal handling would be replaced by a deterministic error without signal handling. (but perhaps still point a smoking gun at tls...)

Benjamin Fry (Sep 05 2025 at 23:27):

Yeah, I'm continuing to try and track that down. But disabling the signal handling gives me a consistent failure scenario, whereas before it was hard to track down.

Benjamin Fry (Sep 05 2025 at 23:30):

Other things I need to double check somehow is if the Arena based allocations in the Java layer are somehow not playing nice in Rust, like somehow having different layouts or something.

Pat Hickey (Sep 05 2025 at 23:31):

Pat Hickey (Sep 05 2025 at 23:34):

both your libc and the jvm are going to be implementing their allocators by asking the OS for pages through mmap, i wouldnt be too suspicious about that compared to the red flags around TLS

Stream: general

Topic: Java with The Foreign Function and Memory (FFM) API

Benjamin Fry (Sep 04 2025 at 22:46):

Pat Hickey (Sep 04 2025 at 22:57):

Pat Hickey (Sep 04 2025 at 22:59):

Benjamin Fry (Sep 04 2025 at 23:00):

Pat Hickey (Sep 04 2025 at 23:01):

Pat Hickey (Sep 04 2025 at 23:01):

Pat Hickey (Sep 04 2025 at 23:02):

Benjamin Fry (Sep 04 2025 at 23:02):

Benjamin Fry (Sep 04 2025 at 23:04):

Pat Hickey (Sep 04 2025 at 23:04):

Chris Fallin (Sep 04 2025 at 23:07):

Chris Fallin (Sep 04 2025 at 23:07):

Benjamin Fry (Sep 04 2025 at 23:08):

Alex Crichton (Sep 04 2025 at 23:09):

Alex Crichton (Sep 04 2025 at 23:09):

Alex Crichton (Sep 04 2025 at 23:10):

Alex Crichton (Sep 04 2025 at 23:10):

Benjamin Fry (Sep 04 2025 at 23:14):

Benjamin Fry (Sep 04 2025 at 23:14):

Alex Crichton (Sep 04 2025 at 23:14):

Alex Crichton (Sep 04 2025 at 23:14):

Alex Crichton (Sep 04 2025 at 23:14):

Alex Crichton (Sep 04 2025 at 23:15):

Benjamin Fry (Sep 04 2025 at 23:24):

Benjamin Fry (Sep 04 2025 at 23:24):

Alex Crichton (Sep 04 2025 at 23:25):

Benjamin Fry (Sep 04 2025 at 23:27):

Benjamin Fry (Sep 04 2025 at 23:30):

Alex Crichton (Sep 04 2025 at 23:30):

Jacob Lifshay (Sep 04 2025 at 23:30):

Benjamin Fry (Sep 04 2025 at 23:31):

Alex Crichton (Sep 04 2025 at 23:32):

Alex Crichton (Sep 04 2025 at 23:32):

Alex Crichton (Sep 04 2025 at 23:32):

Benjamin Fry (Sep 04 2025 at 23:33):

Alex Crichton (Sep 04 2025 at 23:34):

Alex Crichton (Sep 04 2025 at 23:34):

Jacob Lifshay (Sep 04 2025 at 23:34):

Benjamin Fry (Sep 04 2025 at 23:35):

Alex Crichton (Sep 04 2025 at 23:36):

Alex Crichton (Sep 04 2025 at 23:36):

Benjamin Fry (Sep 04 2025 at 23:37):

Alex Crichton (Sep 04 2025 at 23:37):

Benjamin Fry (Sep 04 2025 at 23:39):

Benjamin Fry (Sep 04 2025 at 23:41):

Alex Crichton (Sep 04 2025 at 23:44):

Alex Crichton (Sep 04 2025 at 23:44):

Alex Crichton (Sep 04 2025 at 23:44):

Benjamin Fry (Sep 04 2025 at 23:47):

Benjamin Fry (Sep 04 2025 at 23:47):

Alex Crichton (Sep 04 2025 at 23:48):

Alex Crichton (Sep 04 2025 at 23:48):

Alex Crichton (Sep 04 2025 at 23:49):

Alex Crichton (Sep 04 2025 at 23:50):

Alex Crichton (Sep 04 2025 at 23:50):

Alex Crichton (Sep 04 2025 at 23:51):

Alex Crichton (Sep 04 2025 at 23:51):

Benjamin Fry (Sep 04 2025 at 23:51):

Benjamin Fry (Sep 04 2025 at 23:51):

Alex Crichton (Sep 04 2025 at 23:52):

Benjamin Fry (Sep 04 2025 at 23:54):

Benjamin Fry (Sep 04 2025 at 23:57):

Benjamin Fry (Sep 04 2025 at 23:59):

Alex Crichton (Sep 05 2025 at 00:02):

Alex Crichton (Sep 05 2025 at 00:02):

Alex Crichton (Sep 05 2025 at 00:06):

Benjamin Fry (Sep 05 2025 at 00:07):

Alex Crichton (Sep 05 2025 at 00:09):

Benjamin Fry (Sep 05 2025 at 00:11):

David Lloyd (Sep 05 2025 at 13:00):

Benjamin Fry (Sep 05 2025 at 22:48):

Jacob Lifshay (Sep 05 2025 at 22:58):

Chris Fallin (Sep 05 2025 at 23:01):

Alex Crichton (Sep 05 2025 at 23:15):

Benjamin Fry (Sep 05 2025 at 23:27):

Benjamin Fry (Sep 05 2025 at 23:30):

Pat Hickey (Sep 05 2025 at 23:31):

Pat Hickey (Sep 05 2025 at 23:34):