badeend requested fitzgen for a review on PR #9225.
badeend requested wasmtime-core-reviewers for a review on PR #9225.
badeend opened PR #9225 from badeend:graceful-shutdown
to bytecodealliance:main
:
- Fix Linux-specific peculiarity where recv continues to work even after
shutdown(sock, SHUT_RD)
has been called.- Calling
shutdown
_after_ data has been accepted bywrite
, but _before_ it has been fully flushed, caused the in-flight data to be lost. Now, the nativeshutdown
syscall is deferred until the background write finishes.To facilitate the parent/child resource communication, I've placed the TCP stream implementations behind an
Arc<Mutex<..>>
. WASI is single threaded, so in practice these Mutexes should never be contended. It does make async functions likeready
a bit awkward, though.Tangentially related:
- Renamed:
-abort_wait
->cancel
-LastWrite
->WriteState
-Done
->Ready
-Waiting
-> `Writing
badeend updated PR #9225.
alexcrichton edited PR #9225:
- Fix Linux-specific peculiarity where recv continues to work even after
shutdown(sock, SHUT_RD)
has been called.- Calling
shutdown
_after_ data has been accepted bywrite
, but _before_ it has been fully flushed, caused the in-flight data to be lost. Now, the nativeshutdown
syscall is deferred until the background write finishes.To facilitate the parent/child resource communication, I've placed the TCP stream implementations behind an
Arc<Mutex<..>>
. WASI is single threaded, so in practice these Mutexes should never be contended. It does make async functions likeready
a bit awkward, though.Tangentially related:
- Renamed:
-abort_wait
->cancel
-LastWrite
->WriteState
-Done
->Ready
-Waiting
->Writing
alexcrichton commented on PR #9225:
Thanks for this! Would it be possible to use
tokio::sync::Mutex
andtry_lock
to simplify some of thepoll_*
functions? That should enable holding a lock across.await
while still being able to return errors on contention which may make this a bit easier in a few places. (I think we usetokio::sync::Mutex
for stdio related things too)
badeend updated PR #9225.
badeend commented on PR #9225:
Certainly. Just pushed the changes.
alexcrichton submitted PR review:
If you'll indulge me a bit more, in thinking more about this I'm second-guessing the coupling where
TcpState
owns the reader/writer streams instead of just the socket. That to me seems possible indicative of future changes being required towasi:sockets
which wouldn't require such coupling, but we can't do that now so that's less the issue.In this situation though, if I'm understanding this right, the only reason for this split is so an active write isn't blocked out by accident when the tcp socket is shut down without going through the writer stream, right? If that's the case do you think it would be worthwhile to change the current storage of
Arc<TcpStream>
to just that plus aArc<Mutex<SomeWriterState>>
? That way it's clear that there's only one shared piece of state and it's just the internals of the writer.My hope is that it would be more obvious that the intention is to not hand out multiple readers/writers via the
Arc<Mutex<...>>
and also minimize the scope/breadth of the mutex in question.If you think the refactoring isn't worth it though then I think this is ok as-is.
alexcrichton created PR review comment:
Could this use the
try_lock_*
helpers instead oflock()
to consistently trap on contention? That might help prevent covering up bugs by accident where some contention is handled in some places but not others.
badeend submitted PR review.
badeend created PR review comment:
Subscribe::ready has no return type, so I can't return a trap
badeend commented on PR #9225:
Apologies. Maybe its getting too late for me, but I don't understand what you're saying. Could you maybe clarify with a simple code snippet?
alexcrichton submitted PR review.
alexcrichton created PR review comment:
Hm well actually as I look into looks like stdio is already inconsistent, so let's leave this as-is.
alexcrichton commented on PR #9225:
I'm wondering if we can change the previous
Connected(Arc<tokio::net::TcpStream>),
toConnected(Arc<tokio::net::TcpStream>, Arc<Mutex<WriteState>>)
, instead of having reader/writer streams. There's be no change for a read-level shutdown but for a write-level shutdown it would call shutdown onWriteState
which would handle "if there's a pending write wait for that first" and if there's nothing it would immediately do the shutdown.
badeend commented on PR #9225:
no change for a read-level shutdown
shutdown
needs access to the reader to fix the first issue:Linux-specific peculiarity where recv continues to work even after shutdown(sock, SHUT_RD) has been called.
A completely different way to go around this all is to revert the Mutex entirely and let the
shutdown
binding go through the resource table to mutate the child resources directly. E.g.fn shutdown(&mut self, this: Resource<tcp::TcpSocket>, shutdown_type: ShutdownType) -> SocketResult<()> { let table = self.table(); if let ShutdownType::Receive | ShutdownType::Both = shutdown_type { let input = table.iter_children(&this)?.find_map(|c| c.downcast_mut::<InputStream>()); input.shutdown(); } // etc. }
but this would require some kind of upcasting feature to get from
InputStream
back to aTcpReadStream
(in this case). I didn't want to pollute theHostInput/OutputStream
traits to fix a TCP-specific problem. If you're fine with it, I can go this direction too if you want.
alexcrichton submitted PR review:
Ah ok, nah if this is needed for the read bits anyway this is fine to land. Let's go ahead and land as-is and can always consider refactorings later if necessary, but I don't think it's urgent or really required.
alexcrichton merged PR #9225.
Last updated: Jan 24 2025 at 00:11 UTC