Stream: general

Topic: greetings


view this post on Zulip Austin Huang (Nov 16 2020 at 22:20):

Hi everyone.

I'm new here (via @Lin Clark)... already started a thread under #wasi on some initial pytorch/libtorch wasm experiments my collaborators and I are experimenting with (apologies for the basic questions at this point).

Hope to connect with others on efforts to bring machine learning to wasm.

Sounds like @Till Schneidereit @Mingqiu Sun and @Andrew Brown are the ones to connect with on this topic? Congrats on the 2208 PR - seems like the ball is rolling in this space.

view this post on Zulip Andrew Brown (Nov 17 2020 at 21:49):

@Austin Huang, nice to meet you; are you interested in trying out wasi-nn using Wasmtime?

view this post on Zulip Austin Huang (Nov 17 2020 at 21:53):

@Andrew Brown yes though i'm still learning the basics at this point. also probably need to catch up on the design discussions to date (do you have any links to check out so I don't rehash what's already been discussed?).

As I understand it it's focusing on a runtime for compute graph inference, right? What are the paths to get something compiled down to an artifact it can consume? is there a compilation path, from say, onnx or torchscript?

view this post on Zulip Andrew Brown (Nov 17 2020 at 21:55):

Well, there's an example script that should give you some idea how to perform the inference and all of those artifacts were generated from the instructions I documented here

Standalone JIT-style runtime for WebAssembly, using Cranelift - bytecodealliance/wasmtime
Test Fixtures for OpenVINO Inference. GitHub Gist: instantly share code, notes, and snippets.

view this post on Zulip Andrew Brown (Nov 17 2020 at 21:56):

I am working on a blog post that should be a better explainer for how to use wasi-nn so there's also the option to wait for that :smile:

view this post on Zulip Austin Huang (Nov 17 2020 at 22:06):

@Andrew Brown interesting. why is wasi-nn described as a library here? i was under the impression it's more of a standard the runtime implements.

i haven't used openvino but it sounds like it can consume onnx as a common denominator. could there be a more direct path (from say pytorch or tf? tbh i hear more about other IRs (xla, mlir, onnx ...) ... though i get the IR space is pretty fluid so gotta start somewhere i suppose.

view this post on Zulip Andrew Brown (Nov 17 2020 at 22:14):

Not sure where you see wasi-nn being described as a library (let me know and I will fix!). It is a specification so I should be using "spec" or "standard," though there might be confusion because wasi-nn is implemented in Wasmtime using the wasmtime-wasi-nn crate. That is the crate talks to OpenVINO to actually perform the inference.

view this post on Zulip Austin Huang (Nov 17 2020 at 22:15):

"Not all libraries (e.g. wasi-nn) will know how to decode images..." https://gist.github.com/abrown/c7847bf3701f9efbb2070da1878542c1 maybe i'm misreading

Test Fixtures for OpenVINO Inference. GitHub Gist: instantly share code, notes, and snippets.

view this post on Zulip Andrew Brown (Nov 17 2020 at 22:17):

Cool, let me fix that... As for whether OpenVINO supports other IRs, take a look at their model optimizer documentation--there is a tool that allows for converting one IR to another (not sure if it covers exactly what you're looking to do?)

view this post on Zulip Andrew Brown (Nov 17 2020 at 22:18):

E.g.: https://docs.openvinotoolkit.org/latest/openvino_docs_MO_DG_prepare_model_convert_model_Converting_Model.html

view this post on Zulip Austin Huang (Nov 17 2020 at 22:42):

will have a look. There's a lot of different things I'd like to do :) but my immediate aim is to wrap my head around the scope of wasi-nn (what's in and out of scope) and how to interface with it from a programming langauge runtime (C++ will be a focus for now since I use libtorch)

view this post on Zulip Andrew Brown (Nov 17 2020 at 22:54):

Sounds good, let me know if you run into issues; here's @Mingqiu Sun and I talking about wasi-nn: YouTube - Introducing WASI-NN.


Last updated: Nov 22 2024 at 17:03 UTC