How can I return a JavaScript string from a WebAssembly function
How can I return a JavaScript string from a WebAssembly function?
Can the following module be written in C(++) ?
export function foo() {
return 'Hello World!';
}
Also: Can I pass this to the JS engine to be garbage collected?
Solution 1:
WebAssembly doesn't natively support a string type, it rather supports i32
/ i64
/ f32
/ f64
value types as well as i8
/ i16
for storage.
You can interact with a WebAssembly instance using:
-
exports
, where from JavaScript you call into WebAssembly, and WebAssembly returns a single value type. -
imports
where WebAssembly calls into JavaScript, with as many value types as you want (note: the count must be known at Module compilation time, this isn't an array and isn't variadic). -
Memory.buffer
, which is anArrayBuffer
that can be indexed using (among others)Uint8Array
.
It depends on what you want to do, but it seems like accessing the buffer directly is the easiest:
const bin = ...; // WebAssembly binary, I assume below that it imports a memory from module "imports", field "memory".
const module = new WebAssembly.Module(bin);
const memory = new WebAssembly.Memory({ initial: 2 }); // Size is in pages.
const instance = new WebAssembly.Instance(module, { imports: { memory: memory } });
const arrayBuffer = memory.buffer;
const buffer = new Uint8Array(arrayBuffer);
If your module had a start
function then it got executed at instantiation time. Otherwise you'll likely have an export which you call, e.g. instance.exports.doIt()
.
Once that's done, you need to get string size + index in memory, which you would also expose through an export:
const size = instance.exports.myStringSize();
const index = instance.exports.myStringIndex();
You'd then read it out of the buffer:
let s = "";
for (let i = index; i < index + size; ++i)
s += String.fromCharCode(buffer[i]);
Note that I'm reading 8-bit values from the buffer, I'm therefore assuming the strings were ASCII. That's what std::string
would give you (index in memory would be what .c_str()
returns), but to expose something else such as UTF-8 you'd need to use a C++ library supporting UTF-8, and then read UTF-8 yourself from JavaScript, obtain the codepoints, and use String.fromCodePoint
.
You could also rely on the string being null-terminated, which I didn't do here.
You could also use the TextDecoder
API once it's available more widely in browsers by creating an ArrayBufferView
into the WebAssembly.Memory
's buffer
(which is an ArrayBuffer
).
If, instead, you're doing something like logging from WebAssembly to JavaScript, then you can expose the Memory
as above, and then from WebAssembly declare an import which calls JavaScript with size + position. You could instantiate your module as:
const memory = new WebAssembly.Memory({ initial: 2 });
const arrayBuffer = memory.buffer;
const buffer = new Uint8Array(arrayBuffer);
const instance = new WebAssembly.Instance(module, {
imports: {
memory: memory,
logString: (size, index) => {
let s = "";
for (let i = index; i < index + size; ++i)
s += String.fromCharCode(buffer[i]);
console.log(s);
}
}
});
This has the caveat that if you ever grow the memory (either through JavaScript using Memory.prototype.grow
, or using the grow_memory
opcode) then the ArrayBuffer
gets neutered and you need to create it anew.
On garbage collection: WebAssembly.Module
/ WebAssembly.Instance
/ WebAssembly.Memory
are all garbage collected by the JavaScript engine, but that's a pretty big hammer. You likely want to GC strings, and that's currently not possible for objects which live inside a WebAssembly.Memory
. We've discussed adding GC support in the future.
Solution 2:
2020 Update
Things have changed since the other answers were posted.
Today I would bet on the WebAssembly Interface Types - see below.
Since you asked specifically about C++, see:
- Nbind:
nbind - Magical headers that make your C++ library accessible from JavaScript
nbind is a set of headers that make your C++11 library accessible from JavaScript. With a single #include statement, your C++ compiler generates the necessary bindings without any additional tools. Your library is then usable as a Node.js addon or, if compiled to asm.js with Emscripten, directly in web pages without any plugins.
- Embind:
Embind is used to bind C++ functions and classes to JavaScript, so that the compiled code can be used in a natural way by “normal” JavaScript. Embind also supports calling JavaScript classes from C++.
See the following WebAssembly proposals:
- JS Types
- Reference Types
- Interface Types
The proposal adds a new set of interface types to WebAssembly that describe high-level values (like strings, sequences, records and variants) without committing to a single memory representation or sharing scheme. Interface types can only be used in the interfaces of modules and can only be produced or consumed by declarative interface adapters.
For more info and a great explanation, see:
- WebAssembly Interface Types: Interoperate with All the Things! by Lin Clark
You can already use it with some experimental features, see:
- https://www.youtube.com/watch?v=Qn_4F3foB3Q
For a good real world example using yet another approach, see:
- libsodium.js
libsodium.js - The sodium crypto library compiled to WebAssembly and pure JavaScript using Emscripten, with automatically generated wrappers to make it easy to use in web applications.
See also:
- Wasmer:
Wasmer is an open-source runtime for executing WebAssembly on the Server. Our mission is make all software universally available. We support running Wasm modules standalone in our runtime, but also can be embedded in multiple languages using our language integrations.
and specifically Wasmer-JS:
Wasmer-JS enables the use of server-side compiled WebAssembly Modules in Node.js and the Browser. The project is set up as mono-repo of multiple JavaScript packages.
There's also some good info in this article on Hacker News.
Solution 3:
Given:
-
mem
, theWebAssembly.Memory
object (from the module exports) -
p
, the address of the first character of the string -
len
, the length of the string (in bytes),
you can read the string using:
let str = (new TextDecoder()).decode(new Uint8Array(mem.buffer, p, len));
This assumes the string is UTF-8 encoded.