Show HN: (bits) of a Libc, Optimized for Wasm

(github.com)

76 points | by ncruces 1 day ago

5 comments

  • fuhsnn 1 day ago
    Wasm intrinsics look neat as a higher-level fixed size SIMD abstraction. I wonder how good the compilers can do if using them for AOT targets with libraries like simd-everywhere.

    string.h is missing strstr(), there's an algorithm of similar complexity you might consider: http://0x80.pl/notesen/2016-11-28-simd-strfind.html

    • ncruces 1 day ago
      Yeah, so far I did exactly the ones (my build of) SQLite needed and not others.

      If there's interest, the set of implemented functions can definitely be extended.

  • phickey 1 day ago
    This looks like a nice approach to making wasi-libc faster. Could you submit these changes upstream?
    • ncruces 1 day ago
      I'd like to be a little more sure that I'm not totally messing things up before doing that, but yes, eventually, that would be a nice outcome.

      I've also only really tested wazero. I can't know for sure that this is a straight improvement for other runtimes and architectures.

      For instance, the code delays using wasm_i8x16_bitmask as much as possible, because on Aarch64 it can be slower than not using SIMD at all, whereas it's plenty fast on x86-64.

      • phickey 1 day ago
        The maintainers of wasi-libc are some of the best people to review this, and I don’t think it would be wasting their time to ask them to look at a PR.
        • ncruces 1 day ago
          A PR is a significant investment from me. I'd have to figure out where something like this is supposed to fit, how the build infra works, etc.

          One of the nice things about Go is how much that's a solved issue out of the box, compared to almost everything else; certainly compared to C.

          Pinging them in an issue: https://github.com/WebAssembly/wasi-libc/issues/580

  • cedws 22 hours ago
    Would you consider writing some blog posts or other resources about WASM? I was experimenting recently with WIT, and ran into a mountain of issues. There's a lot of jargon that could do with some untangling.

    It took me a lot longer than it should have to put together this basic module, and even then there's this shared library I had to download to build it, and I couldn't figure out why this requires a libc:

    https://github.com/cedws/wasm-wit-test

    • ncruces 10 hours ago
      I'm not that great at long form writing to be honest, it's always a bit of a chore, and I'm never happy with the result.

      To answer your question, it needs a libc because you're including stdlib.h, and exporting and allocator (even if you're not otherwise using it). You need a libc for malloc.

      This is generally a good idea, if you need to send anything beyond numbers across the API (e.g. you need an allocator if you want to send strings as pointers).

      I never used WIT, so I have no idea if this a requirement for WIT.

  • forrestthewoods 21 hours ago
    What is SWAR?
    • ncruces 20 hours ago
      SIMD within a register: https://en.wikipedia.org/wiki/SWAR

      It's generally used for techniques that apply SIMD principles within general-purpose registers and instructions.

      Assume you've loaded a 64-bit register (a uint64_t) with 8 bytes (unsigned char) of data. Can you answer the question “is any of these 8 bytes zero (the NUL terminator)?”

      If you find a cheap way to do it, you can make strlen go faster by consuming 8 bytes at a time.

      Et voilà:

         #define ONES ((uint64_t)-1/UCHAR_MAX)
         #define HIGHS (ONES \* (UCHAR_MAX/2+1))
         #define HASZERO(x) ((x)-ONES & ~(x) & HIGHS)
  • nu11ptr 1 day ago
    It is still a bit early, but I'm majorly bullish on WASM for multiple use cases:

    1. Client side browser polyglot "applets" (Java applets were ahead of their time IMO)

    2. Server side polyglot "servlets" (Node.js, embedded runtimes, etc.)

    3. Language interop/FFI (Lang A -> WASM -> Lang B, like wasm2c)

    Why is #3 so interesting? The hardest thing in language conversion is the library calls. WASI standardizes that, so all the proprietary libs will eventually compile down to WASI as a sort of POSIX/libc like layer. In addition, WASM standardizes calling convention. The resulting new source code may not look like much, but it will solve the FFI calling convention/marshalling/library issues nicely.

    • frumplestlatz 1 day ago
      I’m not sure how it solves the FFI problem. Lowest common denominator calling conventions don’t make it any easier to bridge languages than it already is.

      C calling conventions are already the standard for FFI in native code, and that means dropping down to what can be expressed in C if you want to cross that boundary.

      • ncruces 1 day ago
        As far as Go is concerned, the Wasm sandbox makes the (addressable, C) stack explicit, which solves at least some of the issues CGO has to deal with.

        It's not a panacea, though; it introduces other issues.