Sp.h is the standard library that C deserves

(spader.zone)

60 points | by dboon 2 days ago

15 comments

  • nextaccountic 1 hour ago
    > Principles

    > Be extremely portable

    > sp.h is written in C99, and it compiles against any compiler and libc imaginable. It works on Linux, on Windows, on macOS. It works under a WASM host. It works in the browser. It works with MSVC, and MinGW, it works with or without libc, or with weird ones like Cosmopolitan. It works with the big compilers and it works with TCC.

    > And, best of all, it does all all of that because it’s small, not because it’s big.

    vs

    > Non-goals

    > Obscure architectures and OSes

    > I write code for x86_64 and aarch64. WASM is becoming more important, but is still secondary to native targets. I don’t care to bloat the library to support a tiny fraction of use cases.

    > That being said, if you’re interested in using the library on an unsupported platform, I’m more than happy to help, and if we can make the patch reasonable, to merge it.

    Those are contradictory. Either the code is extremely portable, or it can't support "obscure" platforms, but not both.

    • riedel 17 minutes ago
      I could not even find a mention what platform it supports. There is a Linux example on the bottom. Have never seem a libc implementation that does not even mention for which platforms it is meant.
    • ktpsns 1 hour ago
      Exactly. This shows that "extremely portable" is actually marketing for "It supports a number of platforms. In my opinion, this number is big".
      • noosphr 30 minutes ago
        We support extreme portability for sufficiently large values of two.
  • zzo38computer 51 minutes ago
    I agree with most of the criticisms they make.

    I agree that pointer and length is better than null-terminated strings (although it is difficult in C, and as they mention you will have to use a macro (or some additional functions) to work this in C).

    Making the C standard library directly against syscalls is also a good idea, although in some cases you might have an implementation that needs to not do this for some reason, generally it is better for the standard library directly against syscalls.

    FILE object is sometimes useful especially if you have functions such as fopencookie and open_memstream; but it might be useful (although probably not with C) to be able to optimize parts of a program that only use a single implementation of the FILE interface (or a subset of its functions, e.g. that does not use seeking).

    • alfiedotwtf 11 minutes ago
      Making every C call a system call is not a good idea at all - think about malloc() etc - the OS shouldn’t care about individual allocations and only worry about providing brk() etc. otherwise, performance will die if you’re doing a thousand system calls per second!
    • fithisux 40 minutes ago
      Null terminated strings have some merits but they should be a completely different data type like in Freebasic.
      • Sankozi 25 minutes ago
        Are there other merits than availability of literals in C?

        It seems like one of the worst data structures ever - lookup complexity of a linked list with a expansion complexity of an array list with security problems added as a bonus.

  • pjmlp 18 minutes ago
    We should have left C in the 90's already, but then FOSS happened,

    "Using a language other than C is like using a non-standard feature: it will cause trouble for users. Even if GCC supports the other language, users may find it inconvenient to have to install the compiler for that other language in order to build your program. So please write in C."

    The GNU Coding Standard in 1994, http://web.mit.edu/gnu/doc/html/standards_7.html#SEC12

  • Panzerschrek 9 minutes ago
    How does this library work in programs with parts still requiring libc?

    How does it deal with code executing before main? Libc does a bunch of necessary stuff, like calling initializers for global variables.

  • Panzerschrek 12 minutes ago
    This doesn't look good:

      c8 buf [SP_PATH_MAX] = sp_zero;
      sp_cstr_copy_to_n(path, len, buf, SP_PATH_MAX);
    
    since

      #define SP_PATH_MAX 4096
    
    There should be a fallback for very long paths.
  • Panzerschrek 15 minutes ago
    It's a disadvantage, that it's header-only. It needs to include <windows.h> and a bunch of other stuff, which slow-downs compilation. Splitting it into a couple of files (a header and an implementation) would be much better.
  • Retr0id 2 hours ago
    > Program directly against syscalls

    Works nicely on Linux where the syscall interface is explicitly stable, but on many (most?) other platforms this is not the case.

    > There Is No Heap

    I don't understand what this means, when it's followed by the definition of a heap allocation interface. The paragraph after the code block conveys no useful information.

    > Null-terminated strings are the devil’s work

    Agreed! I also find the stance regarding perf optimization agreeable.

    • Retr0id 1 hour ago
      Looks like the default allocator uses mmap(2) for every single allocation, which is horribly inefficient - you map a whole PAGE_SIZE worth of memory for every tiny string. Aside from just wasting memory this will make the TLB very unhappy.

      It looks like sp_log's string formatting is entirely unbuffered which results in lots of tiny write syscalls.

      • AlotOfReading 1 hour ago
        That seems to be a pretty consistent quality level for the entire library. Look at the implementations in sp_math, yikes.
        • 12_throw_away 37 minutes ago
          Oh man. Oof. I'm sure there must be some repository out there that has an AGENTS.md but isn't pure slopcode, but I haven't seen it yet. The number of people who can be trusted to vibe code "responsibly" is probably about the same as the number of people who can be trusted to write memory safe C.
        • locknitpicker 1 hour ago
          > That seems to be a pretty consistent quality level for the entire library. Look at the implementations in sp_math, yikes.

          That does spin the meaning of "Sp.h is the standard library that C deserves"

        • jcranmer 1 hour ago
          "How bad can it be, I mean I know that numerics are not many people's strong suit, but..."

          ... ... ... oh wow, the math functions are really bad implementations. The range reduction on the sin/cos functions are yikes-level. Like the wrong input gives you an infinite loop level of yikes.

    • zamadatix 1 hour ago
      > Works nicely on Linux where the syscall interface is explicitly stable, but on many (most?) other platforms this is not the case.

      There is a footnote on this saying as much:

      > 3. Where “syscall” means “the lowest level primitive available”. On Linux, it’s always actual syscalls. On Windows, that’s usually NT. On macOS, it’s usually the syscall-wrapper subset of libc because you’re forced to link libc and it’s not quite as open as Linux (although there is a rich “undocumented” set of APIs and syscalls that are very interesting).

    • quuxplusone 1 hour ago
      The "definition of a heap allocation interface" indicates that there is no standard heap. Instead, there's a standard interface for the use to define their own heaps. Any standard library function that needs to allocate will take a sp_allocator_t parameter, and use that to allocate. As opposed to e.g. strdup, which hard-codes a call to malloc internally. Sp.h's strdup-alike would take an sp_allocator_t as input and call into that to get the memory it needs.

      A C++ programmer might describe this as "PMR, but not default-constructible. And std::stable_sort takes a PMR allocator parameter. And PMR is the default, and there's no implementation of std::allocator (or new or delete)."

  • skybrian 2 hours ago
    My impression of the sample programs is that they're unreadably noisy, but maybe this would be a good compiler target if you're writing your own language?
  • KnuthIsGod 27 minutes ago
    "The library’s stance, to put it simply, that the juice ain’t worth the squeeze when it comes to low level, compute-bound performance.

    Designing software and data structures for performance against unknown use cases on unknown hardware is extremely difficult and the resulting code is much more complicated. Even then, it’s often better to use code written against your actual use case and hardware when performance is that critical.

    Things that are off the table might be:

    SIMD A highly optimized hash table rewrite Figuring out where inlining or LIKELY causes the compiler to produce better code."

    LOL...

    Classic vibe coder.

  • 504118318 13 minutes ago
    Just taking a quick look at the atomics section:

    First, (on unix) it's wrapping pthread mutex. That's part of libc! (Technically it might not be libc.so, but it's still the standard library.)

    Also, none of the atomics talk about the memory model. You don't _have_ to use the C11 memory model (Linux, for example, doesn't). But if you're not using the C11 memory model and letting the compiler insert fences for you, you definitely need to have fence instructions, yourself.

    While C11 atomics do rely on libgcc, so do the __sync* functions that this library uses (see https://godbolt.org/z/bW1f7xGas) for an example.

    Oops... apparently this is vibecoded. Welp, I just wasted ten minutes of my life reviewing slop that I'm not going to get back.

  • JSR_FDED 1 hour ago
    I love how hyper-opinionated this is.
    • smitty1e 58 minutes ago
      Family saying: "It ain't bragging if you can do it."

      When one is competent to work at this level, strong opinions are in order.

      Their correctness is something I cannot gage. I'm barely competent to follow the conversation.

      • rmunn 45 minutes ago
        Considering the first thing I saw in the thread was https://news.ycombinator.com/item?id=48244891 where the values returned from sp's sine function was compared to the correct values, I'm going to take any such opinions with a few grains of salt. Because the correct sine for the number they tested (31337 radians) is 0.3772 (0.3771522646 according to my calculator), sp's implementation returned 0.4385. That's not even close to right.
  • nektro 1 hour ago
    not one mention of Zig on the whole page?
  • Kab1r 2 hours ago
    Best library name.
  • TZubiri 2 hours ago
    > Every language that depends on third party libraries, like js and python, is getting massively infected with supply chain worms

    > Only couple of languages not affected are those that don't have a culture of downloading third party code, like C and C++

    > Ex js and python developer publishes a 'library'

    > Library is vibe coded

    > Published on github amidst GitHub being hit by supply chain attacks, had their source code leaked.

    The timing is terrible for starters, and I don't trust the vibe coded code at all. Imagine a pandemic and the cities are on fire, and you arrive to a rural town asking to kiss people.

    • redlewel 20 minutes ago
      Thanks for this comment, I was about to bookmark the repo for later you saved me the time.
  • KnuthIsGod 29 minutes ago
    Wonderful !

    Yet another slop coded library.

    What could possibly go wrong...