A header-only C vector database library
41 points - today at 5:45 PM
Sourceeatonphil
today at 7:22 PM
As data stores go go this is basically in memory only. The save and load process is manually triggered by the user and the save process isn't crash safe nor does it do any integrity checks.
I also don't think it has any indexes either? So search performance is a function of the number of entries.
kazinator
today at 6:49 PM
In the world of Kubernetes and languages where a one-liner brings in a graph of 1700 dependencies, and oceans of Yaml, it's suddently important for a C thing to be one file rather than two.
jasonpeacock
today at 9:27 PM
C libraries have advertised "header-only" for a long time, it's because there is no package manager/dependency management so you're literally copying all your dependencies into your project.
This is also why everyone implements their own (buggy) linked-list implementations, etc.
And header-only is more efficient to include and build with than header+source.
I never copied my dependencies into my C project, nor does it usually take more than a couple of seconds to add one.
fonheponho
today at 9:17 PM
Exactly; I can't understand this obsession with header-only C "libraries".
quotemstr
today at 9:16 PM
Writing new C code in 2026 is already an artisanal statement, so why not got all the way in making it?
Header-only C libraries are such an underappreciated pattern for embedding into larger projects. For vector search specifically, having something you can just drop into an existing C/C++ codebase without pulling in a whole database dependency is really appealing. Curious about the indexing strategy — is it brute force or does it support approximate nearest neighbor?
Useful for embedded devices? Crashes, disk updates not important for ephemeral process?
Would it work to replace the memory store with mmap?
Mikhail_Edoshin
today at 7:18 PM
Why to call it a header? Could be just a source file. Including sources is uncommon, but why not? Solid "amalgamation" builds are a thing too.