vs zstd

Zstandard is the strongest general-purpose byte-level compressor available. It’s fast, tunable across a wide ratio/speed range, and supports trained dictionaries. If you’re choosing between “smart byte compression” and Bindu, this is the comparison that matters most.

Model

zstd: LZ77 variant with FSE/Huffman entropy, configurable window up to 2 GB, support for pre-trained dictionaries up to 110 KB.
Bindu: symbolic pipeline computing a coordinate-keyed symbol table on the spot. The symbol table has no fixed cap and represents semantic structure rather than byte sequences.

Ratio (measured)

From the industry benchmark:

Aggregate	zstd	Bindu
All files compressed (% reduction)	73.51%	77.95%
Per-file wins	1/30	19/30

Selected workloads:

Workload	zstd best	Bindu
Silesia `webster`	4.90×	5.75×
Silesia `nci`	20.84×	24.79×
GOES-16 weather telemetry	15.93×	21.98×
OMNI solar wind timestamps	3.24×	2,349×
MMS mission status flags	29,103×	263,314×
Silesia `mozilla` (binaries)	3.42×	2.88×
MMS Epoch timestamps	1.98×	1.34×

Bindu wins most of the corpus. zstd holds a small edge on a few file types (notably some binary formats and certain int64 timestamp encodings), and on those the gap is small.

Throughput

zstd holds a real lead on raw decode throughput — it’s the streaming-decode champion at ~2 GB/s on the test rig. Bindu prioritizes operating on the compressed form rather than getting back to bytes as fast as possible. If your workload is “decompress and stream as fast as possible,” zstd is the right pick.

zstd dictionaries vs Bindu symbol tables

zstd supports trained dictionaries, which is the closest analogue to Bindu’s symbol table.

Aspect	zstd dict	Bindu symbol table
Size limit	110 KB	None
Built once and frozen?	Yes	No (can grow)
Carries semantic structure?	No (bytes)	Yes (coordinates)
Search compressed form?	No	Yes

A trained zstd dictionary is the right tool when you have a pile of similar small files and want a flat ratio improvement everywhere. A Bindu symbol table is the right tool when you want the meaning of the data to be reusable — searchable, editable, cross-file comparable.

When zstd wins

Streaming workloads where decode throughput is the bottleneck.
Mixed-content workloads where you want one well-known codec across everything.
You need ubiquitous tooling support (Linux kernel, btrfs, every modern language has a zstd binding).

When Bindu wins

Sequential telemetry / satellite — see the flagship use case.
Workloads where you’ll search, edit, or query the compressed form (the computable property).
Long-retention archives where the symbol table amortizes across many files.