Skip to content

Is Bindu a meta-compressor?

Is Bindu just a wrapper around existing codecs? Perhaps it’s merely a dispatcher that runs gzip on prose, xz on binaries, FLAC on audio, and aec on telemetry, picks whichever wins, and presents the result as one tool?

Such a tool is known as a meta-compressor, and Bindu is not one. On this page, we discuss how to prove that’s the case.

A “meta-compressor” usually means the tool has one or more of the following:

  1. A dispatcher. Looks at the input, picks the best of N classical codecs, and uses that one. The output ratio is bounded above by the best constituent codec on that input.
  2. A chain. Applies one codec after another (e.g. xz → bzip2 → zstd). In practice this worsens ratio: each lossless pass strips exploitable structure for the next.
  3. A bundle. Ships many codecs and lets the user pick. Same upper bound as the dispatcher case.

In all three flavors, the resulting ratio on any given file is no better than the best classical codec on that file. And the resulting output is the same kind of artifact: a dead byte stream that requires decompression before you can do anything with it.

Bindu produces ratios that exceed the dispatcher bound. And Bindu’s output is not a dead byte stream. Both observations rule out meta-compression.

Bindu does not embed gzip, brotli, zstd, xz, bzip2, FLAC, aec, ZFP, or any other compression library. There is no classical codec inside the binary that the system delegates to.

What’s inside is the cascade described in Architecture: a closed-form byte-signature router (Stage 1), several specialized pipelines that factor structural redundancy in different ways (Grammar, BWT, Dictionary, Stride, Constant, RLE, PAETH2D), a persistent rule pool, and a wire-format envelope check before any output ships. None of the classical codecs operate this way, and none of them produce the on-disk format Bindu produces.

You can verify this two ways:

  • Inspect the binary. There are no zstd, gzip, lzma, or other classical-codec symbols linked in. The cascade is self-contained.
  • Inspect a .bindu file. Its on-disk structure is the wire format described in the [file format spec](/reference/file-format/, including rule tables, residual streams, pipeline tags — not a wrapped LZ77/Huffman/BWT byte stream.

A meta-compressor produces the same kind of artifact its constituents produce: bytes that must be decompressed before you can read, search, or compute over them. That property is preserved by every classical codec, by any chain of them, and by any dispatcher that picks among them.

Bindu produces a different kind of artifact. From the same compressed file you can do all of the follwoing without decompression:

  • Read
  • Search
  • Edit

These are the computable properties (see Overview and Operating on compressed data). They are not achievable by any composition or selection of classical codecs, because the byte-stream output of those codecs is opaque by construction. If a system has these properties, it cannot be a wrapper around classical codecs, full stop.

A meta-compressor’s ratio on a given file is bounded above by the best of its constituent codecs on that file. Therefore, if Bindu’s ratio on some file exceeds the best classical codec we’ve measured on that file, Bindu cannot be a meta-compressor over those classical codecs.

See the industry benchmark for the numbers.