One verified stack. Pick your model and your trust boundary.

Every model here earned its place the same way: its output matched an independent referee, token for token, running on the same 8 Helix-built kernels powered by Helix. Click a card for the full recorded result.

Key: anything with this symbol is powered by Helix — compiled from Helix source by kovc.

THE MODELS

Four sizes, one stack — all verified 25/25

SmolLM2 brings 2024's Llama architecture onto the stack — and the whole GPT-2 family, from the 124M starter to the 1.5B flagship, runs on the exact same 8 kernels beneath it, only the dimensions changing. The Llama leg costs just 3 extra kernels.

TWO PATHS

Choose how far your trust has to reach

Both paths run the same model and pass the same token-for-token gates — they differ only in what you must trust beneath the auditable code.

WHY IT MATTERS

Same kernels at every size

124M → 774M → 1.5B ran through the identical 8 kovc-emitted kernels — zero new operations at scale, only dimension changes read from each model's config. That's the integration pitch in one line: bring your weights; the verified stack underneath doesn't change. (44,019 bytes of PTX, 8 entry points.)

Honest residuals: fp32 · verified to PTX, not SASS · single GPU (sm_86) · base models, not assistants · the oracle shares the model's spec. Every number on this page is a recorded, committed result. start here · guided run · expert · proof · models

One verified stack. Pick your model and your trust boundary.

Four sizes, one stack — all verified 25/25

The quick one

The scale proof

The flagship

The modern one

Choose how far your trust has to reach

Fast(er), one trusted-once step

Purest trust — no ptxas at all

Same kernels at every size

Recorded gate result

Recorded gate result

Recorded gate result

Recorded gate result

What's auditable vs trusted

The trade