Skip to content

Record: 11L + order-adaptive 11-gram (mean val_bpb=0.8881)#795

Open
hypery11 wants to merge 1 commit intoopenai:mainfrom
hypery11:submission/2026-03-26_champion_v2
Open

Record: 11L + order-adaptive 11-gram (mean val_bpb=0.8881)#795
hypery11 wants to merge 1 commit intoopenai:mainfrom
hypery11:submission/2026-03-26_champion_v2

Conversation

@hypery11
Copy link

Results

Seed val_bpb
42 0.8883
1337 0.8886
2024 0.8875
Mean 0.8881
Std 0.0006
  • Artifact: 13.99 MB
  • Train: 600s on 8xH100 SXM
  • Eval: ~160s

Method

11-layer XSA-all transformer with order-adaptive entropy-gated n-gram backoff (orders 2-11). Higher-order matches use lower entropy threshold. GPTQ-lite int6 + zstd-22. Score-first, deterministic, no TTT.

  • 8xH100 SXM, train <=600s
  • Eval <=600s (~160s)
  • Artifact <=16MB (13.99MB)
  • 3-seed validation (std 0.0006)

Seeds: 0.8883 / 0.8886 / 0.8875 (std 0.0006).
Order-adaptive entropy gating on 2-11 gram backoff.
13.99MB artifact. Train 600s, eval ~160s.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant