Skip to content

Add bitnet#3002

Draft
dhiaEddineRhaiem wants to merge 2 commits intoNVIDIA:mainfrom
dhiaEddineRhaiem:add-bitnet
Draft

Add bitnet#3002
dhiaEddineRhaiem wants to merge 2 commits intoNVIDIA:mainfrom
dhiaEddineRhaiem:add-bitnet

Conversation

@dhiaEddineRhaiem
Copy link

This PR adds support for BitNet Pretraining for gpt-like architectures:

  1. Integrates BitNet layer specs into GPT model building pipeline via --use-bitnet flag (requires --transformer-impl local)
  2. Implements BitNetColumnParallelLinear and BitNetRowParallelLinear layers that apply 1-bit weight quantization
  3. Uses onebitllms Triton kernels for efficient activation and weight quantization during forward pass

cc @sbhavani

@dhiaEddineRhaiem dhiaEddineRhaiem requested review from a team as code owners January 19, 2026 20:37
@copy-pr-bot
Copy link

copy-pr-bot bot commented Jan 19, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@ko3n1g ko3n1g requested a review from a team January 19, 2026 20:37
@chtruong814 chtruong814 added the needs-follow-up Issue needs follow-up label Jan 21, 2026
@janEbert
Copy link
Contributor

Hey @dhiaEddineRhaiem, sorry for not getting to this earlier. I'm a bit unsure about the dependency you're introducing. It has already not been updated for 9 months, so its future support seems very unstable.

@chtruong814 chtruong814 removed the needs-follow-up Issue needs follow-up label Feb 25, 2026
@janEbert
Copy link
Contributor

I'll discuss the introduction of the dependency with our team. We may take some time to properly review, but we'll try to up the pace. :)

@Phlip79
Copy link
Member

Phlip79 commented Mar 4, 2026

We are changing our review process and marking all open, unlabeled PRs as draft. This change will go in effect starting once #3659 is merged.

Moving forward, all PRs will be required to start as draft PRs. If you wish to get your PR merged, mark your PR as “Ready for review”. Read more about the new process at submit.md.

@Phlip79 Phlip79 marked this pull request as draft March 4, 2026 23:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants