Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
54 commits
Select commit Hold shift + click to select a range
2bb8969
Initial commit
abhinavg4 Sep 30, 2025
e023786
Fix few issues for bridge export (#738)
yaoyu-33 Sep 30, 2025
f9aad1f
chore: Add issue template for model requests (#826)
ko3n1g Sep 30, 2025
a73c1be
ci: Skip if `docs-only` label is attached (#833)
ko3n1g Oct 1, 2025
6db2d13
destroy process group at end of performance script (#772)
ananthsub Oct 1, 2025
ad151f3
ci(fix): pre-flight (#842)
ko3n1g Oct 1, 2025
4bba0e6
[docs] Add canonical lora docs (#821)
ananthsub Oct 2, 2025
7e2eeaa
ci: Bump pre-flight (#854)
ko3n1g Oct 2, 2025
a6cfa88
Gemma model provider + bridge (#394)
ananthsub Oct 2, 2025
1990938
[docs] Packed sequences (#822)
ananthsub Oct 2, 2025
a4912e7
Gemma2 provider + Bridge (#856)
ananthsub Oct 2, 2025
af6bc36
[docs] placeholder page for performance summary (#796)
ananthsub Oct 2, 2025
c149b2e
[checkpoint] save `latest_checkpointed_iteration.txt` for megatron-lm…
ananthsub Oct 3, 2025
bd9465e
fix: exit profiler context (#841)
ananthsub Oct 3, 2025
ad94387
support async saving for CI end to end testing (#804)
ananthsub Oct 3, 2025
ae707eb
ci: Run install check on self-hosted cpu runners (#857)
chtruong814 Oct 3, 2025
a5d7c58
docs: Revert 0.2.0 push (#865)
ko3n1g Oct 3, 2025
5d194b9
Remove model providers for different model sizes (Qwen, Llama) (#607)
yaoyu-33 Oct 3, 2025
96e7b4c
add tests for functor design
ananthsub Sep 26, 2025
4a750dd
improve typing for forward step func and add tests for functors
ananthsub Sep 27, 2025
e0e8611
update tests
ananthsub Sep 27, 2025
7f6ec50
make checks more robust
ananthsub Sep 27, 2025
d6b02c6
docstrings
ananthsub Sep 27, 2025
897da83
docstrings
ananthsub Sep 27, 2025
b7ad487
docstrings
ananthsub Sep 27, 2025
a6ae7a3
fix tests
ananthsub Sep 27, 2025
6883596
inject state once at the beginning of the loops
ananthsub Oct 3, 2025
23e9efc
cleanup
ananthsub Oct 3, 2025
ab4f32d
add tests
ananthsub Oct 3, 2025
ca2a3c5
Add pretraining script for Llama3 8B model with YAML and CLI configur…
abhinavg4 Oct 5, 2025
db1b812
Merge branch 'functor' of https://github.com/ananthsub/Megatron-Bridg…
abhinavg4 Oct 6, 2025
7a701f6
diffusion_energon_datamodule
abhinavg4 Oct 6, 2025
914ff80
Refactor configuration handling and update model parameters
abhinavg4 Oct 6, 2025
a86856a
runnanle mcore Wan inference
Oct 23, 2025
544ad75
clean inference code
Oct 23, 2025
e41b3d1
workable model implementation, inference, finetuning
Oct 30, 2025
74da525
add example commands
Oct 30, 2025
0189812
add example commands
Oct 30, 2025
a2a2580
runnable thd, without containers edits
Oct 31, 2025
77f2673
update commands
Oct 31, 2025
bf4b652
add example commands
Oct 31, 2025
2b4fd60
add example commands
Oct 31, 2025
a263c00
fix example_commands.sh
Oct 31, 2025
ea6bb12
vace
Nov 13, 2025
e8e30d2
hf verification
NeverMore960114 Nov 15, 2025
59d3e99
add support for tp and cp
NeverMore960114 Nov 18, 2025
afdd3c6
add profiling
NeverMore960114 Nov 19, 2025
5996456
fix memory issues
NeverMore960114 Nov 20, 2025
f25c81a
enable batch size more than 1
NeverMore960114 Nov 22, 2025
7eba845
add additional output for context branch and additional input for bas…
NeverMore960114 Nov 28, 2025
40e0e32
vace pretrain scripts
Tatiana21 Nov 18, 2025
661acb1
Vace I2V finetuning
Tatiana21 Dec 2, 2025
dccfce4
Finetuning for V2V
Tatiana21 Dec 10, 2025
c985676
add annotator
Tatiana21 Dec 10, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
28 changes: 28 additions & 0 deletions .github/ISSUE_TEMPLATE/bug_report.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
---
name: Bug report
about: Create a report to help us improve the repository or project
title: ""
labels: bug
assignees: ''

---

**Describe the bug**

A clear and concise description of what the bug is.

**Steps/Code to reproduce bug**

Please list *minimal* steps or code snippet for us to be able to reproduce the bug.

A helpful guide on on how to craft a minimal bug report http://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports.


**Expected behavior**

A clear and concise description of what you expected to happen.


**Additional context**

Add any other context about the problem here.
2 changes: 2 additions & 0 deletions .github/ISSUE_TEMPLATE/config.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
blank_issues_enabled: false

20 changes: 20 additions & 0 deletions .github/ISSUE_TEMPLATE/feature_request.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
---
name: Feature request
about: Suggest an idea for this project
title: ""
labels: enhancement
assignees: ''

---

**Is your feature request related to a problem? Please describe.**
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

**Describe the solution you'd like**
A clear and concise description of what you want to happen.

**Describe alternatives you've considered**
A clear and concise description of any alternative solutions or features you've considered.

**Additional context**
Add any other context or screenshots about the feature request here.
31 changes: 31 additions & 0 deletions .github/ISSUE_TEMPLATE/model-support-request.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
---
name: Model Support Request
about: Request conversion support and training recipes for a new model
title: "<Model name> Model Support"
labels: ''
assignees: ''

---

Add support for \<model name\> model:

**Please include a link to the model's HuggingFace repo**
HF repo:

**These checklist items are required for all models in Megatron Bridge**

- [ ] Model providers
- [ ] Model bridge for HF conversion
- [ ] Unit tests (config and bridge)
- [ ] Model conversion functional tests

**For flagship models, these items are also needed**

- [ ] Optimal pretraining recipe
- [ ] Optimal finetuning recipe
- [ ] Recipe unit tests
- [ ] Recipe functional tests
- [ ] End to end CI tests

**Additional context**
Add any other context or screenshots about the model request here.
2 changes: 1 addition & 1 deletion .github/workflows/build-docs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ on:

jobs:
pre-flight:
uses: NVIDIA-NeMo/FW-CI-templates/.github/workflows/_cicd_preflight.yml@v0.53.0
uses: NVIDIA-NeMo/FW-CI-templates/.github/workflows/_cicd_preflight.yml@v0.64.2

build-docs:
needs: [pre-flight]
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/build-test-publish-wheel.yml
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,7 @@ permissions:

jobs:
pre-flight:
uses: NVIDIA-NeMo/FW-CI-templates/.github/workflows/_cicd_preflight.yml@v0.53.0
uses: NVIDIA-NeMo/FW-CI-templates/.github/workflows/_cicd_preflight.yml@v0.64.2

build-test-publish-wheel:
needs: [pre-flight]
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/cicd-main.yml
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License
# limitations under the License.
name: CICD NeMo
on:
schedule:
Expand All @@ -31,7 +31,7 @@ permissions:

jobs:
pre-flight:
uses: NVIDIA-NeMo/FW-CI-templates/.github/workflows/_cicd_preflight.yml@v0.53.0
uses: NVIDIA-NeMo/FW-CI-templates/.github/workflows/_cicd_preflight.yml@v0.64.2

lint-check:
name: Lint check
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/copyright-check.yml
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ on:

jobs:
pre-flight:
uses: NVIDIA-NeMo/FW-CI-templates/.github/workflows/_cicd_preflight.yml@v0.53.0
uses: NVIDIA-NeMo/FW-CI-templates/.github/workflows/_cicd_preflight.yml@v0.64.2

copyright-check:
needs: [pre-flight]
Expand Down
7 changes: 3 additions & 4 deletions .github/workflows/install-test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -26,20 +26,19 @@ on:

jobs:
pre-flight:
uses: NVIDIA-NeMo/FW-CI-templates/.github/workflows/_cicd_preflight.yml@v0.53.0
uses: NVIDIA-NeMo/FW-CI-templates/.github/workflows/_cicd_preflight.yml@v0.64.2

pip-test-bare-metal:
needs: [pre-flight]
if: |
!(needs.pre-flight.outputs.docs_only == 'true'
|| needs.pre-flight.outputs.is_deployment_workflow == 'true')
runs-on: ${{ matrix.arch }}
name: Pip - Python${{ matrix.python-version }} - ${{ matrix.arch == 'ubuntu-latest' && 'AMD64/Linux' || 'ARM64/Darwin' }} - Bare Metal
runs-on: linux-amd64-cpu16
name: Pip - Python${{ matrix.python-version }} - AMD64/Linux - Bare Metal
container: ubuntu:24.04
strategy:
fail-fast: false
matrix:
arch: ["ubuntu-latest"]
python-version: ["3.10", "3.11", "3.12"]
steps:
- name: Checkout repository
Expand Down
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -182,3 +182,5 @@ slurm*.out

# UV package manager
.uv/

*.mp4
Loading