Add NVIDIA driver update scripts and GPU verification#1031
Draft
GeorgePearse wants to merge 18 commits intomasterfrom
Draft
Add NVIDIA driver update scripts and GPU verification#1031GeorgePearse wants to merge 18 commits intomasterfrom
GeorgePearse wants to merge 18 commits intomasterfrom
Conversation
- Rename all visengine references to visdet/visdet.engine throughout codebase - Fix YAML config handling: convert [1333, 800] list to tuple for img_scale - Fix COCO dataset config: use img_path instead of img for data_prefix - Fix collect_results for non-distributed mode (world_size=1) - Fix collect_results signature (remove incorrect collect_device param) - Fix scale_factor handling in FCNMaskHead for 2-element arrays - Skip out-of-range category predictions in CocoMetric - Add visualization hooks to engine.hooks exports - Fix circular import in visualization_hook with TYPE_CHECKING - Improve SimpleRunner pipeline handling for train/val - Add justfile with common development commands
- Create comprehensive roadmap.md documenting SPDL integration phases - Add references to roadmap in index.md - Update data_pipeline.md with SPDL future enhancement note - Update quick-start.md with SimpleRunner API and visdet imports SPDL (Meta's Scalable and Performant Data Loading) offers: - 74% faster data iteration vs PyTorch DataLoader - 38% less CPU usage - 50GB less memory footprint - Thread-based execution instead of multiprocessing - Additional 33% speedup with Python 3.13t (nogil) Integration planned in 4 phases through 2025: - Phase 1 (Q1): Adapter layer and optional dependency - Phase 2 (Q2): Detection-specific optimizations - Phase 3 (Q3): Distributed training support - Phase 4 (Q4): Python 3.13t and advanced features
Add download_coco2017_to_volume.py that downloads COCO 2017 dataset (train2017, val2017, annotations) to a Modal persistent volume for cloud-based training. Includes checkpointing to handle interruptions. Also update training docs to reference cloud/Modal data preparation. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
feat(tools): Add Modal script for COCO 2017 dataset download
Remove zuban from dev dependencies and pre-commit hooks to unblock CI pipeline. Other pre-commit hooks and tests continue to pass. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
- Import visdet.cv.transforms in builder to ensure registry registration (fixes CI failures where transforms weren't found in PIPELINES) - Restore AssertionError for list-of-integers img_scale in Resize (test expects [1333, 800] to raise, not auto-convert to tuple) - Fix RandomFlip to flip all images in img_fields, not just 'img' (test expects both img and img2 to be flipped) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
chore: Remove zuban type checker temporarily
chore: scoped zuban typecheck for visdet/structures
chore: scoped zuban typecheck for visdet/apis
docs: add tests status badge
chore: scoped zuban typecheck for visdet/core/mask
Make load_yaml_config available from visdet.engine.config for preset loading.
Re-export Config class for easier access in training scripts.
2bb42a1 to
df0dbfc
Compare
Contributor
|
✅ Skylos Scan: No dead code or security issues detected. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
This PR adds comprehensive NVIDIA driver update and GPU verification tools to the project.
Changes
update-nvidia-driver.sh: Main script with auto-detection for NVIDIA driver installation
verify-pytorch-gpu.py: Script to verify PyTorch GPU functionality
NVIDIA_DRIVER_UPDATE.md: Complete documentation and usage guide
Testing
These scripts have been tested with NVIDIA driver installation and PyTorch GPU verification workflows.