Fix codeql alert + modularize pytorch codegen #49

RETR0-OS · 2025-12-28T16:42:52Z

This pull request introduces a new code generation orchestration system for neural network model code, supporting both PyTorch and TensorFlow backends. It adds a modular, extensible architecture for generating code from graph-based model definitions, including support for group blocks (nested reusable blocks), and provides a shared base for framework-specific code generation. The changes include new orchestrator base classes, utility functions for graph processing, and a group block generator abstraction. Additionally, a CodeQL workflow is added for security scanning.

Core code generation architecture:

Added BaseCodeOrchestrator abstract class (base_orchestrator.py) to encapsulate shared orchestration logic for code generation, including topological sorting, edge mapping, code spec generation, and rendering for both PyTorch and TensorFlow backends. This class provides extensible hooks for framework-specific implementations.
Introduced base.py with shared utility functions for topological sorting, input variable mapping, and node type/config extraction, to be used by orchestrators and code generators.
Created __init__.py in the codegen package to expose the orchestrators for PyTorch and TensorFlow, establishing a clear import interface.

Group block (nested block) support:

Added GroupBlockGenerator abstract base class (group_block_generator.py) to handle code generation for group (composite) blocks, including parsing port mappings, sorting internal nodes, detecting nested groups, and preparing template contexts for rendering. This enables reusable, hierarchical model components across frameworks.

DevOps and security:

Added a CodeQL workflow file (.github/workflows/codeql.yml) to enable advanced static analysis and security scanning for Python and JavaScript/TypeScript code in the repository.

- Add LayerCodeSpec dataclass for structured code specifications - Create TemplateManager for Jinja2 template loading and caching - Add get_pytorch_code_spec() and get_tensorflow_code_spec() methods to NodeDefinition - Create template directory structure for PyTorch and TensorFlow - Add base layer templates for both frameworks

- Create layer templates for conv2d, linear, maxpool, flatten, relu, softmax, dropout, batchnorm, attention, add, concat - Create file templates for model.py, train.py, dataset.py, config.py - Templates preserve good coding practices: reusable classes, comprehensive documentation, proper shape annotations

…and Add nodes

- Add base utilities (topological sort, edge map builder) - Create PyTorchCodeOrchestrator class - Implement template-driven code generation - Support skip connections in forward pass - Preserve all existing features (adaptive hyperparameters, test code generation) - Return same API format as original codegen

- Modify generate_pytorch_code() to delegate to PyTorchCodeOrchestrator - Preserve exact same function signature and return type - Maintain 100% backward compatibility with views - Mark legacy code as preserved for reference - Complete refactor: template-based, extensible, non-monolithic

- Create layer templates for conv2d, linear, maxpool, flatten, dropout, batchnorm, add, concat - Create file templates for model.py, train.py, dataset.py, config.py - Adapt templates for TensorFlow/Keras API and NHWC format - Maintain same documentation quality as PyTorch

- Add get_tensorflow_code_spec() to Conv2D, Linear, MaxPool2D, Flatten, Dropout, BatchNorm2D, Add, Concat - Adapt for TensorFlow/Keras API (filters vs channels, NHWC format) - Mirror PyTorch implementation pattern for consistency

- Add TensorFlowCodeOrchestrator class mirroring PyTorch structure - Implement template-driven code generation for TensorFlow/Keras - Support NHWC format and TensorFlow-specific patterns - Handle skip connections in forward pass - Return same API format as original TensorFlow codegen

- Modify generate_tensorflow_code() to delegate to TensorFlowCodeOrchestrator - Preserve exact same function signature and return type - Maintain 100% backward compatibility with views - Mark legacy code as preserved for reference - Complete TensorFlow refactor: template-based, extensible, non-monolithic

- Fix: Remove explicit .forward() and .call() method calls (use __call__ instead) - Fix: PyTorch layers should be called directly, not via .forward() - Fix: TensorFlow layers should be called directly, not via .call() - Add: Base orchestrator class to eliminate WET code (DRY principle) - Prepare: Foundation for refactoring both orchestrators to inherit from base

- Fix: Change import from .pytorch_codegen to .enhanced_pytorch_codegen - The actual file is named enhanced_pytorch_codegen.py, not pytorch_codegen.py

- Remove imports of classes that don't exist (GroupBlockShapeComputer, safe_get_shape_data, etc.) - These were only used in legacy code that never executes due to early delegation - Fixes ModuleNotFoundError on server startup

… Embedding

…age and enhance edge mapping for skip connections

security fixes + codegen improvement

…-ZROiF Claude/refactor pytorch codegen zr oi f

claude and others added 25 commits December 27, 2025 19:37

Implement PyTorch code generation for Conv2D node

7a6f167

Implement PyTorch code generation for Linear node

f50b411

Implement PyTorch code generation for MaxPool2D node

7464dcc

Implement PyTorch code generation for Flatten, Dropout, BatchNorm2D, …

ff0f977

…and Add nodes

Implement PyTorch code generation for Concat node

68938a6

Implement TensorFlow code generation for all node types

4d69fdc

- Add get_tensorflow_code_spec() to Conv2D, Linear, MaxPool2D, Flatten, Dropout, BatchNorm2D, Add, Concat - Adapt for TensorFlow/Keras API (filters vs channels, NHWC format) - Mirror PyTorch implementation pattern for consistency

Fix import error in tensorflow_codegen

53695ea

- Fix: Change import from .pytorch_codegen to .enhanced_pytorch_codegen - The actual file is named enhanced_pytorch_codegen.py, not pytorch_codegen.py

Remove unused imports from tensorflow_codegen

9fa625d

- Remove imports of classes that don't exist (GroupBlockShapeComputer, safe_get_shape_data, etc.) - These were only used in legacy code that never executes due to early delegation - Fixes ModuleNotFoundError on server startup

Add group blolck codegen + add missing blocks

cf8b997

Add PyTorch layer node definitions for Conv1D, Conv3D, LSTM, GRU, and…

3c30704

… Embedding

Refactor PyTorch and TensorFlow orchestrators to optimize variable us…

c12f46c

…age and enhance edge mapping for skip connections

Update CodeQL workflow to include 'deploy' branch

676d482

Delete .github/workflows directory

43a1b26

Update CodeQL workflow to include 'deploy' branch

3be727d

Merge pull request #47 from ForgeOpus/deploy

ecfe249

security fixes + codegen improvement

Merge branch 'main' into claude/refactor-pytorch-codegen-ZROiF

df2f1b5

Merge pull request #46 from ForgeOpus/claude/refactor-pytorch-codegen…

6c62f81

…-ZROiF Claude/refactor pytorch codegen zr oi f

RETR0-OS merged commit d3fe9dc into deploy Dec 28, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix codeql alert + modularize pytorch codegen #49

Fix codeql alert + modularize pytorch codegen #49

RETR0-OS commented Dec 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix codeql alert + modularize pytorch codegen #49

Fix codeql alert + modularize pytorch codegen #49

Conversation

RETR0-OS commented Dec 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants