update super endpoints to build.nvidia#152
update super endpoints to build.nvidia#152AjayThorve merged 10 commits intoNVIDIA-AI-Blueprints:developfrom
Conversation
Greptile SummaryThis PR migrates all Nemotron Super 120B references from a temporary NVCF invocation endpoint ( Key changes:
Confidence Score: 4/5
Important Files Changed
Flowchart%%{init: {'theme': 'neutral'}}%%
flowchart TD
A[User Query] --> B[clarifier_agent\nnow uses nemotron_nano_llm]
B --> C{Tool calls\nparallel_tool_calls=True}
C -- "parallel tools" --> D[ToolNode\nmultiple ToolMessages]
D --> B
C -- "clarification needed" --> E[ask_for_clarification node]
E --> B
C -- "complete" --> F{enable_plan_approval?}
F -- yes --> G[plan_preview node]
F -- no --> H[END]
G --> H
B2[deep_research_agent\nresearcher_llm] --> I{API Endpoint}
I -- "default\nnemotron_nano_llm" --> J["integrate.api.nvidia.com\nnvidia/nemotron-3-nano-30b-a3b"]
I -- "optional\nnemotron_super_llm\ncommented out" --> K["integrate.api.nvidia.com\nnvidia/nemotron-3-super-120b-a12b\n⚠️ limited availability"]
|
…nemotron-3-super-120b-a12b and new base URL for improved integration.
…ron Super model - Introduced a new Jupyter notebook for getting started with the NVIDIA AI-Q Blueprint, detailing installation, environment setup, and usage instructions. - Added a script to install and serve the Nemotron Super model with tensor parallelism, including configuration options and usage instructions. - Updated troubleshooting documentation to address potential issues with the Nemotron Super build endpoint stability and recommended self-hosting solutions.
raykallen
left a comment
There was a problem hiding this comment.
lgtm. we'll patch update for super when ready.
No description provided.