Skip to content

Changes to allow user to enter host GPU for component to get potential nims to use; README changes#10

Open
jtcasablanca wants to merge 42 commits intoNVIDIA:mainfrom
jtcasablanca:main
Open

Changes to allow user to enter host GPU for component to get potential nims to use; README changes#10
jtcasablanca wants to merge 42 commits intoNVIDIA:mainfrom
jtcasablanca:main

Conversation

@jtcasablanca
Copy link

No description provided.

nv-twhitehouse and others added 30 commits February 28, 2025 16:45
- Created app 'Visual Studio Code'
- Updated Requested GPUs to '0'
- Modified files: .project/configpacks, code/chatui/utils/database.py,
code/output.log
- Modified files: code/chatui/pages/converse.py,
code/chatui/utils/database.py, variables.env

 Added LLAMA and MISTRAL constants at top of file to centralize model identifiers
- Updated model_list to use the new constants
- Kept match/case pattern for maintainable model selection logic
- Ensures consistent model identifier usage with API_PREFIX throughout code

This change improves maintainability by centralizing model identifiers and makes it easier to add new models in the future.
…ented.

- Modified files: code/chatui/pages/converse.py, code/output.log,
variables.env
- Fixed API_PREFIX to properly handle trailing slashes with rstrip()
- Updated pattern matching to use guard clauses for model comparison
- Changed from direct variable matching to str() type with equality guards

APPLICATION SUCCESSFULLY STARTED.
- Modified files: code/chatui/pages/converse.py,
code/chatui/utils/database.py, code/output.log, variables.env

- Fix incorrect Llama model endpoint string (llama3-70b-instruct -> llama-3.1-70b-instruct)
- Simplify API prefix logic by using INTERNAL_API environment variable
- Move model identifiers to top-level constants for better maintainability
- Update model string construction to handle internal API paths more cleanly

This change fixes endpoint resolution issues and improves code organization
around API path handling.
- Modified files: code/output.log
- Modified files: README.md
- Modified files: README.md
- Modified files: README.md
Update README structure and content

- Reorder workflow description for logical flow
- Adjust agentic workflow explanation for clarity
- Add details about prompt configuration and pipeline routing
- Update router LLM evaluation process description
- Modified files: README.md
- Modified files: README.md
- Modified files: README.md
- Modified files: README.md
- Modified files: README.md
- Modified files: README.md
- Modified files: README.md
- Updated storage in layout 'data/'
- Deleted layout 'data/scratch/'
- Modified files: .gitignore, code/chatui/utils/database.py,
code/output.log
still changes needed to deep dive.
- Modified files: README.md
- Modified files: README.md
- Modified files: README.md
- Modified files: README.md, code/output.log
- Modified files: README.md
- Added files: code/nim_disk_size.json, code/nim_gpu_support_matrix.json
- Added files: code/chatui/utils/gpu_compatibility.py
- Modified files: .gitignore, code/chatui/pages/converse.py,
code/chatui/utils/graph.py, code/chatui/utils/nim.py
- Add GPU type and count selection UI for all NIM components
- Implement GPU compatibility checking and validation
- Add dynamic model selection based on GPU configuration
- Update state management to include GPU settings
- Add error handling for GPU configuration issues
- Improve API key handling in NIM client

The changes allow users to:
- Select GPU type and count for each NIM endpoint
- See compatible models based on GPU configuration
- Get warnings for incompatible configurations
- Validate GPU settings before making API calls

Technical changes:
- Add gpu_compatibility module for GPU support matrix
- Update CustomChatOpenAI to handle GPU configuration
- Add GPU fields to GraphState
- Add GPU validation in NIM client
- Implement dynamic UI updates based on GPU selection
…nted yet.

- Modified files: README.md, code/output.log
- Modified files: README.md
- Clarify that each pipeline component can be independently configured for self-hosting
- List all available components that support self-hosting (Router, Generator, Retrieval, etc.)
- Emphasize ability to mix between hosted and self-hosted components
- Restructure setup steps to better explain component-specific configuration
- Remove outdated GPU sizing guide
- Modified files: README.md
- Modified files: README.md
- Modified files: README.md
… buttons)

- Modified files: README.md, code/chatui/pages/converse.py,
code/output.log
- Modified files: code/chatui/pages/converse.py
- Modified files: code/chatui/pages/converse.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants