-
Notifications
You must be signed in to change notification settings - Fork 20
Open
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is needed
Description
Description
Enhance fmperf to support various pre-deployed distributed inference stacks, enabling users to benchmark different production-grade deployment solutions.
Target Stacks
- Dynamo AIBrix
- vLLM Production Stack
- LLM-D
- Other distributed inference solutions
Implementation Requirements
-
Stack Configuration
- Define stack specifications for each deployment type
- Support for distributed deployment configurations
- Handle different service discovery mechanisms
-
Deployment Management
- Integration with existing deployment orchestration
- Support for multi-node deployments
- Handle different scaling configurations
-
Monitoring & Metrics
- Handle logging of existing stacks
Expected Benefits
- Enable benchmarking of production-grade distributed deployments
- Support for more realistic deployment scenarios
- Better comparison between different deployment solutions
Related Components
fmperf/Cluster.pyfmperf/StackSpec.py- Deployment configuration files
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is needed