Skip to content

Improve model loading performance and add caching enhancements#65

Open
george1-adel wants to merge 1 commit intoR-s0n:mainfrom
george1-adel:main
Open

Improve model loading performance and add caching enhancements#65
george1-adel wants to merge 1 commit intoR-s0n:mainfrom
george1-adel:main

Conversation

@george1-adel
Copy link
Copy Markdown

This pull request improves the model loading process by optimizing GPU/CPU memory handling and adding caching and timeout improvements.

Changes include:

Improved logging for CUDA and CPU loading

Added retries for model initialization

Enhanced resource cleanup and error handling

These changes reduce load time and improve inference stability on both CPU and GPU devices.

…quests retries, BeautifulSoup cleaning, and resource cleanup; improve caching and timeout handling
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant