Skip to content

Optimize Platform + Restructure#3

Draft
AndreSlavescu wants to merge 5 commits intoSafeAILab:mainfrom
AndreSlavescu:optimize
Draft

Optimize Platform + Restructure#3
AndreSlavescu wants to merge 5 commits intoSafeAILab:mainfrom
AndreSlavescu:optimize

Conversation

@AndreSlavescu
Copy link

@AndreSlavescu AndreSlavescu commented Nov 12, 2023

  • Restructure Makefile (automate detection of compute capability)
  • Optimize existing kernels

@jvhs0706 jvhs0706 self-requested a review November 13, 2023 16:57
@jvhs0706 jvhs0706 self-assigned this Nov 13, 2023
@jvhs0706
Copy link
Contributor

Hi Andre, your optimizations seem useful! Do you want us to merge now, or do you want to do that after you further optimize existing kernels?

@AndreSlavescu
Copy link
Author

Hi Andre, your optimizations seem useful! Do you want us to merge now, or do you want to do that after you further optimize existing kernels?

Hi Haochen,

I will be working on optimizing the kernels as well. For now the changes are for automatically detecting compute capability which eliminates the need for filling that in manually.

I am also trying to discover what CUDA versions work with this setup, because I was having problems with 11.7+

For now, I will just leave this as a draft PR and promptly make the updates to speedup the kernels.

@AndreSlavescu AndreSlavescu marked this pull request as draft November 13, 2023 17:07
@hongyanz
Copy link
Contributor

hongyanz commented Jan 6, 2024

@AndreSlavescu Hi Andre, can you contact me by my email?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants