-
Notifications
You must be signed in to change notification settings - Fork 4
Async copy of self-energyy from Device to Host #9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
36 commits
Select commit
Hold shift + click to select a range
c4f0c74
new async self-energy update in qkpt strems
gauravharsha 1e76013
fix minor bugs
gauravharsha b386b6d
more bugs
gauravharsha 5fa2212
update definition of the cleanup function
gauravharsha f8bc8ec
add profile arguments
gauravharsha c15e7b4
modify profile colors
gauravharsha 90a761e
adding different handles for each qkpt stream to encourage concurrency
gauravharsha 7f7ff16
add profile statement for polarization retrieval
gauravharsha 48e610c
typo in nvtx3 function
gauravharsha 1118a67
use push/pop range for Pqk_tQP function
gauravharsha 304e158
remove nvtx tags
gauravharsha 5dd3a8e
add comments and clean up function signatures
gauravharsha 259d3f5
remove unnecessary stream synchronization calls from obtain_Pq functions
gauravharsha 0592ace
more refactoring
gauravharsha d2fffab
redo Pqk_tQP function -- checking stream readiness has some effect on…
gauravharsha 1aa3a28
Update src/green/gpu/cugw_qpt.h
gauravharsha 552b7e2
Update src/green/gpu/cugw_qpt.h
gauravharsha 06cc9ac
Update src/green/gpu/cugw_qpt.h
gauravharsha fb6a62c
address Copilot reviews
gauravharsha 3666025
rename share coulomb allocation event
gauravharsha 91ebc4b
clean up and modify estimation of flops achieved
gauravharsha 2c68663
fix typo in wait_and_clean_qkpts
gauravharsha a7f6141
rename qkpt_handles
gauravharsha 6ad23d9
naming and documentation updates; use reset events at end of solve cycle
gauravharsha 8e0e0ef
fix function call for flops_achieved()
gauravharsha 422947e
documentation fixes from copilot review
gauravharsha d22fb80
documentation and logic fix thanks to copilot
gauravharsha d07819e
optimize host pinned memory buffers
gauravharsha 25acf9c
use only specific cuda arch for pauli-master gpu quadro p1000
gauravharsha c01e1c8
undo update for test.yaml
gauravharsha 1ff204a
fix mpi_reduce for performance metrics
gauravharsha c36ee81
fix sigmak_stij_buffer allocation - same memory as Gk is conflicting
gauravharsha 7bca915
modify complexity estimation and performance metrics
gauravharsha 86bc13c
minof fix in complexity estimation
gauravharsha 1bbc908
Update src/cugw_qpt.cu
gauravharsha 69c13e5
Update src/cugw_qpt.cu
gauravharsha File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.