Skip to content

Conversation

@apeyser
Copy link
Owner

@apeyser apeyser commented Aug 28, 2018

No description provided.

bcumming and others added 16 commits July 26, 2018 13:51
Use a compat::fma wrapper for std::fma to avoid a bug in the tree optimizer in GCC version < 8.2.

See: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87046
Fixes arbor-sim#568.
…sim#566)

* Add gpu_context as part of execution context containing information about GPU availability, managed_memory synchronization, and atomic double availability.
* Choose between ON and OFF for ARB_GPU in CMake. If ON compile for K20, K80, and P100

Note that we still need compile time information about the GPU in cuda_atomic.hpp for atomicAdd(double*, double*). This is because the function is only defined when the program is compiled  for sm_60 or more.
Move implementation of `gpu_context` from header to `cpp` file, so that `ARB_WITH_CUDA` doesn't leak from library implementation.
* Add new ring benchmark to examples.
* Refactored common functionality for reading miniapp parameters from a json file to `aux` (used by both bench and ring).

Fixes arbor-sim#516.
if std::sort is used with a < operator that does not result in an ordering the result is wrong and contains bad data.
sort the results as the order in which the threads push their data is unknown.
@qres qres force-pushed the tree_solve_multi_block branch from ccf9d96 to b061b47 Compare August 28, 2018 11:59
qres and others added 6 commits August 28, 2018 18:13
Fixes arbor-sim#575.

* Guard CPU architecture option for nvcc with generator expression.
Make the execution context presented to users an opaque handle, moving all implementation of the gpu, thread and distributed contexts into the back end.

* move `execution_context` and `distributed_context` definitions to the back end
* create `execution_context` handle called `context` in the public API
* provide `make_context` helper functions that build different context configurations (default, user-specified local resources, with MPI)
* update documentation for all parts of the public API that touch contexts
* move `distributed_context` docs to the developer documentation (from the public API docs)
@qres qres force-pushed the tree_solve_multi_block branch from 096d985 to e5a8429 Compare September 3, 2018 12:45
@qres qres force-pushed the tree_solve_multi_block branch from ac367a4 to aef710f Compare September 12, 2018 08:39
@qres qres force-pushed the tree_solve_multi_block branch from 89a4496 to 8bf1f92 Compare October 1, 2018 13:31
@qres qres force-pushed the tree_solve_multi_block branch from 8bf1f92 to 5f18da2 Compare October 11, 2018 12:04
@qres qres force-pushed the tree_solve_multi_block branch from 87e54fa to a50b9b4 Compare October 15, 2018 09:59
@qres qres force-pushed the tree_solve_multi_block branch from 13a009e to 624be7f Compare November 1, 2018 17:27
@qres qres force-pushed the tree_solve_multi_block branch from 624be7f to e7a6e99 Compare November 1, 2018 17:27
@qres qres force-pushed the tree_solve_multi_block branch from e7a6e99 to 2a92e14 Compare November 2, 2018 13:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants