`VariableCellArrayReal::synchronize`: parallelize on item and second dimension when pack/unpack messages

Consider a variable of type `VariableCellArrayReal` where the second dimension is high (for example 38400).

When we want to synchronize this variable between GPUs, the messages are packed and unpacked on GPU using the Accelerator API: https://github.com/arcaneframework/framework/blob/74aa8336ad10bd5863c43aad0dc4c261d0be7fb9/arcane/src/arcane/accelerator/MemoryCopier.cc#L80.

In the case where the number of items (`nb_index`) is low and the second dimension (`sub_size`) is really high, the `_copyFrom` and `_copyTo` methods are expansive (because not enough parallelism).

Is it possible to parallelize both on `nb_index` and `sub_size` thanks to a `RUNCOMMAND_LOOP2`?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`VariableCellArrayReal::synchronize`: parallelize on item and second dimension when pack/unpack messages #1981

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

VariableCellArrayReal::synchronize: parallelize on item and second dimension when pack/unpack messages #1981

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`VariableCellArrayReal::synchronize`: parallelize on item and second dimension when pack/unpack messages #1981