Skip to content

When a job is composed of a bench of mpi processes, the effeciency you are computing is false #37

@amalkhabouHQ

Description

@amalkhabouHQ

Hello,
I am using your tool report. I wanted to mention that when a job is composed of x mpi processes (36 in my example below), your computation is wrong

reportseff 6196869
    JobID    State          Elapsed  TimeEff   CPUEff   MemEff
  6196869  COMPLETED    12-14:16:39    ---      5.0%     0.9%
 sacct -P -n -a --format JobID,State,AllocCPUS,REQMEM,TotalCPU,Elapsed,MaxRSS,ExitCode,NNodes,NTasks -j  6196869
6196869|COMPLETED|720|191846Mn|451-06:00:24|12-14:16:39||0:0|20|
6196869.batch|COMPLETED|36|191846Mn|451-06:00:24|12-14:16:39|33824748K|0:0|1|1

My cpus are running at 100% or you are reporting 5%, the CPUEff should be 451,4/12,52/36 since the 36 mpi processes are running in parallel which corresponds to 100%

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions