A new Fonda scripting/launching approach

## Motivation

There are two crucial drawbacks in the current Fonda implementation:

- deadlocks
- idle resources

Both problems are related to the implementation of the scripts launching. There are two types of scripts execution: 

1. The first scripts part (e.g. alignment/post alignment scripts) is launched Fonda at the same time. 
2. The second scripts part (e.g. featureCount, cufflinks) is launched from the alignment/post alignment scripts. 

The work coordination of all scripts is carried out by checking the log files that scripts produce. But log file can not be created at all if the script was not been invoked (e.g. script is launched from the alignment scripts). In this case, post-process scripts will work forever.

![image](https://user-images.githubusercontent.com/23356251/86462320-1a92d180-bd34-11ea-9df4-25f5aeaece87.png)

**Picture 1. The current Fonda launching approach**

> For example, post-process scripts (_qcsummary.sh, cufflinks_cohort.sh_ - see **Picture 1**) are launched with 
> alignment scripts simultaneously. Post-process *cufflinks_cohort.sh* scripts expect the result of the _cufflinks.sh_ 
> script execution by check the _cufflinks.log_ file. But _alignment_ script can fail before _cufflinks.sh_ script invocation. 
> But _cufflinks_cohort.sh_ will not know about it and will run infinitely.

**Deadlocks** are specific for launch in the SGE cluster. Each script is a SGE cluster submitted job. 
The job has specific resource requirements - the number of slots defined by the user in the Fonda global config file 
(_NUMTHREADS_ parameter in _Queue_Parameters_ section). The number of slots is equal to the number of processors in a cluster. 
The user can set such a number of slots that the cluster size will not be enough for job work. In this case, the job hangs on in a pending state (qw). 

> For example, the cluster size is 8 CPU. A user sets _NUMTHREADS=4_.  First of all Fonda launches 3 scripts - _alignment.sh_, _qcsummary.sh_, _cufflinks_cohort.sh_. 2 of them (_alignment.sh_, _qcsummary.sh_) will be in running status. _cufflinks_cohort.sh_ job is in the qw state which stands for being queued and waiting. In its turn, _alignment.sh_ script invokes _cufflinks.sh_ and _featureCount.sh_ and waits for the results. But the cluster doesn't have available slots. _cufflinks.sh_ and _featureCount.sh_ hang on in a pending state, and _alignment.sh_ job will wait for their result endlessly.

Thus, in the beginning post-process jobs take up resources without performing useful work. On the contrary, idle resources are possible in the case of the autoscale cluster. 

## Approach

We propose a new approach to scripts launching.

![image](https://user-images.githubusercontent.com/23356251/86462428-4910ac80-bd34-11ea-84ef-873fa3822cc0.png)

**Picture 2. The new proposed approach**

As can be seen at the picture above, we create an additional **master.sh** script-orchestrator to manage all scripts. 
Fonda will only run **master.sh** script directly. Initially, the master script starts all _alignment.sh_ scripts and waits for their results. After successful completion of alignment step the _cufflinks.sh_, _featureCount.sh_ etc scripts are launched if they are needed.
Please note that we intend to remove the launch of the script from the alignment/post alignment scripts. 
After the per samples scripts are executed successfully, **master.sh** script launches the post-process scripts.

To sum up proposed changes:

- create a new **master.sh** script to manage all scripts
- remove launching of scripts from the alignment/post alignment scripts
- sequential launching of pipeline stages

This approach proposes getting rid of the above problems and makes the process of launching scripts more transparent.
At the same time, this approach preserves the parallelization of processes where it is possible.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A new Fonda scripting/launching approach #162

Motivation

Approach

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

A new Fonda scripting/launching approach #162

Description

Motivation

Approach

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions