Skip to content

Slow cache recovery after sample removal and resume #187

@NureSharafNowerSamia

Description

@NureSharafNowerSamia

Describe the bug
When running the pipeline, if there is an issue with one of the samples, the entire pipeline fails. After removing the problematic sample and resuming the run, the pipeline takes a significantly longer time to recover and reuse cached results compared to earlier versions. Although the cache is flagged, it appears that the pipeline is redoing parts of the analysis instead of properly using the cache.

Impact
This issue is causing much longer run times and inefficient resource usage when recovering from an error. It slows down the workflow considerably, especially when working with large datasets.

To Reproduce

Environment: Local (Docker profile used)

Pipeline version: cdcgov/phoenix v2.1.1

Command run: nextflow run main.nf -profile docker -entry PHOENIX --input <samplesheet.csv> --kraken2db

Custom config file: Not used

Text of the error:
No specific error text upon resuming. After removing the faulty sample, the resumption process is significantly slower, and re-analysis of already completed parts occurs even though the cache is flagged.

Files that caused the error:
Available upon request if needed.

Expected behavior
After removing the problematic sample and resuming, the pipeline should quickly reuse the cached results and only rerun necessary steps, similar to behavior in earlier versions.

Screenshots
N/A

Logs
Logs can be provided if needed. (.nextflow.log and relevant .command.err / .command.out files)

Additional context
In previous versions of the pipeline, resuming after error recovery was much faster and efficient. This change in behavior might be related to recent updates in the pipeline structure or Nextflow version handling.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions