Skip to content

Issue with loading trusted k-mers #27

@standage

Description

@standage

Hi @mourisl,

It was exciting to meet you at CSHL and I appreciate that you took some time to implement the feature we discussed.

I gave it a try this morning, and saving the trusted k-mers file seemed to work fine. At least no error was reported.

$ time lighter \
    -r mother-reads-1.fq -r mother-reads-2.fq \
    -r father-reads-1.fq -r father-reads-2.fq \
    -r proband-reads-1.fq -r proband-reads-2.fq \
    -saveTrustedKmers trusted-kmers \
    -k 25 25000000 0.233 -t 4
[2017-11-06 10:54:06] =============Start====================
[2017-11-06 10:54:07] Bad quality threshold is "7"
[2017-11-06 10:54:54] Finish sampling kmers
[2017-11-06 10:54:54] Bloom filter A's false positive rate: 0.162178
[2017-11-06 10:54:54] The error rate is high. Lighter adjusts -maxcor to 5 and bad quality threshold to "8".
[2017-11-06 10:56:30] Finish storing trusted kmers
[2017-11-06 10:56:30] The trusted kmers are saved in file trusted-kmers.

real	2m23.730s
user	7m45.252s
sys	0m12.727s

There was a problem when I tried to run the error correction though.

$ time lighter -r mother-reads-1.fq -r mother-reads-2.fq  -k 25 25000000 0.233 -t 4 -loadTrustedKmers trusted-kmers -od .
[2017-11-06 11:07:27] =============Start====================
[2017-11-06 11:07:28] Bad quality threshold is "7"
[2017-11-06 11:07:28] Finish loading trusted kmers from file trusted-kmers.
Segmentation fault: 11

real	0m1.548s
user	0m1.385s
sys	0m0.288s

I figured I would try without threading, which did prevent the memory handling error, but did not find any errors to correct.

$ time lighter -r mother-reads-1.fq -r mother-reads-2.fq  -k 25 25000000 0.233 -loadTrustedKmers trusted-kmers -od .
[2017-11-06 11:07:52] =============Start====================
[2017-11-06 11:07:53] Bad quality threshold is "7"
[2017-11-06 11:07:53] Finish loading trusted kmers from file trusted-kmers.
[2017-11-06 11:17:43] Finish error correction
Processed 6000000 reads:
	0 are error-free
	Corrected 0 bases(0.000000 corrections for reads with errors)
	Trimmed 0 reads with average trimmed bases 0.000000
	Discard 0 reads

real	9m50.509s
user	9m42.973s
sys	0m6.276s

Is it possible I'm invoking the software incorrectly?


I gzipped this data set and uploaded to AWS, in case that helps to reproduce the error. Please let me know when you've downloaded the data files so I can revoke public access.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions