Skip to content

k-mer error while running strainscan #25

@AK-25FL

Description

@AK-25FL

Hi,

Thank you for developing this pipeline. This is very helpful for strain-level identification.
I used it for strain-level identification within the species. It worked very well. Then, I tried to identify strains from three highly related species. After building the database while running the strainscan pipeline it gave the following error. I don't know if this has something to do with the update or any other error related to k-mer.
Will appreciate your help.

Thanks,

Traceback (most recent call last):
File "/blue/env/bin/strainscan", line 10, in
sys.exit(main())
File "/blue/env/lib/python3.7/site-packages/StrainScan/StrainScan.py", line 198, in main
cls_dict=identify.identify_cluster(in_fq,db_dir+'/Tree_database',[0.1,0.4,1])
File "/blue/env/lib/python3.7/site-packages/StrainScan/library/identify.py", line 406, in identify_cluster
match_results = jellyfish_count(fq_path, db_dir)
File "/blue/env/lib/python3.7/site-packages/StrainScan/library/identify.py", line 98, in jellyfish_count
match_results[kmer_index_dict[temp[0]]] = int(temp[1])
KeyError: 'TGCGCACCTTTGCGCTGCTGGTGGCGGTGCA'

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions