-
Notifications
You must be signed in to change notification settings - Fork 7
Description
Hi,
Thank you for developing this pipeline. This is very helpful for strain-level identification.
I used it for strain-level identification within the species. It worked very well. Then, I tried to identify strains from three highly related species. After building the database while running the strainscan pipeline it gave the following error. I don't know if this has something to do with the update or any other error related to k-mer.
Will appreciate your help.
Thanks,
Traceback (most recent call last):
File "/blue/env/bin/strainscan", line 10, in
sys.exit(main())
File "/blue/env/lib/python3.7/site-packages/StrainScan/StrainScan.py", line 198, in main
cls_dict=identify.identify_cluster(in_fq,db_dir+'/Tree_database',[0.1,0.4,1])
File "/blue/env/lib/python3.7/site-packages/StrainScan/library/identify.py", line 406, in identify_cluster
match_results = jellyfish_count(fq_path, db_dir)
File "/blue/env/lib/python3.7/site-packages/StrainScan/library/identify.py", line 98, in jellyfish_count
match_results[kmer_index_dict[temp[0]]] = int(temp[1])
KeyError: 'TGCGCACCTTTGCGCTGCTGGTGGCGGTGCA'