Hi,
We are using scythe to trim 3' adapter but we found a very weird behavior using this
sequence (in.fq):
@014_1000001169_x1
AAAAAAGATGCCAGTTGAAGAACTGATGGAATTCTCGGGTGCCAAAGAACTAAAG
+014_1000001169_x1
BBBB>>1111B1B1BBBBF1BF1BB1B11BBBBAD3A00A0BBDB00BB0D1AB1
and adapter fasta file (adapt.fa):
RPI10
TGGAATTCTCGGGTGCCAAGGAACTCCAGTCACTAGCTTATCTCGTATGCCGTCTTCTGCTTG
The result of this command : scythe -o out.fq -m match.txt -a adapt.fa in.fq
is this fastq file (out.fq):
@014_1000001169_x1
N
+
B
Why the scythe trims all the read ??? The match file (match.txt) content is :
p(c|s): 1.000000; p(!c|s): 0.000000; adapter: RPI10
014_1000001169_x1
TGGAATTCTCGGGTGCCAAGGAACTCCAG
||||||||||||||||||| ||||| ||
TGGAATTCTCGGGTGCCAAAGAACTAAAG
B11BBBBAD3A00A0BBDB00BB0D1AB1
[1.00, 0.97, 0.97, 1.00, 1.00, 1.00, 1.00, 1.00, 1.00, 0.98, 1.00, 0.97, 0.97, 1.00, 0.97, 1.00, 1.00, 1.00, 1.00, 0.97, 0.97, 1.00, 1.00, 0.97, 1.00, 0.97, 1.00,
1.00, 0.97]
So it should trim only the 3' region like this (according to the match region):
@014_1000001169_x1
AAAAAAGATGCCAGTTGAAGAACTGA
+014_1000001169_x1
BBBB>>1111B1B1BBBBF1BF1BB1
Hi,
We are using scythe to trim 3' adapter but we found a very weird behavior using this
sequence (in.fq):
@014_1000001169_x1
AAAAAAGATGCCAGTTGAAGAACTGATGGAATTCTCGGGTGCCAAAGAACTAAAG
+014_1000001169_x1
BBBB>>1111B1B1BBBBF1BF1BB1B11BBBBAD3A00A0BBDB00BB0D1AB1
and adapter fasta file (adapt.fa):
The result of this command : scythe -o out.fq -m match.txt -a adapt.fa in.fq
is this fastq file (out.fq):
@014_1000001169_x1
N
+
B
Why the scythe trims all the read ??? The match file (match.txt) content is :
p(c|s): 1.000000; p(!c|s): 0.000000; adapter: RPI10
014_1000001169_x1
TGGAATTCTCGGGTGCCAAGGAACTCCAG
||||||||||||||||||| ||||| ||
TGGAATTCTCGGGTGCCAAAGAACTAAAG
B11BBBBAD3A00A0BBDB00BB0D1AB1
[1.00, 0.97, 0.97, 1.00, 1.00, 1.00, 1.00, 1.00, 1.00, 0.98, 1.00, 0.97, 0.97, 1.00, 0.97, 1.00, 1.00, 1.00, 1.00, 0.97, 0.97, 1.00, 1.00, 0.97, 1.00, 0.97, 1.00,
1.00, 0.97]
So it should trim only the 3' region like this (according to the match region):
@014_1000001169_x1
AAAAAAGATGCCAGTTGAAGAACTGA
+014_1000001169_x1
BBBB>>1111B1B1BBBBF1BF1BB1