Hello,
We built an index over RefSeq genomes. The downloaded filenames are named like this:
/path/GCF_000019125.1_ASM1912v1_genomic.fna.gz
/path/GCF_000019165.1_ASM1916v1_genomic.fna.gz
...
When searching the index, the result looks as follows:
*query1 XXX
GCF_000019125 XXX
GCF_000019165 XXX
...
Luckily for us, the names are still unique and we should be able to compare the output with some effort to reconstruct the full reference name.
This format is lossy if the names weren't unique before the first dot and might even lead to severe false negatives if not noticed by the user.
Best,
Svenja