We need to try and resolve and identify unresolved taxa in the global reference library.
The idea would be to create a spreadsheet that lists problematic accessions (and taxonomy associate with these accessions) for each sequence read so that we can yank them out right after the ecoPCR step and before any unique sequences get merged.
We can start with any of these problematic records that pick up hits with our samples from Fray Jorge, YNP, etc. That way we can at least build the step into the pipeline and clean up one or a few of the worst offenders.
We can also tackle them systematically, which would allow us to distribute effort across lab members (especially if we have grad students / postdocs facilitating contributions by students who want to begin getting more involved).
We need to try and resolve and identify unresolved taxa in the global reference library.
The idea would be to create a spreadsheet that lists problematic accessions (and taxonomy associate with these accessions) for each sequence read so that we can yank them out right after the ecoPCR step and before any unique sequences get merged.
We can start with any of these problematic records that pick up hits with our samples from Fray Jorge, YNP, etc. That way we can at least build the step into the pipeline and clean up one or a few of the worst offenders.
We can also tackle them systematically, which would allow us to distribute effort across lab members (especially if we have grad students / postdocs facilitating contributions by students who want to begin getting more involved).