GitHub - karajones/beeDB: Metabarcoding database curation

R Scripts for Nearctic Bee Database (beeDB)

Caution

This repository is currently under development. This repository is not currently intended for use by anyone other than me.

The purpose of these scripts is to create a comprehensive metabarcoding reference sequence database for all known bee species in the Nearctic (i.e., most of the US and Canada) for all primer pairs that could work on bees. The end goals are to identify how many bees have reference sequences and which primer pairs are most likely to be effective in amplifying bees so that researchers can make more informed choices for metabarcoding.

Currently uploaded scripts (need to finish documentation):

Compile a checklist of all known bee species in the Nearctic and their possible synonyms
Identify sequence data available for those bee species from NCBI's nucleotide database
Clean up and add taxonomic info to NCBI sequence data
Prepare data downloaded from BOLD and merge with NCBI data
Use simplified in silico PCR to extract homologous seed sequences with primers
BLAST seed sequences against database to identify homologous sequences that lack primer regions
Clean up resulting database to remove duplicates and identify potential taxonomic conflicts

Scripts for the next steps are still being validated:

Validate sequence homology using trees
Calculate barcode gap
Summarize results
Shiny interface for visualizing results and accessing beeDB

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
01_taxa_list_cleaning.R		01_taxa_list_cleaning.R
02_get_NCBI_sequences.R		02_get_NCBI_sequences.R
03_clean_NCBI_data.R		03_clean_NCBI_data.R
04_clean_BOLD_data.R		04_clean_BOLD_data.R
05_beeDB_in_silico_PCR.R		05_beeDB_in_silico_PCR.R
06_beeDB_summarize_seed_sequences.R		06_beeDB_summarize_seed_sequences.R
07_beeDB_blast_seed_sequences.R		07_beeDB_blast_seed_sequences.R
DESCRIPTION		DESCRIPTION
NAMESPACE		NAMESPACE
README.md		README.md
beeDB.Rproj		beeDB.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

R Scripts for Nearctic Bee Database (beeDB)

About

Uh oh!

Releases

Packages

Uh oh!

Languages

karajones/beeDB

Folders and files

Latest commit

History

Repository files navigation

R Scripts for Nearctic Bee Database (beeDB)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages