BioPython Library #1
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Just a simple script that demonstrates the use of biopython to read sequences from fasta (.fna) files.
In the test script that I run on only one data file the file had only one sequence in it. From my understanding, there are other types of sequences in which more that one sequence can be found in one
fnafile but that's not the case for our files.I also found that The BioPython library has abilities to import datasets directly from the ncbi website using the
Entrezmodule. I did have an issue finding the official way to signup for an entrez account to be able to use it.While looking at some resources I found a tutorial on using the BioPython library to do machine learning with two existing models, Logistic Regression and K-Nearest Neighbors. Although biopython official documentation has no mention of them. I attempted to import Bio.LogisticRegression but got a warning message saying the that module is deprecated.