Skip to content

Conversation

@AymanBx
Copy link
Collaborator

@AymanBx AymanBx commented Mar 15, 2025

Just a simple script that demonstrates the use of biopython to read sequences from fasta (.fna) files.

In the test script that I run on only one data file the file had only one sequence in it. From my understanding, there are other types of sequences in which more that one sequence can be found in one fna file but that's not the case for our files.

I also found that The BioPython library has abilities to import datasets directly from the ncbi website using the Entrez module. I did have an issue finding the official way to signup for an entrez account to be able to use it.

While looking at some resources I found a tutorial on using the BioPython library to do machine learning with two existing models, Logistic Regression and K-Nearest Neighbors. Although biopython official documentation has no mention of them. I attempted to import Bio.LogisticRegression but got a warning message saying the that module is deprecated.

@AymanBx AymanBx requested a review from nick-rommel March 15, 2025 05:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants