python-bio-utils

A collection of Python scripts for bioinformatics tasks, developed for coursework and personal exploration.
These tools demonstrate practical use of file parsing, sequence analysis, and command-line scripting with a focus on biological data formats like FASTA, GFF, and SAM.

📁 Contents

Script	Description
`seq_tools.py`	Utilities to load FASTA files, compute reverse complements, find ORFs, and translate DNA to protein.
`SAMParser.py`	Extracts transcript-level read counts from aligned SAM/BAM files.
`homology.py`	Identifies reciprocal best homologs based on pairwise BLAST XML comparisons.
`gene_annotation_search.py`	CLI tool to search an SQLite gene annotation database by keyword.
`gffparser.py`	Extracts gene names from GFF files by chromosome and coordinate range.
`test.py`, `test_debug.py`	Miscellaneous or scratch code for testing ideas.

🧪 Features

Manual parsing of GFF, FASTA, and BLAST formats
Command-line interfaces using argparse
Clean class-based sequence utilities
Spike-in normalization and data extraction workflows

🚀 Getting Started

Each script is standalone. You can run any tool from the command line. Example:

python gffparser.py -i example.gff -c Chr1 -s 10000 -e 50000

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

python-bio-utils

📁 Contents

🧪 Features

🚀 Getting Started

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
SAMParser.py		SAMParser.py
gene_annotation_search.py		gene_annotation_search.py
gffparser.py		gffparser.py
homology.py		homology.py
seq_tools.py		seq_tools.py
test.py		test.py
test_debug.py		test_debug.py

Folders and files

Latest commit

History

Repository files navigation

python-bio-utils

📁 Contents

🧪 Features

🚀 Getting Started

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages