👋 Hi, I’m Hanaan
🧬 I’m passionate about leveraging computational tools (Python, R) to solve biological problems, especially in genomics and transcriptomics.
🔬 I’m currently sharpening my skills in RNA-seq analysis and building robust data pipelines.
🤝 I’m looking to collaborate on open-source bioinformatics projects or genomics research.
📫 Reach me at: aminahanaan0310@gmail.com
Tech Stacks: Nextflow BWA-MEM GATK fastp picard bcftools Conda
Developed scalable pipeline for Whole Exome Sequence analysis, that automates workflow from raw reads to filtered-variants and ability to handle multiple cohort samples. Built using Mextflow and following GATK best practises documentation.
Tech Stacks: Python Scikit-learn Logistic-Regression Random-Forest XGBoost Cross-validation Hyperparameter-tuning SHAP PCA Top features
Developed and documented a machine learning workflow from raw gene expression data to identifying potential biomarkers in Alzheimer's Disease. The dataset contains 206 samples and 19,297 genes classified into control and condition classes.
Tech Stacks: Bash scripting Python R FASTQC Trimmomatic HISAT2 SAMtools featureCounts DESeq2 GSEA
Implemented and documented a complete bulk RNA-seq workflow from raw SRA data to differential expression and pathway enrichment analysis for 8 samples under 4 conditions (~78,894 genes)
Tech Stacks: R Seurat
Tech Stacks: Python Scikit-learn Logistic-Regression Decision-Tree Random-Forest Hyperparameter-tuning
A series of tasks that are core to bioinformatic skills.
