Skip to content
View ferygood's full-sized avatar
:octocat:
Writing
:octocat:
Writing

Highlights

  • Pro

Block or report ferygood

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ferygood/README.md

Hi there 👋

I'm Yao, a backend and data engineer with a PhD background, based in Berlin.

I build data processing systems and backend services, mainly using Python and cloud-native tools. My work focuses on turing complex data into reliable, production-ready pipelines and APIs.

What I work on

  • Backend services (FastAPI, REST APIs)
  • Data processing pipelines and workflow automation
  • Cloud-based systems (GCP, Docker)
  • Applied machine learning and LLM-powered data workflows

Open-source & Porjects

  • TEKRABber - Open-source data analysis package with 8,000+ downloads Github repo
  • Backend services for sequencing data integration

Pinned Loading

  1. TEKRABber TEKRABber Public

    An R Bioinformatic package for DE and correlation analysis comparing between species.

    R 3

  2. preTEKRABber_pipe preTEKRABber_pipe Public

    A snakemake pipeline for RNA-seq analysis, especially for preparing dataset for using TEKRABber software.

    Python 1

  3. antifungal-linguist antifungal-linguist Public

    A language model developed for antifungal medicine discovery. The architecture is a pre-trained T5 base model and fine-tuned using step-by-step distilling.

    Jupyter Notebook 1 1

  4. spatial-transcriptomic-visum-pipeline spatial-transcriptomic-visum-pipeline Public

    A multimodal pipeline for integrating spatial transcriptomics using Scanpy

    Jupyter Notebook

  5. nf-STAR-TEtranscript nf-STAR-TEtranscript Public

    This project aims to develop an nf-core nextflow pipeline to conduct genes and transposable elements from RNA-seq data

    Nextflow 1 1

  6. shortURL shortURL Public

    An api that can shorten long url for users

    Python