GenomeScratch

GenomeScratch is a user-friendly tool designed for effortlessly generating virtual reference genomes, accompanied by GFF and GTF files. Tailored for testing workflows and training purposes, it enables users to simulate genomic scenarios and optimize bioinformatics processes in a controlled, lightweight environment. This tool is especially valuable when high-performance computing resources are scarce, allowing researchers to refine their methodologies efficiently before transitioning to real data. Whether for educational use or practical testing, GenomeScratch provides a convenient solution for genomics and bioinformatics applications.

Installation:

git clone https://github.com/saifeldeen-bio/GenomeScratch.git
cd GenomeScratch/
sudo mv GenomeScratch ../path-to/usr/bin

Usage:

GenomeScratch -h [manual] -l <nline> -d/-r <DNA/RNA> -n <fileName> -o <outDir> [optional: -t/-f <GTF/GFF> -ng <nGenes> -gl <geneLength>]

Parameters:

<nline>: Choose the desired number of sequence lines, each line containing 60 nucleotides.
<DNA/RNA>: Specify the sequence type (DNA or RNA).
<fileName>: Provide a name for the output files.
<outDir>: Specify the output directory for the virtual sequences.

Optional Parameters:

-t/--GTF -f/--GFF: Choose either GTF or GFF format for the output (default: none).
-ng/--nGenes: Specify the number of genes (mandatory if using the -t option).
-gl/--geneLength: Specify the length of the genes (default: 500).

Options:

-l/--nline: Specify the number of sequence lines (nucleotides) for each line.
-d/--DNA -r/--RNA: Choose between DNA and RNA as the sequence type.
-n/--name: Specify the name for the output files.
-o/--out: Specify the output directory for the virtual sequences.

Example Usage:

GenomeScratch -l 100 -d -n myGenome -o output_directory -t -ng 50 -gl 800

This software provides a convenient solution for genomics and bioinformatics applications, offering a controlled and efficient environment for testing and training purposes. Researchers can utilize GenomeScratch to simulate various genomic scenarios and optimize their workflows before applying them to real data.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
GenomeScratch		GenomeScratch
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenomeScratch

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GenomeScratch

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages