🧪 Metadoon

User-friendly graphical interface and pipeline for amplicon-based metagenomic data analysis.

Metadoon automates the workflow from FASTQ preprocessing to robust statistical visualization in R, utilizing tools like VSEARCH and Phyloseq. It features a streamlined 5-step interface and runs easily via Docker or Natively via Conda.

📦 What's Included

The environment includes:

Component	Purpose
Python 3.10	GUI interface (Tkinter) and pipeline logic
R (Latest)	Statistical analysis and plotting
VSEARCH	FASTQ processing (merge, filter, cluster)
Libraries	`phyloseq`, `DESeq2`, `ggplot2`, `vegan`, etc.

🚀 Option 1: One-Click Launchers

Easy start scripts for all platforms.

⚠️ First-Time Setup (Permissions)

For macOS (.command) and Linux (.sh)/Or WSL users only: Before running the scripts for the first time, you must grant execution permissions via terminal.

Open a terminal inside the Metadoon folder.
Run the command:
```
chmod +x *
```

Note: Windows users (.bat) DO NOT need this step. You can run the file directly.

1. Prerequisites by OS

Windows & Linux: Docker installed (Enable WSL 2 for Windows).
macOS: Conda installed.
- The macOS .command launcher runs the Native Conda version, not Docker.

2. How to Run

Just double-click the launcher for your OS:

🪟 Windows: Double-click Windows_Run.bat (Runs Docker).
🍎 macOS: Double-click MacOS_Run.command (Runs Conda/Native).
🐧 Linux: Run ./Linux_Run.sh (Runs Docker).

🐍 Option 2: Manual Installation (Terminal)

Recommended for Linux/WSL users or advanced users who prefer manual control.

Follow these steps to run Metadoon directly on your system without the one-click scripts.

1. Prerequisites

Conda (Anaconda or Miniconda) must be installed.

2. Installation & Execution

Open your terminal and run the following commands in order:

Step 1: Clone the repository

git clone https://github.com/rdo-adan/Metadoon.git

Step 2: Enter the directory

cd Metadoon/

Step 3: Grant execution permissions Essential to ensure all scripts can run.

chmod +x *

Step 4: Install dependencies This script creates the metadoon environment and installs R, Python, and VSEARCH.

bash setup.sh

Step 5: Activate environment & Run

conda activate metadoon
python metadoon.py

🖥️ Interface & Workflow

The new interface guides you through 5 simple steps:

Load FASTQ Files: Select your raw data (must contain _R1_ and _R2_).
Configure Parameters: Adjust threads, max errors, and databases (optional).
RUN PIPELINE: Starts the analysis (Merge -> Filter -> Cluster -> Taxonomy -> Stats).
Generate Report: Creates the final HTML summary after the run finishes.
Save Results: Exports all tables, plots, and reports to a clean folder.

📂 Handling Files (Docker Users)

If using Docker (Windows/Linux script), Metadoon maps your local folders:

/workspace ⮕ Metadoon folder (Results saved here).
/app/YOUR_DATA ⮕ User Profile (Documents, Downloads).
/app/C_Drive ⮕ C: Drive (Windows only).

💡 Native/macOS Users: You have direct access to your entire file system.

⚙️ Pipeline Details

Merge Pairs: Merges R1 and R2 using VSEARCH.
Quality Filter: Filters reads based on MaxEE.
Dereplication: Identifies unique sequences.
Clustering: OTU (97%) or ASV (Denoising).
Chimera Removal: De novo + Reference-based.
Taxonomy: SINTAX algorithm.
Statistics (R): Alpha/Beta Diversity, Rarefaction, DESeq2, ANCOM-BC.

📁 Project Structure

Metadoon automatically manages file organization.

Core Files (Before Run)

Metadoon/
│
├── metadoon.py              # Main GUI script
├── Analise.R                # Statistical analysis script (R)
├── generate_report.R        # Report generation script
├── Metadoon_Report.Rmd      # RMarkdown template
├── pipeline_params.json     # Configuration file
├── metadoon_env.yaml        # Conda environment definition
├── setup.sh                 # Native installation script (Linux)
├── LICENSE                  # License file
├── Readme.md                # Project documentation
├── Windows_Run.bat          # Launcher scripts for Docker (All OS)
├── MacOS_Run.command
├── Linux_Run.sh
└── Example_Data.txt         # Links to Download a dataset for testing

Generated Directories (After Run)

Once the pipeline runs, Metadoon creates specific folders to organize the workflow:

Metadoon/
│
├── DB/                      # Downloaded reference databases (RDP, Silva, etc.)
├── Metadata File/           # Stores the uploaded metadata file
├── Tree File/               # Stores the phylogenetic tree (if provided)
│
├── Merged/                  # Paired-end reads merged by VSEARCH
├── FullFiles/               # Concatenated merged reads
├── Filtered/                # Quality filtered sequences
├── Dereplicated/            # Unique sequences (dereplication)
│
├── OTUs/                    # Clustering results
│   ├── centroids.fasta      # Representative sequences
│   ├── otus.fasta           # Final OTUs/ASVs (non-chimeric)
│   └── otutab.txt           # Abundance table
│
├── Taxonomy/                # Taxonomic classification results
│   ├── taxonomy_raw.txt     # Raw output from SINTAX
│   └── taxonomy.txt         # Cleaned taxonomy table for R
│
└── Output/                  # FINAL RESULTS
    ├── Plots (Alpha/Beta diversity, Heatmaps, Rarefaction)
    ├── Statistical Tables (DESeq2, ANCOM-BC, PERMANOVA)
    └── Metadoon_Report.html # Complete HTML Summary

⚠️ Input Data Requirements

Format: Illumina Paired-End .fastq.
Naming: Must contain _R1_ and _R2_.
No Special Characters: Avoid spaces or extra hyphens in sample names.

📬 Contact

For issues or questions: 📧 rdo.adan@gmail.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧪 Metadoon

📦 What's Included

🚀 Option 1: One-Click Launchers

⚠️ First-Time Setup (Permissions)

1. Prerequisites by OS

2. How to Run

🐍 Option 2: Manual Installation (Terminal)

1. Prerequisites

2. Installation & Execution

🖥️ Interface & Workflow

📂 Handling Files (Docker Users)

⚙️ Pipeline Details

📁 Project Structure

Core Files (Before Run)

Generated Directories (After Run)

⚠️ Input Data Requirements

📬 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
Analise.R		Analise.R
Example_Data.txt		Example_Data.txt
LICENSE		LICENSE
Linux_Run.sh		Linux_Run.sh
MacOS_Run.command		MacOS_Run.command
Metadoon-Beta.Rproj		Metadoon-Beta.Rproj
Metadoon.icns		Metadoon.icns
Metadoon.ico		Metadoon.ico
Metadoon.png		Metadoon.png
Metadoon_Report.Rmd		Metadoon_Report.Rmd
OP.png		OP.png
Readme.md		Readme.md
Windows_Run.bat		Windows_Run.bat
generate_report.R		generate_report.R
metadoon.py		metadoon.py
metadoon_env.yaml		metadoon_env.yaml
setup.sh		setup.sh

Folders and files

Latest commit

History

Repository files navigation

🧪 Metadoon

📦 What's Included

🚀 Option 1: One-Click Launchers

⚠️ First-Time Setup (Permissions)

1. Prerequisites by OS

2. How to Run

🐍 Option 2: Manual Installation (Terminal)

1. Prerequisites

2. Installation & Execution

🖥️ Interface & Workflow

📂 Handling Files (Docker Users)

⚙️ Pipeline Details

📁 Project Structure

Core Files (Before Run)

Generated Directories (After Run)

⚠️ Input Data Requirements

📬 Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages