Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
7 changes: 6 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -35,4 +35,9 @@ data/D105heartW632_361712_originally_labeled_D120/
Users
data/D105skinW6_32_305183/
hla_data.js
hla_data.json
hla_data.json

#ignore extracted data
data/ref/*

uMAP.py
2 changes: 1 addition & 1 deletion .vscode/settings.json
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
{
"liveServer.settings.port": 5501
"liveServer.settings.port": 5502
}
63 changes: 35 additions & 28 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -40,11 +40,6 @@ cd HLA-PepClust/
.\hlapepclust-env\Scripts\activate
```

3. **Upgrade `pip`**
```bash
pip install --upgrade pip
```

## Installing Dependencies

1. **Navigate to the project directory** (if not already in it)
Expand Down Expand Up @@ -87,32 +82,36 @@ clust-search <input_data_path> <reference_data_path> \

```bash
clust-search data/D90_HLA_3844874 data/ref_data/Gibbs_motifs_human/output_matrices_human \
--hla_types A0201,A0101,B1302,B3503,C0401 \
--n_clusters 6 \
--species human \
--output test_results \
--processes 4 \
--threshold 0.6
--hla_types A0201,A0101,B1302,B3503,C0401 \ # Specify list of HLA alleles to search
--n_clusters 6 \ # Restrict analysis to 6 Gibbs clusters
--species human \ # Species to evaluate [human, mouse]
--output My_results_directory \ # Directory where results will be saved
--processes 4 \ # Number of parallel processes to use
--threshold 0.7 \ # Correlation threshold for motif matching
--topNHits 3 # Report top-N HLA matches for each Gibbs motif
```

## Command Line Arguments
| Argument | Type | Description | Default |
|------------------|---------|-----------------------------------------------------------------------------|--------------------------|
| `gibbs_folder` | `str` | Path to test folder containing matrices. | *Required* |
| `reference_folder` | `str` | Path to reference folder containing matrices. | *Required* |
| `-o, --output` | `str` | Path to output folder. | `"output"` |
| `-hla, --hla_types` | `list` | List of HLA types to search. | *All* |
| `-p, --processes` | `int` | Number of parallel processes to use. | `4` |
| `-n, --n_clusters` | `str` | Number of clusters to search for. | `"all"` |
| `-t, --threshold` | `float` | Motif similarity threshold. | `0.70` |
| `-s, --species` | `str` | Species to search [Human, Mouse]. | `"human"` |
| `-db, --database` | `str` | Generate a motif database from a configuration file. | `"data/config.json"` |
| `-st, --Searchtype` | `str` | Type of search to perform [Numba, IO]. | `"Numba"` |
| `-k, --best_KL` | `bool` | Find the best KL divergence only. | `False` |
| `--topNHits` | `int` | Number of top hits to retain per Gibbs matrix. | `3` |
| `-l, --log` | `bool` | Enable logging. | `False` |
| `-im, --immunolyser` | `bool` | Enable immunolyser output. | `False` |
| `-npDB, --NumbaDB` | `str` | Path to the Array database folder. | `"data/ref_data/human_db"` |
| `-c, --credits` | `bool` | Show credits for the motif database pipeline. | `False` |
| `-v, --version` | `bool` | Show the version of the pipeline. | `False` |

| Argument | Type | Description | Default |
|----------|------|-------------|---------|
| `gibbs_folder` | `str` | Path to test folder containing matrices. | *Required* |
| `reference_folder` | `str` | Path to reference folder containing matrices. | *Required* |
| `-o, --output` | `str` | Path to output folder. | `"output"` |
| `-hla, --hla_types` | `list` | List of HLA types to search. | *All* |
| `-p, --processes` | `int` | Number of parallel processes to use. | `4` |
| `-n, --n_clusters` | `int` | Number of clusters to search for. | `"all"` |
| `-t, --threshold` | `float` | Motif similarity threshold. | `0.5` |
| `-s, --species` | `str` | Species to search [Human, Mouse]. | `"human"` |
| `-db, --database` | `str` | Generate a motif database from a configuration file. | `"data/config.json"` |
| `-k, --best_KL` | `bool` | Find the best KL divergence only. | `False` |
| `-l, --log` | `bool` | Enable logging. | `False` |
| `-im, --immunolyser` | `bool` | Enable immunolyser output. | `False` |
| `-c, --credits` | `bool` | Show credits for the motif database pipeline. | `False` |
| `-v, --version` | `bool` | Show the version of the pipeline. | `False` |

## Example Output

Expand Down Expand Up @@ -144,7 +143,15 @@ Example of `input` folder path:

![Example Output](assets/img/google-colab.png)

### Citation

If you use **MHC-TP** in your research, please cite:

**Immunolyser 2.0: an advanced computational pipeline for comprehensive analysis of immunopeptidomic data**
Prithvi Raj Munday¹,†, Sanjay S.G. Krishna¹,†, Joshua Fehring¹, Nathan P. Croft¹, Anthony W. Purcell¹, Chen Li¹,², and Asolina Braun¹
¹Department of Biochemistry and Molecular Biology and Biomedicine Discovery Institute, Monash University, Clayton, VIC, 3800, Australia
²Department of Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, VIC 3168, Australia

More detailed instructions coming soon... 🚀
*Computational and Structural Biotechnology Journal*

More detailed instructions coming soon...
Loading