From 976d0e649a6a6b31802b2b698632e6b6c3faa179 Mon Sep 17 00:00:00 2001 From: ziadbkh Date: Tue, 1 Jul 2025 16:17:57 +1000 Subject: [PATCH 1/9] update participants --- participants/tsi.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/participants/tsi.md b/participants/tsi.md index 6224541..5e8d259 100755 --- a/participants/tsi.md +++ b/participants/tsi.md @@ -2,7 +2,7 @@ title: Threatened Species Initiative description: Bioinformatics analyses for the Threatened Species Initiative. toc: false -type: ABLeS Participant +type: ABLeS Participant - Completed --- ## Project title From 007de86699b1f69c172f5e805eb7b3fd73a7c3c9 Mon Sep 17 00:00:00 2001 From: Ziad Al-Bkhetan Date: Wed, 2 Jul 2025 16:08:50 +1000 Subject: [PATCH 2/9] Update thyroid-cancer.md --- participants/thyroid-cancer.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/participants/thyroid-cancer.md b/participants/thyroid-cancer.md index a9a083e..3b1196b 100644 --- a/participants/thyroid-cancer.md +++ b/participants/thyroid-cancer.md @@ -1,13 +1,14 @@ --- title: Thyroid Cancer Research Group, The University of Sydney -description: +description: Applying single cell RNA sequencing to understand the immune landscape of thyroid cancer, with the goal of identifying therapeutic targets and biomarkers. toc: false type: ABLeS Participant --- ## Project title -Applying single cell RNA sequencing to understand the immune landscape of thyroid cancer, with the goal of identifying therapeutic targets and biomarkers. +Thyroid Autoimmunity and the Immune Landscape in Thyroid Cancer + ## Collaborators and funding From 12fe8c5e55ace0243335c473827929a28d0a33d9 Mon Sep 17 00:00:00 2001 From: Hiruna Samarakoon Date: Thu, 7 Aug 2025 16:53:46 +1000 Subject: [PATCH 3/9] add dfam database entry --- if89.md | 9 +++++++++ 1 file changed, 9 insertions(+) diff --git a/if89.md b/if89.md index 2835a64..4041099 100755 --- a/if89.md +++ b/if89.md @@ -221,6 +221,14 @@ A list of the currently available (`as of 25 Jan 2023`) databases is included be Kraken2 pre-built index for RefSeq database (archaea, bacteria, viral, plasmid, human, protozoa & fungi) plus UniVec_Core. + + Dfam + Dfam + 04 Aug 2025 + dfam/04082025/dfam39 + Dfam 3.9; FamDB Format 2.0; Partition 7 [dfam39_full.7.h5]: Mammalia (57 GB) More info - https://www.dfam.org/releases/Dfam_3.9/families/FamDB/README.txt. Available with RepeatMasker/4.2.0 (if89 module) + + @@ -234,3 +242,4 @@ A list of the currently available (`as of 25 Jan 2023`) databases is included be ## if89 Contributors {% include contributor-tiles-all.html custom="Hardip Patel, Ziad Al Bkhetan, J King Chang, Andre Martins Reis, Hasindu Gamaarachchi, Kyle Drover, Tim Amos, Kisaru Liyanage, Terry Bertozzi, Javed Shaikh, Kirat Alreja, Leah Kemp, Andrey Bliznyuk, Hyungtaek Jung, Wenjing Xue, Johan Gustafsson, Dale Roberts" sort=false%} + From 63ed6d0cb9a69d0c4b13c7bf32253e16ce6d8dae Mon Sep 17 00:00:00 2001 From: Johan Gustafsson Date: Wed, 13 Aug 2025 09:50:34 +0930 Subject: [PATCH 4/9] update acknowledgements.md --- acknowledgements.md | 22 ++++++++++++++++++++-- 1 file changed, 20 insertions(+), 2 deletions(-) diff --git a/acknowledgements.md b/acknowledgements.md index f111f18..cd56043 100755 --- a/acknowledgements.md +++ b/acknowledgements.md @@ -9,11 +9,29 @@ The ABLeS program should be both cited and acknowledged in any publication, pres 1. Use the following citation: -> Gustafsson, Ove Johan Ragnar, Al Bkhetan, Ziad, Francis, Rhys & Manos, Steven. (2023). *Enabling national step changes in bioinformatics through ABLeS, the Australian BioCommons Leadership Share (3.0).* Zenodo. [https://doi.org/10.5281/zenodo.10139651](https://doi.org/10.5281/zenodo.10139651) +``` +Gustafsson, Ove Johan Ragnar, Al Bkhetan, Ziad, Francis, Rhys & Manos, Steven. (2023). *Enabling national step changes in bioinformatics through ABLeS, the Australian BioCommons Leadership Share (3.0).* Zenodo. [https://doi.org/10.5281/zenodo.10139651](https://doi.org/10.5281/zenodo.10139651) +``` +``` +@misc{gustafsson_2023_10139651, + author = {Gustafsson, Ove Johan Ragnar and Al Bkhetan, Ziad and Francis, Rhys and Manos, Steven}, + title = {Enabling national step changes in bioinformatics through ABLeS, the Australian BioCommons Leadership Share}, + month = Nov, + year = 2023, + publisher = {Zenodo}, + version = {3.0}, + doi = {10.5281/zenodo.10139651}, + url = {https://doi.org/10.5281/zenodo.10139651}, +} +``` + +{:start="2"} 2. Use the following acknowledgement statement: ->"The authors acknowledge the provision of computing and data resources provided by the Australian BioCommons Leadership Share (ABLeS) program. This program is co-funded by Bioplatforms Australia (enabled by NCRIS), the National Computational Infrastructure and Pawsey Supercomputing Research Centre." +``` +"The authors acknowledge the provision of computing and data resources provided by the Australian BioCommons Leadership Share (ABLeS) program. This program is co-funded by Bioplatforms Australia (enabled by NCRIS), the National Computational Infrastructure and Pawsey Supercomputing Research Centre." +``` ## ABLeS co-authorship policy From b03b08a9512c42816df5b167c57cf7473007a0a4 Mon Sep 17 00:00:00 2001 From: GiorgiaMori Date: Fri, 15 Aug 2025 12:20:36 +1000 Subject: [PATCH 5/9] add ABLeS participants --- participants/bpa.md | 47 +++++++++++++++++++++++++++++++++++++ participants/minderoo.md | 38 ++++++++++++++++++++++++++++++ participants/platypus.md | 50 ++++++++++++++++++++++++++++++++++++++++ participants/pmcc.md | 35 ++++++++++++++++++++++++++++ participants/scu.md | 40 ++++++++++++++++++++++++++++++++ participants/sih.md | 42 +++++++++++++++++++++++++++++++++ participants/usyd-lo.md | 47 +++++++++++++++++++++++++++++++++++++ participants/uwa-hp.md | 35 ++++++++++++++++++++++++++++ 8 files changed, 334 insertions(+) create mode 100644 participants/bpa.md create mode 100644 participants/minderoo.md create mode 100644 participants/platypus.md create mode 100644 participants/pmcc.md create mode 100644 participants/scu.md create mode 100644 participants/sih.md create mode 100644 participants/usyd-lo.md create mode 100644 participants/uwa-hp.md diff --git a/participants/bpa.md b/participants/bpa.md new file mode 100644 index 0000000..f7ef516 --- /dev/null +++ b/participants/bpa.md @@ -0,0 +1,47 @@ +--- +title: Bioplatforms Australia +description: The Avian Genomics Initiative will fill genomic data gaps for investigating and managing unique Australian bird species. +toc: false +type: ABLeS Participant +--- + +## Project title + +Australian Avian Genomics Initiative + +## Collaborators and funding + +Funding led by Bioplatforms Australia, additional funding and collaboration from the Australian research community. + +## Contact(s) + +- Sophie Mazard, Bioplatforms Australia, +- Anna Kearns, CSIRO, + +## Project description and aims + +Australia is home to approximately 830 species of birds, of which 43% are endemics found nowhere else. Despite substantial international efforts in avian genomics and phylogenomics, significant gaps remain in reference data for Australian bird species. +An audit of existing data found 687 reference genomes for Australian birds. Out of the 107 families of birds found in Australia, 41 families of native or endemic birds lack Australian representative reference genomes. Filling these gaps in referential data would significantly enhance our understanding of genomics, ecology, and behaviour for species and functional traits unique to Australia. + +The Australian Avian Genomics Initiative, established in 2023, aims to: + +- Build genomic data for bird conservation: develop data on phylogenomics, reference genomes, and population genetics to support the understanding and conservation of Australia’s unique bird species. +- Advance fundamental bird genomics research: explore key species traits relevant to Australia, including migration, nectarivory, drought tolerance, cooperative breeding, and more. +- Address critical biodiversity needs: use genomics to complement fundamental research and meet the needs identified by society, government, and industry." + +## How is ABLeS supporting this work? + +Infrastructure choice: Unsure (Pawsey?) +Quarterly Service Units (estimate): ? +Storage: requirements for long-term (during the project) and scratch: yes? +Is access to GPUs required?: Yes? +Expected number of users: ? 19 projects have been supported through the initiative so far +Expected duration of the project: 3 years + +## Expected outputs enabled by participation in ABLeS + +Datasets (Umbrella Bioproject ID: PRJNA1098052, [https://data.bioplatforms.com/organization/aus-avian](https://data.bioplatforms.com/organization/aus-avian)) publications ([https://doi.org/10.25953/740m-0320](https://doi.org/10.25953/740m-0320)). + +
+ +> _These details have been provided by project members at project initiation. For more information on the project, please consult the contact(s) or project links above._ diff --git a/participants/minderoo.md b/participants/minderoo.md new file mode 100644 index 0000000..8a7d819 --- /dev/null +++ b/participants/minderoo.md @@ -0,0 +1,38 @@ +--- +title: Bioplatforms Australia, Minderoo Foundation, Fish Genomics Consortium +description: The Australian Fish Genomics Initiative seeks to address critical deficiencies in available genomic data for Australian fishes. By generating high-quality genomic data, this work will provide a foundation for improved population monitoring, environmental assessment, and the sustainable management of fisheries and aquatic ecosystems. +toc: false +type: ABLeS Participant +--- + +## Project title + +Australian Fish Genomics Initiative + +## Collaborators and funding + +- Bioplatforms Australia +- Minderoo Foundation + +## Contact(s) + +- Amy Tims, UniMelb, +- Sophie Mazard, Bioplatforms Australia, +- Luciano Beheregaray, Flinders University, +- Shannon Corrigan, Minderoo Foundation, + +## Project description and aims + +[https://bioplatforms.com/project/australian-fish-genomics-initiative/](https://bioplatforms.com/project/australian-fish-genomics-initiative/) + +## How is ABLeS supporting this work? + +Pawsey, ?, ?, ?, ~50 users, 2 years + +## Expected outputs enabled by participation in ABLeS + +Reference genomes for ~80 species, transcriptomes, short read data for population genetics. Raw data stored on Bioplatforms data portal. + +
+ +> _These details have been provided by project members at project initiation. For more information on the project, please consult the contact(s) or project links above._ diff --git a/participants/platypus.md b/participants/platypus.md new file mode 100644 index 0000000..854a48a --- /dev/null +++ b/participants/platypus.md @@ -0,0 +1,50 @@ +--- +title: Australian Proteome Analysis Facility, Macquarie University +description: We will use protein mass spectrometry and transcriptomics to identify the constituent proteins of platypus venom. Candidate proteins identified will then be analysed and screened in assays, to determine their function, associated envenomation symptoms, and potential as novel therapeutics. +toc: false +type: ABLeS Participant +--- + +## Project title + +Characterization of the Platypus Venom Proteome for Novel Proteins and Therapeutic Candidates + +## Collaborators and funding + +Australasian Wildlife Genomics Group, University of Sydney + +## Contact(s) + +- Adele Gonsalvez, Australasian Wildlife Genomics Group, +- Emma Peel, Australasian Wildlife Genomics Group, +- Carolyn Hogg, Australasian Wildlife Genomics Group, +- Sophie Mazard, Bioplatforms Australia, +- Meena Mikhael, Australian Proteome Analysis Facility, +- Natalie Saez, Institute for Molecular Bioscience, + +## Project description and aims + +This project will employ a comprehensive proteogenomic strategy to identify and evaluate novel therapeutic proteins from platypus venom. We will integrate RNA sequencing (RNA-seq) data with liquid chromatography-tandem mass spectrometry (LC-MS/MS) to create a detailed map of the venom's protein composition. Venom-derived RNA from this study, supplemented with 61 publicly available platypus tissue samples, will be assembled against the high-quality reference genome to generate a custom, tissue-specific protein database. This database will enable high-confidence identification of proteins from LC-MS/MS analysis of venom fluid. + +The aims include: +1) *Develop a Platypus Proteome Database*: To assemble transcript sequences from both novel and publicly available RNA-seq data and generate a comprehensive protein sequence database using the Pawsey Setonix cluster. +2) *Identify Venom Proteins*: To analyze platypus venom fluid using LC-MS/MS and identify its constituent proteins by searching against the custom-generated proteome database. This proteomic database aims to provide novel insights into platypus venom composition, and our understanding of the platypus venom system. +3) *Evaluate Functionality and Therapeutic Potential*: To clone, express, and purify the identified venom proteins for use in functional assays, to attribute their functionality, associated envenomation symptoms, and suitability for therapeutic development. + +## How is ABLeS supporting this work? + +Infrastructure choice: Pawsey +Quarterly Service Units (estimate): 75 kSU +Storage: 3 Tb storage and 1 Tb long term storage +Is access to GPUs required?: No +Expected number of users: 2 +Expected duration of the project: 4 months +We are ready to start using the resources + +## Expected outputs enabled by participation in ABLeS + +It is expected that novel platypus venom proteins will be identified through this proteogenomic strategy, and results will subsequently be published in a peer-reviewed academic journal. + +
+ +> _These details have been provided by project members at project initiation. For more information on the project, please consult the contact(s) or project links above._ diff --git a/participants/pmcc.md b/participants/pmcc.md new file mode 100644 index 0000000..3e26271 --- /dev/null +++ b/participants/pmcc.md @@ -0,0 +1,35 @@ +--- +title: Transcriptome Methods Development at Peter MacCallum Cancer Centre +description: The dysregulation of splicing is a hallmark of cancer. Comparing local RNA-seq cohort data to large-scale global splicing databases remains challenging due to incompatible and specialised processing pipelines. This project aims to uniformly process local reference cohort data to allow efficient comparison with major international pre-computed datasets. +toc: false +type: ABLeS Participant +--- + +## Project title + +Standardised processing of cancer cohort RNA-seq data for streamlined analysis and discovery + +## Collaborators and funding + +- [Peter MacCallum Cancer Centre](https://www.petermac.org/) +- [Children’s Cancer Institute](https://www.ccia.org.au/) + +## Contact(s) + +Andrew Lonsdale, Peter MacCallum Cancer Centre, + +## Project description and aims + +Leveraging these summarised databases effectively with patient and research cohorts requires private RNA-seq samples to be processed using the same pipelines. The Snapcount and Recount3 common backend workflow differs from clinical and standard pipelines, and requires raw data to be reanalyzed and processed to directly comparable with the pre-computed public databases. The goal of this project is to develop a systematic method to: transfer files to national compute, process private samples using a common workflow, summarise at cohort level, and export results back for analysis. + +## How is ABLeS supporting this work? + +Quarterly allocation of 50000 SU and 5TB of long term storage. + +## Expected outputs enabled by participation in ABLeS + +The major outcome of this project would be a Snapcount3 compatible resource via a Snaptron server. This will give an API for querying splicing and metadata across across. paediatric caner cohorts for researchers to use. This will result in both a web accessible database for querying, and an associated paper disseminating key findings. A Snaptron web service running under Docker will be deployed on institutional virtual machines to store and disseminate the data. + +
+ +> _These details have been provided by project members at project initiation. For more information on the project, please consult the contact(s) or project links above._ diff --git a/participants/scu.md b/participants/scu.md new file mode 100644 index 0000000..6652500 --- /dev/null +++ b/participants/scu.md @@ -0,0 +1,40 @@ +--- +title: Tobias Kretzschmar, Southern Cross University +description: This project aims to contribute to the improvement of hemp as a crop, particularly in terms of seed production, through quantitative genetic and functional genomic techniques. +toc: false +type: ABLeS Participant +--- + +## Project title + +Determination of genetic basis of sex expression in hemp (Cannabis sativa) + +## Collaborators and funding + +This project is funded by the Australian Research Council (ARC) Linkage project LP240200616 (Swinging both ways – the genetic control of sex expression in hemp) + +## Contact(s) + +- Stephen Siazon, Southern Cross University, +- Locedie Mansueto, Southern Cross University, +- Tobias Kretzschmar, Southern Cross University, + +## Project description and aims + +Hemp (low THC Cannabis sativa) is an emerging Australian crop that produces high-quality edible oils and plant-based protein from seeds. Hemp typically has separate male and female plants, with 50% of the crop being males that don’t produce seed, causing low and variable yields. + +This project will characterize novel sex-determining genetic factors in hemp, using quantitative genetic and functional genomic approaches. This includes Quantitative Trait Locus (QTL) mapping, Bulk Segregant Analysis via sequencing (BSAseq), transcriptomic analysis via RNA sequencing, DNA resequencing and potentially de-novo assemblies of Cannabis genomes. + +Project outcomes include enhanced knowledge on hemp sex expression, novel hemp crop technologies and associated germplasm that will deliver significant increases to seed yields. + +## How is ABLeS supporting this work? + +NCI; Quarterly Service Units: 20kSUs; 2TB long term storage; 5 TB scratch; GPU: yes; number of users: 2-3; project duration: 6-12 months + +## Expected outputs enabled by participation in ABLeS + +Outputs will be published in peer-reviewed journals and genomics data will be submitted in appropriate public repositories. + +
+ +> _These details have been provided by project members at project initiation. For more information on the project, please consult the contact(s) or project links above._ diff --git a/participants/sih.md b/participants/sih.md new file mode 100644 index 0000000..ac4d679 --- /dev/null +++ b/participants/sih.md @@ -0,0 +1,42 @@ +--- +title: Sydney Informatics Hub (SIH) +description: Testing and development of exercises for Nextflow for HPC workshop, to demonstrate how beginners can scale their Nextflow pipelines. +toc: false +type: ABLeS Participant +--- + +## Project title + +Nextflow for HPC Workshop: Testing and Development + +## Collaborators and funding + +- Sydney Informatics Hub (SIH) +- Australian BioCommons +- Pawsey +- NCI + +## Contact(s) + +- Fred Jaya, SIH, +- Michael Geaghan, SIH, + +## Project description and aims + +"The ""Nextflow for HPC"" workshop will be co-delivered in November 2025 on Setonix and Gadi. The workshop will focus on how users can scale and optimise workflows on these systems. The aim of this project is to test and develop follow-along exercises that demonstrate how users can configure Nextflow pipelines to run with an increasing number of samples, and resources (CPU, RAM, wall time). We expect trial and error with different configuration settings, and toy data sets to ensure that the workshop builds upon introduced concepts, and how users can apply these to their own pipelines. + +Analyses conducted within this project should not require a lot of walltime or memory, to fit the time restrictions of a workshop. It will require jobs with multiple CPUs to demonstrate parallelisation and scalability of Nextflow. Test data used will be subset RNA-seq paired-end reads. + +GitHub repository: [https://github.com/Sydney-Informatics-Hub/nextflow-hpc-workshop](https://github.com/Sydney-Informatics-Hub/nextflow-hpc-workshop) + +## How is ABLeS supporting this work? + +Both NCI and Pawsey required; 10 kSU/quarter; GPUs not required; Number of users: 5-10; Duration: Until the end of 2025; long term object storage not required, scratch space will be required to store sequencing files (~10 GB) and temporary outputs of nextflow runs (~5GB). + +## Expected outputs enabled by participation in ABLeS + +All BioCommons training is archived on Zenodo, and the workshop material will be publicly available on the Sydney Informatics Hub GitHub and GitHub Pages. + +
+ +> _These details have been provided by project members at project initiation. For more information on the project, please consult the contact(s) or project links above._ \ No newline at end of file diff --git a/participants/usyd-lo.md b/participants/usyd-lo.md new file mode 100644 index 0000000..d91235e --- /dev/null +++ b/participants/usyd-lo.md @@ -0,0 +1,47 @@ +--- +title: Nathan Lo, University of Sydney +description: The proposed project will investigate the genomic architecture, evolutionary history and population structure of multiple Blattodean species. This has substantial implications for our understanding of chromosome evolution, inbreeding and parallel evolution, as well as useful results for the conservation of insect biodiversity in Australia and Japan. +toc: false +type: ABLeS Participant +--- + +## Project title + +Testing links between life-history and genome evolution + +## Collaborators and funding + +- [The University of Sydney](https://meep.sydney.edu.au/) +- [The University of Melbourne](https://www.wehi.edu.au/researcher/aaron-jex/) +- Australian Research Council Discovery Project DP240102805; [http://www.arc.gov.au/](http://www.arc.gov.au/) + +## Contact(s) + +- Maxim Adams, +- Nathan Lo, + +## Project description and aims + +Our aims are to: +- Assemble high-quality genomes from species across the subfamily Panesthiinae with which to investigate the presence of genome-wide parallel evolution associated with soil-burrowing behaviour. If detected, this would represent among the first evidence that complex behavioural traits can emerge via parallel molecular trajectories. +- Assemble a high-quality reference genome for the Japanese termite Glyptotermes nakajimai against which to align and call genome-wide SNPs in samples across the species’ range. These data will then be used to investigate population structure, inbreeding and genetic diversity in different colonies and populations. When combined with karyotype data, we will novelly test for an association between inbreeding and neo-sex chromosome formation. +- Perform maximum-likelihood and Bayesian phylogenetic analyses to contextualise the previous results against the lineages’ evolutionary histories. These methods will also shed light into the biogeographic origins of the species. + +We request >= 128 GB of RAM (due to the computational demands of these bioinformatic pipelines) and >= 2 TB of storage (due to the large size of raw genomic data). Requested dependencies include: GATK, PLINK, StringTie, Flye, bedtools, samtools, MrBayes, BEAST, IQ-TREE and STRUCTURE. + +## How is ABLeS supporting this work? + +Infrastructure choice: Pawsey +Quarterly Service Units (estimate): 100 kSUs +Storage: >= 2 Tb +Is access to GPUs required?: Not initially, but we may need them in due course (in a month or so, but it will depend on the approach our new computational specialist wants to use for analyses). +Expected number of users: 4 +Expected duration of the project: 3 years + +## Expected outputs enabled by participation in ABLeS + +Publications in scientific journals such as Current Biology, Molecular Biology and Evolution, PNAS, Evolution. + +
+ +> _These details have been provided by project members at project initiation. For more information on the project, please consult the contact(s) or project links above._ diff --git a/participants/uwa-hp.md b/participants/uwa-hp.md new file mode 100644 index 0000000..980e678 --- /dev/null +++ b/participants/uwa-hp.md @@ -0,0 +1,35 @@ +--- +title: Harry Perkins Institute of Medical Research, The University of Western Australia +description: This project develops a novel approach using high-resolution confocal imaging of chromatin and epigenetics in single cells to derive cell-type-specific signatures of cellular identity and state, such as biological age, and to approximate organ and organism function. The predictive value of these signatures, demonstrated in mice, is now being expanded to human tissues. +toc: false +type: ABLeS Participant +--- + +## Project title + +Biomarkers of Aging and Function + +## Collaborators and funding + +[FHRIF](https://fhrifund.health.wa.gov.au) + +## Contact(s) + +- Dr Kenta Ninomiya, Harry Perkins Institute of Medical Research, The University of Western Australia, +- Dr Alexey Terskikh, Harry Perkins Institute of Medical Research, The University of Western Australia, + +## Project description and aims + +This project introduces a novel method to study aging and age-related functional decline by imaging chromatin and epigenetic modifications in single cells. Using high-resolution confocal microscopy, we extract 3D patterns of epigenetic marks to generate cell-type-specific signatures of cellular identity and states, including biological age. We have demonstrated that these signatures can effectively approximate organ and organismal function. Having validated the predictive power of these chromatin and epigenetic signatures in mouse models, our current efforts focus on extending these findings to a diverse range of human tissues and cell types. + +## How is ABLeS supporting this work? + +This work is supported through the Production Bioinformatics scheme provided by ABLeS. The support includes 4 TB long term storage and 50 KSUs per quarter. + +## Expected outputs enabled by participation in ABLeS + +We aim to publish our findings in high-impact journals such as Nature, Nature Aging, or Cell. The research data will be made publicly available through platforms like The Open Science Framework. Developed analysis methods and corresponding codes developed during this project will be shared on GitHub. + +
+ +> _These details have been provided by project members at project initiation. For more information on the project, please consult the contact(s) or project links above._ From d6dfec22c5eed19d986dd9c306c136555ff58d41 Mon Sep 17 00:00:00 2001 From: Ziad Al-Bkhetan Date: Fri, 15 Aug 2025 14:57:13 +1000 Subject: [PATCH 6/9] Testing case sensitivity --- _config.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/_config.yml b/_config.yml index 2d82220..bc9e505 100755 --- a/_config.yml +++ b/_config.yml @@ -2,7 +2,7 @@ title: ABLeS topnav_title: ABLeS description: ABLeS -remote_theme: ELIXIR-Belgium/elixir-toolkit-theme@2.4.0 +remote_theme: ELIXIR-Belgium/elixir-toolkit-theme@typo_case_senstitive permalink: pretty gtag: G-T8GWK081T9 From 3a2af71726cca383796a9e3f89ce0a3ec1509e56 Mon Sep 17 00:00:00 2001 From: Mitchob Date: Mon, 25 Aug 2025 14:16:02 +1000 Subject: [PATCH 7/9] Add updated kraken2 databases --- if89.md | 16 +++++++++++++++- 1 file changed, 15 insertions(+), 1 deletion(-) diff --git a/if89.md b/if89.md index 4041099..21ce992 100755 --- a/if89.md +++ b/if89.md @@ -122,7 +122,7 @@ The list of tools available through the Australian BioCommons Shared Tools and W Some of the databases required by different bioinformatics software tools are made available through the if89 project. They are located at /g/data/if89/data_library. You can request other databases to be included by [contacting us](https://australianbiocommons.github.io/ables/contact-us/). -A list of the currently available (`as of 25 Jan 2023`) databases is included below: +A list of the currently available (`as of 25 Aug 2025`) databases is included below:
@@ -221,6 +221,20 @@ A list of the currently available (`as of 25 Jan 2023`) databases is included be Kraken2 pre-built index for RefSeq database (archaea, bacteria, viral, plasmid, human, protozoa & fungi) plus UniVec_Core. + + Kraken2 + Kraken 2, KrakenUniq and Bracken indexes + 9 Jun 2025 + kraken2/09062025/k2_core_nt + Kraken2 Very large collection, inclusive of GenBank, RefSeq, TPA and PDB. + + + Kraken2 + Kraken 2, KrakenUniq and Bracken indexes + 9 Jun 2025 + kraken2/09062025/k2_standard + Kraken2 pre-built "standard" database, includes RefSeq archaea, bacteria, viral, plasmid, human, plus UniVec_Core. + Dfam Dfam From 768615a2dd861df5f590414febd4db2d1632e9d3 Mon Sep 17 00:00:00 2001 From: ziadbkh Date: Wed, 27 Aug 2025 09:14:11 +1000 Subject: [PATCH 8/9] update participants info --- participants/bpa.md | 7 +------ participants/minderoo.md | 2 +- participants/platypus.md | 8 +------- participants/pmcc.md | 2 +- participants/scu.md | 2 +- participants/sih.md | 42 ---------------------------------------- participants/usyd-lo.md | 7 +------ 7 files changed, 6 insertions(+), 64 deletions(-) delete mode 100644 participants/sih.md diff --git a/participants/bpa.md b/participants/bpa.md index f7ef516..856fe95 100644 --- a/participants/bpa.md +++ b/participants/bpa.md @@ -31,12 +31,7 @@ The Australian Avian Genomics Initiative, established in 2023, aims to: ## How is ABLeS supporting this work? -Infrastructure choice: Unsure (Pawsey?) -Quarterly Service Units (estimate): ? -Storage: requirements for long-term (during the project) and scratch: yes? -Is access to GPUs required?: Yes? -Expected number of users: ? 19 projects have been supported through the initiative so far -Expected duration of the project: 3 years +This project is supported by the Reference Data Generation scheme provided by ABLeS, providing access to compute and storage resources at the National Computational Infrastructure (NCI) and the Pawsey Supercomputing Research Centre. ## Expected outputs enabled by participation in ABLeS diff --git a/participants/minderoo.md b/participants/minderoo.md index 8a7d819..0503b76 100644 --- a/participants/minderoo.md +++ b/participants/minderoo.md @@ -27,7 +27,7 @@ Australian Fish Genomics Initiative ## How is ABLeS supporting this work? -Pawsey, ?, ?, ?, ~50 users, 2 years +This project is supported by the Reference Data Generation scheme provided by ABLeS, providing access to compute and storage resources at the National Computational Infrastructure (NCI) and the Pawsey Supercomputing Research Centre. ## Expected outputs enabled by participation in ABLeS diff --git a/participants/platypus.md b/participants/platypus.md index 854a48a..c99e8e2 100644 --- a/participants/platypus.md +++ b/participants/platypus.md @@ -33,13 +33,7 @@ The aims include: ## How is ABLeS supporting this work? -Infrastructure choice: Pawsey -Quarterly Service Units (estimate): 75 kSU -Storage: 3 Tb storage and 1 Tb long term storage -Is access to GPUs required?: No -Expected number of users: 2 -Expected duration of the project: 4 months -We are ready to start using the resources +This project is supported by the Production Bioinformatics scheme provided by ABLeS, providing access to compute and storage resources at the Pawsey Supercomputing Research Centre. ## Expected outputs enabled by participation in ABLeS diff --git a/participants/pmcc.md b/participants/pmcc.md index 3e26271..8b95b15 100644 --- a/participants/pmcc.md +++ b/participants/pmcc.md @@ -24,7 +24,7 @@ Leveraging these summarised databases effectively with patient and research coho ## How is ABLeS supporting this work? -Quarterly allocation of 50000 SU and 5TB of long term storage. +This project is supported by the Production Bioinformatics scheme provided by ABLeS, providing access to compute and storage resources at the Pawsey Supercomputing Research Centre. ## Expected outputs enabled by participation in ABLeS diff --git a/participants/scu.md b/participants/scu.md index 6652500..e6a1bc9 100644 --- a/participants/scu.md +++ b/participants/scu.md @@ -29,7 +29,7 @@ Project outcomes include enhanced knowledge on hemp sex expression, novel hemp c ## How is ABLeS supporting this work? -NCI; Quarterly Service Units: 20kSUs; 2TB long term storage; 5 TB scratch; GPU: yes; number of users: 2-3; project duration: 6-12 months +This project is supported by the Production Bioinformatics scheme provided by ABLeS, providing access to compute and storage resources at the National Computational Infrastructure (NCI). ## Expected outputs enabled by participation in ABLeS diff --git a/participants/sih.md b/participants/sih.md deleted file mode 100644 index ac4d679..0000000 --- a/participants/sih.md +++ /dev/null @@ -1,42 +0,0 @@ ---- -title: Sydney Informatics Hub (SIH) -description: Testing and development of exercises for Nextflow for HPC workshop, to demonstrate how beginners can scale their Nextflow pipelines. -toc: false -type: ABLeS Participant ---- - -## Project title - -Nextflow for HPC Workshop: Testing and Development - -## Collaborators and funding - -- Sydney Informatics Hub (SIH) -- Australian BioCommons -- Pawsey -- NCI - -## Contact(s) - -- Fred Jaya, SIH, -- Michael Geaghan, SIH, - -## Project description and aims - -"The ""Nextflow for HPC"" workshop will be co-delivered in November 2025 on Setonix and Gadi. The workshop will focus on how users can scale and optimise workflows on these systems. The aim of this project is to test and develop follow-along exercises that demonstrate how users can configure Nextflow pipelines to run with an increasing number of samples, and resources (CPU, RAM, wall time). We expect trial and error with different configuration settings, and toy data sets to ensure that the workshop builds upon introduced concepts, and how users can apply these to their own pipelines. - -Analyses conducted within this project should not require a lot of walltime or memory, to fit the time restrictions of a workshop. It will require jobs with multiple CPUs to demonstrate parallelisation and scalability of Nextflow. Test data used will be subset RNA-seq paired-end reads. - -GitHub repository: [https://github.com/Sydney-Informatics-Hub/nextflow-hpc-workshop](https://github.com/Sydney-Informatics-Hub/nextflow-hpc-workshop) - -## How is ABLeS supporting this work? - -Both NCI and Pawsey required; 10 kSU/quarter; GPUs not required; Number of users: 5-10; Duration: Until the end of 2025; long term object storage not required, scratch space will be required to store sequencing files (~10 GB) and temporary outputs of nextflow runs (~5GB). - -## Expected outputs enabled by participation in ABLeS - -All BioCommons training is archived on Zenodo, and the workshop material will be publicly available on the Sydney Informatics Hub GitHub and GitHub Pages. - -
- -> _These details have been provided by project members at project initiation. For more information on the project, please consult the contact(s) or project links above._ \ No newline at end of file diff --git a/participants/usyd-lo.md b/participants/usyd-lo.md index d91235e..6d235ec 100644 --- a/participants/usyd-lo.md +++ b/participants/usyd-lo.md @@ -31,12 +31,7 @@ We request >= 128 GB of RAM (due to the computational demands of these bioinform ## How is ABLeS supporting this work? -Infrastructure choice: Pawsey -Quarterly Service Units (estimate): 100 kSUs -Storage: >= 2 Tb -Is access to GPUs required?: Not initially, but we may need them in due course (in a month or so, but it will depend on the approach our new computational specialist wants to use for analyses). -Expected number of users: 4 -Expected duration of the project: 3 years +This project is supported by the Production Bioinformatics scheme provided by ABLeS, providing access to compute and storage resources at the Pawsey Supercomputing Research Centre. ## Expected outputs enabled by participation in ABLeS From 049c02d16257b9f9f206f079c0d7d718f7ae673f Mon Sep 17 00:00:00 2001 From: ziadbkh Date: Wed, 27 Aug 2025 09:19:18 +1000 Subject: [PATCH 9/9] update theme --- _config.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/_config.yml b/_config.yml index bc9e505..5334632 100755 --- a/_config.yml +++ b/_config.yml @@ -2,7 +2,7 @@ title: ABLeS topnav_title: ABLeS description: ABLeS -remote_theme: ELIXIR-Belgium/elixir-toolkit-theme@typo_case_senstitive +remote_theme: ELIXIR-Belgium/elixir-toolkit-theme@5.0.0 permalink: pretty gtag: G-T8GWK081T9