Releases: hazelian0619/protian-entity
Releases · hazelian0619/protian-entity
RNA L1/L2 v2
RNA L1/L2 v2
This release publishes the RNA dataset artifacts that are intentionally excluded from git (data/output/**, pipelines/**/reports/**).
Included
- 16 compressed RNA tables (
*.tsv.gz) - 15 non-sample validation reports (all PASS)
- key audit artifacts for tRNA v2 and ENST↔URS coverage
manifest.json(row counts, checksums, commit)SHA256SUMS.txt
Validation summary
- non-sample validations: 15/15 PASS
Notes
- Structural low-coverage items (ENST↔URS, Rfam full coverage, PDB sparsity) are data-source constraints.
- Active optimization line remains tRNA anticodon (v2 included in this release).
Interaction L2 v1
Interaction L2 v1
Interaction-only release aligned with PR #8 merge scope.
Included evidence tables (compressed .tsv.zst):
- ppi_method_context_v2
- ppi_function_context_v2
- psi_activity_context_v2
- psi_structure_evidence_v2
- rpi_site_context_v2
- rpi_domain_context_v2
- rpi_function_context_v2
- interaction_cross_validation_v2
- interaction_aggregate_score_v2
- interaction_ontology_mapping_v2
Quality summary (all PASS):
- A PPI semantic enrichment QA: PASS
- B PSI activity/structure enrichment QA: PASS
- C RPI site/domain/function gates: PASS
- D cross-validation + aggregate score QA: PASS
- E ontology mapping gates: PASS
- interaction_release_local package: PASS
See manifest for sha256 and exact file sizes.
Molecule L1/L2 v2
Molecule L1/L2 v2
This release publishes the molecule v2 data snapshot built from the newly merged pipelines:
molecule_xref_enrichment_v2molecule_3d_registrymolecule_semantic_layermolecule_activity_fusion
Quality gates (local full run)
- xref v2 QA: PASS
chembl_iddelta vs v1: +464pubchem_ciddelta vs v1: +87
- 3D registry v1 QA: PASS
- backlink rate: 1.0
- semantic tags v1 QA: PASS
- ChEBI expandable rate among mapped rows: 0.999858
- activity fusion v2 QA: PASS
- evidence rows: 3,223,952
- edge rows: 2,410,389
- hard gate
chembl_hard_gate_not_degraded: PASS - conflict audit groups: 46,821
Asset integrity
manifest.jsonandSHA256SUMS.txtare included.- Large TSV outputs are distributed as
.tsv.gzrelease assets.
Notes
- Per repository rule,
data/output/**andpipelines/**/reports/**are not committed to git history. - This release is the data-distribution snapshot corresponding to merged PR #7.
RNA L1 v1
RNA L1 v1 (miRNA + mRNA)
Artifacts (download):
- rna_master_v1.tsv.gz
- rna_master_mirna_v1.tsv.gz
- rna_master_mrna_v1.tsv.gz
QA:
- rna_master_v1.validation.json (quality gates)
- manifest.json (sha256 + sizes)
Repo code/docs:
- Branch: feat/rna-l1-v1