Add and update resource descriptors using bioregistry conventions#335
Add and update resource descriptors using bioregistry conventions#335
Conversation
Add new entries for biocyc, cazy, kegg.pathway, merops.family, and wikipathways. Update existing TCDB entry to use bioregistry prefix, pattern, and URL. All new entries use bioregistry.io resolution URLs and identifier patterns from https://bioregistry.io. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
|
Note: the old (existing) TCDB entry was not based on bioregistry, and its gid pattern only allowed full 4-character IDs. The new pattern comes from bioregistry and allows for xrefs to 1-, 2- or 3- character IDs as well as full 4-character IDs. |
|
Another note: There is one existing KEGG entry in the yaml file that looks like this: That entry doesn't look right, so I asked Claude:Can you tell what the pre-exisitng KEGG entry was meant to represent? The details don't look good to me. Looking at the existing entry:
It looks like it was meant to represent KEGG pathways, based on:
But there are several problems:
It seems like the new kegg.pathway entry we just added is a proper replacement. Would you like to remove this old KEGG entry? This PR doesn't delete that existing KEGG entry, because I wasn't sure exactly what "ontology_provided_cross_reference" might be using it. But I suspect Claude is right, and it should be deleted (and my new kegg.pathway entry used instead. |
Summary
tcdb), identifier pattern, and resolution URLbioregistry.ioresolution URLs and identifier patterns sourced from https://bioregistry.ioTest plan
gid_patternmatches its correspondingexample_giddefault_urlresolves correctly when the example identifier is substituted🤖 Generated with Claude Code