An extension, much like the digests one, that lists common identifier formats and how they are stripped for use in storage path generation (RegEx?).
This could also provide mappings from variable length or non-uniformly distributed identifiers to a form better suited for path generation - for example, by using digests/bit scrambling. Or maybe that another extension in its own right.