Address spec gaps identified from cdx-pandoc analysis#15
Merged
gvonness-apolitical merged 2 commits intomainfrom Feb 1, 2026
Merged
Address spec gaps identified from cdx-pandoc analysis#15gvonness-apolitical merged 2 commits intomainfrom
gvonness-apolitical merged 2 commits intomainfrom
Conversation
- Add semantic.schema.json for validating semantic extension blocks/marks - Extend citation mark with CSL-compatible fields (prefix, suffix, locator, suppressAuthor) - Add source field to entity mark for knowledge graph origin - Add creators field to Dublin Core for structured author data with ORCID - Document bibliography block inline entries with renderedText support - Clarify linebreak semantics (soft breaks vs hard breaks) - Document tableCell simplified form for text node children - Document core vs semantic measurement type relationship - Update CI workflow to validate semantic schema
- Add package.json with npm test for schema and example validation - Add scripts/validate-schemas.ts to compile all JSON schemas - Add scripts/validate-examples.ts to validate example documents - Simplify CI workflow to use npm test - Use Ajv 2020-12 draft for JSON Schema validation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
This PR addresses specification gaps identified by analyzing the cdx-pandoc implementation against the current codex-file-format-spec. The changes ensure better alignment between the spec and real-world implementations.
Priority 1: High Impact
schemas/semantic.schema.json- New JSON schema for validating semantic extension blocks and marks (citation, footnote, entity, glossary, bibliography, term, ref, measurement)prefix,suffix,locator,suppressAuthorentriesfield - Support inline CSL JSON entries withrenderedTextfor citeproc outputPriority 2: Medium Impact
sourcefield - Indicate knowledge graph origin (e.g., "wikidata", "dbpedia")creatorsfield to Dublin Core - Structured author data with ORCID, affiliation, and email supportPriority 3: Low Impact
\nin text) vs hard breaks (breakblock)measurement(metrology) vssemantic:measurement(linked data)Test plan
ajv compile --spec=draft2020