Three skill files from the Opening the Ottoman Archive Ottoman Turkish HTR/OCR project, demonstrating a two-stage pipeline for transcribing Ottoman Turkish documents using large language models (LLMs).
-
V3-S-Minimal — Visual capture protocol for use with Google Gemini 3 Pro Preview. Pure script-to-Unicode conversion with zero semantic interpretation. Designed for Perso-Arabic script documents.
-
V3-T-Government-Gazette — Semantic processing protocol for use with Claude Opus 4.5 or 4.6. Takes the visual capture output and performs transcription, transliteration, translation, and named entity recognition.
- S0-V1.2-Layout-Analysis — Document layout diagnosis. Detects and classifies distinct regions of a document image. Results inform which variant of the V3-S and V3-T files to use or how to adapt them.
Additional skill files are under development in collaboration with Ottoman Turkish scholars. Once they have been tested and validated they will be made publicly available with full documentation. These skill files will cover a wide range of script, document type, and genre from the C16th to the early C20th.
You can find a summary description of all our skill files under development in our public wiki.
For research purposes, skill files under development can be requested by contacting Colin Greenstreet at colin.greenstreet@gmail.com.
Part of the Opening the Ottoman Archive initiative. For methodology and documentation, see the project wiki.
