skill-files-demo

Three skill files from the Opening the Ottoman Archive Ottoman Turkish HTR/OCR project, demonstrating a two-stage pipeline for transcribing Ottoman Turkish documents using large language models (LLMs).

Core Pipeline

The core pipeline contains two stages (1) visual capture (2) semantic processing

V3-S-Minimal — Visual capture protocol for use with Google Gemini 3 Pro Preview. Pure script-to-Unicode conversion with zero semantic interpretation. Designed for Perso-Arabic script documents.
V3-T-Government-Gazette — Semantic processing protocol for use with Claude Opus 4.5 or 4.6. Takes the visual capture output and performs transcription, transliteration, translation, and named entity recognition.

Optional Preparatory Step

S0-V1.2-Layout-Analysis — Document layout diagnosis. Detects and classifies distinct regions of a document image. Results inform which variant of the V3-S and V3-T files to use or how to adapt them.

Other Skill Files

Additional skill files are under development in collaboration with Ottoman Turkish scholars. Once they have been tested and validated they will be made publicly available with full documentation. These skill files will cover a wide range of script, document type, and genre from the C16th to the early C20th.

You can find a summary description of all our skill files under development in our public wiki.

For research purposes, skill files under development can be requested by contacting Colin Greenstreet at colin.greenstreet@gmail.com.

Project

Part of the Opening the Ottoman Archive initiative. For methodology and documentation, see the project wiki.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
README.md		README.md
S0-V1.2-Layout-Analysis.md		S0-V1.2-Layout-Analysis.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

skill-files-demo

Core Pipeline

The core pipeline contains two stages (1) visual capture (2) semantic processing

Optional Preparatory Step

Other Skill Files

Project

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

skill-files-demo

Core Pipeline

The core pipeline contains two stages (1) visual capture (2) semantic processing

Optional Preparatory Step

Other Skill Files

Project

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages