Available languages: English | Ελληνικά
This repository provides a curated collection of open datasets and ontologies developed within the TALOS AI4SSH project (University of Crete, ERA Chair).
These resources support Digital Humanities, Computational Philology, and Semantic Web research, with a focus on modeling, annotating, and analyzing ancient and modern Greek textual, historical, and cultural data.
The repository currently hosts 10 open datasets/ontologies:
-
Open Dataset 1 – Modeling an Archaeological Site (Göbekli Tepe, Turkey)
Event-centric ontology for archaeological site representation. -
Open Dataset 2 – Archaic Lyric Poetry Ontology (ALyrA)
Ontoterminology for archaic lyric poets (799–430 BC). -
Open Dataset 3 – Modeling Events (Classical Period)
Event ontologies for the Classical period. -
Open Dataset 4 – Modeling Events (Hellenistic Period)
Event ontologies for the Hellenistic period. -
Open Dataset 5 – Ancient Greek and Chinese Philosophers Ontology
Cross-cultural ontology modeling philosophers, their works, and intellectual traditions. -
Open Dataset 6 – Collection of Datasets on Greek NLU (OYXOY)
Benchmarking suite for Greek Natural Language Understanding. -
Open Dataset 7 – Greek Dialect Corpus
A large-scale corpus of Greek dialects (Cypriot, Cretan, Pontic, Northern Greek). -
Open Dataset 8 – Modern Greek Literature
Digitized corpus of interwar poetry and prose. -
Open Dataset 9 – Ancient Oratory Ontology
Ontology of legal proceedings in Athenian courts (419–323 BC). -
Open Dataset 10 – Ontology of Legal Bodies in Classical Athens
Ontology defining legal bodies presiding over/judging cases in Classical Athens.
Each dataset includes:
- Ontology files (OWL/RDF) for use in semantic web applications.
- Ontodictionaries (HTML) with terms and proper names.
- Visualization options (WebVOWL, Protégé).
- SPARQL queries and competency questions.
- Zenodo DOI links for citation and reproducibility.
To explore a dataset:
- Navigate to its corresponding folder.
- Follow the instructions in the dataset-specific README.
- Cite using the provided Zenodo DOI.
If you use resources from this repository, please cite the relevant dataset(s) individually (see each README).
For general reference to the collection:
TALOS AI4SSH Project (2024). TALOS Open Data Repository. University of Crete.
All datasets are distributed under the Creative Commons Attribution–NonCommercial–NoDerivatives (CC BY-NC-ND 4.0) license.
You are free to share and redistribute the material under the following conditions:
- BY: Credit must be given to the creator(s).
- NC: Only non-commercial uses are permitted.
- ND: No derivatives or adaptations are allowed.
Contributions, feedback, and collaborations are warmly welcomed.
Please contact the corresponding authors listed in each dataset’s README or open an issue in this repository.
- TALOS Project Website: talos-ai4ssh.uoc.gr
- Publications and supplementary material available via linked Zenodo DOIs in each dataset.