Wikontic

Build ontology-aware, Wikidata-aligned knowledge graphs from raw text using LLMs

🚀 Overview

Knowledge Graphs (KGs) provide structured, verifiable representations of knowledge, enabling fact grounding and empowering large language models (LLMs) with up-to-date, real-world information. However, creating high-quality KGs from open-domain text is challenging due to issues like redundancy, inconsistency, and lack of alignment with formal ontologies.

Wikontic is a multi-stage pipeline for constructing ontology-aligned KGs from unstructured text using LLMs and Wikidata. It extracts candidate triples from raw text, then refines them through ontology-based typing, schema validation, and entity deduplication—resulting in compact, semantically coherent graphs.

📁 Repository Structure

preprocessing/constraint-preprocessing.ipynb
Jupyter notebook for collecting constraint rules from Wikidata.
utils/
Utilities for LLM-based triple extraction and alignment with Wikidata ontology rules.
utils/openai_utils.py
LLMTripletExtractor class for LLM-based triple extraction.

To use ontology:

utils/ontology_mappings/
JSON files containing ontology mappings from Wikidata.
utils/structured_inference_with_db.py
- StructuredInferenceWithDB class: triple extraction and qa functions
utils/structured_aligner.py
- Aligner class: ontology alignment and entity name refinement

Not to use ontology:

utils/inference_with_db.py
- InferenceWithDB class: triple extraction and qa functions
utils/dynamic_aligner.py
- Aligner class: entity and relation name refinement

Evaluation:

inference_and_eval/
- Scripts for building KGs for MuSiQue and HotPot datasets and evaluation of QA performance
analysis/
- Notebooks with downstream analysis of the resulted KG

Use Wikontic as a service:

pages/ and Wikontic.py
Code for the web service for knowledge graph extraction and visualization.
Dockerfile
For building a containerized web service.

🏁 Getting Started

Set up the ontology and KG databases:
```
./setup_db.sh
```
Launch the web service:
```
streamlit run Wikontic.py
```

Enjoy building knowledge graphs with Wikontic!

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
analysis		analysis
datasets		datasets
inference_and_eval		inference_and_eval
media		media
pages		pages
preprocessing		preprocessing
src/wikontic		src/wikontic
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
Wikontic.py		Wikontic.py
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
requirements.txt		requirements.txt
setup_db.sh		setup_db.sh
tutorial.ipynb		tutorial.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wikontic

🚀 Overview

📁 Repository Structure

To use ontology:

Not to use ontology:

Evaluation:

Use Wikontic as a service:

🏁 Getting Started

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

screemix/Wikontic

Folders and files

Latest commit

History

Repository files navigation

Wikontic

🚀 Overview

📁 Repository Structure

To use ontology:

Not to use ontology:

Evaluation:

Use Wikontic as a service:

🏁 Getting Started

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages