Doc Layout API

FastAPI service for document layout detection based on doclayout_yolo (YOLOv10).

It exposes two endpoints:

POST /detect — accepts an image and returns an annotated PNG.
POST /evaluate — accepts an image and COCO annotations, returns a PNG overlay (GT green, predictions red) and evaluation metrics in the HTTP header.

Environment (`.env`)

LOG_LEVEL=INFO
LOG_TYPE=TEXT            # TEXT | JSON

MODEL_PATH=models/doclayout_yolo_docstructbench_imgsz1024.pt
MODEL_DEVICE=cpu         # cpu | cuda:<index>  (e.g., cuda:0)
MODEL_IMGSZ=1024
MODEL_CONF=0.20
MODEL_WARMUP=true
MODEL_CONCURRENT=2

Build & Run

Using Makefile

CPU (default if MODEL_DEVICE=cpu):

make build
make run

GPU (set in .env):

MODEL_DEVICE=cuda:0

make build
make run

Useful commands:

make help
make logs
make stop
make rm
make rebuild

Local development (without Docker)

uv sync --locked
uv run uvicorn src.app:app --host 0.0.0.0 --port 49494

API

Base URL: http://localhost:49494 Docs: /docs (Swagger) or /redoc

`POST /detect`

Request: multipart/form-data
- image: PNG/JPG/WebP
Response: 200 OK → image/png (annotated image or original if no detections)

Example:

curl -s -X POST "http://localhost:49494/detect"   -F "image=@test/example/academic.jpg"   --output test/results/out_detect.png

`POST /evaluate`

Request: multipart/form-data
- image: PNG/JPG/WebP
- annotations: COCO JSON

Response:

200 OK → image/png (GT green, predictions red)

HTTP header X-Metrics with JSON:

{
  "metrics": {"precision": 0.91, "recall": 0.88, "f1": 0.895, "mIoU": 0.73, "mAP@0.5": 0.81},
  "counts": {"tp": 42, "fp": 3, "fn": 6}
}

Example:

curl -s -X POST "http://localhost:49494/evaluate"   -F "image=@test/example/ppt.jpg"   -F "annotations=@test/example/ppt.json"   -D test/results/headers.txt   --output test/results/out_eval.png
grep -i '^X-Metrics:' test/results/headers.txt

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
models		models
src		src
test		test
.codespell.ignorewords		.codespell.ignorewords
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Doc Layout API

Environment (`.env`)

Build & Run

Using Makefile

Local development (without Docker)

API

`POST /detect`

`POST /evaluate`

About

Uh oh!

Releases

Packages

Languages

yyeliseyenka/doc-layout-forge

Folders and files

Latest commit

History

Repository files navigation

Doc Layout API

Environment (.env)

Build & Run

Using Makefile

Local development (without Docker)

API

POST /detect

POST /evaluate

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Environment (`.env`)

`POST /detect`

`POST /evaluate`

Packages