GUI interaction capture -- production-ready event streams with time-aligned media
-
Updated
Jan 29, 2026 - Python
GUI interaction capture -- production-ready event streams with time-aligned media
OpenAdapt’s open-source ML toolkit for training and evaluating general multimodal GUI-action models.
Multimodal demo retrieval for GUI automation
Temporal smoothing for UI element detection with OmniParser integration
HTML viewer components for ML dashboards and benchmarks
PII/PHI detection and redaction for GUI automation data (text, images, dicts)
Evaluation infrastructure for GUI agent benchmarks
Add a description, image, and links to the openadapt topic page so that developers can more easily learn about it.
To associate your repository with the openadapt topic, visit your repo's landing page and select "manage topics."