The repo provides the marketplace environment and agent implementations but does not publish any agent trajectory data. The infrastructure to capture traces already exists (database logging, extract_agent_llm_traces.py), but there is no pipeline to export complete, structured trajectories suitable for sharing.
We should generate trajectory datasets from simulation runs across different LLM providers and configurations, and publish them on Hugging Face Datasets. This would fill a gap — most agent benchmarks release environments but not traces.
Related: #156 (static input data release)