Change the repository type filter
All
Repositories list
10 repositories
toolkit-eval-harness
PublicAI/ML evaluation harness -- versioned golden test suites, deterministic scoring, regression detection, and CI/CD-integrated reporting.- Red-team and compliance test harness for LLM outputs -- detects PII leakage, secret exposure, and policy violations with CI-friendly reporting.
toolkit-inference-mesh
PublicDistributed LLM inference mesh for heterogeneous clusters -- pipeline-parallel sharding with SGLang, MLX-LM, and P2P networking. Fork of Parallax.toolkit-ml-provenance
PublicML provenance and SBOM generator -- deterministic manifests with integrity verification and optional cryptographic signing for datasets, configs, code, and mode…enterprise-crypto
PublicOpen-source multi-agent crypto trading system with institutional-grade risk controls, multi-exchange support, and autonomous strategy execution.toolkit-cost-optimizer
Publictoolkit-llm-gateway
Publictoolkit-data-contracts
PublicData contract management and drift detection for ML/LLM pipelines -- automatic schema inference, validation, and statistical profiling with CI/CD integration.toolkit-mmqa
PublicMultimodal dataset QA -- directory scanning, file hashing, and exact duplicate detection for text, image, and audio datasets.toolkit-rag-quality
PublicDeterministic RAG evaluation toolkit -- retrieval metrics (recall, precision, MRR), corpus overlap detection, and CI regression gating without model calls.