Our journey began on September 1st, 2011, and since then we have made significant strides in the world of continuous integration and deployment (CI/CD) 🌟. We are proud to empower developers and businesses all over the globe 🌍 by helping them automate their build, test, and deploy processes.
Popular repositories Loading
-
-
evalbench
evalbench PublicForked from petmal/MindTrial
Evaluate LLMs side-by-side. Benchmark AI models and coding agents across providers like OpenAI, Google, Anthropic, DeepSeek, and more. Supports custom tasks, structured JSON responses, tool use, an…
HTML
Repositories
Showing 2 of 2 repositories
- evalbench Public Forked from petmal/MindTrial
Evaluate LLMs side-by-side. Benchmark AI models and coding agents across providers like OpenAI, Google, Anthropic, DeepSeek, and more. Supports custom tasks, structured JSON responses, tool use, and LLM-as-judge validation. Originally created by Petr Malik as MindTrial.
CircleCI-Research/evalbench’s past year of commit activity - .github Public
CircleCI-Research/.github’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…
