Popular repositories Loading
-
dynamic-cheatsheet
dynamic-cheatsheet PublicForked from suzgunmirac/dynamic-cheatsheet
Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory
-
deep_research_bench
deep_research_bench PublicForked from Ayanami0730/deep_research_bench
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
-
lm-evaluation-harness-generations
lm-evaluation-harness-generations PublicForked from EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Python
-
gorilla-jayr
gorilla-jayr PublicForked from ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Python 1
-
gepa
gepa PublicForked from gepa-ai/gepa
Optimize prompts, code, and more with AI-powered Reflective Text Evolution
Jupyter Notebook
-
terminal-bench
terminal-bench PublicForked from laude-institute/terminal-bench
A benchmark for LLMs on complicated tasks in the terminal
Python
If the problem persists, check the GitHub status page or contact support.
