bwshen-mi

Follow

bwshen-mi

Follow

30 followers · 3 following

Achievements

Achievements

Organizations

Popular repositories Loading

reward-bench reward-bench Public

Forked from allenai/reward-bench

RewardBench: the first evaluation tool for reward models.

Python
llama.cpp llama.cpp Public

Forked from ggml-org/llama.cpp

LLM inference in C/C++

C++
claw-eval claw-eval Public

Forked from claw-eval/claw-eval

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python