suyoumo/clawprobench
suyoumo/clawprobench: ClawProBench is a live-first benchmark harness for evaluating LLM agents in the OpenClaw runtime with deterministic grading and repeated-trial reliability. License: apache-2.0. Hugging Bay hosted release. Scan: pending.
- License
- apache-2.0
- Scan status
- pending
- Hosting status
- external
- Upstream
- suyoumo/ClawProBench
Open interactive artifact page