Documentation
Core Capabilities

Multi-Model Benchmarking

Comparison

Compare multiple models under the same conditions with side-by-side evaluation, normalized scoring, model ranking, and performance insights.

Ranking

1. mistral   (0.91)
2. llama3    (0.87)
3. llama2    (0.84)
OpenVals Docs