Test Space of the LLM Leaderboard
🏅 LLM Benchmark
🏅 LLM Benchmark
Textbox
Select columns to show
T
Model
Average ⬆️
ARC
HellaSwag
MMLU
TruthfulQA
Winogrande
GSM8K
Type
Architecture
Precision
Merged
Hub License
#Params (B)
Hub ❤️
Model sha
model_name_for_query
Model types
🔶
💬
🤝
🟢
Precision
float16
bfloat16
4bit
Model sizes (in billions of parameters)
?
~1.5
~3
~7
~13
~35
~60
70+
T
⋮
Model
⋮
Average ⬆️
⋮
ARC
⋮
HellaSwag
⋮
MMLU
⋮
TruthfulQA
⋮
Winogrande
⋮
GSM8K
⋮
model_name_for_query
⋮
🔶
eren23/ogno-monarch-jaskier-merge-7b-OH-PREF-DPO-v4-test
📑
81.22
79.78
91.15
77.95
75.18
87.85
76.12
eren23/ogno-monarch-jaskier-merge-7b-OH-PREF-DPO-v4-test