Polish EQ-Bench Leaderboard
Leaderboard was created as part of an open-science project SpeakLeash.org
Polish Emotional Intelligence Benchmark for LLMs
Help us develop Polish Large Language Model Bielik by using Arena.
We gratefully acknowledge Polish high-performance computing infrastructure PLGrid (HPC Centers: ACK Cyfronet AGH) for providing computer facilities and support within computational grant no. PLG/2024/016951.
|  Model   |  Params   |  Benchmark Score   |  Percentage Questions Parseable   |  Error   | 
|---|---|---|---|---|
|  22.2  |  59.579532163742684  |  51.461988304093566  |  133.0 questions were parseable (min is 83%)  | 
Authors:
- Automatic translation: Remigiusz Kinas
- Translation proofreading and localization: Maria Filipkowska, Zuzanna Dabić
- Preparing dataset: Kacper Milan
- Running benchmark and leaderboard: Krzysztof Wróbel
Based on: EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models, Samuel J. Paech, 2023
| output .csv | 39.8 KB ⇣ |