Polish EQ-Bench Leaderboard
Leaderboard was created as part of an open-science project SpeakLeash.org
Polish Emotional Intelligence Benchmark for LLMs
Help us develop Polish Large Language Model Bielik by using Arena.
We gratefully acknowledge Polish high-performance computing infrastructure PLGrid (HPC Centers: ACK Cyfronet AGH) for providing computer facilities and support within computational grant no. PLG/2024/016951.
Model | Params | Benchmark Score | Percentage Questions Parseable | Error |
---|---|---|---|---|
22.2 | 59.579532163742684 | 51.461988304093566 | 133.0 questions were parseable (min is 83%) |
Authors:
- Automatic translation: Remigiusz Kinas
- Translation proofreading and localization: Maria Filipkowska, Zuzanna Dabić
- Preparing dataset: Kacper Milan
- Running benchmark and leaderboard: Krzysztof Wróbel
Based on: EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models, Samuel J. Paech, 2023