Polish EQ-Bench Leaderboard

Leaderboard was created as part of an open-science project SpeakLeash.org

Polish Emotional Intelligence Benchmark for LLMs

Help us develop Polish Large Language Model Bielik by using Arena.

We gratefully acknowledge Polish high-performance computing infrastructure PLGrid (HPC Centers: ACK Cyfronet AGH) for providing computer facilities and support within computational grant no. PLG/2024/016951.

Model
Params
Benchmark Score
Percentage Questions Parseable
Error
46.7
59.579532163742684
51.461988304093566
141.0 questions were parseable (min is 83%)

Authors:

Based on: EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models, Samuel J. Paech, 2023