Eye of Judgement: Dissecting the Evaluation of Russian-speaking LLMs with POLLUX Paper • 2505.24616 • Published May 30, 2025
Multimodal Evaluation of Russian-language Architectures Paper • 2511.15552 • Published Nov 19, 2025 • 78