The 2nd LLMs4Subjects Shared Task: LLM-based Subject Tagging for the TIB
Technical Library's Open-Access Catalog
Theme: The Development of Energy- and Compute-Efficient LLM Systems
Organized as part of the German Evaluation (GermEval 2025) Shared Task
Series
10. - 12. September, 2025
Hildesheim, Germany
(co-located with KONVENS 2025 - Conference on Natural Language Processing)
2nd LLMs4Subjects Shared Task:
https://sites.google.com/view/llms4subjects-germeval/
KONVENS 2025: https://konvens-2025.hs-hannover.de/about/
Task Overview
LLMs4Subjects challenges the research community to develop cutting-edge
LLM-based solutions for subject tagging of technical records from Leibniz
University’s Technical Library (TIBKAT). Participants are tasked with
leveraging large language models (LLMs) to tag technical records using the
GND taxonomy. The task involves bilingual language modeling, as systems
must process technical documents in both German and English. Successful
solutions may be integrated into the operational workflows of TIB, the
Leibniz Information Centre for Science and Technology.
With the rapid advancements in LLMs, the focus is shifting toward making
these models more energy- and compute-efficient while maintaining high
performance. Recent innovations, such as the DeepSeek series, have
demonstrated how techniques like mixture-of-experts (MoE) and model
distillation can significantly reduce computational costs without
sacrificing effectiveness.
The 2nd LLMs4Subjects shared task highlights the importance of efficiency
in LLMs, encouraging participants to explore strategies that enhance model
performance while optimizing for energy consumption and inference speed. We
welcome approaches (but not limited to) that leverage model compression,
quantization, efficient fine-tuning, and adaptive computation techniques to
push the boundaries of sustainable AI development.
Subtasks
The 2nd LLMs4Subjects shared task organizes the following two subtasks:
Subtask 1 - Multi-Domain Classification of Library Records
Subtask 2 - Large-scale Multilabel Subject Indexing of Library Records
Important Dates
· Release of training data: March 8, 2025 · Release of testing
data: May 23, 2025· Deadline for system submissions: June 2, 2025·
Evaluation end: June 27, 2025 · Paper submission deadline: July
7, 2025 · Notification of acceptance: June 28, 2025 · Camera-ready
paper due: August 15, 2025 · Workshop/KONVENS: September 10 - 12,
2025 (TBA)
|