Abstract
[Purpose/Significance] Research on disciplinary classification of academic short texts can effectively enhance the bibliometric analysis of scholarly papers.[Method/Process]This paper proposed an academic short-text classification framework that integrates Retrieval-Augmented Generation(RAG)with large language model fine-tuning. By dynamically retrieving discipline-related information and combining parameter-efficient fine-tuning techniques, the framework enhanced the semantic representation of model inputs while achieving deep adaptation to domain-specific tasks. [Result/Conclusion]Experiments demonstrate that, compared to traditional deep learning models and general large language models, the synergistic paradigm integrating LoRA fine-tuning and RAG significantly enhances multi-label classification performance. The classification error rate decreases by 36.8%, while accuracy, coverage, and top-label error rate all achieve optimal levels. The classification framework integrating RAG and large language model fine-tuning exhibits synergistic advantages in multi-label classification of academic short texts. Its modular architecture provides a technical pathway for cross-disciplinary knowledge transfer, holding significant academic value and practical implications.
| Translated title of the contribution | Research on Academic Short Text Subject Classification Combining RAG and Large Language Model Fine-Tuning |
|---|---|
| Original language | Chinese (Traditional) |
| Pages (from-to) | 18-29 |
| Number of pages | 12 |
| Journal | Journal of Modern Information |
| Volume | 46 |
| Issue number | 3 |
| DOIs | |
| State | Published - Mar 2026 |
Fingerprint
Dive into the research topics of 'Research on Academic Short Text Subject Classification Combining RAG and Large Language Model Fine-Tuning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver