跳到主要导航 跳到搜索 跳到主要内容

Machine translationese of large language models: Dependency triplets, text classification, and SHAP analysis

科研成果: 期刊稿件文章同行评审

摘要

This study addresses the challenge of distinguishing human translations from those generated by Large Language Models (LLMs) by utilizing dependency triplet features and evaluating 16 machine learning classifiers. Using 10-fold cross-validation, the SVM model achieves the highest mean F1-score of 93%, while all other classifiers consistently differentiate between human and machine translations. SHAP analysis helps identify key dependency features that distinguish human and machine translations, improving our understanding of how LLMs produce translationese. The findings provide practical insights for enhancing translation quality assessment and refining translation models across various languages and text genres, contributing to the advancement of natural language processing techniques.

源语言英语
文章编号e0339769
期刊PLoS ONE
21
1 January
DOI
出版状态已出版 - 1月 2026

指纹

探究 'Machine translationese of large language models: Dependency triplets, text classification, and SHAP analysis' 的科研主题。它们共同构成独一无二的指纹。

引用此