Abstract
Pronoun anaphora ambiguity is very common in natural language descriptions, especially in specilized fileds such as computing, medicine and aerospace. When multiple antecedents appear before a pronoun word, readers with different background knowledge often have completely different understandings on a same word. In order to reduce such misunderstandings caused by ambiguity in the process of document propogation, we usually use manual methods to check the ambiguity of reference, which however cannot meet the increasing needs of detection with the development of various disciplines. In this paper, we propose a method to intelligently detect sentences with anaphora ambiguity. First of all, we identify criteria for ambiguous sentences and then use word embeddings to further detect ambiguity. Specifically, we propose a pronoun substitution strategy based on coreference resolution, and combine this strategy with word embedding techniques to generate a cross-domain anaphora ambiguity detection method. Finally, we carry out experiments on aerospace documents, which verify the effectiveness of our proposed method in anaphora ambiguity detection.
| Original language | English |
|---|---|
| Pages (from-to) | 646-649 |
| Number of pages | 4 |
| Journal | Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE |
| Volume | 2023-July |
| DOIs | |
| State | Published - 2023 |
| Event | 35th International Conference on Software Engineering and Knowledge Engineering, SEKE 2023 - Hybrid, San Francisco, United States Duration: 1 Jul 2023 → 10 Jul 2023 |
Keywords
- Cross-domain
- ambiguity detection
- anaphora ambiguity
- natural language processing
- pronoun substitution