TY - GEN
T1 - Chinese Clinical Named Entity Recognition with Word-Level Information Incorporating Dictionaries
AU - Lu, Ningjie
AU - Zheng, Jun
AU - Wu, Wen
AU - Yang, Yan
AU - Chen, Kaiwei
AU - Hu, Wenxin
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/7
Y1 - 2019/7
N2 - Electronic Medical Records (EMRs) are the digital equivalent of paper records, which include treatment and medical history about a patient. At present, the main research goal of Chinese EMRS is to accurately recognize the body parts, drugs, illnesses and other information in the Chinese medical process. Implementing EMRs can boost both the quality and safety of patient care. In Chinese EMRs, how to accurately recognize named entities is important because it is useful to predict the disease risk, therapeutic method and recovery probability. This paper proposes a novel deep learning framework, which uses character-word joint embedding and combines different feature information based on the dictionary. Compared with the predecessors, we incorporate word-level information based on the basic Bi-LSTM model. In addition, we propose an improved n-gram feature encoding method and compare it with PDET feature and PIET feature. Our experimental results demonstrate that our proposed model performs the best in predicting named entities in Chinese EMRs.
AB - Electronic Medical Records (EMRs) are the digital equivalent of paper records, which include treatment and medical history about a patient. At present, the main research goal of Chinese EMRS is to accurately recognize the body parts, drugs, illnesses and other information in the Chinese medical process. Implementing EMRs can boost both the quality and safety of patient care. In Chinese EMRs, how to accurately recognize named entities is important because it is useful to predict the disease risk, therapeutic method and recovery probability. This paper proposes a novel deep learning framework, which uses character-word joint embedding and combines different feature information based on the dictionary. Compared with the predecessors, we incorporate word-level information based on the basic Bi-LSTM model. In addition, we propose an improved n-gram feature encoding method and compare it with PDET feature and PIET feature. Our experimental results demonstrate that our proposed model performs the best in predicting named entities in Chinese EMRs.
KW - Chinese EMR
KW - Deep learning
KW - Feature engineering
KW - Mixed embedding
KW - Named entity recognition
UR - https://www.scopus.com/pages/publications/85073215599
U2 - 10.1109/IJCNN.2019.8852113
DO - 10.1109/IJCNN.2019.8852113
M3 - 会议稿件
AN - SCOPUS:85073215599
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - 2019 International Joint Conference on Neural Networks, IJCNN 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 International Joint Conference on Neural Networks, IJCNN 2019
Y2 - 14 July 2019 through 19 July 2019
ER -