Research article Special Issues

Medical assertion classification in Chinese EMRs using attention enhanced neural network

  • Received: 18 December 2018 Accepted: 17 February 2019 Published: 08 March 2019
  • Electronic medical records (EMRs), such as hospital discharge summaries, contain a wealth of information only expressed in natural language. Automated methods for extracting information from these records must be able to recognize medical concepts in text and their semantic context. A contextual property critical to reason on information from EMRs is the doctor's belief status or assertion of the patient's medical problem. Research on the medical assertion classification (MAC) can establish the foundation for various health data analyses and clinical applications. However, previous MAC studies are mainly based on traditional machine learning methods which mostly require manually constructed features and the original unlabeled data cannot be easily and effectively applied to classification or classification tasks. Furthermore, external medical knowledge such as various medical dictionary bases, which provides rich explain and definition information about medical entity, is rarely utilized in existing neural network models of medical information extraction. In this study, we propose a deep neural network architecture enhanced by medical knowledge attention layer through combining GRU neural network with CNN model to classify the assertion type of medical problem such as disease and symptom in Chinese EMRs. The attention layer in the model is applied to integrate entity representations learned from medical dictionary bases as query for encoding. Experimental results on own manually annotated corpus indicate our approach achieves better performance compared to existing methods.

    Citation: Zhichang Zhang, Yu Zhang, Tong Zhou, Yali Pang. Medical assertion classification in Chinese EMRs using attention enhanced neural network[J]. Mathematical Biosciences and Engineering, 2019, 16(4): 1966-1977. doi: 10.3934/mbe.2019096

    Related Papers:

  • Electronic medical records (EMRs), such as hospital discharge summaries, contain a wealth of information only expressed in natural language. Automated methods for extracting information from these records must be able to recognize medical concepts in text and their semantic context. A contextual property critical to reason on information from EMRs is the doctor's belief status or assertion of the patient's medical problem. Research on the medical assertion classification (MAC) can establish the foundation for various health data analyses and clinical applications. However, previous MAC studies are mainly based on traditional machine learning methods which mostly require manually constructed features and the original unlabeled data cannot be easily and effectively applied to classification or classification tasks. Furthermore, external medical knowledge such as various medical dictionary bases, which provides rich explain and definition information about medical entity, is rarely utilized in existing neural network models of medical information extraction. In this study, we propose a deep neural network architecture enhanced by medical knowledge attention layer through combining GRU neural network with CNN model to classify the assertion type of medical problem such as disease and symptom in Chinese EMRs. The attention layer in the model is applied to integrate entity representations learned from medical dictionary bases as query for encoding. Experimental results on own manually annotated corpus indicate our approach achieves better performance compared to existing methods.


    加载中


    [1] W. W. Chapman, W. Bridewell and P. Hanbur, et al., A simple algorithm for identifying negated findings and diseases in discharge summaries, J. Biomed. Inform., 34(2001), 301–310.
    [2] H. Harkema, J. N. Dowling and T. Thornblade, et al., Con Text: an algorithm fordetermining negation, experiencer, and temporal status from clinicalreports, J. Biomed. Inform., 42(2009), 839–851.
    [3] M. Jiang, Y. Chen and M. Liu, et al., Hybrid approaches to concept extraction and assertion classification-vanderbilt's systems for 2010 I2B2 NLP Challenge, Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data, Boston, MA, USA: i2b2, (2010).
    [4] K. Roberts, B. Rink and S. Harabagiu, Extraction of medical concepts, assertions, and relations from discharge summaries for the fourth i2b2/VA shared task, Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data, Boston, MA, USA: i2b2, (2010).
    [5] B. de Bruijn, C. Cherry and S. Kiritchenko, et al., NRC at i2b2: one challenge, three practical tasks, nine statistical systems, hundreds of clinical records, millions of useful features, Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data, Boston, MA, USA: i2b2, (2010).
    [6] D. Demner-Fushman, E. Apostolova and R. Islamaj Dogan, et al., NLM's system description for the fourth i2b2/VA challenge, Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2, (2010).
    [7] C. Grouin, A. B. Abacha and D. Bernhard, et al., CARAMBA: concept, assertion, and relation annotation using machine-learning based approaches, Proceedings of the 2010 i2b2/ VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2, (2010).
    [8] G. Divita, O. Z. Treitler and Y. J. Kim, et al., Salt lake city VA's challenge submissions. Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data, Boston, MA, USA: i2b2, (2010).
    [9] A. M. Cohen, K. Ambert and J. Yang, et al., OHSU/portland VAMC team participation in the 2010 i2b2/VA challenge tasks, Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2, (2010).
    [10] P. Anick, P. Hong and N. Xue, et al., I2B2 2010 challenge: machine learning for information extraction from patient records. Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2, (2010).
    [11] E. Chang, Y. Xu and K. Hong, et al., A hybrid approach to extract structured information from narrative clinical discharge summaries, Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2, (2010).
    [12] C. Clark, J. Aberdeen and M. Coarr, et al., Determining assertion status for medical problems in clinical records, Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2, (2010).
    [13] K. Cho, B. Van Merrienboer and D. Bahdanau, et al., On the Properties of Neural Machine Translation: Encoder-Decoder Approaches, Computer Science, (2014), arXivpreprintarXiv:1409.1259
    [14] D. Nadeau and S. Sekine, A survey of named entity recognition and classification, LingvisticaeInvestigationes, 30.1(2007), 3–26.
    [15] X. Ma and E. Hovy, End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, (2016), arXiv preprint arXiv:1603.01354.
  • Reader Comments
  • © 2019 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(4300) PDF downloads(682) Cited by(5)

Article outline

Figures and Tables

Figures(4)  /  Tables(4)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog