Contaminants are the critical targets of food safety supervision and risk assessment. In existing research, food safety knowledge graphs are used to improve the efficiency of supervision since they supply the relationship between contaminants and foods. Entity relationship extraction is one of the crucial technologies of knowledge graph construction. However, this technology still faces the issue of single entity overlap. This means that a head entity in a text description may have multiple corresponding tail entities with different relationships. To address this issue, this work proposes a pipeline model with neural networks for multiple relations enhanced entity pairs extraction. The proposed model can predict the correct entity pairs in terms of specific relations by introducing the semantic interaction between relation identification and entity extraction. We conducted various experiments on our own dataset FC and on the open public available data set DuIE2.0. The results of experiments show our model reaches the state-of-the-art, and the case study indicates our model can correctly extract entity-relationship triplets to release the problem of single entity overlap.
Citation: Yuanyuan Cai, Hao Liang, Qingchuan Zhang, Haitao Xiong, Fei Tong. Food safety in health: a model of extraction for food contaminants[J]. Mathematical Biosciences and Engineering, 2023, 20(6): 11155-11175. doi: 10.3934/mbe.2023494
Contaminants are the critical targets of food safety supervision and risk assessment. In existing research, food safety knowledge graphs are used to improve the efficiency of supervision since they supply the relationship between contaminants and foods. Entity relationship extraction is one of the crucial technologies of knowledge graph construction. However, this technology still faces the issue of single entity overlap. This means that a head entity in a text description may have multiple corresponding tail entities with different relationships. To address this issue, this work proposes a pipeline model with neural networks for multiple relations enhanced entity pairs extraction. The proposed model can predict the correct entity pairs in terms of specific relations by introducing the semantic interaction between relation identification and entity extraction. We conducted various experiments on our own dataset FC and on the open public available data set DuIE2.0. The results of experiments show our model reaches the state-of-the-art, and the case study indicates our model can correctly extract entity-relationship triplets to release the problem of single entity overlap.
[1] | W. Guo, B. Pan, S. Sakkiah, G. Yavas, W. Ge, W. Zou, et al., Persistent organic pollutants in food: contamination sources, health effects and detection methods, Int. J. Environ. Res. Public Health, 16 (2019), 4361. https://doi.org/10.3390/ijerph16224361 doi: 10.3390/ijerph16224361 |
[2] | F. Yeni, S. Yavaş, H. Alpas, Y. Soyer, Most common foodborne pathogens and mycotoxins on fresh produce: a review of recent outbreaks, Crit. Rev. Food Sci. Nutr., 56 (2016), 1532–1544. https://doi: 10.1080/10408398.2013.777021 doi: 10.1080/10408398.2013.777021 |
[3] | C. A. Damalas, I. G. Eleftherohorinos, Pesticide exposure, safety issues, and risk assessment indicators, Int. J. Environ. Res. Public Health, 8 (2011), 1402–1419. https://doi.org/10.3390/ijerph8051402 doi: 10.3390/ijerph8051402 |
[4] | P. Bertail, S. Clémençon, J. Tressou, A storage model with random release rate for modeling exposure to food contaminants, Math. Biosci. Eng., 5 (2008), 35–60. https://doi.org/10.3934/mbe.2008.5.35 doi: 10.3934/mbe.2008.5.35 |
[5] | W. Min, C. Liu, L. Xu, S. Jiang, Applications of knowledge graphs for food science and industry, Patterns, 3 (2022), 100484. https://doi: 10.1016/j.patter.2022.100484 doi: 10.1016/j.patter.2022.100484 |
[6] | C. Li, K. Ma, Entity recognition of Chinese medical text based on multi-head self-attention combined with BILSTM-CRF, Math. Biosci. Eng., 19 (2022), 2206–2218. https://doi.org/10.3934/mbe.2022103 doi: 10.3934/mbe.2022103 |
[7] | H. Yu, H. Li, D. Mao, Q. Cai, A domain knowledge graph construction method based on Wikipedia, J. Inf. Sci, 47 (2021), 783–793. https://doi.org/10.1177/0165551520932510 doi: 10.1177/0165551520932510 |
[8] | H. Yu, H. Li, D. Mao, Q. Cai, A relationship extraction method for domain knowledge graph construction, World Wide Web, 23 (2020), 735–753. https://doi.org/10.1007/s11280-019-00765-y doi: 10.1007/s11280-019-00765-y |
[9] | K. Hashimoto, M. Miwa, Y. Tsuruoka, T. Chikayama, Simple customization of recursive neural networks for semantic relation classification, in Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (ACL), (2013), 1372–1376. |
[10] | Q. Li, H. Ji, Incremental joint extraction of entity mentions and relations, in Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL), (2014), 402–412. https://doi.org/10.3115/v1/P14-1038 |
[11] | X. Yu, W. Lam, Jointly identifying entities and extracting relations in encyclopedia text via a graphical model approach, in International Conference on Computational Linguistics, (2010), 1399–1407. Available from: https://aclanthology.org/C10-2160. |
[12] | H. Chang, H. Zan, T. Guan, K. Zhang, Z. Sui, Application of cascade binary pointer tagging in joint entity and relation extraction of Chinese medical text, Math. Biosci. Eng., 19 (2022), 10656–10672. https://doi:10.3934/mbe.2022498 doi: 10.3934/mbe.2022498 |
[13] | Z. Liang, Z. Zhang, H. Chen, Z. Zhang, Disease prediction based on multi-type data fusion from Chinese electronic health record, Math. Biosci. Eng, 19 (2022), 13732–13746. https://doi:10.3934/mbe.2022640 doi: 10.3934/mbe.2022640 |
[14] | Z. Zhong, D. Chen, A frustratingly easy approach for entity and relation extraction, in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies (NAACL), (2021), 50–61. https://doi.org/10.18653/v1/2021.naacl-main.5 |
[15] | X. Zeng, D. Zeng, S. He, K. Liu, J. Zhao, Extracting relational facts by an end-to-end neural model with copy mechanism, in Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL), (2018), 506–514. https://doi.org/10.18653/v1/P18-1047 |
[16] | Z. Wei, J. Su, Y. Wang, Y. Tian, Y. Chang, A novel cascade binary tagging framework for relational triple extraction, arXiv preprint, (2010), arXiv: 1909.03227. https://doi.org/10.48550/arXiv.1909.03227 |
[17] | Y. Zhang, X. Li, Y. Yang, T. Wang, Disease- and drug-related knowledge extraction for health management from online health communities based on BERT-BiGRU-ATT, Int. J. Environ. Res. Public Health, 19 (2022), 16590. https://doi.org/10.3390/ijerph192416590. doi: 10.3390/ijerph192416590 |
[18] | Q. Pan, C. Huang, D. Chen, A method based on multi-standard active learning to recognize entities in electronic medical record, Math. Biosci. Eng., 18 (2021), 1000–1021. https://doi.org/10.3934/mbe.2021054 doi: 10.3934/mbe.2021054 |
[19] | G. Zhou, J. Su, J. Zhang, M. Zhang, Exploring various knowledge in relation extraction, in Proceedings of the Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics (ACL), (2005), 427–434. https://doi.org/10.3115/1219840.1219893 |
[20] | S. Brin, Extracting patterns and relations from the World Wide Web, in the World Wide Web and Databases, Springer, (1999), 172–183. https://doi.org/10.1007/10704656_11 |
[21] | M. Craven, J. Kumlien, Constructing biological knowledge bases by extracting information from text Sources, Proc. Int. Conf. Intell. Syst. Mol. Biol., 1999 (1999), 77–86. |
[22] | T. Hasegawa, S. Sekine, R. Grishman, Discovering relations among named entities from large corpora, in Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, (2004), 415–422. https://doi.org/10.3115/1218955.1219008 |
[23] | D. Zeng, K. Liu, S. Lai, G. Zhou, J. Zhao, Relation classification via convolutional deep neural network, in Proceedings of COLING 2014 the 25th International Conference on Computational Linguistics: Technical Papers (COLING), (2014), 2335–2344. |
[24] | R. Socher, B. Huval, C. D. Manning, A. Y. Ng, Semantic compositionality through recursive matrix-vector spaces, in Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP), (2012), 1201–1211. |
[25] | J. R. Barr, P. Shaw, F. N. Abu-Khzam, S. Yu, H. Yin, T. Thatcher, Combinatorial code classification & vulnerability rating, in Second International Conference on Transdisciplinary AI (TransAI), (2020), 80–83, https://doi:10.1109/TransAI49837.2020.00017" target="_blank">10.1109/TransAI49837.2020.00017">https://doi:10.1109/TransAI49837.2020.00017 |
[26] | K. T. Chui, B. B. Gupta, P. Vasant, A genetic algorithm optimized RNN-LSTM model for remaining useful life prediction of turbofan engine, Electronics, 10 (2021), 285. https://doi.org/10.3390/electronics10030285 doi: 10.3390/electronics10030285 |
[27] | Y. Xu, L. Mou, G. Li, Y. Chen, H. Peng, Z. Jin, Classifying relations via long short term memory networks along shortest dependency paths, in Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), (2015), 785–1794. https://doi.org/10.18653/v1/D15-1206 |
[28] | P. Shi, J. Lin, Simple BERT models for relation extraction and semantic role labeling, arXiv preprint, (2019), arXiv: 1904.05255. https://doi.org/10.48550/arXiv.1904.05255 |
[29] | K. Xu, Y. Feng, S. Huang, D. Zhao, Semantic relation classification via convolutional neural networks with simple negative sampling, arXiv preprint, (2015), arXiv: 1506.07650. https://doi.org/10.48550/arXiv.1506.07650 |
[30] | C. Santos, B. Xiang, B. Zhou, Classifying relations by ranking with convolutional neural networks, in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, (2015), 626–634. https://doi.org/10.48550/arXiv.1504.06580 |
[31] | Y. Lin, S. Shen, Z. Liu, H. Luan, M. Sun, Neural relation extraction with selective attention over instances, in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL), 1 (2016), 2124–2133. https://doi.org/10.18653/v1/P16-1200 |
[32] | S. Zhang, D. Zheng, X. Hu, M. Yang, Bidirectional long short-term memory networks for relation classification, in Proceedings of the 29th Pacific Asia Conference on Language Information and Computation(PACLIC), (2015), 73–78. |
[33] | M. Miwa, M. Bansal, End-to-end relation extraction using LSTMs on sequences and tree structures, in Proceedings of the Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL), 1 (2016), 1105–1116. https://doi.org/10.18653/v1/P16-1105 |
[34] | S. Zheng, Y. Hao, D. Lu, H. Bao, J. Xu, H. Hao, et al., Joint entity and relation extraction based on a hybrid neural network, Neurocomputing, 257 (2017), 59–66. https://doi.org/10.1016/j.neucom.2016.12.075 doi: 10.1016/j.neucom.2016.12.075 |
[35] | K. Xue, Y. Zhou, Z. Ma, T. Ruan, H. Zhang, P. He, Fine-tuning BERT for joint entity and relation extraction in Chinese medical text, in 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), (2019), 892–897. https://dx.doi.org/10.1109/bibm47256.2019.8983370 |
[36] | G. Bekoulis, J. Deleu, T. Demeester, C. Develder, Joint entity recognition and relation extraction as a multi-head selection problem, Expert Syst. Appl., 114 (2018): 34–45. https://dx.doi.org/10.1016/j.eswa.2018.07.032 |
[37] | S. Zheng, F. Wang, H. Bao, Y. Hao, P. Zhou, B. Xu, et al., Joint extraction of entities and relations based on a novel tagging scheme, in Proceedings of the Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), (2017), 1227–1236. https://dx.doi.org/10.18653/v1/P17-1113 |
[38] | A. Katiyar, C. Cardie, Going out on a limb: Joint extraction of entity mentions and relations without dependency trees, in Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), (2017), 917–928. https://dx.doi.org/10.18653/v1/P17-1085 |
[39] | X. Li, F. Yin, Z. Sun, X. Li, A. Yuan, D. Chai, et al., Entity-relation extraction as multi-turn question answering, in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), (2019). https://dx.doi.org/10.18653/v1/P19-1129 |
[40] | D. Dai, X. Xiao, Y. Lyu, S. Dou, Q. She, H. Wang, Joint extraction of entities and overlapping relations using position-attentive sequence labeling, in Proceedings of the AAAI Conference on Artificial Intelligence, 33 (2019), 6300–6308. https://doi.org/10.1609/aaai.v33i01.33016300 |
[41] | T. J. Fu, P. H. Li, W. Y. Ma, Graphrel: Modeling text as relational graphs for joint entity and relation extraction, in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), (2019), 1409–1418. https://doi.org/10.18653/v1/P19-1136 |
[42] | M. Eberts, A. Ulges, Span-based joint entity and relation extraction with transformer pre-training, arXiv preprint, (2019), arXiv: 1909.07755. https://doi.org/10.48550/arXiv.1909.07755 |
[43] | Z. Lan, M. Chen, S. Goodman, K. Gimpel, P. Sharma, R. Soricut, ALBERT: A lite BERT for self-supervised learning of language representations, arXiv preprint, (2019), arXiv: 1909.11942. https://doi.org/10.48550/arXiv.1909.11942 |
[44] | T. Mikolov, K. Chen, G. Corrado, J. Dean, Efficient estimation of word representations in vector space, arXiv preprint, (2013), arXiv: 1301.3781. https://doi.org/10.48550/arXiv.1301.3781 |
[45] | Y. Kim, Convolutional neural networks for sentence classification, in Proceedings of the Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), (2014), 1746–1751. https://doi.org/10.3115/v1/D14-1181 |
[46] | Y. Liu, M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, et al., RoBERTa: A robustly optimized BERT pretraining approach, arXiv preprint, (2019), arXiv: 1907.11692. https://doi.org/10.48550/arXiv.1907.11692 |
[47] | J. Devlin, M. W. Chang, K. Lee, K. Toutanova, BERT: Pre-training of deep bidirectional transformers for language understanding, arXiv preprint, (2019), arXiv: 1810.04805. https://doi.org/10.48550/arXiv.1810.04805 |
[48] | A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. Gomez, et al., Attention is all you need, arXiv preprint, (2017), arXiv: 1706.03762. https://doi.org/10.48550/arXiv.1706.03762 |
[49] | H. Yan, B. Deng, X. Li, X. Qiu, TENER: Adapting transformer encoder for named entity recognition, arXiv preprint, (2019), arXiv: 1911.04474. https://doi.org/10.48550/arXiv.1911.04474 |
[50] | D. Hendrycks, K. Gimpel, Gaussian Error Linear Units (GELUs), arXiv preprint, (2016), arXiv: 1606.08415. https://doi.org/10.48550/arXiv.1606.08415 |
[51] | Y. Zhang, H. Zhao, B. Li, Semantic slot filling based on BERT and BiLSTM, Comput. Sci., 48 (2021), 247–252. https://doi.org/10.11896/jsjkx.191200088 doi: 10.11896/jsjkx.191200088 |
[52] | J. Lafferty, A. McCallum, F. C. Pereira, Conditional random fields: Probabilistic models for segmenting and labeling sequence data, in Proceedings of the Eighteenth International Conference on Machine Learning, (2001), 282–289. https://dl.acm.org/doi/10.5555/645530.655813 |
[53] | D. P. Kingma, J. Ba, Adam: A method for stochastic optimization, arXiv preprint, (2014), arXiv: 1412.6980. https://doi.org/10.48550/arXiv.1412.6980 |
[54] | A. Viterbi, Error bounds for convolutional codes and an asymptotically optimum decoding algorithm, IEEE Trans. Inf. Theory, 13 (1967), 260–269. https://doi.org/10.1109/TIT.1967.1054010. doi: 10.1109/TIT.1967.1054010 |
[55] | E. Strubell, P. Verga, D. Belanger, A. McCallum, Fast and accurate entity recognition with iterated dilated convolutions, in Proceedings of the Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), (2017), 2670–2680. https://doi.org/10.18653/v1/D17-1283 |
[56] | Z. Huang, W. Xu, K. Yu, Bidirectional LSTM-CRF models for sequence tagging, arXiv preprint, (2015), arXiv: 1508.01991. https://doi.org/10.48550/arXiv.1508.01991 |
[57] | X. Jin, J. Zhang, J. Kong, T. Su, Y. Bai, A reversible automatic selection normalization (RASN) deep network for predicting in the smart agriculture system, Agronomy, 12 (2022), 591. https://doi.org/10.3390/agronomy12030591 doi: 10.3390/agronomy12030591 |
[58] | B. Gupta, A. Gaurav, P. Panigrahi, V. Arya, Analysis of artificial intelligence-based technologies and approaches on sustainable entrepreneurship, Technol. Forecasting Social Change, 186 (2023), 122152. https://doi.org/10.1016/j.techfore.2022.122152 doi: 10.1016/j.techfore.2022.122152 |