Research article

Temporal fact extraction of fruit cultivation technologies based on deep learning


  • Received: 16 November 2022 Revised: 09 January 2023 Accepted: 22 January 2023 Published: 10 February 2023
  • There are great differences in fruit planting techniques due to different regional environments. Farmers can't use the same standard in growing fruit. Most of the information about fruit planting comes from the Internet, which is characterized by complexity and heterogeneous multi-source. How to deal with such information to form the convenient facts becomes an urgent problem. Information extraction could automatically extract fruit cultivation facts from unstructured text. Temporal information is especially crucial for fruit cultivation. Extracting temporal facts from the corpus of cultivation technologies for fruit is also vital to several downstream applications in fruit cultivation. However, the framework of ordinary triplets focuses on handling static facts and ignores the temporal information. Therefore, we propose Basic Fact Extraction and Multi-layer CRFs (BFE-MCRFs), an end-to-end neural network model for the joint extraction of temporal facts. BFE-MCRFs describes temporal knowledge using an improved schema that adds the time dimension. Firstly, the basic facts are extracted from the primary model. Then, multiple temporal relations are added between basic facts and time expressions. Finally, the multi-layer Conditional Random Field are used to detect the objects corresponding to the basic facts under the predefined temporal relationships. Experiments conducted on public and self-constructed datasets show that BFE-MCRFs achieves the best current performance and outperforms the baseline models by a significant margin.

    Citation: Xinliang Liu, Lei Ma, Tingyu Mao, Yanzhao Ren. Temporal fact extraction of fruit cultivation technologies based on deep learning[J]. Mathematical Biosciences and Engineering, 2023, 20(4): 7217-7233. doi: 10.3934/mbe.2023312

    Related Papers:

  • There are great differences in fruit planting techniques due to different regional environments. Farmers can't use the same standard in growing fruit. Most of the information about fruit planting comes from the Internet, which is characterized by complexity and heterogeneous multi-source. How to deal with such information to form the convenient facts becomes an urgent problem. Information extraction could automatically extract fruit cultivation facts from unstructured text. Temporal information is especially crucial for fruit cultivation. Extracting temporal facts from the corpus of cultivation technologies for fruit is also vital to several downstream applications in fruit cultivation. However, the framework of ordinary triplets focuses on handling static facts and ignores the temporal information. Therefore, we propose Basic Fact Extraction and Multi-layer CRFs (BFE-MCRFs), an end-to-end neural network model for the joint extraction of temporal facts. BFE-MCRFs describes temporal knowledge using an improved schema that adds the time dimension. Firstly, the basic facts are extracted from the primary model. Then, multiple temporal relations are added between basic facts and time expressions. Finally, the multi-layer Conditional Random Field are used to detect the objects corresponding to the basic facts under the predefined temporal relationships. Experiments conducted on public and self-constructed datasets show that BFE-MCRFs achieves the best current performance and outperforms the baseline models by a significant margin.



    加载中


    [1] J. Yan, C. Wang, W. Cheng, M. Gao, A. Zhou, A retrospective of knowledge graphs, Front. Comput. Sci., 12 (2018), 55–74. https://doi.org/10.1007/s11704-016-5228-9. doi: 10.1007/s11704-016-5228-9
    [2] T. Mitchell, W. Cohen, E. Hruschka, P. Talukdar, B. Yang, J. Betteridge, et al., Never-ending learning, Commun. ACM. 61 (2018), 103–115. https://doi.org/10.1145/3191513
    [3] W. Wu, H. Li, H. Wang, K. Q. Zhu, Probase: a probabilistic taxonomy for text understanding, in Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, 2012,481–492. https://doi.org/10.1145/2213836.2213891
    [4] T. Rebele, F. Suchanek, J. Hoffart, J. Biega, E. Kuzey, G. Weikum, YAGO: A multilingual knowledge base from wikipedia, wordnet, and geonames, in The Semantic Web – ISWC 2016, (2016), 177–185. https://doi.org/10.1007/978-3-319-46547-0_19
    [5] I. Mani, Recent developments in temporal information extraction, in Recent Advances in Natural Language Processing III, 2003. https://doi.org/10.1075/cilt.260.06man
    [6] C. Lim, Y. Jeong, H. Choi, Survey of temporal information extraction, J. Inf. Process. Sys., 15 (2019), 931–956.
    [7] Y. Cao, W. Groves, T. K. Saha, J. Tetreault, A. Jaimes, H. Peng, et al., XLTime: A cross-lingual knowledge transfer framework for temporal expression extraction, in Findings of the Association for Computational Linguistics: NAACL 2022, (2022), 1931–1942. http://doi.org/10.18653/v1/2022.findings-naacl.148
    [8] X. Ling, D. S. Weld, Temporal information extraction, in Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, (2010), 1385–1390.
    [9] H. Li, J. Strötgen, J. Zell, M. Gertz, Chinese temporal tagging with heidelTime, in Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers, (2014), 133–137. http://doi.org/10.3115/v1/E14-4026
    [10] W. Li, K. Wong, C. Yuan, Toward automatic Chinese temporal information extraction, J. Am. Soc. Inf. Sci. Technol., 52 (2001), 748–762. https://doi.org/10.1002/asi.1126 doi: 10.1002/asi.1126
    [11] H. Tanev, J. Piskorski, M. Atkinson, Real-time news event extraction for global crisis monitoring, in Natural Language and Information Systems, 207–218, https://doi.org/10.1007/978-3-540-69858-6_21
    [12] J. Strötgen, M. Gertz, P. Popov, Extraction and exploration of spatio-temporal information in documents, in Proceedings of the 6th Workshop on Geographic Information Retrieval, (2010), 1–8. https://doi.org/10.1145/1722080.1722101
    [13] N. Kannen, U. Sharma, S. Neelam, D. Khandelwal, S. Ikbal, H. Karanam, et al., Targeted extraction of temporal facts from textual resources for improved temporal question answering over knowledge bases, preprint, arXiv: 2203.11054.
    [14] I. Mani, B. Schiffman, J. Zhang, Inferring temporal ordering of events in news, in Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003, 2 (2003), 55–57. https://doi.org/10.3115/1073483.1073502
    [15] J. Pustejovsky, P. Hanks, R. Saurí, A. See, R. Gaizauskas, A. Setzer, et al., The TIMEBANK Corpus, Nat. Lang. Process. Inf. Syst., 4592 (2002), 647–656,
    [16] Y. Wang, M. Zhu, L. Qu, M. Spaniol, G. Weikum, Timely YAGO: harvesting, querying, and visualizing temporal knowledge from Wikipedia, in Proceedings of the 13th International Conference on Extending Database Technology, (2010), 697–700, https://doi.org/10.1145/1739041.1739130
    [17] D. Vrandečić, M. Krötzsch, Wikidata: A free collaborative knowledgebase, Commun. ACM, 57 (2014), 78–85, https://dl.acm.org/doi/10.1145/2629489
    [18] J. Hoffart, F. M. Suchanek, K. Berberich, G. Weikum, YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia, Artif. Intell., 194 (2013), 28–61. https://doi.org/10.1016/j.artint.2012.06.001 doi: 10.1016/j.artint.2012.06.001
    [19] E. Kuzey, G. Weikum, Extraction of temporal facts and events from Wikipedia, in Proceedings of the 2nd Temporal Web Analytics Workshop, (2012), 25–32. https://doi.org/10.1145/2169095.2169101
    [20] Y. Liu, W. Hua, X. Zhou, Temporal knowledge extraction from large-scale text corpus, World Wide Web, 24 (2021), 135–156, https://doi.org/10.1007/s11280-020-00836-5 doi: 10.1007/s11280-020-00836-5
    [21] B. Tang, Y. Wu, M. Jiang, Y. Chen, J. C. Denny, H. Xu, A hybrid system for temporal information extraction from clinical text, J. Am. Med. Inform. Assoc., 20 (2013), 828–835. https://doi.org/10.1136/amiajnl-2013-001635 doi: 10.1136/amiajnl-2013-001635
    [22] G. Moharasan, T. B. Ho, Extraction of temporal information from clinical narratives, J. Healthc. Inform. Res., 3 (2019), 220–244. https://doi.org/10.1007/s41666-019-00049-0 doi: 10.1007/s41666-019-00049-0
    [23] R. Han, Q. Ning, N. Peng, Joint event and temporal relation extraction with shared representations and structured prediction, in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), (2019), 434–444. http://doi.org/10.18653/v1/D19-1041
    [24] R. Han, Y. Zhou, N. Peng, Domain knowledge empowered structured neural net for end-to-end event temporal relation extraction, in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), (2020), 5717–5729. http://doi.org/10.18653/v1/2020.emnlp-main.461
    [25] C. Lin, T. Miller, D. Dligach, S. Bethard, G. Savova, Representations of time expressions for temporal relation extraction with convolutional neural networks, in BioNLP 2017, (2017), 322–327. http://doi.org/10.18653/v1/W17-2341
    [26] P. Cao, X. Zuo, Y. Chen, K. Liu, J. Zhao, W. Bi, Uncertainty-aware self-training for semi-supervised event temporal relation extraction, in Proceedings of the 30th ACM International Conference on Information & Knowledge Management, (2021), 2900–2904. https://doi.org/10.1145/3459637.3482207
    [27] H. Wen, H. Ji, Utilizing relative event time to enhance event-event temporal relation extraction, in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, (2021), 10431–10437. http://doi.org/10.18653/v1/2021.emnlp-main.815
    [28] K. Ma, Extraction of temporal information from social media messages using the BERT model, Earth. Sci. Inform., 15 (2022), 573–584. https://doi.org/10.1007/s12145-021-00756-6 doi: 10.1007/s12145-021-00756-6
    [29] A. Uzun, A. C. Tantuğ, ITUTime: Turkish temporal expression extraction and normalization, in Distributed Computing and Artificial Intelligence, 2 (2021), 74–85. https://doi.org/10.1007/978-3-030-86887-1_7
    [30] J. Devlin, M. Chang, K. Lee, K. Toutanova, BERT: Pre-training of deep bidirectional transformers for language understanding, in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, (2019), 4171–4186. http://doi.org/10.18653/v1/N19-1423
    [31] D. Kingma, J. Ba, Adam: A method for stochastic optimization, preprint, arXiv: 1412.6980.
    [32] T. Fu, P. Li, W. Ma, GraphRel: Modeling text as relational graphs for joint entity and relation extraction, in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, (2019), 1409–1418. http://doi.org/10.18653/v1/P19-1136
    [33] L. Mingyi, T. Zhiying, Z. Tong, S. Tonghua, X. Xiaofei, Z. Wang, Ltp: A new active learning strategy for crf-based named entity recognition, Neural Process. Lett., 54 (2022), 2433–2454. https://doi.org/10.1007/s11063-021-10737-x doi: 10.1007/s11063-021-10737-x
    [34] N. Deng, F. Hao, C. Xu, Named entity recognition of traditional chinese medicine patents based on bilstm-crf, Wireless Commun. Mobile Comput., 2021 (2021), 6696205. https://doi.org/10.1155/2021/6696205 doi: 10.1155/2021/6696205
  • Reader Comments
  • © 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(1509) PDF downloads(58) Cited by(0)

Article outline

Figures and Tables

Figures(5)  /  Tables(5)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog