Extractive text summarization model based on advantage actor-critic and graph matrix methodology

Senqi Yang; Xuliang Duan; Xi Wang; Dezhao Tang; Zeyan Xiao; Yan Guo; Senqi Yang; Xuliang Duan; Xi Wang; Dezhao Tang; Zeyan Xiao; Yan Guo

doi:10.3934/mbe.2023067

Mathematical Biosciences and Engineering

2023, Volume 20, Issue 1: 1488-1504. doi: 10.3934/mbe.2023067

Previous Article Next Article

Research article Special Issues

Extractive text summarization model based on advantage actor-critic and graph matrix methodology

1.
College of Information and Engineering, Sichuan Agricultural University, Ya'an, China
2.
The Lab of Agricultural Information Engineering, Sichuan Key Laboratory, Ya'an, China

Received: 09 July 2022 Revised: 22 September 2022 Accepted: 16 October 2022 Published: 31 October 2022

The automatic text summarization task faces great challenges. The main issue in the area is to identify the most informative segments in the input text. Establishing an effective evaluation mechanism has also been identified as a major challenge in the area. Currently, the mainstream solution is to use deep learning for training. However, a serious exposure bias in training prevents them from achieving better results. Therefore, this paper introduces an extractive text summarization model based on a graph matrix and advantage actor-critic (GA2C) method. The articles were pre-processed to generate a graph matrix. Based on the states provided by the graph matrix, the decision-making network made decisions and sent the results to the evaluation network for scoring. The evaluation network got the decision results of the decision-making network and then scored them. The decision-making network modified the probability of the action based on the scores of the evaluation network. Specifically, compared with the baseline reinforcement learning-based extractive summarization (Refresh) model, experimental results on the CNN/Daily Mail dataset showed that the GA2C model led on Rouge-1, Rouge-2 and Rouge-A by 0.70, 9.01 and 2.73, respectively. Moreover, we conducted multiple ablation experiments to verify the GA2C model from different perspectives. Different activation functions and evaluation networks were used in the GA2C model to obtain the best activation function and evaluation network. Two different reward functions (Set fixed reward value for accumulation (ADD), Rouge) and two different similarity matrices (cosine, Jaccard) were combined for the experiments.

Keywords:

Citation: Senqi Yang, Xuliang Duan, Xi Wang, Dezhao Tang, Zeyan Xiao, Yan Guo. Extractive text summarization model based on advantage actor-critic and graph matrix methodology[J]. Mathematical Biosciences and Engineering, 2023, 20(1): 1488-1504. doi: 10.3934/mbe.2023067

Related Papers:

[1]	Xiaoming Su, Jiahui Wang, Adiya Bao . Stability analysis and chaos control in a discrete predator-prey system with Allee effect, fear effect, and refuge. AIMS Mathematics, 2024, 9(5): 13462-13491. doi: 10.3934/math.2024656
[2]	Kottakkaran Sooppy Nisar, G Ranjith Kumar, K Ramesh . The study on the complex nature of a predator-prey model with fractional-order derivatives incorporating refuge and nonlinear prey harvesting. AIMS Mathematics, 2024, 9(5): 13492-13507. doi: 10.3934/math.2024657
[3]	Nehad Ali Shah, Iftikhar Ahmed, Kanayo K. Asogwa, Azhar Ali Zafar, Wajaree Weera, Ali Akgül . Numerical study of a nonlinear fractional chaotic Chua's circuit. AIMS Mathematics, 2023, 8(1): 1636-1655. doi: 10.3934/math.2023083
[4]	A. Q. Khan, Ibraheem M. Alsulami . Complicate dynamical analysis of a discrete predator-prey model with a prey refuge. AIMS Mathematics, 2023, 8(7): 15035-15057. doi: 10.3934/math.2023768
[5]	Xiao-Long Gao, Hao-Lu Zhang, Xiao-Yu Li . Research on pattern dynamics of a class of predator-prey model with interval biological coefficients for capture. AIMS Mathematics, 2024, 9(7): 18506-18527. doi: 10.3934/math.2024901
[6]	Weili Kong, Yuanfu Shao . The effects of fear and delay on a predator-prey model with Crowley-Martin functional response and stage structure for predator. AIMS Mathematics, 2023, 8(12): 29260-29289. doi: 10.3934/math.20231498
[7]	Asharani J. Rangappa, Chandrali Baishya, Reny George, Sina Etemad, Zaher Mundher Yaseen . On the existence, stability and chaos analysis of a novel 4D atmospheric dynamical system in the context of the Caputo fractional derivatives. AIMS Mathematics, 2024, 9(10): 28560-28588. doi: 10.3934/math.20241386
[8]	Yao Shi, Zhenyu Wang . Bifurcation analysis and chaos control of a discrete fractional-order Leslie-Gower model with fear factor. AIMS Mathematics, 2024, 9(11): 30298-30319. doi: 10.3934/math.20241462
[9]	Guilin Tang, Ning Li . Chaotic behavior and controlling chaos in a fast-slow plankton-fish model. AIMS Mathematics, 2024, 9(6): 14376-14404. doi: 10.3934/math.2024699
[10]	Xuyang Cao, Qinglong Wang, Jie Liu . Hopf bifurcation in a predator-prey model under fuzzy parameters involving prey refuge and fear effects. AIMS Mathematics, 2024, 9(9): 23945-23970. doi: 10.3934/math.20241164

Abstract

1. Introduction

Throughout the paper, we work over an algebraically closed field $\Bbbk$ of characteristic zero. Let $C$ be a nonsingular projective curve of genus $g\geq 0$ , and $L$ be a very ample line bundle on $C$ . The complete linear system $|L|$ embeds $C$ into a projective space $\mathbb{P}^r: = \mathbb{P}(H^0(C,L))$ . For an integer $k\geq 0$ , the $k$ -th secant variety

$\Sigma_k = \Sigma_k(C,L) \subseteq \mathbb{P}^r$

of $C$ in $\mathbb{P}^r$ is the Zariski closure of the union of $(k+1)$ -secant $k$ -planes to $C$ .

Assume that $\deg L\geq 2g+2k+1$ . Then the $k$ -th secant variety $\Sigma_k$ can be defined by using the secant sheaf $E_{k+1, L}$ and the secant bundle $B^k(L)$ as follows. Denote by $C_m$ the $m$ -th symmetric product of $C$ . Let

$\sigma_{k+1} \colon C_{k} \times C\longrightarrow C_{k+1}$

be the morphism sending $(\xi,x)$ to $\xi+x$ , and $p \colon C_k \times C \rightarrow C$ the projection to $C$ . The secant sheaf $E_{k+1,L}$ on $C_{k+1}$ associated to $L$ is defined by

$E_{k+1,L}: = \sigma_{k+1,*}p^*L,$

which is a locally free sheaf of rank $k+1$ . Notice that the fiber of $E_{k+1,L}$ over $\xi \in C_{k+1}$ can be identified with $H^0(\xi, L|_{\xi})$ . The secant bundle of $k$ -planes over $C_{k+1}$ is

$B^{k}(L): = \mathbb{P}(E_{k+1,L})$

equipped with the natural projection $\pi_k \colon B^{k}(L) \rightarrow C_{k+1}$ . We say that a line bundle $\mathcal{L}$ on a variety $X$ separates $m+1$ points if the natural restriction map $H^0(X, \mathcal{L}) \to H^0(\xi, \mathcal{L}|_{\xi})$ is surjective for any effective zero-cycle $\xi \subseteq X$ with $\rm{length}(\xi) = m+1$ . Notice that a line bundle $\mathcal{L}$ is globally generated if and only if $\mathcal{L}$ separates $1$ point, and $\mathcal{L}$ is very ample if and only if $\mathcal{L}$ separates $2$ points. Since $\deg L \geq 2g+k$ , it follows from Riemann–Roch that $L$ separates $k+1$ points. Then the tautological bundle $\mathscr{O}_{B^k(L)}(1)$ is globally generated. We have natural identifications

$H^0(B^k(L), \mathscr{O}_{B^k(L)}(1)) = H^0(C_{k+1}, E_{k+1,}) = H^0(C, L),$

and therefore, the complete linear system $| \mathscr{O}_{B^k(L)}(1)|$ induces a morphism

$\beta_k \colon B^k(L)\longrightarrow \mathbb{P}^r = \mathbb{P}(H^0(C,L)).$

The $k$ -th secant variety $\Sigma_k = \Sigma_k(C, L)$ of $C$ in $\mathbb{P}^r$ can be defined to be the image $\beta_k(B^k(L))$ . Bertram proved that $\beta_k \colon B^k(L) \to \Sigma_k$ is a resolution of singularities (see [1,Section 1]).

It is clear that there are natural inclusions

$C = \Sigma_0 \subseteq \Sigma_1\subseteq \cdots \subseteq \Sigma_{k-1} \subseteq \Sigma_k\subseteq \mathbb{P}^r.$

The preimage of $\Sigma_{k-1}$ under the morphism $\beta_k$ is actually a divisor on $B^k(L)$ . Thus there exits a natural morphism from $B^k(L)$ to the blowup of $\Sigma_k$ along $\Sigma_{k-1}$ . Vermeire proved that $B^1(L)$ is indeed the blowup of $\Sigma_1$ along $\Sigma_0 = C$ ([3,Theorem 3.9]). In the recent work [2], we showed that $B^k(L)$ is the normalization of the blowup of $\Sigma_k$ along $\Sigma_{k-1}$ ([2,Proposition 5.13]), and raised the problem asking whether $B^k(L)$ is indeed the blowup itself ([2,Problem 6.1]). The purpose of this paper is to give an affirmative answer to this problem by proving the following:

Theorem 1.1. Let $C$ be a nonsingular projective curve of genus $g$ , and $L$ be a line bundle on $C$ . If $\deg L \geq 2g+2k+1$ for an integer $k \geq 1$ , then the morphism $\beta_k \colon B^k(L) \to \Sigma_k(C, L)$ is the blowup of $\Sigma_k(C,L)$ along $\Sigma_{k-1}(C,L)$ .

To prove the theorem, we utilize several line bundles defined on symmetric products of the curve. Let us recall the definitions here and refer the reader to [2] for further details. Let

$C^{k+1} = \underbrace{C\times \cdots \times C}_{\rm{ k+1 times}}$

be the $(k+1)$ -fold ordinary product of the curve $C$ , and $p_i \colon C^{k+1} \to C$ be the projection to the $i$ -th component. The symmetric group $\mathfrak{{S}}_{k+1}$ acts on $p_1^*L \otimes \cdots\otimes p_{k+1}^*L$ in a natural way: a permutation $\mu\in \mathfrak{{S}}_k$ sends a local section $s_1\otimes \cdots \otimes s_{k+1}$ to $s_{\mu(1)}\otimes \cdots \otimes s_{\mu(k+1)}$ . Then $p_1^*L \otimes \cdots\otimes p_{k+1}^*L$ is invariant under the action of $\mathfrak{{S}}_{k+1}$ , so it descends to a line bundle $T_{k+1}(L)$ on the symmetric product $C_{k+1}$ via the quotient map $q \colon C^{k+1} \to C_{k+1}$ . We have $q^*T_{k+1}(L) = p_1^*L \otimes \cdots\otimes p_{k+1}^*L$ . Define a divisor $\delta_{k+1}$ on $C_{k+1}$ such that the associated line bundle $\mathscr{O}_{C_{k+1}}(\delta_{k+1}) = \det \big(\sigma_{k+1,*}( \mathscr{O}_{C_k\times C})\big)^*$ . Let

$A_{k+1,L} : = T_{k+1}(L)(-2\delta_{k+1})$

be a line bundle on $C_{k+1}$ . When $k = 0$ , we use the convention that $T_1(L) = E_{1,L} = L$ and $\delta_1 = 0$ .

The main ingredient in the proof of Theorem 1.1 is to study the positivity of the line bundle $A_{k+1,L}$ . Some partial results and their geometric consequences have been discussed in [2,Lemma 5.12 and Proposition 5.13]. Along this direction, we establish the following proposition to give a full picture in a general result describing the positivity of the line bundle $A_{k+1,L}$ . This may be of independent interest.

Proposition 1.2. Let $C$ be a nonsingular projective curve of genus $g$ , and $L$ be a line bundle on $C$ . If $\deg L \geq 2g+2k+\ell$ for integers $k, \ell \geq 0$ , then the line bundle $A_{k+1,L}$ on $C_{k+1}$ separates $\ell+1$ points.

In particular, if $\deg L \geq 2g+2k$ , then $A_{k+1, L}$ is globally generated, and if $\deg L \geq 2g+2k+1$ , then $A_{k+1,L}$ is very ample.

2. Proof of the main theorem

In this section, we prove Theorem 1.1. We begin with showing Proposition 1.2.

Proof of Proposition 1.2. We proceed by induction on $k$ and $\ell$ . If $k = 0$ , then $A_{1,L} = L$ and $\deg L\geq 2g+\ell$ . It immediately follows from Riemann–Roch that $L$ separates $\ell+1$ points. If $\ell = 0$ , then $\deg L \geq 2g+2k$ . By [2,Lemma 5.12], $A_{k+1,L}$ separates 1 point.

Assume that $k\geq 1$ and $\ell \geq 1$ . Let $z$ be a length $\ell+1$ zero-dimensional subscheme of $C_{k+1}$ . We aim to show that the natural restriction map

$r_{z, k+1, L} \colon H^0(C_{k+1}, A_{k+1,L}) \longrightarrow H^0(z, A_{k+1, L}|_z)$

is surjective. We can choose a point $p \in C$ such that $X_p$ contains a point in the support of $z$ , where $X_p$ is the divisor on $C_{k+1}$ defined by the image of the morphism $C_k\rightarrow C_{k+1}$ sending $\xi$ to $\xi+p$ . Let $y: = z\cap X_p$ be the scheme-theoretic intersection, and $\mathscr{I}_x: = ( \mathscr{I}_z: \mathscr{I}_{X_p})$ , which defines a subscheme $x$ of $z$ in $C_{k+1}$ , where $\mathscr{I}_z$ and $\mathscr{I}_{X_p}$ are ideal sheaves of $z$ and $X_p$ in $C_{k+1}$ , respectively. We have the following commutative diagram

$r_{z, k+1, L} \colon H^0(C_{k+1}, A_{k+1,L}) \longrightarrow H^0(z, A_{k+1, L}|_z)$

where all rows and columns are short exact sequences. By tensoring with $A_{k+1,L}$ and taking the global sections of last two rows, we obtain the commutative diagram with exact sequences

$r_{z, k+1, L} \colon H^0(C_{k+1}, A_{k+1,L}) \longrightarrow H^0(z, A_{k+1, L}|_z)$

in which we use the fact that $H^1(A_{k+1,L}(-X_p)) = 0$ (see the proof of [2,Lemma 5.12]). Note that $A_{k+1,L}(-X_p) = A_{k+1, L(-p)}$ and $A_{k+1,L}|_{X_p}\cong A_{k,L(-2p)}$ , where we identify $X_p = C_k$ .

Since $\rm{length}(y) \leq \rm{length}(z) = \ell+1$ and $\deg L(-2p) \geq 2g+2(k-1)+\ell$ , the induction hypothesis on $k$ implies that $r_{y,k,L(-2p)}$ is surjective. On the other hand, if $x = \emptyset$ , which means that $z$ is a subscheme of $X_p$ , then trivially $r_{x,k+1,L(-p)}$ is surjective. Otherwise, suppose that $x\neq \emptyset$ . By the choice of $X_p$ , we know that $y$ is not empty, and therefore, we have $\rm{length}(x) \leq \rm{length}(z)-1 = \ell$ . Now, $\deg L(-p) \geq 2g + 2k + (\ell -1)$ , so the induction hypothesis on $\ell$ implies that $L(-p)$ separates $\ell$ points. In particular, $r_{x,k+1,L(-p)}$ is surjective. Hence $r_{z,k+1,L}$ is surjective as desired.

Lemma 2.1. Let $\varphi \colon X \to Y$ be a finite surjective morphism between two varieties. If $\varphi^{-1}(q)$ is scheme theoretically a reduced point for each closed point $q \in Y$ , then $\varphi$ is an isomorphism.

Proof. Note that $\varphi$ is proper, injective, and unramifield. Then it is indeed a classical result that $\varphi$ is an isomorphism. Here we give a short proof for reader's convenience. The problem is local. We may assume that $X = {\rm{Spec}}\; B$ and $Y = {\rm{Spec}}\; A$ for some rings $A,B$ . We may regard $A$ as a subring of $B$ . For any $q \in Y$ , let $p: = \varphi^{-1}(q) \in X$ . It is enough to show that the localizations $A': = A_{\mathfrak{m}_q}$ and $B': = B_{\mathfrak{m}_p}$ are isomorphic. Let $\mathfrak{m}_q', \mathfrak{m}_p'$ be the maximal ideals of the local rings $A',B'$ , respectively. The assumption says that $\mathfrak{m}_q'B' = \mathfrak{m}_p'$ . We have

$B'/A' \otimes_{A'} A'/\mathfrak{m}_q' = B'/(\mathfrak{m}_q'B'+A') = B'/(\mathfrak{m}_p' + A') = 0.$

By Nakayama lemma, we obtain $B'/A' = 0$ .

We keep using the notations used in the introduction. Recall that $C$ is a nonsingular projective curve of genus $g \geq 0$ , and $L$ is a very ample line bundle on $C$ . Consider $\xi_{k} \in C_{k}$ and $x \in C$ , and let $\xi: = \xi_{k} + x \in C_{k+1}$ . The divisor $\xi_{k}$ spans a $k$ -secant $(k-1)$ -plane $\mathbb{P}(H^0(\xi_{k}, L|_{\xi_{k}}))$ to $C$ in $\mathbb{P}(H^0(C,L))$ , and it is naturally embedded in the $(k+1)$ -secant $k$ -plane $\mathbb{P}(H^0(\xi,L|_{\xi}))$ spanned by $\xi$ . This observation naturally induces a morphism

$\alpha_{k,1} \colon B^{k-1}(L)\times C\longrightarrow B^k(L).$

To see it in details, we refer to [1,p.432,line –5]. We define the relative secant variety $Z = Z_{k-1}$ of $(k-1)$ -planes in $B^k(L)$ to be the image of the morphism $\alpha_{k,1}$ . The relative secant variety $Z$ is a divisor in the secant bundle $B^k(L)$ , and it is the preimage of $(k-1)$ -th secant variety $\Sigma_{k-1}$ under the morphism $\beta_k$ . It plays the role of transferring the codimension two situation $(\Sigma_k,\Sigma_{k-1})$ into the codimension one situation $(B^k(L),Z)$ . We collect several properties of $Z$ .

Proposition 2.2. ([2,Proposition 3.15,Theorem 5.2,and Proposition 5.13]) Recall the situation described in the diagram

$\alpha_{k,1} \colon B^{k-1}(L)\times C\longrightarrow B^k(L).$

Let $H$ be the pull back of a hyperplane divisor of $\mathbb{P}^r$ by $\beta_k$ , and let $I_{\Sigma_{k-1}|\Sigma_k}$ be the ideal sheaf on $\Sigma_k$ defining the subvariety $\Sigma_{k-1}$ . Then one has

1. $\mathscr{O}_{B^k(L)}((k+1)H-Z) = \pi^*_kA_{k+1,L}$ .

2. $R^i\beta_{k,*} \mathscr{O}_{B^k(L)}(-Z) = \begin{cases} I_{\Sigma_{k-1}|\Sigma_k}& \mathit{if \; i = 0} \\ 0 & \mathit{if \; i >0}. \end{cases}$

3. $I_{\Sigma_{k-1}|\Sigma_k}\cdot \mathscr{O}_{B^k(L)} = \mathscr{O}_{B^k(L)}(-Z)$ .

As a direct consequence of the above proposition, we have an identification

$H^0(C_{k+1},A_{k+1,L}) = H^0(\Sigma_k,I_{\Sigma_{k-1}|\Sigma_k}(k+1)).$

We are now ready to give the proof of Theorem 1.1.

Proof of Theorem 1.1. Let

$b \colon \widetilde{\Sigma}_k: = {\rm{Bl}}_{\Sigma_{k-1}}\Sigma_{k} \longrightarrow \Sigma_k$

be the blowup of $\Sigma_k$ along $\Sigma_{k-1}$ with exceptional divisor $E$ . As $I_{\Sigma_{k-1}|\Sigma_k}\cdot \mathscr{O}_{B^k(L)} = \mathscr{O}_{B^k(L)}(-Z)$ (see Proposition 2.2), there exists a morphism $\alpha$ from $B^k(L)$ to the blowup $\widetilde{\Sigma}_k$ fitting into the following commutative diagram

$b \colon \widetilde{\Sigma}_k: = {\rm{Bl}}_{\Sigma_{k-1}}\Sigma_{k} \longrightarrow \Sigma_k$

We shall show that $\alpha$ is an isomorphism.

Write $V: = H^0(\Sigma_{k}, I_{\Sigma_{k-1}|\Sigma_k}(k+1))$ . As proved in [2,Theorem 5.2], $I_{\Sigma_{k-1}|\Sigma_k}(k+1)$ is globally generated by $V$ . This particularly implies that on the blowup $\widetilde{\Sigma}_k$ one has a surjective morphism $V\otimes \mathscr{O}_{\widetilde{\Sigma}_k}\rightarrow b^* \mathscr{O}_{\Sigma_{k}}(k+1)(-E)$ , which induces a morphism

$\gamma \colon \widetilde{\Sigma}_k\longrightarrow \mathbb{P}(V).$

On the other hand, one has an identification $V = H^0(C_{k+1}, A_{k+1,L})$ by Proposition 2.2. Recall from Proposition 1.2 that $A_{k+1,L}$ is very ample. So the complete linear system $|V| = |A_{k+1,L}|$ on $C_{k+1}$ induces an embedding

$\psi \colon C_{k+1}\longrightarrow \mathbb{P}(V).$

Also note that $\alpha^*(b^* \mathscr{O}_{\Sigma_{k}}(k+1)(-E)) = \beta_k^* \mathscr{O}_{\Sigma_k}(k+1)(-Z) = \pi_k^*A_{k+1,L}$ by Proposition 2.2. Hence we obtain the following commutative diagram

$\psi \colon C_{k+1}\longrightarrow \mathbb{P}(V).$

Take an arbitrary closed point $x \in \widetilde{\Sigma}_k$ , and consider its image $x': = b(x)$ on $\Sigma_{k}$ . There is a nonnegative integer $m \leq k$ such that $x'\in \Sigma_m \setminus \Sigma_{m-1} \subseteq \Sigma_k$ . In addition, the point $x'$ uniquely determines a degree $m+1$ divisor $\xi_{m+1,x'}$ on $C$ in such a way that $\xi_{m+1,x'} = \Lambda \cap C$ , where $\Lambda$ is a unique $(m+1)$ -secant $m$ -plane to $C$ with $x' \in \Lambda$ (see [2,Definition 3.12]). By [2,Proposition 3.13], $\beta_k^{-1}(x') \cong C_{k-m}$ and $\pi_k(\beta_k^{-1}(x')) = \xi_{m+1,x'} + C_{k-m} \subseteq C_{k+1}$ . Consider also $x'': = \gamma(x)$ which lies in the image of $\psi$ . As $\psi$ is an embedding, we may think $x''$ as a point of $C_{k+1}$ . Now, through forming fiber products, we see scheme-theoretically

$\alpha^{-1}(x) \subseteq \pi^{-1}_k(x'') \cap \beta_k^{-1}(x').$

However, the restriction of the morphism $\pi_k$ on $\beta_k^{-1}(x')$ gives an embedding of $C_{k-m}$ into $C_{k+1}$ . This suggests that $\pi_k^{-1}(x'') \cap \beta_k^{-1}(x')$ is indeed a single reduced point, and so is $\alpha^{-1}(x)$ . Finally by Lemma 2.1, $\alpha$ is an isomorphism as desired.

References

[1]	G. Erkan, D. R. Radev, Lexrank, Graph-based lexical centrality as salience in text summarization, J. Artif. Intell. Res., 22 (2004), 457–479. https://doi.org/10.1613/jair.1523 doi: 10.1613/jair.1523
[2]	D. R. Radev, H. Jing, M. Styś, D. Tam, Centroid-based summarization of multiple documents, Inf. Process. Manage., 40 (2004), 919–938. https://doi.org/10.1016/j.ipm.2003.10.006 doi: 10.1016/j.ipm.2003.10.006
[3]	S. Li, D. Lei, P. Qin, W. Y. Wang, Deep reinforcement learning with distributional semantic rewards for abstractive summarization, preprint, arXiv: 1909.00141. https://doi.org/10.48550/arXiv.1909.00141
[4]	A. See, P. J. Liu, C. D. Manning, Get to the point: summarization with pointer-generator networks, preprint, arXiv: 1704.04368. https://doi.org/10.48550/arXiv.1704.04368
[5]	H. P. Luhn, The automatic creation of literature abstracts, IBM J. Res. Dev., 2 (1958), 159–165. https://doi.org/10.1147/rd.22.0159 doi: 10.1147/rd.22.0159
[6]	D. Radev, T. Allison, S. Blair-Goldensohn, J. Blitzer, Z. Zhang, MEAD—a platform for multidocument multilingual text summarization, in 4th International Conference on Language Resources and Evaluation, (2004), 699–702.
[7]	R. Mihalcea, P. Tarau, E. Figa, PageRank on semantic networks, with application to word sense disambiguation, in COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics, (2004), 1126–1132.
[8]	S. Ma, Z. H. Deng, Y. Yang, An unsupervised multi-document summarization framework based on neural document model, in Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, (2016), 1514–1523.
[9]	J. Cheng, L. Dong, M. Lapata, Long short-term memory-networks for machine reading, preprint, arXiv: 1601.06733. https://doi.org/10.48550/arXiv.1601.06733
[10]	R. Nallapati, F. Zhai, B. Zhou, Summarunner: A recurrent neural network based sequence model for extractive summarization of documents, in Thirty-first AAAI Conference on Artificial Intelligence, 2017.
[11]	A. Jadhav, V. Rajan, Extractive summarization with swap-net: Sentences and words from alternating pointer networks, in Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 1 (2018), 142–151, https://doi.org/10.18653/v1/P18-1014.
[12]	Q. Zhou, N. Yang, F. Wei, S. Huang, M. Zhou, T. Zhao, Neural document summarization by jointly learning to score and select sentences, preprint, arXiv: 1807.02305. https://doi.org/10.48550/arXiv.1807.02305
[13]	D. Wang, P. Liu, Y. Zheng, X. Qiu, X. Huang, Heterogeneous graph neural networks for extractive document summarization, preprint, arXiv: 2004.12393. https://doi.org/10.48550/arXiv.2004.12393
[14]	M. Zhong, P. Liu, Y. Chen, D. Wang, X. Qiu, X. Huang, Extractive summarization as text matching, preprint, arXiv: 2004.08795. https://doi.org/10.48550/arXiv.2004.08795
[15]	Y. Dong, Z. Li, M. Rezagholizadeh, J. C. K. Cheung, EditNTS: An neural programmer-interpreter model for sentence simplification through explicit editing, preprint, arXiv: 1906.08104. https://doi.org/10.48550/arXiv.1906.08104
[16]	M. A. Ranzato, S. Chopra, M. Auli, W. Zaremba, Sequence level training with recurrent neural networks, preprint, arXiv: 1511.06732. https://doi.org/10.48550/arXiv.1511.06732
[17]	D. Bahdanau, P. Brakel, K. Xu, A. Goyal, R. Lowe, J. Pineau, et al., An actor-critic algorithm for sequence prediction, preprint, arXiv: 1607.07086. https://doi.org/10.48550/arXiv.1607.07086
[18]	R. Paulus, C. Xiong, R. Socher, A deep reinforced model for abstractive summarization, preprint, arXiv: 1705.04304. https://doi.org/10.48550/arXiv.1705.04304
[19]	S. Narayan, S. B. Cohen, M. Lapata, Ranking sentences for extractive summarization with reinforcement learning, preprint, arXiv: 1802.08636. https://doi.org/10.48550/arXiv.1802.08636
[20]	Y. Mao, Y. Qu, Y. Xie, X. Ren, J. Han, Multi-document summarization with maximal marginal relevance-guided reinforcement learning, preprint, arXiv: 2010.00117. https://doi.org/10.48550/arXiv.2010.00117
[21]	L. Page, S. Brin, R. Motwani, T. Winograd, The Pagerank Citation Ranking: Bringing Order to the Web, Technical Report, Stanford InfoLab, 1998.
[22]	P. Zhang, X. Huang, Y. Wang, C. Jiang, S. He, H. Wang, Semantic similarity computing model based on multi model fine-grained nonlinear fusion, IEEE Access, 9 (2021), 8433–8443. https://doi.org/10.1109/ACCESS.2021.3049378 doi: 10.1109/ACCESS.2021.3049378
[23]	G. Malik, M. Cevik, D. Parikh, A. Basar, Identifying the requirement conflicts in SRS documents using transformer-based sentence embeddings, preprint, arXiv: 2206.13690. https://doi.org/10.48550/arXiv.2206.13690
[24]	Y. Kim, Convolutional neural networks for sentence classification, in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), (2014), 1746–1751. https://doi.org/10.3115/v1/D14-1181
[25]	Y. Zhang, B. Wallace, A sensitivity analysis of (and practitioners' guide to) convolutional neural networks for sentence classification, preprint, arXiv: 1510.03820. https://doi.org/10.48550/arXiv.1510.03820
[26]	C. Y. Lin, F. Och, Looking for a few good metrics: ROUGE and its evaluation, in Ntcir workshop, 2004.
[27]	T. Ma, H. Wang, Y. Zhao, Y. Tian, N. Al-Nabhan, Topic-based automatic summarization algorithm for Chinese short text, Math. Biosci. Eng., 17 (2020), 3582–3600. https://doi.org/10.3934/mbe.2020202 doi: 10.3934/mbe.2020202
[28]	T. Zhang, I. C. Irsan, F. Thung, D. Han, D. Lo, L. Jiang, iTiger: An automatic issue title generation tool, preprint, arXiv: 2206.10811. https://doi.org/10.48550/arXiv.2206.10811
[29]	A. Mullick, A. Nandy, M. N. Kapadnis, S. Patnaik, R. Raghav, R. Kar, An evaluation framework for legal document summarization, preprint, arXiv: 2205.08478. https://doi.org/10.48550/arXiv.2205.08478
[30]	S. Li, Y. Yan, J. Ren, Y. Zhou, Y. Zhang, A sample-efficient actor-critic algorithm for recommendation diversification, Chin. J. Electron., 29 (2020), 89–96. https://doi.org/10.1049/cje.2019.10.004 doi: 10.1049/cje.2019.10.004
[31]	Project Webpage, Available from: https://github.com/vietnguyen91/Super-mario-bros-A3C-pytorch.
[32]	N. Xie, S. Li, H. Ren, Q. Zhai, Abstractive summarization improved by wordnet-based extractive sentences, in CCF International Conference on Natural Language Processing and Chinese Computing, Springer, Cham, (2018), 404–415. https://doi.org/10.1007/978-3-319-99495-6_34
[33]	K. Yao, L. Zhang, T. Luo, Y. Wu, Deep reinforcement learning for extractive document summarization, Neurocomputing, 284 (2018), 52–62. https://doi.org/10.1016/j.neucom.2018.01.020 doi: 10.1016/j.neucom.2018.01.020
[34]	J. Tong, Z. Wang, X. Rui, A multi-model-based deep learning framework for short text multiclass classification with the imbalanced and extremely small data set, Comput. Math. Appl., 113 (2022), 34–44. https://doi.org/10.1016/j.camwa.2022.03.005 doi: 10.1016/j.camwa.2022.03.005
[35]	M. Liu, Z. Cai, J. Chen, Adaptive two-layer ReLU neural network: I. Best least-squares approximation, Comput. Math. Appl., 113 (2021), 34–44. https://doi.org/10.1016/j.camwa.2022.03.005 doi: 10.1016/j.camwa.2022.03.005
[36]	A. Maniatopoulos, N. Mitianoudis, Learnable Leaky ReLU (LeLeLU): An alternative accuracy-optimized activation function, Information, 12 (2021), 513. https://doi.org/10.3390/info12120513 doi: 10.3390/info12120513
[37]	B. H. Nayef, S. N. H. S. Abdullah, R. Sulaiman, Z. A. A. Alyasseri, Applications, Optimized leaky ReLU for handwritten Arabic character recognition using convolution neural networks, Multimedia Tools Appl., 81 (2022), 2065–2094. https://doi.org/10.1007/s11042-021-11593-6 doi: 10.1007/s11042-021-11593-6
[38]	S. K. Karn, N. Liu, H. Schuetze, O. J. Farri, Differentiable multi-agent actor-critic for multi-step radiology report summarization, preprint, arXiv: 2203.08257. https://doi.org/10.48550/arXiv.2203.08257
[39]	Y. Guo, D. Z. Tang, W. Tang, S. Q. Yang, Q. C. Tang, Y. Feng, et al., Agricultural price prediction based on combined forecasting model under spatial-temporal influencing factors, Sustainability, 14 (2022). https://doi.org/10.3390/su141710483. doi: 10.3390/su141710483

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)