Loading [MathJax]/jax/output/SVG/jax.js
Research article

Feature group tabular transformer: a novel approach to traffic crash modeling and causality analysis


  • Reliable and interpretable traffic crash modeling is essential for understanding causality and improving road safety. This study introduced a novel approach to predicting crash types by utilizing a comprehensive dataset fused from multiple sources, including weather data, crash reports, high-resolution traffic information, pavement geometry, and facility characteristics. An essential part of our proposed approach was a feature group tabular transformer (FGTT) model, which organizes disparate data into meaningful feature groups, represented as tokens. These group-based tokens serve as rich semantic components, enabling effective identification of collision patterns and interpretation of causal mechanisms. The FGTT model was compared with widely used tree ensemble models, including random forest, XGBoost, and CatBoost, demonstrating better predictive performance. Furthermore, the attention heatmaps from the FGTT model revealed key influential factor interactions, providing fresh insights into the underlying causality of distinct crash types.

    Citation: Oscar Lares, Hao Zhen, Jidong J. Yang. Feature group tabular transformer: a novel approach to traffic crash modeling and causality analysis[J]. Applied Computing and Intelligence, 2025, 5(1): 29-56. doi: 10.3934/aci.2025003

    Related Papers:

    [1] Yang Pan, Jinhua Yang, Lei Zhu, Lina Yao, Bo Zhang . Aerial images object detection method based on cross-scale multi-feature fusion. Mathematical Biosciences and Engineering, 2023, 20(9): 16148-16168. doi: 10.3934/mbe.2023721
    [2] Vishal Pandey, Khushboo Anand, Anmol Kalra, Anmol Gupta, Partha Pratim Roy, Byung-Gyu Kim . Enhancing object detection in aerial images. Mathematical Biosciences and Engineering, 2022, 19(8): 7920-7932. doi: 10.3934/mbe.2022370
    [3] Auwalu Saleh Mubarak, Zubaida Said Ameen, Fadi Al-Turjman . Effect of Gaussian filtered images on Mask RCNN in detection and segmentation of potholes in smart cities. Mathematical Biosciences and Engineering, 2023, 20(1): 283-295. doi: 10.3934/mbe.2023013
    [4] Xin Shu, Xin Cheng, Shubin Xu, Yunfang Chen, Tinghuai Ma, Wei Zhang . How to construct low-altitude aerial image datasets for deep learning. Mathematical Biosciences and Engineering, 2021, 18(2): 986-999. doi: 10.3934/mbe.2021053
    [5] Qihan Feng, Xinzheng Xu, Zhixiao Wang . Deep learning-based small object detection: A survey. Mathematical Biosciences and Engineering, 2023, 20(4): 6551-6590. doi: 10.3934/mbe.2023282
    [6] Yantao Song, Wenjie Zhang, Yue Zhang . A novel lightweight deep learning approach for simultaneous optic cup and optic disc segmentation in glaucoma detection. Mathematical Biosciences and Engineering, 2024, 21(4): 5092-5117. doi: 10.3934/mbe.2024225
    [7] Hongxia Ni, Minzhen Wang, Liying Zhao . An improved Faster R-CNN for defect recognition of key components of transmission line. Mathematical Biosciences and Engineering, 2021, 18(4): 4679-4695. doi: 10.3934/mbe.2021237
    [8] Zhigao Zeng, Cheng Huang, Wenqiu Zhu, Zhiqiang Wen, Xinpan Yuan . Flower image classification based on an improved lightweight neural network with multi-scale feature fusion and attention mechanism. Mathematical Biosciences and Engineering, 2023, 20(8): 13900-13920. doi: 10.3934/mbe.2023619
    [9] Zhiqin Zhu, Shaowen Wang, Shuangshuang Gu, Yuanyuan Li, Jiahan Li, Linhong Shuai, Guanqiu Qi . Driver distraction detection based on lightweight networks and tiny object detection. Mathematical Biosciences and Engineering, 2023, 20(10): 18248-18266. doi: 10.3934/mbe.2023811
    [10] Xiaotang Liu, Zheng Xing, Huanai Liu, Hongxing Peng, Huiming Xu, Jingqi Yuan, Zhiyu Gou . Combination of UAV and Raspberry Pi 4B: Airspace detection of red imported fire ant nests using an improved YOLOv4 model. Mathematical Biosciences and Engineering, 2022, 19(12): 13582-13606. doi: 10.3934/mbe.2022634
  • Reliable and interpretable traffic crash modeling is essential for understanding causality and improving road safety. This study introduced a novel approach to predicting crash types by utilizing a comprehensive dataset fused from multiple sources, including weather data, crash reports, high-resolution traffic information, pavement geometry, and facility characteristics. An essential part of our proposed approach was a feature group tabular transformer (FGTT) model, which organizes disparate data into meaningful feature groups, represented as tokens. These group-based tokens serve as rich semantic components, enabling effective identification of collision patterns and interpretation of causal mechanisms. The FGTT model was compared with widely used tree ensemble models, including random forest, XGBoost, and CatBoost, demonstrating better predictive performance. Furthermore, the attention heatmaps from the FGTT model revealed key influential factor interactions, providing fresh insights into the underlying causality of distinct crash types.



    Every graph taken into consideration here is a simple graph Γ=(V,E), where |V|=ν and |E|=m. The count of edges connected to a vertex vV, denoted by dΓ(v) is the degree of that vertex. The degree of an edge e=uv is defined by dΓ(e)=dΓ(u)+dΓ(v)2. The parameters Δ (resp. δ) and Δ(δ) represent the maximum (resp. minimum) vertex and edge degree of a graph, respectively. If ΔΓ=δΓ=ν1 then the graph Γ is said to be a complete graph denoted by Kν. The graph Γ is called bipartite if V(Γ) is partitioned into two sets say, M and N (partite sets), such that every edge in Γ has one endpoint in M and the other in N. If every vertex of M is adjacent to every vertex of N with |M|=r and |N|=s, then the graph is called the complete bipartite graph, denoted as Kr,s. The graph K1,ν1 denoted by Sν is a star graph, and the graph Kr,r is called the equi-bipartite graph. The graph ¯Γ represents the complement of a graph Γ, which is defined on the same vertex set as Γ such that if two vertices are adjacent in ¯Γ, then they are not adjacent in Γ. For more graph-theoretic terminologies, we refer to the book by Harary [19]. For any real number x, the floor function is the greatest integer less than or equal to x, denoted as x.

    The first degree-based molecular descriptor, the Zagreb index, was developed by Gutman and Trinajstić [9]. It first emerged in the topological formula for conjugated molecules regarding their total π-electron energy. The first Zagreb index is defined as:

    M1(Γ)=vV(Γ)dΓ(v)2=e=uvE(Γ)(dΓ(u)+dΓ(v)). (1.1)

    The Zagreb indices were reformulated in 2004 by Miliˊceviˊc et al. [28] in terms of edge degree, where the edge degree is given by dΓ(e)=dΓ(u)+dΓ(v)2 for e=uvE(Γ). Thus, the first reformulated Zagreb index is given by

    EM1(Γ)=eE(Γ)dΓ(e)2. (1.2)

    The reader is referred to [9] regarding applications of Zagreb indices.

    The sum of the absolute values of eigenvalues of the adjacency matrix of Γ gives us the energy E(Γ) of a graph Γ. This quantity is introduced in [10]. Suppose λ1λ2λν are the eigenvalues of the adjacency matrix A(Γ), then the energy of the graph Γ is given by

    E(Γ)=νi=1|λi|. (1.3)

    Other graph energies were also introduced and studied. Indulal et al. [20] presented some results on the distance energy of graphs. Gutman and Zhou [8] investigated the Laplcaian energy of graphs and derived some extremal results from it.

    The extended adjacency matrix of graph Γ was proposed by Yang et al. [36] in 1994, denoted by Aex(Γ). Its (i,j)-entry is defined to be equal to 12(djdi+didj) if vivj and 0 otherwise. Since Aex is a real symmetric matrix of order ν, all its eigenvalues are real, which are denoted as η1η2ην. Yang et al. [36] also investigated the extended graph energy by summing the absolute values of the eigenvalues of the Aex-matrix, defined as

    Eex(Γ)=νi=1|ηi|. (1.4)

    For recent studies on graph energy, we refer to [5,6,23,24,33,35].

    For a given graph Γ, we define the following terminologies:

    Definition 1.1. Let V={v1,v2,,vν} be the vertex set and {w1,w2,,wν} be the vertex-weights, then the vertex weight for viV is defined as w(vi)=dΓ(vi). The range for a vertex-weight of viV is 0w(vi)ΔΓ (here w(vi)=0 if Γ is disconnected).

    Definition 1.2. Let EΓ={e1,e2,,em} be the edge set, then the edge-weight of ei=uivi is defined by w(ei)=dΓ(ui)+dΓ(vi)2. The range for an edge-weight of eE is 0w(e)M1(Γ)2m (where w(e)=0 if and only if Γ=K2.

    Definition 1.3. The weighted degree of a vertex vV(Γ) is defined as

    dwΓ(v)=e=uvw(e).

    It is very clear that w(e)=dΓ(e) for every eE.

    Observe that

    νi=1dwΓ(vi)=2mi=1w(ei)=2uvE[dΓ(u)+dΓ(v)2]=2[M12m].

    Motivated by the extended adjacency matrix of graph Γ, we introduce a new edge-weighted adjacency matrix of graph Γ denoted by Aw(Γ). It is defined in such a way that, for any vi that is adjacent to vj, its (i,j)-entry equals dΓ(vi)+dΓ(vj)2; otherwise, it equals to 0. In fact, Aw(Γ) is a real symmetric matrix of order ν. Hence, all its eigenvalues are real and can be arranged as λw1λw2λwν, where the largest eigenvalue λw1 is called the spectral radius of Aw(Γ). The edge-weighted graph energy of Γ is given by

    Ew=Ew(Γ)=νi=1|λwi|. (1.5)

    Following the types of adjacencies in [34], the type of adjacency employed by the edge-weighted matrix is the vertex-based adjacency.

    This subsection presents some basic properties of edge-weighted eigenvalues of graphs.

    (1) νi=1λwi=0.

    (2) νi=1(λwi)2=2mi=1[w(ei)]2=2mi=1dΓ(ei)2=2EM1(Γ).

    (3) 0ijλwiλwj=mi=1[w(ei)]2=mi=1dΓ(ei)2=EM1(Γ).

    Moreover, observe that,

    (1) νi=1.(λwi)2=2EM1(Γ).

    (2) 0ijλwiλwj=EM1(Γ).

    The following example delivers the edge-weighted energy of standard graph families.

    Example 1.4. (1) The edge-weighted energy Ew(Kν) of Kν is 4(ν1)(ν2).

    (2) The edge-weighted energy Ew(Kr,s) of Kr,s is 2(r+s2)rs.

    (3) The edge-weighted energy Ew(Sν) of Sν is 2(ν2)(ν1).

    (4) The edge-weighted energy Ew(Kr,r) of Kr,r is 4r(r1).

    Proof. The edge-weighted energy Ew(Kr,s) of Kr,s is 2(r+s2)rs. Replacing s by r in Ew(Kr,s) of Kr,s, we get Ew(Kr,r) of Kr,r as 2(r+r2)r2=4r(r1).

    In subsequent sections, we need the following already established results:

    Lemma 2.1. [35] Let C=(cij) and D=(dij) be real symmetric non-negative matrices of order ν. If CD, i.e., cijdij for all i,j, then λ1(C)λ1(D), whereas λ1 is the largest eigenvalue.

    Lemma 2.2. [35] Let Γ be a connected graph of order ν with m edges. Then

    λ1(Γ)2mν+1 (2.1)

    with equality if and only if ΓK1,ν1 or ΓKν.

    Here we deliver the well-known Cauchy–Schwartz inequality.

    Lemma 2.3. (Cauchy–Schwartz inequality)[3] Let ri and si, 1iν be any real numbers, then

    (νi=1risi)2(νi=1r2i)(νi=1s2i). (2.2)

    The Ozeki inequality is frequently used in the spectral analysis of graphs.

    Lemma 2.4. (Ozeki inequality)[30] If ri and si (1iν) are non-negative real numbers, then

    νi=1r2iνi=1s2i[νi=1risi]2ν24(P1P2p1p2)2 (2.3)

    where P1=max1iν{ri}, P2=max1iν{si}, p1=min1iν{ri}, p2=min1iν{si}.

    The following inequality has been retrieved from Dragomir [7].

    Lemma 2.5. [7] Let pi, qi, ri and si be sequences of real numbers, and mi, νi are non-negative for i=1,2,,ν. Then the following inequality is valid:

    νi=1mip2iνi=1νiq2i+νi=1mir2iνi=1mis2i2νi=1mipiriνi=1νiqisi. (2.4)

    Jog and Gurjar [23] used the following inequality while studying bounds on the distance energy of graphs:

    Lemma 2.6. Let ri, 1iν be any real numbers, then

    (νi=1|ri|)2(νi=1|ri|2). (2.5)

    The following inequality was shown by Pólya and Szegó in their book [31].

    Lemma 2.7. [31] Suppose ri and si, 1iν are positive real numbers, then

    νi=1r2iνi=1s2i14(P1P2p1p2+p1p2P1P2)2(νi=1risi)2 (2.6)

    where P1=max1iν{ri}, P2=max1iν{si}, p1=min1iν{ri}, p2=min1iν{si}.

    The following classical inequality was proven by Biernacki et al. [2].

    Lemma 2.8. [2] Suppose ri and si, 1iν are positive real numbers, then

    |ννi=1risiνi=1riνi=1si|α(ν)(Rr)(Ss) (2.7)

    where r, s, R, and S are real constants such that for each i, 1iν, rriR, and ssiS. Further, α(ν)=νν2(11νν2).

    Diaz and Metcalf [4] delivered a proof of the following inequality:

    Lemma 2.9. [4] Let ri and si, 1iν are nonnegative real numbers, then

    νi=1s2i+pPνi=1r2i(p+P)(νi=1risi) (2.8)

    where p and P are real constants, so that for each i, 1iν, holds, prisiPri.

    Next, we prove the following inequality on the edge-connected eigenvalues:

    Lemma 2.10. Let Γ be a connected graph of order ν2. Then λw1>λw2.

    Proof. Let us assume, for the sake of contradiction, that λw1=λw2. Since Γ is connected, Aw(Γ) is an irreducible non-negative ν×ν matrix. By the Perron–Frobenius theorem, the eigenvector x corresponding to λw1 has all components positive. Let y be an eigenvector corresponding to λw2. Since λw1=λw2, any linear combination of x and y would be an eigenvector corresponding to λw1. This implies that it would be possible to construct an eigenvector with some zero components, which contradicts the fact that all components of x are positive. Hence, we must have λw1>λw2.

    Next, we deliver a characterization for an ν-vertex satisfying |λw1|=|λw2|==|λwν|.

    Proposition 2.11. Let Γ be a graph of order ν. Then |λw1|=|λw2|==|λwν| if and only if Γ¯Kν or Γν2K2.

    Proof. First, assume that |λw1|=|λw2|==|λwν|. Let S be the number of isolated vertices in Γ. If S1, then λw1=λw2==λwν=0, hence Γ¯Kν or Γν2K2. Otherwise, if the maximum degree Δ2, then Γ contains a connected component H with at least 3 vertices. If H=Kν, ν3, then by Lemma 2.10, |λw1|=2(ν1)(ν2) and |λw2|=2(ν2), clearly |λw1|>|λw2|, a contradiction. Otherwise, if H is not a complete graph, then by Lemma 2.10, λw1>λw2, a contradiction.

    Conversely, one can easily check that |λw1|=|λw2|==|λwν| holds for ¯Kν and ν2K2.

    This section delivers various upper/lower extremal values for Ew(Γ) and λw1 for an (m,ν)-graph with ν-vertices and m-edges.

    A sharp upper bound on λw1 (i.e., edge-weighted spectral radius) is being proven in the following result.

    Theorem 3.1. Let Γ be an (m,ν)-graph possessing the maximum degree Δ. Then

    λw12(Δ1)1ν+2m (3.1)

    with λw1=2(Δ1)1ν+2m ΓKν, ν2.

    Proof. Since Aw(Γ)2(Δ1)A(Γ) and λw1 be its spectral radius. Employing Lemma 2.1, one has

    λw12λ1(Δ1).

    By Lemma 2.2, one obtains

    λw12(Δ1)1ν+2m.

    Also, λw1=2(Δ1)1ν+2m in (3.1) ΓKν, ν2.

    The next theorem delivers an upper extremal value considering the first–formulated Zagreb EM1 index of Γ.

    Theorem 3.2. If Γ is an ν-vertex graph having λw1 as its spectral radius (the largest eigenvalue), then

    λw12(1+ν)EM1(Γ)ν. (3.2)

    Proof. Since νk=1λwk=0 it can be rewritten as νk=2λwk=λw1. Further, (νk=1(λwk)2)=2EM1(Γ), (νk=2(λwk)2)=(νk=1(λwk)2(λw1)2)=(2EM1(Γ)(λw1)2) and (νk=21)=(ν1).

    Put rk=1 and sk=λwk in Lemma 2.3 and we obtain

    (νk=2(λwk))2(ν1)νk=2(λwk)2(λw1)2(1+ν)(2EM1(Γ)(λw1)2)(λw1)2(1+ν)2EM1(Γ)(ν1)(λw1)2(λw1)2(1+ν)2EM1(Γ)ν(λw1)2+(λw1)2ν(λw1)2(1+ν)2EM1(Γ)λw12(1+ν)EM1(Γ)ν.

    Hence, we have furnished the proof.

    Some lower and upper extremal values on Ew i.e., edge-weighted energy of Γ.

    Theorem 3.3. For a connected ν-vertex graph Γ, Ew satisfies

    2EM1(Γ)Ew(Γ)2νEM1(Γ). (3.3)

    Proof. For the upper bound, consider Lemma 2.3, i.e.,

    (νk=1rksk)2(νk=1r2k)(νk=1s2k)

    put rk=1 and sk=|λwk|2 in Lemma 2.3, we obtain

    (νk=1|λwk|)2(νk=112)(νk=1(|λwk|)2)

    since (νk=1|λwk|)=Ew(Γ), (νk=112)=1 and (νk=1(|λwk|)2)=2EM1(Γ), we have

    (Ew(Γ))2ν2EM1(Γ)Ew(Γ)2νEM1(Γ).

    Similarly, for a lower bound, consider Lemma 2.6, i.e.,

    (νk=1|rk|)2(νk=1|rk|2)

    put rk=|λwk| in Lemma 2.6, we obtan

    (Ew(Γ))22EM1(Γ)Ew(Γ)2EM1(Γ).

    Hence, by combining the upper bound and the lower bound, we obtain the required result.

    In terms of the EM1 index, our next result delivers another lower extremal value on Ew.

    Theorem 3.4. Any (m,ν)-graph Γ satisfies

    Ew(Γ)2νEM1(Γ)ν24(|λw1||λwν|)2 (3.4)

    where |λwν| (resp. |λw1|) are minimum (resp. maximum) values of |λwk|.

    Proof. Let |λw1||λw2||λwν| be the eigenvalues of Aw(Γ). By putting rk=1, sk=|λwk| P1=1, P2=|λw1|, p1=1 and p2=|λwν| in Lemma 2.4, one gets

    νk=1(1)2νk=1(|λwk|2)[νk=1|λwk|]2ν24(|λw1||λwν|)2

    since (νk=1(λwk)2)=2EM1(Γ) we have

    2νEM1(Γ)(Ew(Γ))2ν24(|λw1||λwν|)22νEM1(Γ)ν24(|λw1||λwν|)2(Ew(Γ))2Ew(Γ)2νEM1(Γ)ν24(|λw1||λwν|)2.

    Hence, the proof has been furnished.

    The following theorem further refines the bound in Theorem 3.4.

    Theorem 3.5. For an (ν,m)-graph Γ, assume λw1λw2λwν are eigenvalues of Aw(Γ). This implies that,

    Ew(Γ)2νEM1(Γ)α(ν)(|λw1||λwν|)2 (3.5)

    where α(ν)=νν2(11νν2).

    Proof. Let |λw1||λw2||λwν| be the eigenvalues of Aw(Γ). By putting rk=|λwk|=sk, R=|λw1|=S, and r=|λwν|=s in Lemma 2.8, one obtains

    |ννk=1|λwk|2(νk=1|λwk|)2|α(ν)(|λw1||λwν|)2|2νEM1(Γ)(Ew(Γ))2|α(ν)(|λw1||λwν|)22νEM1(Γ)α(ν)(|λw1||λwν|)2(Ew(Γ))2Ew(Γ)2νEM1(Γ)α(ν)(|λw1||λwν|)2.

    Hence the proof has been furnished.

    For a non-zero eigenvalue of a graph, the following theorem delivers yet another lower bound on the edge-weighted energy Ew of graphs.

    Theorem 3.6. If the eigenvalues of Aw(Γ) are non-zero, then

    Ew(Γ)2|λw1||λwν|2EM1(Γ)|λw1|+|λwν|, (3.6)

    where |λw1| and |λwν| are the maximum and minimum of |λwk|.

    Proof. Let |λw1||λw2||λwν| be the eigenvalues of Aw(Γ). By putting rk=|λk| and sk=1 in Lemma 2.7, we obtain

    νk=1|λwk|2νk=11214(|λw1||λwν|+|λwν||λw1|)2(νk=1|λwk|)22νEM1(Γ)14((|λw1|+|λw2|)2|λw1||λwν|)(Ew(Γ))22νEM1(Γ)4|λw1||λwν|(|λw1|+|λwν|)2(Ew(Γ))2Ew(Γ)2|λw1||λwν|2νEM1(Γ)|λw1|+|λwν|.

    Hence the proof is completed.

    The following two results deliver lower bounds on Ew in terms of EM1, the smallest and largest eigenvalues of graphs.

    Theorem 3.7. Assuming λw1λw2λwν to be the eigenvalues of Aw(Γ), where Γ is an (ν,m)-graph. Then

    Ew(Γ)ν+2EM1(Γ)(λwνλw1)λwν+λw1. (3.7)

    Proof. Let λw1λw2λwν be the eigenvalues of Aw(Γ). By putting rk=|λwk|, sk=1, p=λwν and P=λw1, in Lemma 2.9, we obtain

    νk=112+λwνλw1νk=1|λwk|2(λwν+λw1)(νk=1|λwk|)ν+2EM1(Γ)(λwνλw1)(λwν+λw1)Ew(Γ)Ew(Γ)ν+2EM1(Γ)(λwνλw1)λwν+λw1.

    Hence, the proof has been completed.

    Theorem 3.8. Assuming λw1λw2λwν to be the eigenvalues of Aw(Γ), where Γ is an (ν,m)-graph, then

    Ew(Γ)2EM1(Γ)+νλw1λwνλw1+λwν. (3.8)

    Proof. Let λw1λw2λwν be the eigenvalues of Aw(Γ). By putting rk=|λwk|, sk=1, p=λwν and P=λw1, in Lemma 2.9, we obtain

    νk=1|λwk|2+λwνλw1νk=112(λwν+λw1)(νk=1|λwk|)2EM1(Γ)+ν(λwνλw1)(λwν+λw1)Ew(Γ)Ew(Γ)2EM1(Γ)+νλw1λwνλw1+λwν.

    Hence the proof has been furnished.

    Next, a sharp upper extremal value on Ew is proven.

    Theorem 3.9. If Γ is a non-empty graph of order ν. Then

    Ew(Γ)2(EM1(Γ))2+ν22. (3.9)

    Proof. Let λw1λw2λwν be the eigenvalues of Aw(Γ). Substituting pk=|λwk|=qk and rk=sk=mk=νk=1 in Lemma 2.5, we obtain

    νk=11|λwk|2νk=11|λwk|2+νk=1112νk=11122νk=11|λwk|1νk=11|λwk|12EM1(Γ)2EM1(Γ)+νν2(Ew(Γ))24(EM1(Γ))2+ν22(Ew(Γ))24(EM1(Γ))2+ν22Ew(Γ)Ew(Γ)2(EM1(Γ))2+ν22.

    Until now, many researchers have studied the predictive potential of molecular descriptors (mainly degree-, distance-, and eigenvalue-based) for estimating the π-electron energy (Eπ) of benzenoid hydrocarbons (BHs) and also for estimating the enthalpy of formation ΔHof and boiling point Bp of BHs. For instance, Hayat et al. [16] (resp. Hayat and coauthors [14,27]) investigated the efficiency of degree-dependent (resp. eigenvalues-dependent) graphical descriptors for estimating Eπ of BHs. Similar studies were conducted by Hayat et al. [15] (Hayat and Liu [12]) for distance-related and temperature-based graphical descriptors. For predicting physicochemical properties such as Bp and ΔHof of BHs, comparative studies for distance-dependent, temperature-related, degree-related, and eigenvalue-related descriptors were conducted in [13,17,18,26], respectively. For more studies on QSPR, we refer to [1,11,29,32].

    In this section, we calculate energy and edge-weighted energy for molecular graphs of 22 benzenoid hydrocarbons which are listed in Table 1. The adjacency matrix is defined, and eigenvalues, energy, and edge-weighted energy are calculated using the Python programming language. The correlation and regression models are obtained for the physicochemical properties of 22 BHs, for which the data is taken from [25] (refer to Table 2), and the predictive ability is tested using energy E(Γ) and edge-weighted energy Ew(Γ). In Table 3, the values of 11 molecular descriptors for 22 BHs are listed, which are from [25]. Note that we consider carbon–carbon structures as chemical graphs. One can consider other construction, of chemical graph based on molecular alignment [22].

    Table 1.  Edge-weighted energy Ew(Γ) and graph energy E(Γ) of lower 22 BHs.
    Compound Ew(Γ) E(Γ)
    Naphthalene 33.92994 13.684
    Anthracene 51.763186 19.3136
    Phenanthrene 51.86194 19.488
    Pyrene 64.405992 22.505
    Naphthacene 69.568754 27.3178
    Triphenylene 71.69452 25.276
    Tetraphene 87.365348 25.104
    Benzo[c]phenanthrene 69.71874 13.252
    Chrysene 69.74482 25.192
    Perylene 82.208618 28.2453
    Benzo[e]pyrene 82.37146 28.3360
    Benzo[a]pyrene 82.242794 28.2219
    Benzo[ghi]perylene 94.95264 31.4250
    Anthanthrene 94.986946 31.2528
    Picene 87.64762 30.942
    Dibenz[a, j]anthracene 87.69062 30.879
    Dibenz[a, h]anthracene 86.646292 30.880
    Dibenzo[a, l]pyrene 100.135362 34.030
    Dibenzo[a, i]pyrene 100.121532 34.018
    Dibenzo[a, h]pyrene 100.048094 33.926
    Dibenzo[a, e]pyrene 99.165458 34.604
    Coronene 107.64814 34.568

     | Show Table
    DownLoad: CSV
    Table 2.  Experimental data on certain physicochemical characteristics of BHs [25].
    Compound Boiling point (BP)(C) Entropy(S) (Cal/mol.K) Acentric Factor (ω) log P Retention Index (RI) Enthalpy ΔHof (kJ/mol)
    Naphthalene 218.000 79.38 0.302 3.30 200.00 150.6
    Anthracene 340.050 92.43 0.402 4.45 301.69 218.3
    Phenanthrene 338.000 93.79 0.394 4.46 300.00 209.1
    Pyrene 404.000 96.06 0.410 4.88 351.22 230.5
    Naphthacene 440.000 105.47 0.460 5.76 408.30 286.1
    Triphenylene 429.000 104.66 0.460 5.49 400.00 258.5
    Tetraphene 425.000 108.22 0.460 5.76 398.50 276.9
    Benzo[c]phenanthrene 448.000 113.61 - 5.70 391.12 280.5
    Chrysene 431.000 106.83 0.460 5.81 400.00 267.7
    Perylene 497.000 109.10 0.490 6.25 456.22 279.9
    Benzo[e]pyrene 493.000 110.46 - 6.44 450.73 289.1
    Benzo[a]pyrene 496.000 111.85 - 6.13 453.44 279.9
    Benzo[ghi]perylene 542.000 114.10 - 6.63 501.32 301.3
    Anthanthrene 547.000 114.10 - 7.04 503.89 310.5
    Picene 519.000 119.87 0.540 7.11 500.00 326.3
    Dibenz[a, j]anthracene 531.000 119.87 - 6.54 489.80 335.5
    Dibenz[a, h]anthracene 536.000 119.87 - 6.75 495.45 335.5
    Dibenzo[a, l]pyrene 595.000 131.69 - 7.71 553.00 351.2
    Dibenzo[a, i]pyrene 594.000 123.50 - 7.28 556.47 347.7
    Dibenzo[a, h]pyrene 596.000 123.50 - 7.28 559.90 347.7
    Dibenzo[a, e]pyrene 592.000 124.89 - 7.28 551.53 338.5
    Coronene 590.000 116.36 0.540 7.64 549.67 322.7

     | Show Table
    DownLoad: CSV
    Table 3.  Molecular graphical descriptors of the lower 22 BHs [25].
    Compound M1 M2 F R ABC SCI
    Naphthalene 50.00 57.00 118.00 4.96632 7.73773 5.19710
    Anthracene 76.00 90.00 188.00 6.93265 11.23282 7.39420
    e Phenanthrene 76.00 91.00 188.00 6.94948 11.19238 7.40802
    Pyrene 94.00 117.00 242.00 7.93265 13.23282 8.61895
    Naphthacene 102.00 123.00 258.00 8.89897 14.72792 9.59130
    Triphenylene 102.00 126.00 258.00 8.94948 14.60660 9.63277
    Tetraphene 102.00 124.00 258.00 8.91581 14.68748 9.60512
    Benzo[c]phenanthrene 102.00 125.00 258.00 8.93265 14.64704 9.61895
    Chrysene 102.00 125.00 258.00 8.93265 14.64704 9.61895
    Perylene 120.00 152.00 312.00 9.93265 16.64704 10.84369
    Benzo[e]pyrene 120.00 152.00 312.00 9.93265 16.64704 10.84369
    Benzo[a]pyrene 120.00 151.00 312.00 9.91581 16.68748 10.82987
    Benzo[ghi]perylene 138.00 178.00 366.00 10.91581 18.68748 12.05461
    Anthanthrene 138.00 177.00 366.00 10.89897 18.72792 12.04079
    Picene 128.00 159.00 328.00 10.91581 18.10169 11.82987
    Dibenz[a, j]anthracene 128.00 158.00 328.00 10.89897 18.14213 11.81605
    Dibenz[a, h]anthracene 128.00 158.00 328.00 10.89897 18.14213 11.81605
    Dibenzo[a, l]pyrene 146.00 186.00 382.00 11.91581 20.10169 13.05461
    Dibenzo[a, i]pyrene 146.00 185.00 382.00 11.89897 20.14213 13.04079
    Dibenzo[a, h]pyrene 146.00 185.00 382.00 11.89897 20.14213 13.04079
    Dibenzo[a, e]pyrene 146.00 186.00 382.00 11.91581 20.10169 13.05461
    Coronene 156.00 204.00 420.00 11.89897 20.72792 13.26554
    Compound GA HA SDD ReZM RR -
    Naphthalene 10.91918 4.93333 22.66666 270.00000 24.79795
    Anthracene 15.83836 6.86666 33.33333 444.00000 37.59591
    Phenanthrene 15.87877 6.90000 33.00000 454.00000 37.69693
    Pyrene 18.83836 7.86666 39.33333 606.00000 46.59591
    Naphthacene 20.75755 8.80000 44.00000 618.00000 50.39387
    Triphenylene 20.87877 8.90000 43.00000 648.00000 50.69693
    Tetraphene 20.79795 8.83333 43.66666 628.00000 50.49489
    Benzo[c]phenanthrene 20.83836 8.86666 43.33333 638.00000 50.59591
    Chrysene 20.83836 8.86666 43.33333 638.00000 50.59591
    Perylene 23.83836 9.86666 49.33333 800.00000 59.59591
    Benzo[e]pyrene 23.83836 9.86666 49.33333 800.00000 59.59591
    Benzo[a]pyrene 23.79795 9.83333 49.66666 790.00000 59.49489
    Benzo[ghi]perylene 26.79795 10.83333 55.66666 952.00000 68.49489
    Anthanthrene 26.75755 10.80000 56.00000 942.00000 68.39387
    Picene 25.79795 10.83333 53.66666 822.00000 63.49489
    Dibenz[a, j]anthracene 25.75755 10.80000 54.00000 812.00000 63.39387
    Dibenz[a, h]anthracene 25.75755 10.80000 54.00000 812.00000 63.39387
    Dibenzo[a, l]pyrene 28.79795 11.93333 59.66666 984.00000 72.49489
    Dibenzo[a, i]pyrene 28.75755 11.80000 60.00000 974.00000 72.39389
    Dibenzo[a, h]pyrene 28.75755 11.80000 60.00000 974.00000 72.39389
    Dibenzo[a, e]pyrene 28.79795 11.83333 59.66666 984.00000 72.49489
    Coronene 29.75755 11.80000 62.00000 1104.00000 77.39387

     | Show Table
    DownLoad: CSV

    This subsection records all the data sets that we employ for our structure-property models.

    The intercorrelation between the physicochemical properties of polycyclic aromatic hydrocarbons, such as Kovats retention index (RI), acentric factor (ω), octanol-water partition coefficient (logP), boiling point (BP), enthalpy of formation (ΔHf) and entropy (S), with graph energy E(Γ) and edge-weighted energy Ew(Γ), is analysed in Table 4. Also, the intercorrelation between 11 molecular descriptors such as atom bond connectivity index (ABC), forgotten index (F), 1st and 2nd Zagreb invariants (M1 and M2), sum division degree index (SDD), reciprocal Randiˊc index (RR), classical Randić index (R), redefined Zagreb index (ReZM) with graph energy E(Γ) and edge-weighted energy Ew(Γ) is listed in Table 5. We observe that the 11 molecular descriptors are highly intercorrelated with edge-weighted energy Ew(Γ) with r>0.97, which is highlighted in Table 5.

    Table 4.  Correlation coefficient r between graph energy E(Γ), edge-weighted energy Ew(Γ), and physicochemical properties.
    Energy log P ω RI BP S ΔHf
    Ew(Γ) 0.969 0.937 0.974 0.969 0.913 0.930
    E(Γ) 0.900 0.977 0.921 0.900 0.811 0.863

     | Show Table
    DownLoad: CSV
    Table 5.  Correlation coefficient r between graph energy E(Γ), edge-weighted energy Ew(Γ), and molecular descriptors.
    Degree based Molecular Descriptors E(Γ) Ew(Γ)
    M1 0.914 0.980
    M2 0.909 0.977
    ABC 0.914 0.977
    ReZM 0.913 0.980
    R 0.911 0.974
    F 0.911 0.978
    HA 0.916 0.979
    SCI 0.914 0.979
    RR 0.915 0.979
    SDD 0.903 0.972
    GA 0.911 0.973

     | Show Table
    DownLoad: CSV

    The value of r for Ew(Γ) ranges from 0.972 to 0.980.

    The quadratic regression models for physico-chemical properties (PPs) such as Kovats retention index (RI), acentric factor (ω), octanol–water partition coefficient (logP), boiling point (BP), enthalpy of formation (ΔHf), and entropy (S) are derived with respect to graph energy E(Γ) and edge-weighted energy Ew(Γ). The symbols ν, r, F, and SE are used to represent population, correlation coefficient, F-values, and the standard error of the estimate, respectively. Note that, in general, quadratic models have very bad predictive power, even if they have good estimating power. The reader is referred to [21] for diversity in detailed regression analysis.

    The quadratic regression model is defined as

    PP=a(E(Γ))2+b(E(Γ))+c.

    The quadratic regression of PP with E(Γ) is as follows:

    BP=(0.489)(E(Γ))2+(10.219)(E(Γ))+(375.526).
    ν=22r=0.9230F=54.872CSE=39.940.
    S=(0.074)(E(Γ))2+(2.101)(E(Γ))+(109.805).
    ν=22r=0.8491F=24.597CSE=6.838.
    ω=(0.000)(E(Γ))2+(0.020)(E(Γ))+(0.067).
    ν=22r=0.9823F=111.040CSE=0.014.
    logP=(0.006)(E(Γ))2+(0.123)(E(Γ))+(4.986).
    ν=22r=0.9241F=55.578CSE=0.460.
    RI=(0.462)(E(Γ))2+(8.919)(E(Γ))+(323.656).
    ν=22r=0.9428F=75.994CSE=33.759.
    ΔHf=(0.236)(E(Γ))2+(4.631)(E(Γ))+(229.191).
    ν=22r=0.8831F=33.780CSE=25.526.

    The quadratic regression of PP with Ew(Γ) is as follows:

    BP=(0.013)(Ew(Γ))2+(7.026)(Ew(Γ))+(3.434).
    ν=22r=0.9705F=153.726CSE=25.082.
    S=(0.004)(Ew(Γ))2+(1.118)(Ew(Γ))+(44.689).
    ν=22r=0.9208F=52.889CSE=5.055.
    ω=(2.494)(Ew(Γ))2+(0.007)(Ew(Γ))+(0.110).
    ν=22r=0.9544F=41.147CSE=0.023.
    logP=(6.907)(Ew(Γ))2+(0.070)(Ew(Γ))+(1.030).
    ν=22r=0.9695F=149.803CSE=0.294.
    RI=(0.007)(Ew(Γ))2+(5.987)(Ew(Γ))+(6.350).
    ν=22r=0.9741F=175.069CSE=22.976.
    ΔHf=(0.015)(Ew(Γ))2+(4.853)(Ew(Γ))+(2.607).
    ν=22r=0.9386F=70.070CSE=18.826.

    The following analysis can be made from the quadratic regression models:

    ● The correlation coefficient r for quadratic regression models gives high predictability for physicochemical properties with respect to graph energy E(Γ) and edge-weighted energy Ew(Γ).

    ● The quadratic regression models for E(Γ) give high intercorrelation with a correlation value of r=0.9823 for the acentric factor.

    ● The degree-2 polynomial regression model for E(Γ) gives appreciable intercorrelation with a correlation value of r=0.9230 for the boiling point, r=0.9241 for the log P, r=0.9428 for the retention index.

    ● The quadratic regression model for E(Γ) is weakly intercorrelation with correlation value r=0.8491 for the entropy, and r=0.8831 for the enthalpy.

    ● The quadratic regression models for Ew(Γ) give high intercorrelation with a correlation value of r=0.9705 for the boiling point and r=0.9741 for retention index.

    ● The quadratic regression model for Ew(Γ) gives appreciable intercorrelation with a correlation value of r=0.9208 for entropy, r=0.9544 for acentfac, r=0.9695 for log P, and r=0.9386 for the enthalpy.

    ● From all the 12 quadratic regression models, it has been observed that the significance F is 0.000.

    The scattered curve diagram of E(Γ) with physiochemical properties are depicted in Figures 16.

    Figure 1.  The quadratic regression model for boiling point with E(Γ).
    Figure 2.  The quadratic regression model for entropy with E(Γ).
    Figure 3.  The quadratic regression model for acentric factor with E(Γ).
    Figure 4.  The quadratic regression model for log P with E(Γ).
    Figure 5.  The quadratic regression model for retention index with E(Γ).
    Figure 6.  The quadratic regression model for enthalpy with E(Γ).

    The scattered curve diagram of Ew(Γ) with physicochemical characteristics are depicted in Figures 712.

    Figure 7.  The quadratic regression model for boiling point with Ew(Γ).
    Figure 8.  The quadratic regression model for entropy with Ew(Γ).
    Figure 9.  The quadratic regression model for acentric factor with Ew(Γ).
    Figure 10.  The quadratic regression model for log P with Ew(Γ).
    Figure 11.  The quadratic regression model for retention index with Ew(Γ).
    Figure 12.  The quadratic regression model for enthalpy with Ew(Γ).

    This paper puts forward the edge-weighted adjacency matrix Aw(Γ) of a graphical structure Γ. The energy Ew(Γ) as well as the spectral radius λw1 of the Aw(Γ) have been studied, and lower and upper extremes are derived for λw1 and Ew(Γ) in terms of other graphical parameters. Further, we calculated the graph energy and edge-weighted energy of 22 BHs by drawing their molecular graphs to check the predictive potential of the physicochemical characteristics of BHs. Polynomials of degree-2 regression models were generated for Kovats retention index (RI), acentric factor (ω), octanol-water partition coefficient (logP), boiling point (BP), enthalpy of formation (ΔHf), and entropy (S) using theses two graph energies. We also found correlation coefficients of the physicochemical properties and molecular descriptors of BHa corresponding to the two graph energies E(Γ) and Ew(Γ).

    All authors contributed equally to this paper. All authors have read and approved the final version of the manuscript for publication.

    The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

    S. Hayat is supported by UBD Faculty Research Grants under Grant Number UBD/RSCH/1.4/FICBF(b)/2022/053 and the National Natural Science Foundation of China (Grant No. 622260-101). A. Khan was sponsored by the National Natural Science Foundation of China grant Nos. 622260-101 and 12250410247, and also by the Ministry of Science and Technology of China, grant No. WGXZ2023054L. Mohammed J. F. Alenazi extends his appreciation to Researcher Supporting Project number (RSPD2024R582), King Saud University, Riyadh, Saudi Arabia.

    The authors declare that they have no known competing financial interests.



    [1] National Highway Traffic Safety Administration, Early estimate of motor vehicle traffic fatalities in 2023, U.S. Department of Transportation, 2023. Available from: https://crashstats.nhtsa.dot.gov/Api/Public/ViewPublication/813561.
    [2] A Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. Gomez, et al., Attention is all you need, Proceedings of the 31st International Conference on Neural Information Processing Systems, 2017, 6000–6010.
    [3] D. Kim, S. Washington, J. Oh, Modeling crash types: new insights into the effects of covariates on crashes at rural intersections, J. Transp. Eng., 132 (2006), 282–292. https://doi.org/10.1061/(ASCE)0733-947X(2006)132:4(282) doi: 10.1061/(ASCE)0733-947X(2006)132:4(282)
    [4] K. Assi, Traffic crash severity prediction—a synergy by hybrid principal component analysis and machine learning models, Int. J. Environ. Res. Public Health, 17 (2020), 7598. https://doi.org/10.3390/ijerph17207598 doi: 10.3390/ijerph17207598
    [5] G. Kadilar, Effect of driver, roadway, collision, and vehicle characteristics on crash severity: a conditional logistic regression approach, Int. J. Inj. Control Sa., 23 (2016), 135–144. https://doi.org/10.1080/17457300.2014.942323 doi: 10.1080/17457300.2014.942323
    [6] Q. Zeng, W. Hao, J. Lee, F. Chen, Investigating the impacts of real-time weather conditions on freeway crash severity: a bayesian spatial analysis, Int. J. Environ. Res. Public Health, 17 (2020), 2768. https://doi.org/10.3390/ijerph17082768 doi: 10.3390/ijerph17082768
    [7] R. Spicer, A. Vahabaghaie, D. Murakhovsky, G. Bahouth, B. Drayer, S. Lawrence, Effectiveness of advanced driver assistance systems in preventing system-relevant crashes, SAE Int. J. Adv. Curr. Prac. in Mobility, 3 (2021), 1697–1701. https://doi.org/10.4271/2021-01-0869 doi: 10.4271/2021-01-0869
    [8] M. Ahmed, M. Abdel-Aty, The viability of using automatic vehicle identification data for real-time crash prediction, IEEE Trans. Intell. Transp., 13 (2011), 459–468. https://doi.org/10.1109/TITS.2011.2171052 doi: 10.1109/TITS.2011.2171052
    [9] A. Høye, I. Hesjevoll, Traffic volume and crashes and how crash and road characteristics affect their relationship–-a meta-analysis, Accident Anal. Prev., 145 (2020), 105668. https://doi.org/10.1016/j.aap.2020.105668 doi: 10.1016/j.aap.2020.105668
    [10] C. Xu, K. Ozbay, H. Liu, K. Xie, D. Yang, Exploring the impact of truck traffic on road segment-based severe crash proportion using extensive weigh-in-motion data, Safety Sci., 166 (2023), 106261. https://doi.org/10.1016/j.ssci.2023.106261 doi: 10.1016/j.ssci.2023.106261
    [11] N. Dutta, M. Fontaine, Improving freeway segment crash prediction models by including disaggregate speed data from different sources, Accident Anal. Prev., 132 (2019), 105253. https://doi.org/10.1016/j.aap.2019.07.029 doi: 10.1016/j.aap.2019.07.029
    [12] R. Avelar, K. Dixon, S. Ashraf, A comparative analysis on performance of severe crash prediction methods, Transport. Res. Rec., 2672 (2018), 109–119. https://doi.org/10.1177/0361198118794052 doi: 10.1177/0361198118794052
    [13] M. Bedard, G. Guyatt, M. Stones, J. Hirdes, The independent contribution of driver, crash, and vehicle characteristics to driver fatalities, Accident Anal. Prev., 34 (2002), 717–727. https://doi.org/10.1016/S0001-4575(01)00072-0 doi: 10.1016/S0001-4575(01)00072-0
    [14] A. Iranitalab, A. Khattak, Comparison of four statistical and machine learning methods for crash severity prediction, Accident Anal. Prev., 108 (2017), 27–36. https://doi.org/10.1016/j.aap.2017.08.008 doi: 10.1016/j.aap.2017.08.008
    [15] B. Naik, L. Tung, S. Zhao, A. Khattak, Weather impacts on single-vehicle truck crash injury severity, J. Safety Res., 58 (2016), 57–65. https://doi.org/10.1016/j.jsr.2016.06.005 doi: 10.1016/j.jsr.2016.06.005
    [16] K. Santos, J. Dias, C. Amado, A literature review of machine learning algorithms for crash injury severity prediction, J. Safety Res., 80 (2022), 254–269. https://doi.org/10.1016/j.jsr.2021.12.007 doi: 10.1016/j.jsr.2021.12.007
    [17] C. Morris, J. Yang, Understanding multi-vehicle collision patterns on freeways—a machine learning approach, Infrastructures, 5 (2020), 62. https://doi.org/10.3390/infrastructures5080062 doi: 10.3390/infrastructures5080062
    [18] C. Morris, J. Yang, Effectiveness of resampling methods in coping with imbalanced crash data: crash type analysis and predictive modeling, Accident Anal. Prev., 159 (2021), 106240. https://doi.org/10.1016/j.aap.2021.106240 doi: 10.1016/j.aap.2021.106240
    [19] A. Géron, Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow, Sebastopol: O'Reilly Media, Inc., 2022.
    [20] C. Dong, C. Shao, J. Li, Z. Xiong, An improved deep learning model for traffic crash prediction, J. Adv. Transport., 2018 (2018), 3869106. https://doi.org/10.1155/2018/3869106 doi: 10.1155/2018/3869106
    [21] K. Sattar, F. Chikh Oughali, K. Assi, N. Ratrout, A. Jamal, S. Masiur Rahman, Transparent deep machine learning framework for predicting traffic crash severity, Neural Comput. Applic., 35 (2023), 1535–1547. https://doi.org/10.1007/s00521-022-07769-2 doi: 10.1007/s00521-022-07769-2
    [22] H. Li, Y. Li, A novel explanatory tabular neural network to predicting traffic incident duration using traffic safety big data, Mathematics, 11 (2023), 2915. https://doi.org/10.3390/math11132915 doi: 10.3390/math11132915
    [23] X. Wang, M. Feng, Freeway single and multi-vehicle crash safety analysis: influencing factors and hotspots, Accident Anal. Prev., 132 (2019), 105268. https://doi.org/10.1016/j.aap.2019.105268 doi: 10.1016/j.aap.2019.105268
    [24] S. Geedipally, D. Lord, Investigating the effect of modeling single-vehicle and multi-vehicle crashes separately on confidence intervals of poisson–-gamma models, Accident Anal. Prev., 42 (2010), 1273–1282. https://doi.org/10.1016/j.aap.2010.02.004 doi: 10.1016/j.aap.2010.02.004
    [25] Weather underground, Wundermap weather information, The Weather Company, LLC., 2023. Available from: https://www.wunderground.com/wundermap?lat=33.751&lon=-84.39.
    [26] H. Khanum, A. Garg, M. Faheem, Accident severity prediction modeling for road safety using random forest algorithm: an analysis of indian highways, F1000Res., 12 (2023), 494. https://doi.org/10.12688/f1000research.133594.2 doi: 10.12688/f1000research.133594.2
    [27] T. Chen, C. Guestrin, Xgboost: a scalable tree boosting system, Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016,785–794. https://doi.org/10.1145/2939672.2939785 doi: 10.1145/2939672.2939785
    [28] Y. Yang, K. Wang, Z. Yuan, D. Liu, Predicting freeway traffic crash severity using xgboost-bayesian network model with consideration of features interaction, J. Adv. Transport., 2022 (2022), 4257865. https://doi.org/10.1155/2022/4257865 doi: 10.1155/2022/4257865
    [29] L. Prokhorenkova, G. Gusev, A. Vorobev, A. Dorogush, A. Gulin, Catboost: unbiased boosting with categorical features, Proceedings of the 32nd International Conference on Neural Information Processing Systems, 2018, 6639–6649.
    [30] A. Hasan, M. Jalayer, S. Das, M. Kabir, Application of machine learning models and shap to examine crashes involving young drivers in new jersey, International Journal of Transportation Science and Technology, 14 (2024), 156–170. https://doi.org/10.1016/j.ijtst.2023.04.005 doi: 10.1016/j.ijtst.2023.04.005
    [31] J. Li, X. Wang, X. Yang, Q. Zhang, H. Pan, Analyzing freeway safety influencing factors using the catboost model and interpretable machine-learning framework, shap, Transport. Res. Rec., 2678 (2024), 563–574. https://doi.org/10.1177/03611981231208903 doi: 10.1177/03611981231208903
    [32] S. Lundberg, G. Erion, S. Lee, Consistent individualized feature attribution for tree ensembles, arXiv: 1802.03888. https://doi.org/10.48550/arXiv.1802.03888
    [33] R. Turner, D. Eriksson, M. McCourt, J. Kiili, E. Laaksonen, Z. Xu, I. Guyon, Bayesian optimization is superior to random search for machine learning hyperparameter tuning: analysis of the black-box optimization challenge 2020, Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2021, 3–26.
    [34] A. Victoria, G. Maragatham, Automatic tuning of hyperparameters using bayesian optimization, Evol. Syst., 12 (2021), 217–223. https://doi.org/10.1007/s12530-020-09345-2 doi: 10.1007/s12530-020-09345-2
    [35] T. Akiba, S. Sano, T. Yanase, T. Ohta, M. Koyama, Optuna: a next-generation hyperparameter optimization framework, Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2019, 2623–2631. https://doi.org/10.1145/3292500.3330701 doi: 10.1145/3292500.3330701
    [36] T. Lin, P. Goyal, R. Girshick, K. He, P. Dollár, Focal loss for dense object detection. Proceedings of IEEE International Conference on Computer Vision (ICCV), 2017, 2980–2988. https://doi.org/10.1109/ICCV.2017.324 doi: 10.1109/ICCV.2017.324
    [37] J. Tian, P. Tsai, K. Zhang, X. Cai, H. Xiao, K. Yu, et al., Synergetic focal loss for imbalanced classification in federated xgboost, IEEE Transactions on Artificial Intelligence, 5 (2023), 647–660. https://doi.org/10.1109/TAI.2023.3254519 doi: 10.1109/TAI.2023.3254519
    [38] X. Liu, J. Lu, X. Chen, Y. Fong, X. Ma, F. Zhang, Attention based spatio-temporal graph convolutional network with focal loss for crash risk evaluation on urban road traffic network based on multi-source risks, Accident Anal. Prev., 192 (2023), 107262. https://doi.org/10.1016/j.aap.2023.107262 doi: 10.1016/j.aap.2023.107262
    [39] R. Yu, Y. Wang, Z. Zou, L. Wang, Convolutional neural networks with refined loss functions for the real-time crash risk analysis, Transport. Res. C-Emer., 119 (2020), 102740. https://doi.org/10.1016/j.trc.2020.102740 doi: 10.1016/j.trc.2020.102740
    [40] M. Hernandez, G. Epelde, A. Alberdi, R. Cilla, D. Rankin, Synthetic data generation for tabular health records: a systematic review, Neurocomputing, 493 (2022), 28–45. https://doi.org/10.1016/j.neucom.2022.04.053 doi: 10.1016/j.neucom.2022.04.053
    [41] Y. Jo, C. Oh, S. Kim, Estimation of heavy vehicle-involved rear-end crash potential using wim data, Accident Anal. Prev., 128 (2019), 103–113. https://doi.org/10.1016/j.aap.2019.04.005 doi: 10.1016/j.aap.2019.04.005
  • This article has been cited by:

    1. Zilin Xia, Jinan Gu, Wenbo Wang, Zedong Huang, Research on a lightweight electronic component detection method based on knowledge distillation, 2023, 20, 1551-0018, 20971, 10.3934/mbe.2023928
    2. Hanyi Shi, Ningzhi Wang, Xinyao Xu, Yue Qian, Lingbin Zeng, Yi Zhu, HeMoDU: High-Efficiency Multi-Object Detection Algorithm for Unmanned Aerial Vehicles on Urban Roads, 2024, 24, 1424-8220, 4045, 10.3390/s24134045
    3. Bao Liu, Wenqiang Jiang, RS-YOLOx: target feature enhancement and bounding box auxiliary regression based object detection approach for remote sensing, 2024, 18, 1931-3195, 10.1117/1.JRS.18.016514
  • Reader Comments
  • © 2025 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(483) PDF downloads(30) Cited by(0)

Figures and Tables

Figures(7)  /  Tables(7)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog