Research article Special Issues

Forecasting regional short-term freight volume using QPSO-LSTM algorithm from the perspective of the importance of spatial information


  • It is of great significance to accurately and efficiently predict expressway freight volume to improving the supervision level of the transportation industry and reflect the performance of transportation. Using expressway toll system records to predict regional freight volume plays an important role in the development of expressway freight organization work; especially, the short-term (hour, daily or monthly) freight volume is directly related to the compilation of regional transportation plans. Artificial neural networks have been widely used in forecasting in various fields because of their unique structural characteristics and strong learning ability, among which the long short-term memory (LSTM) network is suitable for processing and predicting series with time interval attributes such as expressway freight volume data. Considering the factors affecting regional freight volume, the data set was reconstructed from the perspective of spatial importance; we then use a quantum particle swarm optimization (QPSO) algorithm to tune parameters for a conventional LSTM model. In order to verify the efficiency and practicability, we first selected the expressway toll collection system data of Jilin Province from January 2018 to June 2021, and then used database and statistical knowledge to construct the LSTM data set. In the end, we used a QPSO-LSTM algorithm to predict the freight volume at the future times (hour, daily or monthly). Compared with the conventional LSTM model without tuning, the results of four randomly selected grids naming Changchun City, Jilin city, Siping City and Nong'an County show that the QPSO-LSTM network model based on spatial importance has a better effect.

    Citation: Liying Zhao, Ningbo Cao, Hui Yang. Forecasting regional short-term freight volume using QPSO-LSTM algorithm from the perspective of the importance of spatial information[J]. Mathematical Biosciences and Engineering, 2023, 20(2): 2609-2627. doi: 10.3934/mbe.2023122

    Related Papers:

    [1] Xin Jing, Jungang Luo, Shangyao Zhang, Na Wei . Runoff forecasting model based on variational mode decomposition and artificial neural networks. Mathematical Biosciences and Engineering, 2022, 19(2): 1633-1648. doi: 10.3934/mbe.2022076
    [2] Faisal Mehmood Butt, Lal Hussain, Anzar Mahmood, Kashif Javed Lone . Artificial Intelligence based accurately load forecasting system to forecast short and medium-term load demands. Mathematical Biosciences and Engineering, 2021, 18(1): 400-425. doi: 10.3934/mbe.2021022
    [3] Keruo Jiang, Zhen Huang, Xinyan Zhou, Chudong Tong, Minjie Zhu, Heshan Wang . Deep belief improved bidirectional LSTM for multivariate time series forecasting. Mathematical Biosciences and Engineering, 2023, 20(9): 16596-16627. doi: 10.3934/mbe.2023739
    [4] Peng Lu, Ao Sun, Mingyu Xu, Zhenhua Wang, Zongsheng Zheng, Yating Xie, Wenjuan Wang . A time series image prediction method combining a CNN and LSTM and its application in typhoon track prediction. Mathematical Biosciences and Engineering, 2022, 19(12): 12260-12278. doi: 10.3934/mbe.2022571
    [5] Xiaotong Ji, Dan Liu, Ping Xiong . Multi-model fusion short-term power load forecasting based on improved WOA optimization. Mathematical Biosciences and Engineering, 2022, 19(12): 13399-13420. doi: 10.3934/mbe.2022627
    [6] Guanghua Fu, Qingjuan Wei, Yongsheng Yang . Bearing fault diagnosis with parallel CNN and LSTM. Mathematical Biosciences and Engineering, 2024, 21(2): 2385-2406. doi: 10.3934/mbe.2024105
    [7] Lihe Liang, Jinying Cui, Juanjuan Zhao, Yan Qiang, Qianqian Yang . Ultra-short-term forecasting model of power load based on fusion of power spectral density and Morlet wavelet. Mathematical Biosciences and Engineering, 2024, 21(2): 3391-3421. doi: 10.3934/mbe.2024150
    [8] Haoyu Wang, Xihe Qiu, Jinghan Yang, Qiong Li, Xiaoyu Tan, Jingjing Huang . Neural-SEIR: A flexible data-driven framework for precise prediction of epidemic disease. Mathematical Biosciences and Engineering, 2023, 20(9): 16807-16823. doi: 10.3934/mbe.2023749
    [9] Lanjun Liu, Han Wu, Junwu Wang, Tingyou Yang . Research on the evaluation of the resilience of subway station projects to waterlogging disasters based on the projection pursuit model. Mathematical Biosciences and Engineering, 2020, 17(6): 7302-7331. doi: 10.3934/mbe.2020374
    [10] Zhijun Yang, Wenjie Huang, Hongwei Ding, Zheng Guan, Zongshan Wang . Performance analysis of a two-level polling control system based on LSTM and attention mechanism for wireless sensor networks. Mathematical Biosciences and Engineering, 2023, 20(11): 20155-20187. doi: 10.3934/mbe.2023893
  • It is of great significance to accurately and efficiently predict expressway freight volume to improving the supervision level of the transportation industry and reflect the performance of transportation. Using expressway toll system records to predict regional freight volume plays an important role in the development of expressway freight organization work; especially, the short-term (hour, daily or monthly) freight volume is directly related to the compilation of regional transportation plans. Artificial neural networks have been widely used in forecasting in various fields because of their unique structural characteristics and strong learning ability, among which the long short-term memory (LSTM) network is suitable for processing and predicting series with time interval attributes such as expressway freight volume data. Considering the factors affecting regional freight volume, the data set was reconstructed from the perspective of spatial importance; we then use a quantum particle swarm optimization (QPSO) algorithm to tune parameters for a conventional LSTM model. In order to verify the efficiency and practicability, we first selected the expressway toll collection system data of Jilin Province from January 2018 to June 2021, and then used database and statistical knowledge to construct the LSTM data set. In the end, we used a QPSO-LSTM algorithm to predict the freight volume at the future times (hour, daily or monthly). Compared with the conventional LSTM model without tuning, the results of four randomly selected grids naming Changchun City, Jilin city, Siping City and Nong'an County show that the QPSO-LSTM network model based on spatial importance has a better effect.



    As data gradually increase from various industries around the world, its status as an important production element and strategic production resource is increasingly emerging. As a flourishing technology, big data plays a significant role in improving the management level and increasing data utilization in the process of integrating into highway management. By using the toll collection system of an expressway network, the integration of information can be effectively realized. Based on this, it is more important to use effective methods to deal with the problem of freight volume forecasting, and to improve the forecasting accuracy with scientific and reliable of data analysis. Therefore, it is necessary to establish a targeted prediction model to provide a certain reference for future economic development planning, to guide the future planning and investment of the transportation network, and to ensure the application efficiency of the toll collection system of the expressway network.

    The accuracy of existing artificial intelligence prediction methods is usually higher than that of traditional prediction methods although they have low requirements when selecting raw data, the reason is that they can analyze the optimal logical relationship between data by using the ideas of induction, deduction and analogy. For example, Zhang et al. proposed an EMD-LSTM model to grasp water quality in the drainage network; as a result, the model has achieved the highest R2 values and the lowest root mean square error values in the prediction process [1]. Sigakova et al. used a regression tree and other machine learning methods to study the close relationship between various influencing factors, which improved the freight volume forecast [2]; Nuzzolo and Comi joined the existing logistics infrastructure of a city into the influencing factors; and then used the city freight volume of Rome to verify the proposed model; the results showed that the accuracy when predicting freight-delivery-vehicle flow by using generalized least squares and maximum likelihood estimation is higher than that by traditional methods [3]; Ran constructed a backpropagation (BP) neural network prediction model with the help of Nanning road freight volume data over the years. Compared with the trend extrapolation, exponential smoothing, gray prediction and exponential regression method, their model was able to fit the original data more accurately and have a smaller relative error [4]. Wang and Shao not only constructed a road freight volume prediction model based on a support vector machine by using historical data of industries and economic indicators of Beijing such as import and export trade volume and so on, but they also analyzed their impacts on the results. Apart from the results showing that the proposed model has high effectiveness and feasibility, it can be extended to practical applications [5]. Based on this, Li et al. also used variables such as population number, vehicle ownership and various economic indicators to establish a generalized regression neural network. After simplifying the adjustable parameters, the proposed model was found to have high efficiency in the forecast of highway freight volume in Nanjing [6]; Wang compared and analyzed a variety of traditional forecasting methods with the proposed BP neural network freight volume forecasting model by using economic indicators; the results showed that the proposed model has better applicability [7]. In a word, the above prediction models mainly focus on the impact of various macro factors on freight volume, but they cannot solve the short-term prediction problem of freight volume. Still and all, they also provide a new idea for establishing intelligent prediction models.

    In recent years, the use of big data has become more and more popular in the forecast of freight volume [8,9,10]. For example, Liang et al. proposed a hybrid kernel function support vector machine model after performing a grey correlation analysis on various influencing factors [11]. Considering the characteristics of freight data in different periods, Cheng et al. established a multivariate long short-term memory (LSTM) prediction model based on monthly and daily freight volume data, then, they used the freight volume data from 2010 to 2017 on the Guangzhou Railway to verify the proposed model [12]. Since many scholars have begun to focus on road freight transportation, machine learning algorithms are beginning to enter the research field. Different from the traditional road freight volume statistics method using the sampling survey [13], the calculation of regional short-term freight volume generally relies on real-time toll station receipts, which overcomes the shortcomings of incomplete data [14]. With the help of a highway network toll collection system [15,16], after analyzing the temporal and spatial variation of highway traffic flow [17,18], some researchers began to use the deep learning model to predict the highway freight volume [19,20], Wang and Chen even used a graph convolutional network to predict the origin-destination (OD) freight volume matrix according to the toll station data [21]. Inspired by this, the above models have also laid a foundation for predicting freight volume from the perspective of spatio-temporal information importance.

    Expressway freight volume has strong regional and seasonal fluctuation characteristics. It has become an urgent problem in the logistics field to explore how to build data sets from the perspective of time and space, establish short-term prediction models by using relevant factors, and improve the accuracy of freight volume prediction. The LSTM network utilizes the sensitivity of orders and is especially suitable for processing and predicting time series with nonlinear properties. Using a quantum particle swarm optimization (QPSO) algorithm to tune the LSTM parameters can reduce redundancy and save simulation time. Through the improvement of LSTM, the model can capture patterns that may be ignored by LSTM. In addition to the prediction of highway freight volume, as a common LSTM variant, the QPSO-LSTM algorithm proposed in this paper can be applied for the prediction of continuous variables such as short-term traffic flow, future temperature and stock trends. With the help of the Jilin expressway online toll information system, we studied the application of a QPSO-LSTM network model based on spatial information importance in expressway freight volume prediction, which can be used as a supplement to the conventional freight volume prediction model.

    PSO is a kind of bionic algorithm, which realizes random optimization by simulating the behavior of organisms. It is designed by imitating the process of bird groups looking for food. Particles can compare the individual optimal value and the global optimal value of the group according to the state in flight with the fitness value. After obtaining the results, the optimal solution can be searched in the solution space of the optimal particles. The disadvantage of PSO is that the speed and position of particles determine the current motion state of particles. In the process of particle evolution, the feasible solution of particles is a limited and shrinked region, and the particle speed is fixed. Hence, once the particle motion trajectory is fixed, it cannot cover the whole solution space, which leads to the phenomenon of local optimal values. In the QPSO algorithm, the quantum uncertainty theory in quantum mechanics theory is introduced. In quantum space, the velocity and position of particles cannot be determined at the same time, so the trajectory of particles is uncertain and can appear randomly in all solution spaces, which solves the problem of the particle trajectory of PSO algorithm being fixed and easy to fall into local optimal solutions [22,23,24,25,26]. In QPSO, the wave function is used to represent the motion state of particles. Since space and time are independent of each other in quantum space, particles corresponding to wave functions are also random. Besides, QPSO has the advantages of fewer parameters, a simple structure and fast convergence.

    For the QPSO, the mbest parameter is introduced to represent the average value of the historical best position, pbest of the particle swarm. The particle update steps of the QPSO algorithm are as follows.

    Step 1: Calculate mbest, i.e.,

    mbest=1MMi=1pbest_i (1)

    where M represents the size of the particle swarm; pbest_i represents the i th pbest in the current iteration.

    Step 2: Update particle position u:

    Pi=Φpbest_i+(1Φ)gbest (2)

    where gbest represents the current global optimal particle; Pi is used to update the position of the i th particle.

    The particle position update formula is

    xi=Pi+α|Mbestxi|ln(1μ) (3)

    where xi represents the position of the i th particle; α is the innovation parameter; Φ and μ are uniformly distributed values on (0, 1), and the probability of taking both positive and negative is 0.5. It can be seen that there is only one innovative parameter setting (α) in the QPSO algorithm, and α is generally not greater than 1.

    The LSTM network is a deep recurrent neural network proposed by Hochreiter and Schmidhuber [27]. It selectively affects the state of each moment in the neural network by relying on the "gate" structure, and it mainly includes a forget gate, input gate and output gate. Their roles, in turn, are to decide when to forget the previous state, add a new state and output the final state, respectively. The network structure of the LSTM network is shown in Figure 1.

    Figure 1.  Classical LSTM architecture block diagram.

    As can be seen in Figure 1, each black line represents the transmission direction of the entire vector, ie from the output of one node to the input of other nodes. The rounded boxes represent computational operations; the boxes in the middle represent the learned neural network layer; joined lines represent vector connections; separated lines represent content being copied and then distributed to different locations. The core idea of LSTM is not only that the last state can be stored through the connection of the state of cells (A in the figure), but also that useful states can be stored through network learning. Regarding the iterative process, the update formulas for the forget gate, input gate and output gate are as follows:

    ForgetGate:ft=σ(Wf[ht1,xt]+bf) (4)
    InputGate:it=σ(Wi[ht1,xt]+bi) Ct=tanh(Wc[ht1,xt]+bc)Ct=ftCt1+it Ct (5)
    OutputGate:ot=σ(Wo[ht1,xt]+bo)ht=otanh(Ct) (6)

    where, Wf, Wi and Wo represent the corresponding weights of the forget gate, input gate and output gate respectively; bf, bi and bo represent the bias vectors of the forget gate, input gate and output gate; Ct represents the new cell state vector; Ct1 represents the cell state vector at the previous moment; σ represents the sigmoid function.

    When using the QPSO algorithm for parameter tuning, particle initialization is transformed into a series of LSTM parameters, and its fitness is the MAE or MAPE value of the LSTM model when using initialization parameters, i.e.,

    Fitness(QPSO)=MAE=1NN1| yiyi|or  Fitness(QPSO)=MAPE=1NN1| yiyiyi| (7)

    where  yi represents the predicted value, yi represents the true value, and N represents the number of training sets.

    When forecasting freight volume, in addition to the selection of models, the role of data sets should not be underestimated. This is because there are abnormal, repeated, blank values in the original records of the expressway toll collection system. Since the information contained in the records constitutes the most original data, it is necessary to statistically analyze on them and extract effective information which can improve the prediction accuracy of the model. This method extracts the information that affects the freight volume from the perspective of spatial importance, and completes the reconstruction of the data set. Before the reconstruction of the data set, the main preparations are to convert the freight traffic records into freight traffic information of each spatial region, and then preprocess the input records to obtain the freight traffic information between each pair of regions. After all that work, the spatial influencing factors analysis for the regional freight traffic can be conducted further.

    Definition 1: Divide the statistical time interval T=[Ta,Tb] into a series of time series {T1,T2,...,TM}, and each time series is called a time slot. When the freight volume is counted, the time slot can be a year, month, day, or hour. According to the size of the time slot, freight volume forecasts can be divided into short-term and long-term forecasts.

    Definition 2: Divide a spatial region, such as an administrative region, city, province or even a country, into N non-overlapping grids G={g1,g2,...,gN}. Each grid is called a grid, and its size is determined by the area of the region and the maximum number of grids N. Figure 2 depicts a region consistings of 16 grids. The data point Mi of each grid gi in a given time slot Ti is not only affected by the previous time slot T1,T2,...Ti1, but also by the data Mj,j=1,2,...N,ji of other grids. For the freight volume, this occurrence within the grid is closely related to the cargo attraction of other regions.

    Figure 2.  Region-based grid diagram.

    Definition 3: For a grid gi, when there is a cargo transportation record Mij with other grids gj gj,j=1,2,...N,ji, gj is called a neighborhood of the grid gi.

    Ωit={gj|Mij>0,MijMt} (8)

    The regional outline of Jilin Province is depicted in Figure 3 which shows its longitude and latitude. It includes highway toll stations, county-level administrative centers, county boundaries, provincial boundaries and its main highway lines in each administrative region.

    Figure 3.  Regional outline and distribution map of toll stations.

    The expressway toll collection system includes truck traffic records of 188 provincial toll stations and 24 provincial border toll stations. Each truck traffic record exists in the form of toll records which includes a series of valid variables, such as a vehicle entry time, entry toll station, driving lane, exit time, exit toll station, vehicle type, load, ETC charge or not, vehicle license plate, etc. The provincial border toll station mainly records the information of vehicles entering and leaving the provincial boundary, such as the time of entering and leaving the provincial boundary, the serial number of the toll station entering and leaving the provincial boundary, the license plate, etc. The description of the field attributes is shown in Table 1.

    Table 1.  Field property description table.
    Serial number Field name Field type Specific instructions
    In the province
    1 RUKOURIQI VARCHAR2 Time to enter the highway
    2 RUZHANKOUBIANHAO VARCHAR2 Starting toll booth
    3 CHUKOUCHEDAO VARCHAR2 Driving lane on the highway
    4 CHUKOURIQI VARCHAR2 Time off highway
    5 CHUZHANKOUBIANHAO VARCHAR2 Terminate toll booths
    6 ZUIZHONGCHEXING VARCHAR2 Types of trucks, including six categories, namely Model I ~ Model VI
    7 ZHOUZONGZHONG NUMBER Load capacity of the moving truck
    8 SHOUFEILEIXING NUMBER Is it an automatic charge? 1 denotes yes, and 0 denotes no
    9 SHIBIICHEPAI VARCHAR2 Vehicle license plate
    Door frame
    1 JIAOYISHIJIAN VARCHAR2 Time in or out of the provincial border
    2 MENJIA VARCHAR2 Door racks entering or leaving provincial borders
    3 CHEPAI VARCHAR2 Vehicle license plate

     | Show Table
    DownLoad: CSV

    When organizing the input data of the model, the interception time interval for each pair of grids is set to 1 day, that is, the number of cargo exchanges in one day. The reason why one day is selected as the statistical standard is that among the truck trips involved, the medium and long trips occupy the vast majority. Taking December 1, 2020 as an example, if the cargo transportation volume from Dongfeng County to Changchun City is 486.2 tons, it means that the total cargo is 486.2 tons from all of the toll stations in Dongfeng County to all of the toll stations in Changchun City.

    The specific process and steps of data preprocessing are shown in Figure 4.

    Figure 4.  Data preprocessing flow chart.

    Step 1: Perform data cleaning on the expressway operation data and gantry data, mainly including the processing of missing values and outliers. According to the administrative regionalization of Jilin Province, the whole region of Jilin City is divided into a series of grids Gi,i=1,2,...,N, where each grid is an administrative region including all of the toll stations located within it; in the end a toll -station- grid map is established.

    Figure 3 shows the location map of all toll stations (blue triangle) in Jilin Province. The freight volume of each grid is the sum of the freight volume records of all toll stations located in it. In this study, each county-level administrative region is regarded as a grid, and Jilin Province includes a total of 48 grids (41 county-level administrative regions and seven provincial toll stations). The entry and exit toll stations in each record are mapped to each grid; then, the records are organized into records of cargo exchange between grids. Days are used as the time slot for statistics, that is, each piece of data records the cargo exchange volume of a pair of grids in one day.

    Step 2: Filter out all types of trucks from the original data set which includes six types, namely, Vehicle type I ~ Vehicle type VI.

    Step 3: Calculate the average dead-weight of the six truck types.

    Step 4: Correlate the provincial record and gantry record according to the license plate information and time information to form a complete data record.

    Step 5: Take one grid gi as the origin point O, and the other grids gj,j=1,2,...,N as the destination point D. Then take the exit time as the statistical standard; and calculate the total freight volume of different truck types for each day.

    Step 6: Repeat Step 5 for all grids gi,i=1,2,...,N.

    Step 7: Encode the input data into a two-dimensional matrix format, in which the columns are arranged in grid order, and the rows are arranged in a time series. Each row records the gi occurrence of cargo exchange between one grid with other grids in a time slot, and it satisfies the conditions of independent and identical distribution, finally the OD data record format is set as shown in Table 2.

    Table 2.  OD data record format.
    Grid gi Time slot g1 g2 gi gN1 gN1 Occurrence
    Changchun City 2020-11-1 M11 M12 M1i _ M1(N-1) M1N M1
    Changchun City 2020-11-2 M21 M22 M2i _ M2(N-1) M2N _ M2
    Jilin City 2020-11-3 N31 N32 N3i N3(N- 1) N3N N3
    Jilin City 2021-11-30 NM1 NM2 NMi N4(N- 1) N3N NM

     | Show Table
    DownLoad: CSV

    Taking Changchun City as an example, Table 2 shows the cargo volume of within Changchun City and Changchun City to the other cities (such as Dehui, Yushu, Nong'an, Jilin, Jiaohe, Shulan, Panshi, Yongji, Siping, Gongzhuling, Shuangliao, Lishu, Yitong Manchu. Autonomous County, Liaoyuan, Dongfeng, Dongliao, Tonghua City, Tonghua County, Huinan, Liuhe, Meihekou, Baishan, Fusong, Jingyu, Songyuan, Qianguo Mongolian Autonomous County, Changling, Gan'an, Fuyu, Baicheng, Da'an, Zhenlai, Taonan, Yanji, Tumen, Dunhua, Hunchun, Longjing, Wangqing, Antu), as well as the Changchun borders, Siping borders, Liaoyuan borders, Tonghua borders, Songyuan borders, Baicheng borders and Yanji borders. The cargo volume sum of all grids is the occurrence of Changchun City.

    To address the fact that not all grids have goods exchange between them, screening the preprocessed data and appropriately selecting the spatial domain embedding dimension are the basis for regional freight volume forecasting. A QPSO-LSTM network model as developed by using this dataset, and it can predict the amount of freight in one grid in the future. According to the classification and regression tree (CART) algorithm, the importance of 46 spatial variables in the statistical time period is calculated respectively, and then the dimension selection is carried out according to the calculation results.

    The CART algorithm is a typical nonparametric binary tree classifier, which is used to select the spatial embedding dimension of the input data. The CART algorithm first calculates the Gini index between the freight volume MiN and the occurrence volume Mt between different grids, and then it ranks the importance of Mti to Mt.The input spatial embedding dimension is selected by setting an importance threshold. In order to analyze the impact of each grid on the future freight volume, the importance of the predictor variables is calculated by using Eq (9).

    I(ai)=Tt=1MtMCARTΔGini(z(ai,t)) (9)

    where Gini(z(ai,t)) represents the Gini index of the j th attribute of the t th internal node; MCART represents the total freight volume; Mt represents the cargo volume of the internal node at time t.

    Combined with the mentioned Sections 3.1-3.3, the overall flow of the spatial importance-based QPSO-LSTM algorithm proposed in this paper is as follows.

    1) The data of the expressway toll system is preprocessed first. Then, the appropriate time interval is selected; and the freight volume value between each pair of grids is obtained by statistical analysis.

    2) After obtaining the freight volume information between grids, the OD transaction records of cargo with other grids are obtained for each grid.

    3) For the OD transaction records in Step 2, the algorithm in Section 3.3 is used to calculate the importance of other grids to the starting grid. This is followed by, sorting the importance, setting the importance threshold, and delete other grid data whose importance is less than this threshold.

    4) For the reconstructed dataset obtained by Step 3, use the QPSO algorithm to obtain the optimal tuning parameters for the LSTM model.

    5) After selecting the length of the input space dimension, the optimal tuning parameters are input into the LSTM model for freight volume forecasting.

    6) Using the QPSO-LSTM model to complete the prediction, compare the error prediction results with the conventional LSTM model. The overall prediction effect of the QPSO-LSTM model is finished.

    Based on the preliminary analysis of expressway freight volume, it can be understood that the difference in spatial factors has obvious influence on the freight volume. If these influencing factors are included in the prediction model features, the robustness of the algorithm will be increased and the prediction accuracy will be improved accordingly. The algorithm in Section 3.3 was used to calculate the importance of other grids to the initial grid. In order to represent fairness, four grids were randomly selected as representatives; of the spatial importance of Hunchun City, Panshi City, Siping City and Tumen City, respectively, as shown in Figure 5 below.

    Figure 5.  Spatial importance analysis results.

    Through the analysis of the spatial dimension, it can be found that each grid is differently affected by the freight volume of other grids. For example, Hunchun City is mainly affected by Tumen City, Yanji City, Jiaohe City, Jilin City, Antu City, Changchun City, Dunhua City, the Changchun Boundary and Tonghua. The impact ratios were 47.40, 19.11, 10.06, 9.44, 4.64, 2.22, 1.69, 1.51, and 1.45%, respectively. But, Panshi City is mainly influenced by Panshi City itself, Huinan County, Nong'an County, Dongfeng County, the Tonghua Boundary, Jilin City, Yongji County and Changchun City. The impact ratios were 33.96, 22.95, 13.22, 5.52, 4.13, 3.36, 3.33 and 2.58%, respectively. Siping City has relatively few spatial influence variables, such as Gongzhuling City, Lishu County, Changchun City and Siping City. Alternatively, Tumen City is greatly influenced by Antu City, Jilin City, Yanji City, Hunchun City and Changchun City. Their importance factors were calculated to be 39.10, 29.33, 12.65, 7.47 and 33.88%, 25.67, 20.26, 9.25 and 2.88% respectively. The freight volume relationship between different grids is closely related to the freight volume exchange between the grids. Therefore, it is essential to obtain the dynamic freight volume matrix between different grids to improve the LSTM model and predict the freight volume in future.

    On the premise of taking into account the modeling accuracy level and efficiency, 10 grids whose importance is greater than 0.001 were selected to represent the length of the historical freight volume, that is, the grid embedding dimension of the QPSO-LSTM model was 10.

    In order to fully verify the effectiveness and performance advantages of the LSTM model with on QPSO parameter tuning in short-term freight volume prediction, the QPSO algorithm simulation parameters were set as shown in Table 3. The experimental platform was a CPU Intel (R) core i5 and 12G RAM, Windows 10 (64 bit) and anaconda3 (64 bit). The QPSO parameter tuning simulation results for Changchun City, Jilin City, Siping City and Nong'an County are shown in Table 4. In the experiment, the original data were all from the records of the Jilin expressway toll system from January 2018 to June 2021, with a total of 62, 510, 817 records.

    Table 3.  QPSO algorithm simulation parameters.
    Simulation parameters Physical meaning
    particle_num=10 Number of particles
    particle_dim=4 Particle length
    α=0.6 Innovation parameters
    iter_num=50 QPSO total number of iterations
    lstm_iter_num=30 Number of iterations of the L STM
    max_value=[6,30,30,20] Maximum value of particle dimension
    min_value=[2,1,1,1] Minimum particle dimension

     | Show Table
    DownLoad: CSV
    Table 4.  Q PSO algorithm tuning results.
    Grid Parameter tuning results
    Changchun City [2,1,1,1]
    Jilin City [3,1,1,14]
    Siping [4,20,3,9]
    Nong'an County [2,4,1,1]

     | Show Table
    DownLoad: CSV

    The results of parameter tuning includes four parameters. The first parameter N is the QPSO particle number, which means N-fold cross-validation for the LSTM model. The second parameter is the number of LSTM unit cells in the first layer, the third parameter is the number of LSTM unit cells in the second layer; and the fourth parameter is the number of batches per iteration. It can be seen in Table 4 that, when predicting the freight volume of Changchun City, the LSTM model for the freight volume was conducted with two-fold cross-validation and the model included two layers. The first layer included a cell unit, and the second layer included a cell unit. When the number of batches was 1, the prediction effect was the best.

    When building the LSTM network with the parameters in Table 4, the prediction results for Changchun City, Jilin City, Siping City and Nong'an County were as shown in Table 5 and Figure 6.

    Table 5.  Comparison of predicted and tested values of different grids.
    Grid Date Test
    value
    Predicted value Relative
    error
    Grid Date Test
    value
    Predicted value Relative
    error
    Changchun City 2020/05/25 67932.37 76846.38 13.12% Siping
    City
    2020/05/28 4891.07 6674.27 36.46%
    2020/08/31 89661.49 101951.57 13.71% 2020/09/02 6421.41 6686.08 4.12%
    2020/10/18 106065.97 107793.83 1.63% 2020/10/20 4071.51 4263.05 4.70%
    2020/11/08 104721.32 108409.19 3.52% 2020/11/15 3807.22 4177.38 9.72%
    2021/06/19 118038.25 120866.19 2.40% 2021/06/19 4661.90 4880.50 4.69%
    Jilin
    City
    2020/05/27 49021.53 58935.426 20.22% Nong'an County 2020/05/24 5964.2 7284.87 22.14%
    2020/08/28 103014.34 109854.09 6.64% 2020/09/10 7093.22 7517.61 5.98%
    2020/10/19 119091.15 126344.29 6.09% 2020/10/25 9762.56 10975.1 12.42%
    2020/11/15 95035.565 109285.85 14.99% 2021/01/09 8376.34 8566.44 2.27%
    2021/06/15 97514.864 99282.15 1.81% 2021/06/14 14779.9 14396.8 -2.59%
    MAE 0.00367 0.00578 MAE 0.00381 0.0018
    MAPE 16.55851 14.10185 MAPE 15.3475 17.00631

     | Show Table
    DownLoad: CSV
    Figure 6.  Comparison of QPSO-LSTM predicted and test values of for different grids.

    For the sake of fairness, five dates were randomly chosen for illustration in Table 5, for example, we will discuss Changchun City on 2020/05/25, 2020/08/31, 2020/10/18, 2020/11/08 and 2021/06/19. The relative errors between the actual values of the test set and the predicted values obtained by using the model were 13.12, 13.71, 1.63, 3.52 and 2.40% respectively. The relative errors of Jilin City on 2020/5/27, 2020/8/28, 2020/10/19, 2020/11/15 and 2021/6/15 are 20.22, 6.64, 6.09, 14.99 and 1.81%, respectively. The maximum and minimum errors of different grids were within the acceptable range. According to the comparison between the actual value and the predicted value within the whole-time range of the test set, the LSTM model optimized by QPSO algorithm can accurately predict the changing trend of shipment volume from 2020 to 2021 for different grids. In February, when the freight volume fell sharply, for example, for Changchun 2021/2/14, the actual freight volume was 9827.703 tons, and the predicted value was 10744.6536 tons, with a relative error of only 9.33%. The relative error for Jilin City, Siping City and Nong'an County on 2021/2/3 were -4.21, 8.93 and 9.88% respectively. This also shows that QPSO-LSTM model can still achieve a good prediction effect when the freight volume decreases sharply.

    Figure 7.  MAE and MAPE values for the QPSO-LSTM model on different grids.

    For all datasets, the training set and test set were divided as 0.64:0.36. For the four grids (Changchun, Jilin, Siping and Nong'an County), the iteration diagram of MAE and MAPE was as shown in Figure 10. It can be seen in Figure 10 that whether it is the MAE or MAPE, the four grids could converge within 10 rounds. The order of MAE was Jilin City > Siping City > Changchun City > Nong'an County, and the order of MAPE was Nong'an County > Changchun City > Siping City > Jilin City.

    Using the same dataset, another set of experiments was designed to evaluate a benchmark for comparative analysis, that is, the LSTM model that does not use the QPSO algorithm for parameter optimization. The experimental parameters were set randomly except that the proportion of the training set and the test set were the same; and the LSTM had the double-layer structure. In order to facilitate the comparison of models, the same four grids (Changchun City, Jilin City, Siping City and Nong'an County) were again selected for simulation. The simulation results for the four grids using the conventional LSTM model are shown in Figure 8, and the comparison results with the MAE and MAPE for the QPSO-LSTM model are shown in Figure 9.

    Figure 8.  Comparison of predicted and test values for conventional LSTM model on different grids.
    Figure 9.  MAE and MAPE comparison of QPSO -LSTM and conventional LSTM.

    It can be seen in Figures 6 and 8 that although the general changing trend of freight volume can also be predicted by using the conventional LSTM model, its prediction effect was worse than that of the QPSO-LSTM model for Changchun, Jilin, Siping and Nong'an. The error of the former was 40% higher than that of the latter, on average. The prediction value of the conventional LSTM model was relatively stable, but it could not cope with the sharp change in freight volume. For example, the relative errors from 2021/02/01 in Changchun, 2021/02/27 in Jilin, 2021/01/25 in Siping and 2021/02/24 in Nong'an County were 53.34, 134.56, 12.56 and 33.28% respectively. Therefore, when using the conventional LSTM model, although the internal correlation between the freight volume and spatial factors was also used, the influence of the length selection of the freight volume historical time information was ignored. Because the length of the historical time cannot be determined, the information was redundant and the prediction effect was poor.

    Similarly, it can be seen in Figure 9 that the MAE and MAPE of the QPSO-LSTM model were lower than those of the conventional LSTM model. The MAE values of the four grids obtained by using the QPSO-LSTM model were 0.00367, 0.00578, 0.00381 and 0.0018, respectively. The MAE values of the four grids obtained by using the conventional LSTM model were 0.00558, 0.01029, 0.008 and 3.132e-4. The errors increase by 52.04, 78.03, 109.97 and 82.60%, respectively. The MAPE also decrease from 61.07905, 57.71179, 71.85626 and 51.63141 to 16.55851, 14.10185, 15.3475 and 17.00631, respectively, and the errors increased by 268.87, 309.25, 368.20 and 203.60% respectively. This also shows that the prediction effect of the QPSO-LSTM model is better than that of the conventional LSTM model, which is consistent with the conclusions in Figure 6 and Figure 8.

    To sum up, the QPSO-LSTM model based on spatial information importance has the highest accuracy and stability for short-term freight volume prediction.

    Accurate freight volume prediction is necessary to reflect the contribution, quality and structure of transportation development. Aiming to solve the problem of highway freight volume prediction, we proposed an LSTM model whose parameters were tuned by a QPSO algorithm. In order to improve the calculation efficiency of the model, it uses a CART algorithm to obtain the importance factors of spatial information; and then screen the input information after exceeding the preset threshold; finally, the QPSO -LSTM model input data set is reconstructed, which can ensure the accuracy and reduce the complexity of the model at the same time.

    In order to prove the effectiveness of the QPSO-LSTM model, two groups of experiments were designed. The first group of experiments proved the necessity of QPSO-LSTM from the perspective of spatial information importance. The second group of experiments compared the simulation effects of QPSO-LSTM with those of the conventional LSTM model, which still has a two-layer structure but the parameters are set randomly. The results showed that QPSO-LSTM not only well reflects the changing trend and law of highway freight volume, but it can also improve the prediction accuracy. The two groups of experiments were verified by taking the highway freight volume records of Jilin Province in China as an example, which shows that the proposed model can be widely used as a practical short-term highway freight volume prediction method. The next step will continue to carry out relevant research on dataset construction methods and prediction models.

    This work was supported by the Young Teachers Research Project for Xi'an University of Technology: [Grant Number 256081921]; Special nature project of Shaanxi Provincial Department of Education [Grant Number 21JK0786]; Scientific research plan project of Shaanxi Provincial Department of Education [Grant Number 2021R025]; Scientific research plan project of Shaanxi Provincial Department of Education [Grant Number 21JZ005]; and Natural Science Basic Research Program of Shanxi: [Grant Number 2021JQ-279].

    The authors declare that there is no conflicts of interest.



    [1] Y. Zhang, C. Li, Y. Jiang, L. Sun, R. Zhao, K. Yan, et al., Accurate prediction of water quality in urban drainage network with integrated EMD-LSTM model, J. Cleaner Prod., 354 (2022), 131724. https://doi.org/10.1016/j.jclepro.2022.131724 doi: 10.1016/j.jclepro.2022.131724
    [2] K. Sigakova, G. Mbiydzenyuy, J. Holmgren, Impacts of traffic conditions on the performance of road freight transport, in 2015 IEEE 18th International Conference on Intelligent Transportation Systems, (2015), 2947-2952. https://doi.org/10.1109/ITSC.2015.472
    [3] A. Nuzzolo, A. Comi, Urban freight demand forecasting: a mixed quantity/delivery/vehicle-based model, Transp. Res. Part E Logist. Transp. Rev., 65 (2014), 84-98. https://doi.org/10.1016/j.tre.2013.12.014 doi: 10.1016/j.tre.2013.12.014
    [4] R. An, Prediction of road freight volume in Nanning based on BP neural network (in Chinese), Transp. Res., 2015 (2015), 58-64. https://doi.org/10.16503/j.cnki.2095-9931.2015.02.011 doi: 10.16503/j.cnki.2095-9931.2015.02.011
    [5] Y. Wang, C. Shao, Research on the forecasting of road freight volume based on support vector machine (in Chinese), Logist. Technol., 29 (2010), 4-9. http://doi.org/10.3969/j.issn.1005-152X.2010.h3.044 doi: 10.3969/j.issn.1005-152X.2010.h3.044
    [6] J. Li, K. Wang, H. Wang, Highway freight prediction method based on GRNN (in Chinese), Comput. Commun., 25 (2007), 131-133. http://doi.org/10.3963/j.issn.1674-4861.2007.03.033 doi: 10.3963/j.issn.1674-4861.2007.03.033
    [7] D. Wang, Forecast method of road freight traffic based on BP neural network (in Chinese), J. Beihua Univ. (Nat. Sci.), 15 (2014), 417-420. Available from: https://kns.cnki.net/kcms/detail/detail.aspx?dbcode=CJFD&dbname=CJFD2014&filename=ZLYY201403036&uniplatform=NZKPT&v=fPAJcuTQDy4V5PCCHHX_A565Z7sCEoixis0rCMwT5lh-ZV3yc-50dedeW6PgvF23.
    [8] H. Zhang, W. Qin, Application analysis of traffic big data in intelligent expressway (in Chinese), West. China Commun. Sci. Technol., 2019 (2019). http://doi.org/10.13282/j.cnki.wccst.2019.09.051 doi: 10.13282/j.cnki.wccst.2019.09.051
    [9] M. Li, S. Dai, J. Wang, Z. Shen, The analysis of urban spatial development pattern in Beijing based on the big data of government (in Chinese), Geomatics World, 23 (2016), 20-26. Available from: https://kns.cnki.net/kcms/detail/detail.aspx?dbcode=CJFD&dbname=CJFDLAST2016&filename=CHRK201603004&uniplatform=NZKPT&v=09YbnU2_ffAfxC6Gb3BGR2oqvb0aXkvzXemDGuIJ5LBVczIqI8hg6ORlLgnR3uCG.
    [10] X. Long, F. Gao, Expressway meteorological identification method based on deep learning (in Chinese), China ITS J., 2021 (2021). http://dx.doi.org/10.13439/j.cnki.itsc.2021.05.015 doi: 10.13439/j.cnki.itsc.2021.05.015
    [11] N. Liang, L. Geng, Z. Zhang, Y. Liang, A prediction method of railway freight volumes using GRA and SVM-mixed (in Chinese), J. Transp. Syst. Eng. Inf. Technol., 16 (2016), 94-99. http://dx.doi.org/10.16097/j.cnki.1009-6744.2016.06.015 doi: 10.16097/j.cnki.1009-6744.2016.06.015
    [12] Z. Cheng, X. Zhang, Y. Liang, Railway freight volume prediction based on LSTM network (in Chinese), J. China Railway Soc., 42 (2020), 15-21. http://dx.doi.org/10.3969/j.issn.1001-8360.2020.11.003 doi: 10.3969/j.issn.1001-8360.2020.11.003
    [13] X. Cui, The Method of Survey and Satistics of Transportation Burden by Highway Traffic, M.A thesis, Jilin University, 2007.
    [14] L. Zhang, Study on the Highway Traffic Volume Prediction Based on Toll Station Big Data, M.A thesis, Chang'an University, 2016.
    [15] C. Ma, Application Research Based on ETC Operational Data in National Network, M.A thesis, Beijing University of Posts and Telecommunications, 2018.
    [16] X. Zhou, Research on the application of network charging data analysis platform based on big data (in Chinese), Digital Commun. World, 2016 (2016), 161. http://dx.doi.org/10.3969/J.ISSN.1672-7274.2019.06.129 doi: 10.3969/J.ISSN.1672-7274.2019.06.129
    [17] J. Wang, Short-term Traffic Flow Prediction Based on Deep Learning, M.A thesis, Southwest University, 2019.
    [18] Q. Chen, Z. Zheng, X. Chen, J. Ren, A deep learning-based expressway exit flow prediction method, China ITS J., 1 (2018), 50-56. http://dx.doi.org/10.13439/j.cnki.itsc.2018.S1.066 doi: 10.13439/j.cnki.itsc.2018.S1.066
    [19] D. Chang, Research on Volume of Road Cargo Transportation Forecasting Method Based on Deep Learning, M.A thesis, Chang'an University, 2018.
    [20] Q. Wang, Study on the Statistical Analysis of Freight Volume Based on Freeway Toll Data, M.A thesis, Chang'an University, 2017.
    [21] Y. Wang, H. Yin, H. Chen, T. Wo, J. Xu, K. Zheng, Origin-destination matrix prediction via graph convolution: a new perspective of passenger demand modeling, in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, (2019), 1227-1235. https://doi.org/10.1145/3292500.3330877
    [22] J. Sun, Particle Swarm Optimization with Particles Having Quantum Behavior, M.A thesis, Jiangnan University, 2009.
    [23] W. Fang, J. Sun, Z. Xie, W. Xu, Convergence analysis of quantum-behaved particle swarm optimization algorithm and study on its control parameter, Acta Phys. Sin., 59 (2010), 3686-3694. https://doi.org/10.7498/aps.59.3686 doi: 10.7498/aps.59.3686
    [24] S. Yang, M. Wang, L. Jiao, A quantum particle swarm optimization, in Proceedings of the 2004 Congress on Evolutionary Computation, (2004), 320-324. https://doi.org/10.1109/CEC.2004.1330874
    [25] S. M. Mikki, A. A. Kishk, Quantum particle swarm optimization for electromagnetics, IEEE Trans. Antennas Propag., 54 (2006), 2764-2775. https://doi.org/10.1109/TAP.2006.882165 doi: 10.1109/TAP.2006.882165
    [26] S. L. Sabat, L. dos Santos Coelho, A. Abraham, MESFET DC model parameter extraction using quantum particle swarm optimization, Microelectron. Reliab., 49 (2009), 660-666. https://doi.org/10.1016/j.microrel.2009.03.005 doi: 10.1016/j.microrel.2009.03.005
    [27] S. Hochreiter, J. Schmidhuber, Long short-term memory, Neural Comput., 9 (1997), 1735-1780. https://doi.org/10.1162/neco.1997.9.8.1735 doi: 10.1162/neco.1997.9.8.1735
  • This article has been cited by:

    1. Benxiang Lin, Chao Wei, Fuyong Feng, A Vehicle Velocity Prediction Method with Kinematic Segment Recognition, 2024, 14, 2076-3417, 5030, 10.3390/app14125030
    2. Anh Tuan Nguyen, Duy Hoang Pham, Bee Lan Oo, Yonghan Ahn, Benson T. H. Lim, Predicting air quality index using attention hybrid deep learning and quantum-inspired particle swarm optimization, 2024, 11, 2196-1115, 10.1186/s40537-024-00926-5
    3. Hongbo Xu, Jundong Zhang, Jiale Liu, Yang Cao, Ao Ma, Optimization of Ship Permanent Magnet Synchronous Motor ADRC Based on Improved QPSO, 2025, 15, 2076-3417, 1608, 10.3390/app15031608
    4. Zhuangqun Song, Peng Zhao, Xueji Wu, Rong Yang, Xueshan Gao, An Active Control Method for a Lower Limb Rehabilitation Robot with Human Motion Intention Recognition, 2025, 25, 1424-8220, 713, 10.3390/s25030713
  • Reader Comments
  • © 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(2026) PDF downloads(119) Cited by(4)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog