CDBC: A novel data enhancement method based on improved between-class learning for darknet detection

Binjie Song; Yufei Chang; Minxi Liao; Yuanhang Wang; Jixiang Chen; Nianwang Wang; Binjie Song; Yufei Chang; Minxi Liao; Yuanhang Wang; Jixiang Chen; Nianwang Wang

doi:10.3934/mbe.2023670

Mathematical Biosciences and Engineering

2023, Volume 20, Issue 8: 14959-14977. doi: 10.3934/mbe.2023670

Previous Article Next Article

Research article Special Issues

CDBC: A novel data enhancement method based on improved between-class learning for darknet detection

1.
Academy of A&AD, Zhengzhou 450000, China
2.
South China University of Technology, Guangzhou 511400, China
3.
State Key Laboratory of Mathematical Engineering and Advanced Computing, Zhengzhou 450001, China

Academic Editor: Junxin Chen

Received: 16 March 2023 Revised: 20 June 2023 Accepted: 04 July 2023 Published: 12 July 2023

With the development of the Internet, people have paid more attention to privacy protection, and privacy protection technology is widely used. However, it also breeds the darknet, which has become a tool that criminals can exploit, especially in the fields of economic crime and military intelligence. The darknet detection is becoming increasingly important; however, the darknet traffic is seriously unbalanced. The detection is difficult and the accuracy of the detection methods needs to be improved. To overcome these problems, we first propose a novel learning method. The method is the Chebyshev distance based Between-class learning (CDBC), which can learn the spatial distribution of the darknet dataset, and generate "gap data". The gap data can be adopted to optimize the distribution boundaries of the dataset. Second, a novel darknet traffic detection method is proposed. We test the proposed method on the ISCXTor 2016 dataset and the CIC-Darknet 2020 dataset, and the results show that CDBC can help more than 10 existing methods improve accuracy, even up to 99.99%. Compared with other sampling methods, CDBC can also help the classifiers achieve higher recall.

Keywords:

Citation: Binjie Song, Yufei Chang, Minxi Liao, Yuanhang Wang, Jixiang Chen, Nianwang Wang. CDBC: A novel data enhancement method based on improved between-class learning for darknet detection[J]. Mathematical Biosciences and Engineering, 2023, 20(8): 14959-14977. doi: 10.3934/mbe.2023670

Related Papers:

[1]	Haitao Huang, Min Tian, Jie Zhou, Xiang Liu . Reliable task allocation for soil moisture wireless sensor networks using differential evolution adaptive elite butterfly optimization algorithm. Mathematical Biosciences and Engineering, 2023, 20(8): 14675-14698. doi: 10.3934/mbe.2023656
[2]	Shuming Sun, Yijun Chen, Ligang Dong . An optimization method for wireless sensor networks coverage based on genetic algorithm and reinforced whale algorithm. Mathematical Biosciences and Engineering, 2024, 21(2): 2787-2812. doi: 10.3934/mbe.2024124
[3]	Hao Yuan, Qiang Chen, Hongbing Li, Die Zeng, Tianwen Wu, Yuning Wang, Wei Zhang . Improved beluga whale optimization algorithm based cluster routing in wireless sensor networks. Mathematical Biosciences and Engineering, 2024, 21(3): 4587-4625. doi: 10.3934/mbe.2024202
[4]	Xiang Liu, Min Tian, Jie Zhou, Jinyan Liang . An efficient coverage method for SEMWSNs based on adaptive chaotic Gaussian variant snake optimization algorithm. Mathematical Biosciences and Engineering, 2023, 20(2): 3191-3215. doi: 10.3934/mbe.2023150
[5]	Zhonghua Lu, Min Tian, Jie Zhou, Xiang Liu . Enhancing sensor duty cycle in environmental wireless sensor networks using Quantum Evolutionary Golden Jackal Optimization Algorithm. Mathematical Biosciences and Engineering, 2023, 20(7): 12298-12319. doi: 10.3934/mbe.2023547
[6]	Noman Zahid, Ali Hassan Sodhro, Usman Rauf Kamboh, Ahmed Alkhayyat, Lei Wang . AI-driven adaptive reliable and sustainable approach for internet of things enabled healthcare system. Mathematical Biosciences and Engineering, 2022, 19(4): 3953-3971. doi: 10.3934/mbe.2022182
[7]	Guohao Sun, Sen Yang, Shouming Zhang, Yixing Liu . A hybrid butterfly algorithm in the optimal economic operation of microgrids. Mathematical Biosciences and Engineering, 2024, 21(1): 1738-1764. doi: 10.3934/mbe.2024075
[8]	Tingting Yang, Yi He . Design of intelligent robots for tourism management service based on green computing. Mathematical Biosciences and Engineering, 2023, 20(3): 4798-4815. doi: 10.3934/mbe.2023222
[9]	Zhiyang Ju, Hui Zhang, Ying Tan, Xiang Chen . Coverage control of mobile sensor networks with directional sensing. Mathematical Biosciences and Engineering, 2022, 19(3): 2913-2934. doi: 10.3934/mbe.2022134
[10]	Zhenao Yu, Peng Duan, Leilei Meng, Yuyan Han, Fan Ye . Multi-objective path planning for mobile robot with an improved artificial bee colony algorithm. Mathematical Biosciences and Engineering, 2023, 20(2): 2501-2529. doi: 10.3934/mbe.2023117

Abstract

1. Introduction

Wireless sensor networks (WSN) are widely used in the automation, production, transport, healthcare, and agricultural fields ^[1,2,3]. As one core technology in the area of the Internet, wireless sensor networks are composed of multiple sensors in different ways to complete signal collection, signal transmission, data processing, and other operations ^[4,5,6]. WSN coverage problem is one of the basic problems in sensor networks ^[7,8]. Coverage capability is a criterion for evaluating the monitoring capability of WSN to a monitoring area, that is, how to deploy nodes to reach the maximum networks coverage, so as to improve the stability and effectiveness of information transmission.

Much work has been done to improve the maximum coverage of WSN. Yang et al. ^[9] improve the traditional centroid location algorithm, design a weighting strategy according to the reference anchor node and raise the location accuracy through multiple weights. From the view of repairing coverage gaps, based on graph theory and geometric calculation, Lu et al. ^[10] propose a Voronoi polygon strategy to repair local gaps for higher coverage. Based on game theory, Hajjej et al. ^[11] propose a new topology control method using reinforcement learning (RL) to repair coverage holes in a distributed way, so as to repair coverage vulnerabilities more effectively.

In recent years, an increasing number of scholars have consistently tried to apply intelligent algorithms to solve the WSN coverage problem ^[12] and have obtained specific results ^[13]. Kannan et al. ^[14] introduce the simulated annealing (SA) into the positioning research of wireless sensor networks. The algorithm has strong global optimization ability and robustness by jumping out of local optimal solution with a certain probability. However, the simulated annealing relies excessively on the parameters setting, which leads to the plight that better solution and search time cannot have both Kulkarni et al. ^[15] introduce particle swarm optimization (PSO) to improve node deployment and data fusion tasks. There is still a disadvantage that it is easy to fall into local optimum. Thus, the classical evolutionary computation cannot meet the needs of the problem. ZainEldin et al. ^[16] propose a novel improved dynamic deployment technique based-on genetic algorithm (IDDT-GA) to solve this problem. Similarly, aiming at how to minimize the number of nodes and maximize the coverage at the same time, Rebai et al. ^[17] construct an integer programming model and a special genetic algorithm (GA) is proposed for this model. Shivalingegowda et al. ^[18] introduce a new algorithm called hybrid gravitational search algorithm with social ski-driver (GSA-SSD). It proves that the hybrid algorithm model has certain advantages in the application of wireless sensor networks. Binh et al. ^[19] optimize several metaheuristic algorithms for WSN coverage in restricted areas with obstacles. Raj Priyadarshini et al. ^[20] extend the problem to underwater environment and proposed an energy prediction algorithm based on Markov chain Monte Carlo (MCMC) process for underwater acoustic WSN to reduce coverage vulnerabilities. Khalaf et al. ^[21] introduce a coverage optimization method for WSN based on the bee algorithm (BA) and compare with the genetic algorithm (GA), the results show BA can spend less time and use less resources to achieve higher coverage. Wang et al. ^[22] propose a Virtual Force Algorithm-Lévy-Embedded Grey Wolf Optimization (VFLGWO) to solve the WSN coverage problem. The results under different nodes and different monitoring area are discussed respectively in this paper. Feng et al. ^[23] combine K-means algorithm with artificial fish swarm algorithm, so that a WSN clustering method is proposed, which has better performance compared with traditional methods.

Arora et al. ^[24] propose a node location scheme using a natural heuristic meta-heuristic algorithm called butterfly optimization algorithm (BOA), which is simulated on sensor networks of different scales and compared to the performance of PSO and firefly algorithm (FA). The results show that BOA can obtain more accurate and stable node positioning. However, I think this work is still not perfect, because the butterfly algorithm has the disadvantages of low optimization accuracy and easy to fall into local optimization. Therefore, a series of work has down to improve the butterfly optimization algorithm and make BOA better with hybrid strategies. Arora et al. ^[24] combine the artificial bee colony algorithm (ABC) with the butterfly optimization algorithm and add the Lévy flights strategy to the butterfly optimization algorithm, so that a BOA/ABC algorithm is proposed. When the search phase of BOA is completed, the ABC algorithm start to explore in the search space. To some extent, it speeds up the convergence speed and widens the search space. The core idea of hybrid strategy still lies in accelerating convergence speed and jumping out of local optimization. Utama et al. ^[25] propose a new hybrid butterfly optimization algorithm for solving green vehicle routing problem (G-VRP). This algorithm is discretized by LRV method and applied to solve discrete optimization problems. By combining the tabu search (TS) algorithm and local search swap and flip strategies, the algorithm gradually approaches the global optimum by continuously flipping and exchanging nodes. Although it can solve the G-VRP well, it still has high time complexity and needs a lot of computing time than BOA. To overcome the problems faced by classical evolutionary computation. Our contributions lie in two aspects:

1) We make a new attempt to integrate five different improvement strategies with BOA, which help to accelerate the convergence speed and improve the optimization accuracy while ensuring the calculation time. We introduce Kent chaotic map in the initialization stage, improve the local search and mutation strategy of butterflies, construct a new inertia weight factor, and propose a simple simulated annealing process based on Standard normal distribution disturbance and Metropolis criterion, which gives full play to the advantages of meta heuristic algorithm.

2) The time complexity, convergence and asymptotic of the algorithm are analyzed in this paper. Through simulation test, the results show the hybrid strategies perform well.

2. WSN coverage optimization model

In this paper, we use Boolean perception model to research the problem. There is a set of $N$ sensor nodes $\mathrm{S} = \{{s}_{i}, \mathrm{i} = \mathrm{1, 2}, 3, \cdots n\}$ in a 2D monitoring area of size $\mathrm{L}\times W$ . Each node takes $\left({x}_{i}, {y}_{i}\right)$ as the center of a circle and takes R as the radius. Suppose the monitoring area which is to be covered is discretized into $\mathrm{L}\times W$ pixels, if the target pixel is within the sensor node coverage radius, it is successfully covered. The position of pixel ${p}_{j}$ can be expressed as $\left({x}_{j}, {y}_{j}\right)$ , so the distance can be expressed as

$\mathrm{d}\left({s}_{i}, {p}_{j}\right) = \sqrt{{\left({x}_{i}-\text{ }{x}_{j}\right)}^{2}+{\left({y}_{i}-\text{ }{y}_{j}\right)}^{2}}$

(2.1)

when the pixel ${p}_{j}$ is within the radius of the nodes, ${p}_{j}$ is covered successfully. So, we can get the probability which the pixel ${p}_{j}$ is covered successfully, as shown in Eq (2.2).

$p({s}_{i}, {p}_{j}) = \left\{\begin{array}{c}1 &d({s}_{i}, {p}_{j})\le R\\ 0 &d({s}_{i}, {p}_{j}) > R\end{array}\right.$

(2.2)

In particular, the target point only needs to be covered by any node in $S = {\{s}_{i}, i = \mathrm{1, 2}, 3, \cdots n\}$ , then the information of this pixel can be detected. The joint perception probability of target point ${p}_{j}$ is

$p(S, {p}_{j}) = 1-{\prod }_{i = 1}^{n}[1-p({s}_{i}, {p}_{j}\left)\right]$

(2.3)

In Eq (2.3), if ${p}_{j}$ is out of the range of any node ${s}_{i}$ in $S$ , then ${\prod }_{i = 1}^{n}[1-p({s}_{i}, {p}_{j}\left)\right] = 1$ . So $p(S, {p}_{j}) = 0$ , that is, this pixel is out of WSN coverage range. If ${p}_{j}$ is in the range of any node ${s}_{i}$ in $S$ , then $p({s}_{i}, {p}_{j}) = 1, {\prod }_{i = 1}^{n}\left[1-p\left({s}_{i}, {p}_{j}\right)\right] = 0,$ so $P(S, {p}_{j}) = 1$ , that is, the pixel ${p}_{j}$ is perceived by $S$ .

We use coverage rate to express the monitoring ability of sensor nodes. The formula of coverage rate is the ratio of sensor nodes coverage monitoring area ${A}_{c}$ to the whole area $A$ , which can be expressed by Eq (2.4).

$f = \frac{{A}_{c}}{A} = \frac{\sum _{j = 1}^{L\times W}p(S, {p}_{j})}{L\times W}$

(2.4)

We take Eq (2.4) as the objective function. We give a simple example to illustrate the problem vividly, as shown in . Assume that the nodes deployment scheme is shown as the figure, the shaded areas represent covered areas and white areas represent uncovered areas. Only one pixel is not covered so the coverage area of sensor nodes ${A}_{c}$ = 35. The number of sensor nodes is 8 and the monitoring area $A = 6 \times 6 = 36$ . So the coverage rate is

$coverage = \frac{{A}_{c}}{A} = \frac{35}{36} = 0.972 .$

(2.5)

Figure 1. An example of WSN cover model.

DownLoad: Full-Size Img PowerPoint

3. Butterfly optimization algorithm

The butterfly optimization algorithm (BOA) ^[26] based on the butterfly foraging behavior, was first proposed by Professor Arora et al. in 2018. In nature, butterflies use multiple sensory organs to find food, such as vision, touch, etc. The most important one is the smell. In BOA, each butterfly can emit a certain concentration of fragrance and smell the fragrance nearby. The butterfly moves towards the most fragrant direction. This phase is known as the global search phase. Suppose one butterfly cannot smell the fragrance from other butterflies. It will move randomly. This phase is known as the local search phase. For the above behaviors, we make a range of assumptions as follows:

a) All the butterflies can disseminate a particular concentration of fragrance.

b) Each butterfly moves only at random or towards the direction of the highest concentration.

c) The intensity of the stimulus is only affected by the objective function.

d) The butterfly controls the local search and the global search by switching the probability $p$ .

The fragrance concentration is closely related to the population's fitness, and related to the following three factors: stimulus intensity $I$ , perceived form $c$ , and power index $\alpha$ . The relationship among them can be expressed as

$f = c{I}^{\alpha }$

(3.1)

In the population iteration process, for butterflies moving through search space, the first thing to do is to calculate the fitness of all butterflies and then calculate the fragrance that the group produced at that location through Eq (3.2).

${{x}_{i}}^{t+1} = {{x}_{i}}^{t}+({r}^{2}\times {g}^{*}-{{x}_{i}}^{t})\times {f}_{i}$

(3.2)

where ${x}_{i}^{t+1}$ is the location ${x}_{i}$ for $i$ th butterfly in iteration $t+1$ , ${x}_{i}^{t}$ is the location ${x}_{i}$ for $i$ th butterfly in iteration $t$ .Here ${g}^{*}$ represents the current best location among all the locations in the current stage. The fragrance of $i$ th butterfly is represented by ${f}_{i}$ . $r$ , a random number in [0, 1]. The local search phase can be expressed as Eq (3.3).

${{x}_{i}}^{t+1} = {{x}_{i}}^{t}+({r}^{2}\times {{x}_{j}}^{t}-{{x}_{k}}^{t})\times {f}_{i}$

(3.3)

where ${x}_{j}^{t}$ and ${x}_{k}^{t}$ are the random butterflies in the solution space from the current population. $\text{r}$ , is a random number in [0, 1]. Based on the above description, the process of BOA is as follows:

Step 1: Set each parameter and initialize the population position.

Step 2: Calculate fitness function and find the best population.

Step 3: If $rand$ > $p$ , , do global search using Eq (3.2).

Step 4: If $rand$ < $p$ do local search using Eq (3.3).

Step 5: Judge whether butterfly exceeds the search boundary and do boundary treatment.

Step 6: Calculate the fitness of the new population in Step 3 or Step 4, and if it is better than the current solution, update the global optimal solution and the global optimal fitness.

Step 7: If the algorithm meets the maximum iteration, jump out of the program, else jump to Step3.

4. Hybrid butterfly optimization algorithm (H-BOA)

4.1. Kent chaotic map initialization mechanism

Like other classical evolutionary computation algorithms, BOA randomly initializes population, which may cause population unequally distribution at the initial stage. It will enormously affect the speed of searching. Chaotic map is one of the methods to solve this problem. In this paper, we choose Kent chaotic map, which can improve the search speed and increase population diversity.

Chaos phenomenon is one of the main manifestations of complex dynamics of non-linear dynamic mapping, which has great randomness, non-periodicity and other characteristics. So, it is widely used by scholars in the fields of information security, communication encryption and control theory ^[27,28,29]. In the field of evolutionary computation, chaotic mapping is often used to replace random number generator to initialize population ^[30,31,32].

Logistic chaotic map ^[33] is a commonly used chaotic map model, which is famous for its simple structure. However, it has a natural disadvantage, that is, it depends too much on the setting of parameters and initial values ^[34]. Because the chaotic interval is limited and the output chaotic sequence values are not uniformly distributed, the effect is still not ideal when the model is in a full mapping state. Kent chaotic map ^[35] is designed by

${x}_{k+1} = \left\{\begin{array}{c}\frac{{x}_{k}}{\alpha }&0\le x\le \alpha \\ \frac{1-{x}_{k}}{1-\alpha }&\alpha < x\le 1\end{array}\right.$

(4.1)

which combines the advantages of simple structure and uniform distribution of sequence values.

As shown in Figures 2 and 3, when iterating 200 times, there is still a part of the value aggregation extremes under the Logistic chaotic map model compared with the Kent map. To show the advantages of Kent map more clearly, we iterate 1000 times to get the value distribution of the Logistic map and the Kent map as shown in Figure 4, where the X-axis represents the range of values generated, the Y-axis represents the type of chaotic model, and the Z-axis represents the frequency of sequence values generated by the chaotic model in the relevant range.

Figure 2. Logistic chaotic map.

DownLoad: Full-Size Img PowerPoint

Figure 3. Kent chaotic map.

DownLoad: Full-Size Img PowerPoint

Figure 4. Frequency distribution of Kent map and Logistic map.

DownLoad: Full-Size Img PowerPoint

It is clear from the graph that the Kent chaotic map has better randomness and chaotic characteristics than Logistic chaotic map. So, it can utilize as much information as possible in the solution space.

4.2. A new inertial weight strategy

BOA is similar to other heuristic swarm intelligence algorithms, facing the balance of the capacities between the global and the local search. A host of scholars use the inertial weight strategy, including linearity weight and nonlinearity weight to solve this problem. Both linearity and nonlinearity weight strategies have been widely used in the improvement of intelligent algorithms.

$Sigmoid \ output = \frac{1}{1+{e}^{-x}}$

(4.2)

We consider a better balance between linearity and nonlinearity of Sigmoid function ^[36], mathematical description of Sigmoid function is shown as Eq (4.2).

We design a new inertial weight based on the Sigmoid function. Inspired by the Paper ^[37], we add a self-adaptive property to make population search more flexible. The new function is shown as follows

$w = \frac{1}{(1+\mu {e}^{\frac{10t}{M}-5}{)}^{2}}$

(4.3)

where $\mu \in$ [0, 1], $t$ is the current iteration number, M is the maximum iteration number. At the beginning of the iteration, the weight is relatively large. The weight is slowly reduced as the number of iterations increases, which is expected to search in the most extensive possible range. In the middle of the iteration, the weight reduces at a relatively rapid rate to ensure a faster search speed. In the later iteration, the weight slowly decreases so that the local search ability increases, and it gradually approaches the global optimal solution within the solution space. The weight evolution curve depicts as Figure 5.

Figure 5. Graph of weight change with iteration number.

DownLoad: Full-Size Img PowerPoint

Equation (4.4) which is modified from Eq (3.2) described as follows:

${x}_{i}^{t+1} = w\times {x}_{i}^{t}+({r}^{2}\times {g}^{*}-{x}_{i}^{t})\times {f}_{i} .$

(4.4)

4.3. Elite-fusion and elite-oriented local mutation strategy

In the local search phase of BOA, when a butterfly cannot feel the fragrance of the best butterfly, it can only move towards the butterfly nearby. At this time, the blindness of population limits the search range to some extent, which would lead individuals to fall into the local optimum. In order to enhance the ability of local search, in this paper, we draw on the idea of genetic algorithm (GA) ^[38] and differential evolution (DE) ^[39], propose elite-fusion mutation and elite-oriented mutation strategies, which enlarge the population on the one hand and reduce the possibility of algorithm falling into the local optimum at the end of the iteration on the other hand. Equation (4.5), which is modified from Eq (3.3) described as follows:

${x}_{i}^{t+1} = \beta \times [{x}_{i}^{t}+({r}^{2}\times {x}_{j}^{t}-{x}_{k}^{t}\left)\right]+(1-\beta )\times {g}^{*}$

(4.5)

${x}_{i}^{t+1} = {g}^{*}+\theta \times ({x}_{j}^{t}-{x}_{k}^{t})+\gamma \times ({x}_{m}^{t}-{x}_{n}^{t}) .$

(4.6)

where ${x}_{i}^{t}$ is the position ${x}_{i}$ for $t$ th butterfly in iteration number, ${x}_{i}^{t+1}$ is the position ${x}_{i}$ for $t+1$ th butterfly in iteration number, ${x}_{j}^{t}, {x}_{k}^{t}, {x}_{m}^{t}$ and ${x}_{n}^{t}$ are the random butterflies in the solution space from the current population. θ, β, γ is a number in [0, 1]. Equation (4.6) retains part of the elite solution and merges it with the population generated from the local search phase. They compose a new solution through a certain weight, thus enriching the population diversity.

4.4. Disturbance with standard normal distribution and simulated annealing

In the search process, H-BOA relies on the information of elite butterflies, so the direction of elite butterflies, that is, the direction with the strongest fragrance, is very important. Normal distribution disturbance is a way of random disturbance term, which is usually used in optimization problem ^[40,41]. In nature, normal distribution is one of the objective laws of nature. Introducing positive distribution to disturb the population is in line with the objective law of swarm intelligence algorithm. Therefore, adding a normal distribution disturbance term to the new location individuals generated in the search stage will not destroy the search law and expand the search space to a certain extent.

Simulated annealing (SA) is a heuristic algorithm proposed by Professor Metropolis in 1953. It is based on the principle of solid annealing ^[42]. The most commonly used is the Metropolis criterion, which says that the algorithm dynamically generates a probability to determine whether to accept an inferior solution at each iteration.so it will help the algorithm jump out of the local optimum effectively. This shows excellent global optimization.

If the complete simulated annealing algorithm is combined with BOA, it will increase the time complexity of the algorithm and reduce the convergence speed of the algorithm. Therefore, we adopt the core idea of Metropolis criterion to help the algorithm jump out of the local optimal solution while ensuring the convergence speed. In this paper, the initial solution of the simulated annealing process is the population after the search stage. We add the population subject to the disturbance of standard normal distribution to replace the process of generating random solutions in the simulated annealing process, as well as combining it with the simulated annealing process. The new solution is explored around the optimal individual through Metropolis criterion, so the population can make full use of the characteristic information reflected by the best butterfly and guide the whole group to evolve towards the optimal solution. At the same time, it has the ability to jump out of the local optimum and great convergence speed. It can constantly modify and refine the evolution direction.

Assume that ${T}_{t}$ is the current temperature, ${T}_{0}$ is the initial temperature, the maximum number of iterations is $M$ , and the current iteration is $t$ . We parallel it to the search phase. The flow chart of this stage is shown in Figure 6.

Figure 6. Flow chart of normal distribution disturbance and Metropolis criterion.

DownLoad: Full-Size Img PowerPoint

4.5. The Pseudocode of H-BOA

Based on the above description, the Pseudocode of H-BOA can be described as follows:

Algorithm 1 A Hybrid-Strategy-Improved Butterfly Optimization Algorithm

Input: Population size

$N$ , switch probability

$p$ ,

${p}_{c}$ , sensor modality

$c$ , power exponent

$\alpha$ , maxi-mum iterations

$M$ and parameters

$\beta , \gamma , \theta , \mu , {T}_{0}$
Output: Best solution

${X}^{*}$
1 Initialize the butterfly population F with Kent chaotic map
2 for

$t\leftarrow$ 1 to M do
3 for i

$\leftarrow$ 1 to N do
4      Calculate fitness of F
5    end for
6    Find the best solution as

${X}^{best}$
7 for i

$\leftarrow$ 1 to N do
8 Generate a random number

$r$ from [0, 1]
9 if

$r\le p$ then
10        Do global search using Eq (4.4)
11      else
12        If

$r > {p}_{c}$ then
13          Do elite-fusion local mutation strategy using Eq (4.5)
14        else
15          Do elite-oriented local mutation strategy using Eq (4.6)
16        end if
17      end if
18    end for
19    Perform disturbance with Standard Normal Distribution
20    Perform simulated annealing process according to Metropolis criterion
21    Update the best solution
22 end for

| Show Table

DownLoad: CSV

4.6. Algorithm complexity analysis

Time complexity of an algorithm is one of the criteria for evaluating the performance of an algorithm. In BOA, assume that the number of populations is $N$ , the objective function is f(x), and the dimension is n. According to the analysis of BOA, the time complexity of BOA is O(n + f(n)).

For H-BOA, in the initial population phase, assume the time for initializing each parameter is ${t}_{1}$ , the time for initializing the population position using Kent chaotic map is ${t}_{2}$ , the time for calculating fitness according to the population position is $f\left(n\right)$ , and the time to save the current optimal solution is ${t}_{3}$ , so the total time complexity in this phase is

$O\left(N\right(n{t}_{2}+{t}_{3}+f\left(n\right)+{t}_{1}) = O(n+f\left(n\right)) .$

When entering the iteration, assume the time for updating new inertia weight is ${t}_{4}$ . The time to calculate the individual fragrance concentration is ${t}_{5}$ .

1) Global search phase: Assume that the time used to update each dimension of the butterfly position is ${t}_{6}$ according to Eq (4.4), the time to calculate the fitness of the new population is $f\left(n\right)$ , the time to compare the fitness of the latest and old butterfly is ${t}_{7}$ . Then the total time complexity of the global search phase is

$O\left(N\right({t}_{4}+{t}_{5}+n{t}_{6}+{t}_{7}+f\left(n\right)) = O(n+f\left(n\right)) .$

2) Local search phase: The population uses two mutation methods to perform local wandering. The time to generate a random number p c is ${t}_{8}$ , the time used to update each dimension of the butterfly is ${t}_{9}$ and ${t}_{10}$ according to Eqs (4.5) and (4.6), the time to calculate new population fitness is f(n). The time for comparing and replacing the fitness of the old and new population is the same as that of the global search phase, which is ${t}_{7}$ . Then the total time complexity of this stage is

$O\left(N\right({t}_{4}+{t}_{7}+n({t}_{8}+{t}_{9}+{t}_{10})+f\left(n\right)\left)\right) = O(n+f(n\left)\right) .$

Standard normal disturbance and simulated annealing stage: Assume that the time to generate a normal distribution of random numbers is ${t}_{12}$ , the time used to calculate each dimension of the butterfly after disturbance is ${t}_{13}$ , the fitness of the new population is calculated as $f\left(n\right)$ and the time complexity of the simulated annealing stage is $O\left(Nn\right)$ , then the total time complexity of this stage is

$O\left(N\right({t}_{12}+{t}_{13})+f(n\left)\right)+O\left(Nn\right) = O(n+f(n\left)\right) .$

Record the optimal solution stage: Assume that the comparison time of each butterfly's fitness with the current optimal solution is ${t}_{14}$ and the time to replace the new solution in the simulated annealing phase is ${t}_{15}$ .so the total time complexity of this stage is

$O\left(N\right({t}_{14}+{t}_{15}\left)\right) = O\left(n\right) .$

Therefore, the total time complexity of H-BOA is $O(n+f(n\left)\right)$ , which does not increase the extra time complexity compared with BOA.

The space complexity is mainly affected by dimensionality and population size, so the space complexity of the two algorithms is $O\left(Nn\right)$ .

4.7. Asymptotic analysis and convergence analysis of the algorithm

In this section, we analyze the asymptotic and convergence of H-BOA. In addition, the definitions and theorems are provided.

Definition 1. The objective function $f:{R}^{n}\to R$ is a continuous function on the non-empty feasible region $S$ and there is an optimal solution set $\Omega = \left\{{x}^{*}\right|{x}^{*}\in S, f\left({x}^{*}\right) < \delta \}$ in $S$ , δ is an acceptable fitness.

Theorem 1. H-BOA has progressive fitness. Assume that the optimal butterfly position found in the $t$ th is ${x}_{t}^{*}$ , the non-negative random process generated by H-BOA is $\{{d}_{t}\mid = f({x}_{t}^{*})-f({x}^{*}), 1\le t\le T\}$ . When $f\left({x}_{t}^{*}\right) > f\left({x}^{*}\right)$ , there is a normal number $\tau$ , which makes $E\left({d}_{t+1}\right)\le E\left({d}_{t}\right)-\tau$ . Then we can say the optimization algorithm has progressive fitness ^[43].

Proof. According to the analysis in Section 4.4, the new solution generated by normal distribution disturbance and simulated annealing process is not necessarily closer to the global optimal solution, but the previous optimal position will be recorded during optimization, that is, the original better position will not be covered. Therefore, the distance between the new solution and the global optimal solution will not increase after annealing, that is, the direction of population evolution is monotonous.

Because the core purpose of H-BOA is to find the most fragrant butterfly so that $P\left(f\right({x}_{t+1}^{*})-f({x}_{t}^{*}) > 0) = 0$ . The butterfly population updates individual positions in three ways and all of them have random factors. So $P\left(f\right({x}_{t+1}^{*}) = f({x}_{t}^{*}\left)\right)\ne 1$ . From the above derivations we can get

$P\left(f\right({x}_{t+1}^{*})-f({x}_{t}^{*}) < 0) > 0 .$

Let $E\left[f\left({x}_{t+1}^{*}\right)-f\left({x}_{t}^{*}\right)\right] = -{\tau }_{t+1}, where \ {\tau }_{t+1} > 0.$ So

$E\left({d}_{t+1}-{d}_{t}\right) = E\left[\left(f\left({x}_{t+1}^{*}\right)-f\left({x}^{*}\right)\right)-\left(f\left({x}_{t}^{*}\right)-f\left({x}^{*}\right)\right)\right] \\ = E\left(f\right({x}_{t+1}^{\mathrm{*}})-f({x}_{t}^{\mathrm{*}})$

$= \mathrm{ }-{\tau }_{t+1} .$

That is $E\left({d}_{t+1}\right) = E\left({d}_{t}\right)-{\tau }_{t+1}$ . Finally, let $\tau = min\{{\tau }_{1}, {\tau }_{2}, · · · , {\tau }_{T}\}$ , so we can get

$E\left({d}_{t+1}\right)\le E\left({d}_{t}\right)-\tau .$

So, the algorithm has progressive fitness.

In the next step, we will analyze the convergence of H-BOA and the definitions and theorems are provided ^[44,45].

Definition 2. There is a set of random sequences ${\xi }_{t}(t = 1, 2, · · · )$ in the probability space, if there is a random variable $\xi$ when any $\epsilon > 0$ is satisfied, it makes

$\underset{t\to \mathrm{\infty }}{lim}P\{\left|{\xi }_{t}-\xi \right| < \epsilon \} = 1$

or the following equation is qualified, namely

$P\{\bigcup _{n = 1}^{\mathrm{\infty }}\bigcap _{k\ge n}^{\mathrm{\infty }}\left[\right|{\xi }_{t}-\xi |\ge \epsilon ]\} = 0 .$

Then the random sequence $\left\{{\epsilon }_{t}\right\}$ is said to converge to Random variable ε with probability 1.

Lemma 1: (Borel-Cantelli Lemmas): $Let \ {A}_{1}, {A}_{2}, · · ·F$

1) If $\sum _{\mathit{n} = 1}^{\mathrm{\infty }}\text{P}\left({A}_{n}\right) < \infty$ , then $\text{P}\left(lim{sup}_{n}{A}_{n}\right) = 0$ .

2) $If\sum _{\mathit{n} = 1}^{\bf{\infty }}\text{P}\left({A}_{n}\right) = \mathrm{\infty }$ and $\left\{{A}_{n}\right\}$ are independent, then $\text{P}\left(lim{sup}_{n}{A}_{n}\right) = 1$ .

Theorem 2. Let $\left\{X\right(t\left)\right\}$ be the position sequence of H-BOA. $\left\{{X}_{g}\right(t\left)\right\}\in X\left(t\right)$ be the best solution sequence. Assume that $\Delta E$ generated in the simulated annealing stage satisfies $\Delta E\sim N(\mathrm{\mu }, {\sigma }^{2})$ If Definition 2 is satisfied, namely

$P\left\{\underset{t\to \mathrm{\infty }}{lim}f\right({x}_{g}\left(t\right)) = {X}^{*}\} = 1 .$

We can call that H-BOA converges to the global optimum with probability 1.

Proof. Let $p\left(k\right) = \prod _{t = 1}^{k}P\left\{\right|f\left({X}_{g}\right(t\left)\right)-{X}^{*}|\ge \epsilon \}$ where $\epsilon$ > 0. Here ${X}^{*}$ is the global optimal solution. Suppose ${X}^{*} = min\left\{f\right(x), x\in \mathrm{\Omega }\}$ , then

$p\left(k\right) = \prod\limits _{t = 1}^{k}P\{\left|f\left({X}_{g}\left(t\right)\right)-{X}^{*}\right|\ge \epsilon \} \\ = \prod_{t = 1}^{k}P\{f\left({X}_{g}\left(t\right)\right)-{X}^{*}\ge \epsilon \}$

$= \prod _{t = 1}^{k}P\left\{\mathrm{\Delta }f\right({X}_{g}\left(t\right))\ge \epsilon +{X}^{*}-f({X}_{g}(t-1)\left)\right\} .$

So, we can get an equation from $\mathrm{\Delta }f\left({X}_{g}\right(t\left)\right)\sim N({\mu }_{1}, {\sigma }_{1}^{2})$ shown as

$P\left(k\right) = \prod _{t = 1}^{k}{\int }_{\epsilon +{X}^{*}-f\left({X}_{g}\right(t-1\left)\right)}^{\mathrm{\infty }}\frac{1}{\sqrt{2\pi }{\sigma }_{1}}{e}^{\frac{-{x}^{2}}{2\pi {\sigma }_{1}^{2}}}\mathrm{d}t ,$

then let

$b = max\left\{{\int }_{\epsilon +{X}^{*}-f\left({X}_{g}\right(t-1\left)\right)}^{\mathrm{\infty }}\frac{1}{\sqrt{2\pi }{\sigma }_{1}}{e}^{\frac{-{x}^{2}}{2\pi {\sigma }_{1}^{2}}}\mathrm{d}t\right\} .$

According to the relevant knowledge of the infinite series, we can get

$\sum _{k = 1}^{\mathrm{\infty }}p\left(k\right)\le \sum _{k = 1}^{\mathrm{\infty }}{b}^{k} = \frac{1}{1-b} < \mathrm{\infty } \text{,}$

From Lemma 1 we can get

$P\{\bigcup _{n = 1}^{\mathrm{\infty }}\bigcap _{k\ge n}^{\mathrm{\infty }}|f\left({X}_{g}\right(t\left)\right)-{X}^{*}|\ge \epsilon \} = 0 .$

Finally, it can be seen that $f\left({X}_{g}\right(t)$ converges to ${X}^{*}$ with probability 1 from Definition 2.

5. Hybrid butterfly optimization algorithm (H-BOA)

5.1. Simulation experiment environment

The experimental environment is Windows10, 64-bit operating system, CPU is Intel Core i510400H, main frequency is 3.2 GHz, memory is 8 G, and the algorithm is based on MATLAB2020b.

5.2. International benchmark function test

To verify the performance of H-BOA, we select ten international benchmark functions shown in , where ${f}_{1}, {f}_{2}, {f}_{3}, {f}_{4}, {f}_{5}, {f}_{6}$ are unimodal functions, ${f}_{7, }{f}_{8}, {f}_{9}, {f}_{10}$ are multimodal functions. We compared H-BOA with particle swarm optimization (PSO), BOA, a new inertia weight proposed in Section 4.2 improved butterfly optimization algorithm (IBOA) and butterfly optimization algorithm of Cauchy mutation and adaptive weight optimization (CWBOA) ^[46]. To keep the fairness and objectivity, the initial population size is all uniformly set to 30, the number of iterations is set to 500, and the part of parameters are the same as those in CWBOA. To reduce the randomness, we conducted 30 experiments and took the average and standard. The specific algorithm parameter settings are shown in Table 1. The data of CWBOA is directly derived from the paper ^[46].

Table 1. Caption of the table.

Algorithm	parameter settings
H-BOA	$p={p}_{c}= 0.8, \beta = 0.3, \theta =\gamma = 0.1, \mu = 1, {T}_{0}= 1000$ $p=0.8, I=0.01, \alpha =0.1$ $p=0.8, I=0.01, \alpha =0.1$ $p=0.8, I=0.01, \alpha =0.1$ ${c}_{1}={c}_{2}= 2, Vmax= 1, Vmin= -1$
IBOA
BOA CWBOA PSO

| Show Table

DownLoad: CSV

Table 2. International text function.

Function	dim	range	best value
${f}_{1}=\sum\limits _{i=1}^{n}{x}_{i}^{2}$	30	(-100,100)	0
${f}_{2}=max(\left\|{x}_{i}\right\|, 1 < i < n)$	30	(-100,100)	0
${f}_{3}=\sum\limits _{i=1}^{n}\left\|{x}_{i}\right\|+\prod\limits _{i=1}^{n}\left\|{x}_{i}\right\|$	30	(-10, 10)	0
${f}_{4}=\sum\limits _{i=1}^{n}{\left(\sum\limits _{j=1}^{n}{x}_{j}\right)}^{2}$	30	(-100,100)	0
${f}_{5}=\sum\limits _{i=1}^{n}i{x}^{4}+random\left(\mathrm{0, 1}\right)$	30	(-1.28, 1.28)	0
${f}_{6}=$ 0.26 $\left({x}_{1}^{2}+{x}_{2}^{2}\right)-0.48{x}_{1}{x}_{2}$	30	(-10, 10)	0
${f}_{7}=\sum\limits _{i=1}^{n}[{x}_{i}^{2}-10\mathit{cos}\left(2\pi {x}_{i}\right)+10]$	30	(-5.12, 5.12)	0
${f}_{8}=\sum\limits _{i=1}^{d}{x}_{i}^{2}+{\left(\sum\limits _{i=1}^{d}0.5i{x}_{i}\right)}^{2}+{\left(\sum\limits _{i=1}^{d}0.5i{x}_{i}\right)}^{4}$	30	(-5, 10)	0
${f}_{9}=\frac{1}{4000}\sum\limits _{i=1}^{d}{x}_{i}^{2}-\prod\limits _{i=1}^{d}\mathit{cos}\left(\frac{{x}_{i}}{\sqrt{i}}\right)+1$	30	(-600,600)	0
${f}_{10}=-20\mathit{exp}\left(-0.2\times \sqrt{\frac{1}{n}\sum\limits _{i=1}^{n}{x}_{i}^{2}}\right)-\mathit{exp}\left(\frac{1}{n}\sum\limits _{i=1}^{n}\mathit{cos}\left(2\pi {x}_{i}\right)\right)+20+e$	30	(-32, 32)	0

| Show Table

DownLoad: CSV

5.3. Experimental results and analysis

To measure the convergence and accuracy of H-BOA, Table 3 shows the optimization results of the five algorithms on the test functions. "-" represents the test function is not included in the original paper. Figure 7 shows the iterative convergence curves of the four algorithms where the X-axis represents the number of iterations and the Y-axis represents the fitness.

Table 3. Function optimization results.

Function		HBOA	IBOA	CWBOA	BOA	PSO
${f}_{1}$	Mean	0	1.36E-11	0	5.70E-11	8.81E+02
${f}_{1}$	Std	0	5.90E-13	0	6.59E-12	1.72E+02
${f}_{2}$	Mean	0	1.17E-11	3.29E-134	5.53E-11	1.08E+03
${f}_{2}$	Std	0	8.53E-13	1.80E-133	5.30E-12	6.54E+02
${f}_{3}$	Mean	0	5.99E-09	3.86E-134	1.94E-08	5.49E+00
${f}_{3}$	Std	0	4.70E-10	1.52E-133	2.52E-09	8.36E-01
${f}_{4}$	Mean	0	2.91E-09	-	1.75E-08	1.05E+01
${f}_{4}$	Std	0	1.69E-09	-	2.59E-09	2.87E+00
${f}_{5}$	Mean	8.71E-05	1.15E-03	1.57E-04	1.63E-03	2.47E-01
${f}_{5}$	Std	7.24E-05	2.89E-04	1.35E-04	7.81E-04	1.40E-01
${f}_{6}$	Mean	0	9.79E-13	0	1.78E-12	5.43E-84
${f}_{6}$	Std	0	3.18E-13	0	2.50E-13	9.22E-84
${f}_{7}$	Mean	0	2.27E-14	0	1.30E+02	1.04E+02
${f}_{7}$	Std	0	3.11E-14	0	7.49E+01	1.62E+01
${f}_{8}$	Mean	0	1.12E-11	0	5.39E-11	1.25E+04
${f}_{8}$	Std	0	1.27E-12	0	7.27E-12	9.39E+03
${f}_{9}$	Mean	0	8.04E-12	0	3.28E-11	9.14E+00
${f}_{9}$	Std	0	6.77E-13	0	1.14E-11	4.32E+00
${f}_{10}$	Mean	8.88E-16	5.72E-09	8.88E-16	1.93E-08	1.02E+01
${f}_{10}$	Std	0	7.95E-11	0	1.12E-09	8.39E-01

| Show Table

DownLoad: CSV

Figure 7. Convergence curve for four algorithms.

DownLoad: Full-Size Img PowerPoint

It can be seen from that H-BOA has found the optimal function value 0 except ${f}_{5}$ and ${f}_{10}$ and the optimization effect reaches 100%. For function ${f}_{5}$ , the average accuracy of H-BOA is two orders of magnitude better than BOA, and one order of magnitude better than CWBOA. As for the function ${f}_{10}$ , the average accuracy of H-BOA is seven orders of magnitude better than that of BOA, which is the same as the result of CWBOA and the variance is 0. The algorithm has good stability and robustness. In addition, as can be seen from the results that the performance of IBOA is generally better than that of BOA, which shows the new inertia weight strategy is feasible.

As can be seen from Figure 7, when solving different functions, the curve of the H-BOA is smoother, the convergence speed is faster, the optimization accuracy is higher and the curve inflection point appears firstly, which indicating that H-BOA has a stronger ability to overcome the local optimum. Further prove the effectiveness of hybrid strategies.

5.4. WSN coverage optimization simulation experiment

The simulation experiment uses MATLAB, and the environment settings are the same with international function test. We choose H-BOA, BOA, the single-strategy improved butterfly optimization algorithm (IBOA) and the sparrow search algorithm (SSA) for comparison. The algorithm parameters are shown as , where the maximum number of iterations is $M = 100$ , and the population size is $N = 30.$

Table 4. Algorithm parameter setting.

Algorithm	parameter settings
H-BOA	$p={p}_{c}= 0.8, \beta = 0.3, \theta =\gamma = 0.1, \mu = 1, {T}_{0}= 1000$
IBOA	$p=0.8, I=0.01, \alpha =0.1$
BOA SSA	$p=0.8, I=0.01, \alpha =0.1$ $ST= 0.6, PD= 0.7, SD= 0.2$

| Show Table

DownLoad: CSV

To show the effectiveness of H-BOA, in this paper, we tried to do multiple experiments with different numbers of nodes. The experimental parameter settings are shown in Table 5. Figure 8 shows the simulation coverage figure. Figure 9 describes the maximum coverage percent that various algorithm can achieve and Table 6 shows the specific maximum coverage value. Figure 10 shows the evolution curve of the maximum coverage of WSN with the number of population iterations when the number of nodes is 25, as can be seen from the figure that H-BOA has more uniform distribution and less gaps. This fully shows higher convergence and better robustness. In particular, for some wireless sensor node deployment tasks that require significantly higher coverage, the optimization strategy of the HBOA algorithm can achieve more excellent optimization with fewer nodes in less time.

Table 5. Parameter setting.

Parameter settings	value
Region Area	100m × 100m
Pixels	100 × 100
Nodes Perceived Radius	N = 10, 15, 20, 25, 30 R = 15m

| Show Table

DownLoad: CSV

Figure 8. WSN nodes coverage graph for four algorithms.

DownLoad: Full-Size Img PowerPoint

Figure 9. Comparison of different node coverage.

DownLoad: Full-Size Img PowerPoint

Table 6. Comparison of maximum coverage rate with different number of nodes.

Algorithm	Number of nodes		Maximum coverage (%)
Algorithm	N = 10	N = 15	N = 20	N = 25	N = 30
SSA	62.14	78.51	83.42	92.60	94.50
H-BOA	64.61	80.05	91.21	95.78	98.34
BOA	56.82	71.81	82.47	90.91	94.13
IBOA	56.68	74.42	83.07	91.60	94.37

| Show Table

DownLoad: CSV

Figure 10. Evolution curve.

DownLoad: Full-Size Img PowerPoint

In summary, H-BOA has better performance in WSN node coverage application and can improve energy effectiveness and real-time schedule.

6. Conclusions

Based on BOA, we propose a hybrid-strategy-improved butterfly optimization algorithm (HBOA). First, H-BOA uses Kent chaotic map to initialize the population to keep a more balanced search space. Next, we introduce a new inertial weight modified from the Sigmoid function to increase the fixability of global search and local search. Then the introduction of elite-fusion and elite-oriented strategy aims to improve population diversity. Finally, we adopt the disturbance based on standard normal distribution to prevent the algorithm from falling into precociousness and use simulated annealing process to ensure the quality of the solution. The above improvement points observably improve the performance of the algorithm. We selected the international test functions for performance testing and compared them with BOA, a single-strategy-improved butterfly optimization algorithm (IBOA) and particle swarm optimization (PSO). The results show that H-BOA is better than other algorithms, together with higher accuracy and robustness. We apply H-BOA in WSN coverage research, which proves that this algorithm can effectively solve this type of problem and broaden the application field. However, the algorithm still has some shortcomings. Our next work is how to optimize the strategy to make the algorithm more expressive and propose more effective hybrid heuristic algorithms for different and more complex problems.

Acknowledgments

All sources of funding of the study must be disclosed.

Conflict of interest

The authors declare there is no conflict of interest.

References

[1]	A. Montieri, D. Ciuonzo, G. Aceto, A. Pescapé, Anonymity services tor, i2p, jondonym: classifying in the dark (web), IEEE Trans. Dependable Secure Comput., 17 (2018), 662−675. https://doi.org/10.1109/TDSC.2018.2804394 doi: 10.1109/TDSC.2018.2804394
[2]	Y. Gao, J. Lin, J. Xie, Z. Ning, A real-time defect detection method for digital signal processing of industrial inspection applications, IEEE Trans. Ind. Inf., 17 (2021), 3450−3459. https://doi.org/10.1109/TII.2020.3013277 doi: 10.1109/TII.2020.3013277
[3]	W. Wang, N. Kumar, J. Chen, Z. Gong, X. Kong, W. Wei, et al., Realizing the potential of the internet of things for smart tourism with 5G and AI, IEEE Network, 34 (2020), 295−301. https://doi.org/10.1109/MNET.011.2000250 doi: 10.1109/MNET.011.2000250
[4]	R. Dingledine, N. Mathewson, P. Syverson, Tor: The second-generation onion router, in 13th USENIX Security Symposium, 2004 (2004), 303−320. https://doi.org/10.1016/0016-0032(45)90142-6
[5]	A. Cuzzocrea, F. Martinelli, F. Mercaldo, G. Vercelli, Tor traffic analysis and detection via machine learning techniques, in 2017 IEEE International Conference on Big Data, 2017 (2017), 4474−4480. https://doi.org/10.1109/BigData.2017.8258487
[6]	R. Jansen, M. Juarez, R. Galvez, T. Elahi, C. Diaz, Inside job: Applying traffic analysis to measure tor from within, Network Distributed Syst. Security, 2018 (2018). http://dx.doi.org/10.14722/ndss.2018.23261 doi: 10.14722/ndss.2018.23261
[7]	H. Yin, Y. He, I2P anonymous traffic detection and identification, in 2019 5th International Conference on Advanced Computing & Communication Systems, 2019 (2019), 157−162. https://doi.org/10.1109/ICACCS.2019.8728517
[8]	I. Clarke, O. Sandberg, B. Wiley, Freenet: A distributed anonymous information storage and retrieval system, Des. Privacy Enhancing Technol., 2001 (2001), 46−66. https://doi.org/10.1007/3-540-44702-4_4 doi: 10.1007/3-540-44702-4_4
[9]	S. Lee, S. H. Shin, B. H. Roh, Classification of freenet traffic flow based on machine learning, J. Commun., 13 (2018), 654−660. https://doi.org/10.12720/jcm.13.11.654-660 doi: 10.12720/jcm.13.11.654-660
[10]	S. Wang, Y. Gao, J. Shi, X. Wang, C. Zhao, Z. Yin, Look deep into the new deep network: A measurement study on the ZeroNet, in Computational Science-ICCS 2020, (2020), 595−608. https://doi.org/10.1007/978-3-030-50371-0_44
[11]	M. Wang, X. Wang, J. Shi, Q. Tan, Y. Gao, M. Chen, et al., Who are in the darknet measurement and analysis of darknet person attributes, in 2018 IEEE Third International Conference on Data Science in Cyberspace, 2018 (2018), 948−955. https://doi.org/10.1109/DSC.2018.00151
[12]	C. Fachkha, M. Debbabi, Darknet as a source of cyber intelligence: Survey, taxonomy, and characterization, IEEE Commun. Surv. Tutorials, 18 (2015), 1197−1227. https://doi.org/10.1109/COMST.2015.2497690 doi: 10.1109/COMST.2015.2497690
[13]	G. Draper-Gil, A. H. Lashkari, M. S. I. Mamun, A. A. Ghorbani, Characterization of encrypted and VPN traffic using time-related features, in Proceedings of the 2nd International Conference on Information Systems Security and Privacy, 1 (2016), 407−414. https://doi.org/10.5220/0005740704070414
[14]	Y. Hu, F. Zou, L. Li, P. Yi, Traffic classification of user behaviors in tor, i2p, zeronet, freenet, in 2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications, (2020), 418–424. https://doi.org/10.1109/TrustCom50675.2020.00064
[15]	R. Rawat, V. Mahor, S. Chirgaiya, R. N. Shaw, A. Ghosh, Analysis of darknet traffic for criminal activities detection using TF-IDF and light gradient boosted machine learning algorithm, Innovations Electr. Electron. Eng., 2021 (2021), 671−681. https://doi.org/10.1007/978-981-16-0749-3_53 doi: 10.1007/978-981-16-0749-3_53
[16]	Q. A. Al-Haija, M. Krichen, W. A. Elhaija, Machine-learning-based darknet traffic detection system for IoT applications, Electronics, 11 (2022), 556. https://doi.org/10.3390/electronics11040556 doi: 10.3390/electronics11040556
[17]	A. H. Lashkari, G. Kaur, A. Rahali, DIDarknet: A contemporary approach to detect and characterize the darknet traffic using deep image learning, in 2020 the 10th International Conference on Communication and Network Security, (2020), 1−13. https://doi.org/10.1145/3442520.3442521
[18]	C. Liu, L. He, G. Xiong, Z. Cao, Z. Li, FS-Net: A flow sequence network for encrypted traffic classification, in IEEE INFOCOM 2019-IEEE Conference On Computer Communications, (2019), 1171−1179. https://doi.org/10.1109/INFOCOM.2019.8737507
[19]	M. Lotfollahi, M. J. Siavoshani, R. S. H. Zade, M. Saberian, Deep packet: A novel approach for encrypted traffic classification using deep learning, Soft Comput., 24 (2020), 1999−2012. https://doi.org/10.1007/s00500-019-04030-2 doi: 10.1007/s00500-019-04030-2
[20]	X. Wang, S. Chen, J. Su, App-Net: A hybrid neural network for encrypted mobile traffic classification, in IEEE INFOCOM 2020-IEEE Conference on Computer Communications Workshops, (2020), 424−429. https://doi.org/10.1109/INFOCOMWKSHPS50562.2020.9162891
[21]	M. B. Sarwar, M. K. Hanif, R. Talib, M. Younas, M. U. Sarwar, DarkDetect: Darknet traffic detection and categorization using modified convolution-long short-term memory, IEEE Access, 9 (2021), 113705−113713. https://doi.org/10.1109/ACCESS.2021.3105000 doi: 10.1109/ACCESS.2021.3105000
[22]	W. Cai, L. Xie, W. Yang, Y. Li, Y. Gao, T. Wang, DFTNet: Dual-path feature transfer network for weakly supervised medical image segmentation, IEEE/ACM Trans. Comput. Biol. Bioinf., 2022 (2022), 1−12. https://doi.org/10.1109/TCBB.2022.3198284 doi: 10.1109/TCBB.2022.3198284
[23]	X. Xie, Y. Li, Y. Gao, C. Wu, P. Gao, B. Song, et al., Weakly supervised object localization with soft guidance and channel erasing for auto labelling in autonomous driving systems, ISA Trans., 132 (2023), 39−51. https://doi.org/10.1016/j.isatra.2022.08.003 doi: 10.1016/j.isatra.2022.08.003
[24]	W. Wang, J. Chen, J. Wang, J. Chen, J. Liu, Z. Gong, Trust-enhanced collaborative filtering for personalized point of interests recommendation, IEEE Trans. Industrial Inf., 16 (2020), 6124−6132. https://doi.org/10.1109/TII.2019.2958696 doi: 10.1109/TII.2019.2958696
[25]	Y. Tokozume, Y. Ushiku, T. Harada, Between-class learning for image classification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018 (2018), 5486−5494, arXiv.1711.10284
[26]	Y. Gao, J. Chen, H. Miao, B. Song, Y. Lu, W. Pan, Self-learning spatial distribution-based intrusion detection for industrial cyber-physical systems, IEEE Trans. Comput. Social Syst., 9 (2022), 1693−1702. https://doi.org/10.1109/TCSS.2021.3135586 doi: 10.1109/TCSS.2021.3135586
[27]	A. H. Lashkari, G. Draper-Gil, M. S. I. Mamun, A. A. Ghorbani, Characterization of tor traffic using time based features, in Proceedings of the 3rd International Conference on Information Systems Security and Privacy, 2017 (2017), 253−262. https://doi.org/10.5220/0006105602530262
[28]	F. R. Torres, J. A. Carrasco-Ochoa, J. F. Martínez-Trinidad, SMOTE-D a deterministic version of SMOTE, in Mexican Conference on Pattern Recognition, 9703 (2016), 177−188. https://doi.org/10.1007/978-3-319-39393-3_18
[29]	H. Lee, J. Kim, S. Kim, Gaussian-based SMOTE algorithm for solving skewed class distributions, Int. J. Fuzzy Logic Intell. Syst., 17 (2017), 229−234. https://doi.org/10.5391/IJFIS.2017.17.4.229 doi: 10.5391/IJFIS.2017.17.4.229

This article has been cited by:

1.	Jicheng Yao, Xiaonan Luo, Fang Li, Yizhou Feng, Jundi Dou, Lixiang Dai, Songhua Xu, Ruiai Chen, 2022, Butterfly intelligent optimization algorithm based on Good Point Set and adaptive weight factor, 978-1-6654-5478-0, 194, 10.1109/ICDH57206.2022.00037
2.	Maosong Peng, Xiuxi Wei, Huajuan Huang, A chaotic adaptive butterfly optimization algorithm, 2023, 1864-5909, 10.1007/s12065-023-00832-4
3.	Qing He, Zhouxin Lan, Damin Zhang, Liu Yang, Shihang Luo, Improved Marine Predator Algorithm for Wireless Sensor Network Coverage Optimization Problem, 2022, 14, 2071-1050, 9944, 10.3390/su14169944
4.	Wen Long, Jianjun Jiao, Tiebin Wu, Ming Xu, Shaohong Cai, A balanced butterfly optimization algorithm for numerical optimization and feature selection, 2022, 26, 1432-7643, 11505, 10.1007/s00500-022-07389-x
5.	Khaoula Zaimen, Mohamed-El-Amine Brahmia, Laurent Moalic, Abdelhafid Abouaissa, Lhassane Idoumghar, A Survey of Artificial Intelligence Based WSNs Deployment Techniques and Related Objectives Modeling, 2022, 10, 2169-3536, 113294, 10.1109/ACCESS.2022.3217200
6.	Zhenzhen Geng, 2023, Chapter 78, 978-981-19-9372-5, 693, 10.1007/978-981-19-9373-2_78
7.	Yan Liu, 2024, Refining Machine Learning for Employee Turnover Classification: A CAP-BOA-Based Probability Neural Network Approach, 979-8-3503-8098-9, 741, 10.1109/EEBDA60612.2024.10486002
8.	Junqi Geng, Haihua Wang, Jie Su, Xiaoyu Zheng, Xianming Sun, Xu Wu, Yue Zhang, 2023, Coverage optimization of wireless sensor networks with improved golden jackal optimization, 979-8-3503-1449-6, 1, 10.1109/ICECAI58670.2023.10176640
9.	J. David Sukeerthi Kumar, M. V. Subramanyam, A. P. Siva Kumar, Hybrid Sand Cat Swarm Optimization Algorithm-based reliable coverage optimization strategy for heterogeneous wireless sensor networks, 2024, 2511-2104, 10.1007/s41870-024-02163-8
10.	V. Saravanan, Indhumathi G, Ramya Palaniappan, Narayanasamy P, M. Hema Kumar, K. Sreekanth, Navaneethan S, A novel approach to node coverage enhancement in wireless sensor networks using walrus optimization algorithm, 2024, 24, 25901230, 103143, 10.1016/j.rineng.2024.103143
11.	Xuejun Chen, Ying Wang, Haitao Zhang, Jianzhou Wang, A novel hybrid forecasting model with feature selection and deep learning for wind speed research, 2024, 43, 0277-6693, 1682, 10.1002/for.3098
12.	Ateq Alsaadi, Fazal Dayan, Nauman Ahmed, Dumitru Baleanu, Muhammad Rafiq, Ali Raza, A novel method for the dynamics of worms in wireless sensor networks with fuzzy partition, 2023, 13, 2158-3226, 10.1063/5.0165342
13.	Chengwang Lin, Hoiman Cheng, Parameter Optimization and Solution Performance Analysis of Multi-Modal Butterfly Optimization Algorithm, 2024, 12, 2169-3536, 143163, 10.1109/ACCESS.2024.3470845
14.	Haixu Niu, Yonghai Li, Chunyu Zhang, Tianfei Chen, Lijun Sun, Muhammad Irsyad Abdullah, Multi-Strategy Bald Eagle Search Algorithm Embedded Orthogonal Learning for Wireless Sensor Network (WSN) Coverage Optimization, 2024, 24, 1424-8220, 6794, 10.3390/s24216794
15.	Zhixiong Liu, Wei Zhou, Energy-Efficient Algorithms for Path Coverage in Sensor Networks, 2023, 23, 1424-8220, 5026, 10.3390/s23115026
16.	Mostafa Basirnezhad, Mahboobeh Houshmand, Seyyed Abed Hosseini, Mehrdad Jalali, 2023, Optimizing Coverage in Wireless Sensor Networks Using the Cheetah Meta-Heuristic Algorithm, 979-8-3503-6941-0, 1, 10.1109/IoT60973.2023.10365363
17.	Ojonukpe S. Egwuche, Abhilash Singh, Absalom E. Ezugwu, Japie Greeff, Micheal O. Olusanya, Laith Abualigah, Machine learning for coverage optimization in wireless sensor networks: a comprehensive review, 2023, 0254-5330, 10.1007/s10479-023-05657-z
18.	Aiguo Li, Jie Yang, 2022, Multi-objective optimization algorithm for indoor positioning sensor deployment based on wireless network, 9781450397520, 110, 10.1145/3586102.3586138
19.	Aiguo Li, Yunfei Jia, A Base Station Deployment Algorithm for Wireless Positioning Considering Dynamic Obstacles, 2025, 82, 1546-2226, 4573, 10.32604/cmc.2025.059184

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Mathematical Biosciences and Engineering

3.9

Metrics

Article views(1686) PDF downloads(45) Cited by(1)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Mathematical Biosciences and Engineering

CDBC: A novel data enhancement method based on improved between-class learning for darknet detection

Related Papers:

Abstract

1. Introduction

2. WSN coverage optimization model

3. Butterfly optimization algorithm

4. Hybrid butterfly optimization algorithm (H-BOA)

4.1. Kent chaotic map initialization mechanism

4.2. A new inertial weight strategy

4.3. Elite-fusion and elite-oriented local mutation strategy

4.4. Disturbance with standard normal distribution and simulated annealing

4.5. The Pseudocode of H-BOA

4.6. Algorithm complexity analysis

4.7. Asymptotic analysis and convergence analysis of the algorithm

5. Hybrid butterfly optimization algorithm (H-BOA)

5.1. Simulation experiment environment

5.2. International benchmark function test

5.3. Experimental results and analysis

5.4. WSN coverage optimization simulation experiment

6. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. WSN coverage optimization model

3. Butterfly optimization algorithm

4. Hybrid butterfly optimization algorithm (H-BOA)

4.1. Kent chaotic map initialization mechanism

4.2. A new inertial weight strategy

4.3. Elite-fusion and elite-oriented local mutation strategy

4.4. Disturbance with standard normal distribution and simulated annealing

4.5. The Pseudocode of H-BOA

4.6. Algorithm complexity analysis

4.7. Asymptotic analysis and convergence analysis of the algorithm

5. Hybrid butterfly optimization algorithm (H-BOA)

5.1. Simulation experiment environment

5.2. International benchmark function test

5.3. Experimental results and analysis

5.4. WSN coverage optimization simulation experiment

6. Conclusions

Acknowledgments

Conflict of interest

References