Loading [MathJax]/jax/output/SVG/jax.js
Research article Special Issues

Image steganalysis feature selection based on the improved Fisher criterion

  • In order to improve the detection accuracy of hidden message in images, steganalysis features are selected as inputs for steganalysers. However, the existing Fisher criterion ignores the contribution of steganalysis feature components in dispersion to classification, which causes the useful feature components to be deleted, and decreases the detection accuracy of steganalysis features. By analyzing the separability of steganalysis feature components, we introduce the sigmoid function into Fisher's criterion and propose an improved Fisher criterion (I-Fisher criterion), which can make up for the traditional Fisher criterion in separability measurement of steganalysis feature components. To optimize the steganalysis feature and reduce its dimension, we employ the improved Fisher criterion as the heuristic function of the decision rough set α-positive region reduction, and propose the feature selection method based on the improved Fisher. Experimental results show that the proposed method can reduce the dimension and memory of the GFR high-dimensional feature and the CC-PEV lowdimensional feature while maintaining or improving the detection accuracy.

    Citation: Yuanyuan Ma, Jinwei Wang, Xiangyang Luo, Zhenyu Li, Chunfang Yang, Jun Chen. Image steganalysis feature selection based on the improved Fisher criterion[J]. Mathematical Biosciences and Engineering, 2020, 17(2): 1355-1371. doi: 10.3934/mbe.2020068

    Related Papers:

    [1] Tao Zhang, Hao Zhang, Ran Wang, Yunda Wu . A new JPEG image steganalysis technique combining rich model features and convolutional neural networks. Mathematical Biosciences and Engineering, 2019, 16(5): 4069-4081. doi: 10.3934/mbe.2019201
    [2] Lingyun Xiang, Guoqing Guo, Jingming Yu, Victor S. Sheng, Peng Yang . A convolutional neural network-based linguistic steganalysis for synonym substitution steganography. Mathematical Biosciences and Engineering, 2020, 17(2): 1041-1058. doi: 10.3934/mbe.2020055
    [3] Tinghua Wang, Huiying Zhou, Hanming Liu . Multi-label feature selection based on HSIC and sparrow search algorithm. Mathematical Biosciences and Engineering, 2023, 20(8): 14201-14221. doi: 10.3934/mbe.2023635
    [4] Chunfang Yang, Fenlin Liu, Shuangkui Ge, Jicang Lu, Junwei Huang . Locating secret messages based on quantitative steganalysis. Mathematical Biosciences and Engineering, 2019, 16(5): 4908-4922. doi: 10.3934/mbe.2019247
    [5] Ming Meng, Luyang Dai, Qingshan She, Yuliang Ma, Wanzeng Kong . Crossing time windows optimization based on mutual information for hybrid BCI. Mathematical Biosciences and Engineering, 2021, 18(6): 7919-7935. doi: 10.3934/mbe.2021392
    [6] Jiaxi Lu, Yingwei Guo, Mingming Wang, Yu Luo, Xueqiang Zeng, Xiaoqiang Miao, Asim Zaman, Huihui Yang, Anbo Cao, Yan Kang . Determining acute ischemic stroke onset time using machine learning and radiomics features of infarct lesions and whole brain. Mathematical Biosciences and Engineering, 2024, 21(1): 34-48. doi: 10.3934/mbe.2024002
    [7] Yingjian Yang, Wei Li, Yan Kang, Yingwei Guo, Kai Yang, Qiang Li, Yang Liu, Chaoran Yang, Rongchang Chen, Huai Chen, Xian Li, Lei Cheng . A novel lung radiomics feature for characterizing resting heart rate and COPD stage evolution based on radiomics feature combination strategy. Mathematical Biosciences and Engineering, 2022, 19(4): 4145-4165. doi: 10.3934/mbe.2022191
    [8] Pavithra Latha Kumaresan, Subbulakshmi Pasupathi, Sindhia Lingaswamy, Sreesharmila Thangaswamy, Vimal Shunmuganathan, Danilo Pelusi . Fruit-Fly optimization based feature integration in image retrieval. Mathematical Biosciences and Engineering, 2021, 18(5): 6178-6197. doi: 10.3934/mbe.2021309
    [9] Yingjian Yang, Wei Li, Yingwei Guo, Nanrong Zeng, Shicong Wang, Ziran Chen, Yang Liu, Huai Chen, Wenxin Duan, Xian Li, Wei Zhao, Rongchang Chen, Yan Kang . Lung radiomics features for characterizing and classifying COPD stage based on feature combination strategy and multi-layer perceptron classifier. Mathematical Biosciences and Engineering, 2022, 19(8): 7826-7855. doi: 10.3934/mbe.2022366
    [10] Yugui Jiang, Qifang Luo, Yuanfei Wei, Laith Abualigah, Yongquan Zhou . An efficient binary Gradient-based optimizer for feature selection. Mathematical Biosciences and Engineering, 2021, 18(4): 3813-3854. doi: 10.3934/mbe.2021192
  • In order to improve the detection accuracy of hidden message in images, steganalysis features are selected as inputs for steganalysers. However, the existing Fisher criterion ignores the contribution of steganalysis feature components in dispersion to classification, which causes the useful feature components to be deleted, and decreases the detection accuracy of steganalysis features. By analyzing the separability of steganalysis feature components, we introduce the sigmoid function into Fisher's criterion and propose an improved Fisher criterion (I-Fisher criterion), which can make up for the traditional Fisher criterion in separability measurement of steganalysis feature components. To optimize the steganalysis feature and reduce its dimension, we employ the improved Fisher criterion as the heuristic function of the decision rough set α-positive region reduction, and propose the feature selection method based on the improved Fisher. Experimental results show that the proposed method can reduce the dimension and memory of the GFR high-dimensional feature and the CC-PEV lowdimensional feature while maintaining or improving the detection accuracy.


    Steganalysis aims to detect hidden information in image, audio, video, text, 3D object and other multimedia cover [1,2]. It is designed to counteract against to steganography which is a technology that hides messages in multimedia cover [3,4,5,6]. After more than 20 years of development, the research on steganalysis has made significant progress [7,8,9,10,11]. The key point of steganalysis is to identify statistical differences between features extracted from cover-signals and stego-signals. However, traditional steganalysis methods no longer work when the message is hidden by image adaptive steganography methods, such as those from [12]. Existing steganalysis methods require ever larger feature space, enormous memory and computation power, which limits the practical application of steganslysis. Both traditional steganalysis features and current steganalysis features require to be reduced. How to reduce the dimension of the steganalysis feature has become an urgent problem to be solved.

    Recently, steganalysis feature reduction methods have been proposed by researchers. The typical methods are those steganalysis feature reduction based on Genetic or Particle Swarm Optimization algorithm and the integrated classifier [13,14]. These methods can reduce the steganalysis feature dimension, while the time complexity is high. Other feature selection methods use the mutual information to select complementary features [15,16]. These methods can reduce the dimensionality of the steganalysis features, but at the same time it is also reduce the detection accuracy for the stego images. Principal Component Analysis (PCA) is also used by several methods[17,18] in order to reduce the feature dimensions while aiming to maintain the accuracy of the steganalyser. These methods can reduce feature dimension and maintain the detection accuracy of linear features on stegos. However, a large number of experimental results show that detection accuracy for the nonlinear steganalysis features based on PCA can be cut down. In previous work, we have conducted a series of related studies on feature selection for steganalysis [19,20]. A feature selection method for image steganalysis based on decision rough set α-positive region reduction is proposed to reduce feature dimension [21]. This method can significantly reduce the feature dimension while maintaining or improving the detection accuracy. Nevertheless, the steganalysis detection accuracy using the selected feature set by this method can be improved. The feature selection method based on Fisher criterion can not only reduce feature dimension efficiently while maintaining detection accuracy, but also greatly improve detection efficiency [22]. However, the feature dimensions are still high.

    For the problems above, analyzing the principle of the steganalytic feature separability measurement, this paper improves the Fisher criterion (i.e. I-Fisher criterion). The I-Fisher criterion is applied to the decision rough set α-positive domain reduction, defining a steganalysis feature selection method, which is expected to further improve the detection accuracy while reducing the feature dimension.

    The rest of this manuscript is organized as follows. Section 2 briefly introduces the Fisher criterion. Section 3 describes the improved Fisher criterion. In section 4 describes the proposed feature selection method based on an improved Fisher criterion. Section 5 gives the experimental results. Finally, the final section summarizes the full text and looks forward to the next step.

    The Fisher criterion is a measurement criterion to the contribution of a feature component to classify two classes. In steganalysis feature selection, the Fisher criterion is usually used to measure the separability (i.e. the contribution to classify stego images) of a feature component. The measurement results of the feature components based on the Fisher criterion are usually used as the basis for selection. The steganalysis feature selection based on the Fisher criterion can reduce the feature dimension. The Fisher criterion is defined as:

    Denote the cover image class as XC and the stego image class as XS. Feature spaces FC and FS are extracted from XC and XS, respectively. N is the element number of FC and FS. Then, the Fisher criterion is defined as:

    Fscore(fi)=[mC(fi)mS(fi)]2d2C(fi)+d2S(fi),(1iN) (2.1)

    where mC(fi) denotes the mean of the ith feature component of the cover image class, mS(fi) denotes the mean of the ith feature component of the stego image class, dC(fi) denotes the standard deviation of the ith feature component of the cover image class, dS(fi) denotes the standard deviation of the ith feature component of the stego image class.

    The larger the Fscore(fi) is the better the separability and the greater the contribution of the feature component to the stego images. For this reason, the Fscore values can provide a basis for reduction in feature selection. The feature selection based on the Fisher criterion has achieved good results in steganalysis [22].

    In this section, we analyze the principle of the steganalysis feature component separability measurement at first. And based on these, we propose an improved Fisher criterion to measure the separability of a feature component. Then we give the corresponding measurement algorithm of the improved Fisher criterion.

    In pattern recognition, the inner-inter class distance is the most intuitive basis of feature selection. It is assumed that if the distance between two classes (brief as inter-class distance) is larger and the distance within a class (brief as inner-class distance) is smaller, the two classes can be more accurately separated. Similarly, the greater the inter-class distance between the cover image class and stego image class is, and the smaller the inner-class distance within a class is, the better the two image classes can be separated in the feature space. In addition, the dispersion degree is also a concept of distance, which can be used to measure the inconsistent distribution between the two classes of samples in the feature space. A larger value of dispersion degree can also be the basis for separate the two classes of samples from each other in the feature space. Similarly, the larger value of the dispersion between the cover image class and the stego image class is, the greater the inconsistent distribution between the two classes of samples in the feature space is, then the better the ability to separate. The relationships between the separability and the eight different combinations of these distances are shown in the following Table 1.

    Table 1.  Relationship between distances combinations and separability.
    IDInterclass distanceInnerclass distanceDispersionSeparability
    alargelargelargeFormal
    blargelargesmallFormal
    clargesmalllargeGood
    dlargesmallsmallGood
    esmalllargelargeFormal
    fsmalllargesmallBad
    gsmallsmalllargeFormal
    hsmallsmallsmallBad

     | Show Table
    DownLoad: CSV

    To illustrate this further, a specific schematic diagram is shown as Figure 1:

    Figure 1.  Effects of inter distance and dispersion on stego image classification.

    In this figure, the red dot is the center of the cover image class, and the green dot is the center of the stego image class. The straight line between the red dots and the green dots indicates the inter-class distance between the cover image class and the stego image class. A blue triangle and a yellow square represent a cover image and a stego image, respectively. From Figure 1, it is not difficult to find that the cover images and the stego images are not easily separated when the inter-class distance and dispersion are all small, such as Figure 1(f) and (h). And on the premise of "larger inter-class distance", if the inner-class distance is smaller, the cover image and the stego image can be more easily separated, such as Figure 1(c) and (d). After comprehensive analysis, it can be seen that the feature components with "large inter-class distance, small inner-class distance" or "large dispersion" can better separate the cover image class from the stego image class. Therefore, to measure more comprehensively and accurately, we need to construct a separability measurement criterion that can measure "inner-inter class distance" and "dispersion" of steganalysis feature components. However, the inter distance of steganalysis feature component between cover image class and stego image class plays a more important role in distinguishing the stego image class from the cover image class than dispersion. As shown in Figure 1(a) and (c), as long as the inter distance of the steganalysis feature component is sufficiently large between the cover image class and the stego image class, the cover image class and the stego image class can be separated by the steganalysis feature component. That is to say, when the inter distance of the steganalysis feature component between the cover image class and the stego image class is large enough, the dispersion's influence of the feature components between the cover image class and the stego image class on image classification is not obvious. All of these tell us that when measuring the separability of a feature component, we should focus on the principle of "inner-inter class distance" while taking into account the principle of "dispersion".

    Fisher criterion can measure the separability of steganalysis feature component, however it still has some shortcomings. This section improves the Fisher criterion to measure the separability of steganalysis feature component more comprehensively and provide more accurate basis for selection. In the following, we present an algorithm to calculate the separability of the steganalysis feature component based on the I-Fisher criterion.

    In the Fisher criterion, [mC(fi)mS(fi)]2 represents the inter-class distance of the feature component fi between the cover image class and the stego image class. d2C(fi)+d2S(fi) represents the inner-class distance of the feature component fi in the cover image class and the stego image class. According to the "inner-class agammaegation, inter-class dispersion" of the pattern recognition principle, it is known that the steganalysis feature component with large inter distance has a greater contribution to classify the cover image class and the stego image class. Therefore, the Fisher criterion is used to measure the separability of the feature component between the cover image class and the stego image class. The larger the Fscore(fi) value, the better the separability of the feature component.

    However, there is a problem when the Fisher criterion measure the separability of the feature component. When mC(fi)=mS(fi), Fscore(fi)=0. It means that the separability of feature components is zero as long as the distance between cover image class and the stego image class is zero. However, this is not the case. According to the principle of pattern recognition, when the inner class agammaegation difference between the cover image class and the stego image class (i.e. dispersion) is significant, the feature component will also contribute to classify the stego image class and the cover image class.

    From the above subsection, when measuring the separability of a feature component, we should focus on the principle of "inner-class agammaegation, inter-class dispersion" while taking into account the principle of "dispersion".

    In general, the dispersion of a feature component can be represented by the standard deviation difference between the two classes. In addition, dC(fi)dS(fi) and dS(fi)dC(fi) can represent the dispersion of the feature component between the cover image class and stego image class, consider new equation

    g(fi)=12×(dC(fi)dS(fi)+dS(fi)dC(fi))=d2C(fi)+d2S(fi)2dC(fi)dS(fi) (3.1)

    To prevent the value of g(fi) from being too large, we use the function of sigmoid to balance the the value of g(fi). We define the measurement equation of dispersion. That is

    E(fi)=11+eg(fi) (3.2)

    We consider adding the function E(fi)=11+eg(fi) into the Fisher criterion to improve the Fisher criterion. In addition, since the principle of "inter-class dispersion and inner-class agammaegation" embodied in Fisher criterion plays a relatively important role in the separability measurement of steganalysis features, we give an improved Fisher criterion by taking into account the principle of "dispersion" and increasing the proportion of Fisher criterion (i.e. I-Fisher criterion).

    IFscore(fi)=4×Fscore(fi)+E(fi)=4×[mC(fi)mS(fi)]2d2C(fi)+d2s(fi)+11+e(d2C(fi)+d2s(fi)2dC(fi)dS(fi)) (3.3)

    where larger value of IFscore(fi), the better the separability of feature component fi, the greater the contribution of feature component fi to classify the stego image.

    This section introduces the application of the I-Fisher criterion in separability measurement. Let fi be a steganalysis feature component (1iN), where N is the element number of a steganalysis feature. The IFscore(fi) value is calculated based on the I-Fisher criterion. The calculation process is shown as Figure 2:

    Figure 2.  Calculation process of I-Fisher criterion.

    As shown in Figure 2, an operator is put into an gray box, where ()2 represents the square value of , ()() represents the radio of to , and ×4 reflects the importance of Fisher's criterion. The data which involved in the operation is put into a blue box. mC(fi) and mS(fi) represent the means of feature component fi in the cover image class and the stego image class, respectively. dC(fi) and dS(fi) represent the standard deviations of fi in the cover image and the stego image classes, respectively. Through the calculation procedure of this figure, the value of IFscore(fi) which is the separability of steganalysis feature component fi is obtained.

    Algorithms 1 is an algorithm to measure the separability of steganalysis feature component based on I-Fisher criterion. The specific algorithm is shown in Algorithm 1.

    According to the I-Fisher criterion, we can know that the larger the IFscore(fi) value, the better the separability of the feature component. Comparing the IFscore(fi) values of all the feature components in the feature space, we choose the larger feature components and remove the small feature components to reduce the feature dimension and maintain the detection accuracy of detecting stego images.

    In view of advantage of the I-Fisher criterion in separability measurement, the I-Fisher criterion proposed in this paper is applied as a heuristic function in the decision rough set α-positive region reduction (brief as IF-based method).

    This decision rough set α-positive region reduction can removes those feature components which do not fulfill the attribute independence requirement and positive region non-reduced. The decision rough set α-positive region reduction briefly describes as following.

    Let T be a decision table, where object set U={x1,x2,,x2m}, H and Q are the conditional attribute set and decision attribute set on U, respectively, X is the subset of U, α[0,1], if the attribute subset BH satisfies the following two conditions:

    1) Positive region non-reduced, POSαB(Q)POSαH(Q);

    2) Attribute independence, for any fiB, POSαB{fi}(Q)<POSαB(Q);

    Algorithm 1:Measurement algorithm based on I-Fisher criterion
    Input:Steganalysis feature component fi;
    Output:The IFscore values of fi;
    Step 1:Calculate the mean values of steganalysis feature component fi. According to the values of fi, calculate the averages of fi in the cover image class and the stego image class, which are defined as mC(fi) and mS(fi), where M is the element number of the cover image class and the stego image class, vcj,i and vsj,i represent the normalized values of fi in the jth cover image and stego image, respectively:
    mC(fi)=1MMj=1vcj,i;
    mS(fi)=1MMj=1vsj,i;
    Step 2:Calculate the standard deviations of steganalytic feature component fi. According to the values of fi and step1, calculate the standard deviations of fi in the cover image class and the stego image class, which are defined as dC(fi) and dS(fi):
    dC(fi)=1MMj=1(vcj,imC(fi))2;
    dS(fi)=1MMj=1(vsj,imS(fi))2;
    Step 3:Calculate the Fscore(fi) value of steganalytic feature component fi. According to the results of Step3 and Step4, calculate the Fscore(fi):
    Fscore(fi)=[mC(fi)mS(fi)]2d2C(fi)+d2S(fi);
    Step 4:Calculate the E(fi) value of the steganalytic feature components. According to the Step2, calculate the E(fi):
    E(fi)=11+eg(fi);
    Step 5:Calculate the IFscore value of the steganalytic feature components fi. According to the results of Step3 and Step4, calculate the IFscore(fi):
    IFscore(fi)=4×Fscore(fi)+E(fi).

    then the attribute subset B is a decision rough set α-positive region reduction of the attribute set H, where POSαH(Q)=xU/QRαH(X), RαH(X)={xU|P(X|[x]H)α}, the threshold α is a preset detection accuracy of objects. RαH(X) is a α-lower approximation set of X. [x]H represents the equivalence class of x under attribute set H. P(X|[x]H) represents the conditional probability, P(X|[x]H)=X[x]H[x]H. represents the element number in set .

    According to the decision rough set α-positive region reduction to reduce the steganalysis feature, we can obtain several decision rough set α-positive region reduction subsets. Then it selects the best result from those decision rough set α-positive region reduction subsets.

    The main steps of the IF-based method is shown as follows.

    Step 1: Constructing a feature matrix and the corresponding decision table. Construct a feature matrix based on the features extracted from the cover image class and the stego image class. Then, add a decision attribute as last column of the matrix to construct a decision table T. The decision attribute value of a cover image is "0", and the decision attribute value of a stego image is "1".

    Step 2: Measuring separability. Calculate the IFscore value of feature components in matrix based on IFisher criterion. The measurement method based on IFisher criterion is the emphasis of this paper. See Section 3.3 for details.

    Step 3: Reducing feature. First, delete irrelevant feature components. Set IFscoremin as the lower limit of IFscore value. If IFscore(fi)<IFscoremin, then the feature component fi is an irrelevant feature component. The irrelevant feature components are removed, and the rest feature component are add into the candidate feature component set H which is a multiset. Second, calculate the division step λ, λ=IFscoremaxIFscoreminm, where m is the number of expected feature subsets. Third, according to the IFscore value, sort the corresponding feature components of H in descending order. Fourth, according to the value of λ, divide the sorted steganalysis feature components into m subsets, i.e. H={h1,h2,,hm}, where hi={fi1,fi2,,fit} and t is the element number of feature subsets hi. Fourth, initialize B=. For each additional feature subset hi, determine whether the reduction set meets with the positive region non-reduction principle. If yes, continue to test whether the feature component in this feature subset hi meets with the attribute independence principle, else remove this subset, and continue to add the next subset. Then for the subset hi, test whether meets with the attribute independence principle. If yes, obtain this candidate subset B, else remove components in this subset. And output the reduction subsets B which meets with the positive region non-reduction principle and the attribute independence principle.

    Step 4: Selecting feature subset. Detect the training images based on reduction feature subsets, and calculate the detection accuracy. Select a feature subset B with high detection accuracy and low dimension from reduction subsets, which obtained in Step3, as the final reduction result. And output the subset B and the corresponding column number η.

    The detailed process is shown in [21]. In order to facilitate the reader's understanding, we give a diagram of the IF-based method. The diagram of the IF-based method is given in Figure 3.

    Figure 3.  The diagram for the IF-based method.

    In this section, after introducing the experimental settings, we analyze the experimental results of steganalysis feature selection method based on I-Fisher criterion.

    The images used in the experiments are from the BOSSbase-1.01 database* containing 10000 grayscale images, size 512 × 512. First, all the grayscale PGM images are converted to JPEG grayscale images with quality factor 95. Second, the stego images with the payloads of 1.0, 0.25, 0.5, 0.8, 1.0 bpac (bits per nonzero AC DCT coefficient) are generated by the SI-UNIWARD [12] steganographic algorithm, which has good anti-detection performance. Then one group of cover images and five groups of stego images are considered. The steganalysis features are extracted from all the cover and stego images, using the GFR method [23] (17000-D) and CC-PEV method [24] (548-D). A steganalysis feature database which includes 120000 features are obtained.

    * P. Bas, T. Filler, T. Pevny, available: http://agents.fel.cvut.cz/stegodata/

    Table 2 shows the image sets for experiment.

    Table 2.  Experimental image sets.
    Image source BOSSbase 1.01 database
    Image size 512× 2
    Image color Grayscale
    Extraction algorithm GFR [23], CC-PEV[24]
    Steganography method SI-UNIWARD
    Number of original 10000
    Number of stego images 10000×5
    Image format JPEG
    Embedding ratios 0.1,0.25,0.5,0.8,1.0
    Training images 5000 pairs
    Testing images 5000 pairs
    Total features 10000× = 120000

     | Show Table
    DownLoad: CSV

    In this paper we consider the Ensamble Classifier for steganalysis. The detection error PE represents the sums of false negatives (missed detections) and false positives (false alarms). ¯PE represents the average of the detection error PE. Average detection accuracy ¯PA=1¯PE is used to evaluate the performance. 5000 pairs of images are used as the training set. 5000 pairs of cover and stego images are used as the testing set. The number of expected feature subsets is considered as 100.

    The dimension of the GFR steganalysis feature proposed in [23] is 17000, including five sub-features, which capture the changes of image statistical feature from 5 different perspectives. We firstly consider the GFR features to conduct steganalysis. We then measure the IFscore values of every feature component based on the I-Fisher criterion. According to IFscore values and decision table, we use decision rough set α-positive region reduction to reduce the GFR feature. Finally, the average detection accuracy of the selected feature is evaluated for image steganalysis.

    In the plots of Figure 4, we show the average detection accuracy of the original GFR steganalysis feature and after being reduced using the IF-based method when the bit embedded payloads in the images are 0.1, 0.25, 0.5, 0.8, 1.0. The horizontal axis represents the feature dimension number and the vertical-axis represents the average detection accuracy. "" indicates the average detection accuracy of different dimensions after reduction, The red "" indicates that average detection accuracy of the original GFR (17000-D) before reduction. The green "" indicates the highest average detection accuracy and its feature dimension. As shown in Figure 4 (c), when the feature number is reduced to 11956 with payload 0.5, the detection accuracy is 0.6593, which is about 0.21% higher than that of the original. As shown in Figure 4 (d), when the feature number is reduced to 10325 with payload 0.8, the detection accuracy is 0.9156, which is about 0.26% higher than that of the original. From Figure 4, we can see that even if the feature dimension drops to lower than one fourth, it still achieves good classification effect. It can be observed that the proposed method significantly reduced the dimensionality of the steganalysis feature, while also improving the stego-image detection accuracy.

    Figure 4.  Detection accuracy of stego-images using the original GFR feature and after the feature selection based on the IF-based method. For the best results indicated on the plots we specify the number of features and the detection accuracy achieved as well those corresponding to the original feature.

    The memory cost of 10000 images' GFR features are compared before and after reduction when the payloads are 0.1, 0.25, 0.5, 0.8, 1.0. When the detection accuracy is closed to the original, the memory costs are shown as follows.

    The storage space of an original GFR steganalysis feature is close to 0.5 GB for 10000 images. In order to evaluate the memory requirement for the reduced steganalysis feature, in Table 3 we provide the memory requirements when the reduced feature achieves the same stego image detection results as the case of the original GFR steganalysis feature. From Table 3, it can be seen that the storage space of the GFR steganalysis feature after reducing the feature size under different payloads is significantly reduced as well. Such as: when the payload is 0.1, the original GFR feature requires 0.4610 G memory, the selected feature based on IF-based method needs 0.2831G memory, which saves 0.1779 G, i.e. 38.59%, of the memory cost. When the payload is 1.0, the original GFR feature requires 0.4621 G memory, the selected feature based on IF-based method needs 0.1121 G memory, which saves 0.3500 G, i.e. 75.74%, of the memory cost. In summary, the features which are selected by the IF-based method can save a lot of storage space.

    Table 3.  Comparison of memory cost before and after feature reduction.
    PayloadsGFR memory cost (GB)Selected featureSaving memory
    (GB)
    Saving ratio
    Feature number Memory cost(GB)
    0.10.461099450.28310.177938.59%
    0.250.4610126880.32390.137129.74%
    0.50.4611118140.29680.164335.63%
    0.80.461573370.20500.256555.62%
    1.00.462139870.11210.350075.74%

     | Show Table
    DownLoad: CSV

    The dimension of the CC-PEV steganalysis feature proposed in [24] is 548. As shown in [24] that is to construct a new multi-class JPEG steganalyzer with markedly improved performance. We firstly consider the CC-PEV features to conduct steganalysis. We then measure the IFscore values of every feature component based on the I-Fisher criterion. According to IFscore values and decision table, we use decision rough set α-positive region reduction to reduce the CC-PEV feature. Finally, the average detection accuracy of the selected feature is evaluated for image steganalysis.

    In the plots of Figure 5, we show the average detection accuracy of the original CC-PEV steganalysis feature and after being reduced using the IF-based method when the bit embedded payloads in the images are 0.1, 0.25, 0.5, 0.8, 1.0. The horizontal axis represents the feature dimension number and the vertical-axis represents the average detection accuracy. "" indicates the average detection accuracy of different dimensions after reduction. The red "" indicates that average detection accuracy of the original CC-PEV (548-D) before reduction. The green "" indicates the highest average detection accuracy and its feature dimension. As shown in Figure 5(b), when the feature number is reduced to 193 with payload 0.25, the detection accuracy is 0.52528, which is about 0.408% higher than that of the original. As shown in Figure 5(d), when the feature number is reduced to 504 with payload 0.8, the detection accuracy is 0.65245, which is about 0.125% higher than that of the original. From Figure 5, it can be observed that the proposed method reduced the dimensionality of the CC-PEV feature, while also improving the stego-image detection accuracy.

    Figure 5.  Detection accuracy of stego-images using the original CC-PEV feature and after the feature selection based on the IF-based method. For the best results indicated on the plots we specify the number of features and the detection accuracy achieved as well those corresponding to the original feature.

    Since the original dimension and occupied memory space of the CC-PEV steganalysis feature are small, the memory space that can be saved after the selection is also small. Therefore, the memory space comparison experiment of the CC-PEV steganalysis feature is no longer performed.

    The Steganalysis-α method [21] is a general steganalytic feature selection method based on decision rough set α-positive region reduction. This method further removes the redundant steganalysis feature components based on the Attribute independence principle of decision rough set α-positive region reduction, which can further reduce the feature dimension. In addition, this method removes the conflicting feature components based on the Positive domain non-reduced principle of the decision rough set α-positive region reduction, which can maintain or even improve the detection accuracy. The main steps of Steganalysis-α method is shown as follows: First, measure the ASM value of each feature component in steganalysis feature based on the ASM criterion. Second, reorder these feature components in descending order according to ASM values. Third, reduce the reordered feature components based on decision rough α-positive region reduction. Then select the high detection accuracy and low dimension from all the decision rough α-positive region reduction subsets as the final reduction result. Finally, detect stego images based on the selected feature components.

    Figure 6 is a double-axis histogram. The x-axis represents the payload, the left y-axis represents the detection accuracy, and the right y-axis represents the feature number. In the plots of Figure 6, we show the average detection accuracy of selected GFR steganalysis feature by the Steganalysis-α method and the IF-based method when the bit embedded payloads in the images are 0.1, 0.25, 0.5, 0.8, 1.0. The blue histogram represents the detection accuracy of the selected steganalysis feature by Steganalysis-α method. The green histogram represents the detection accuracy of the selected steganalysis feature by IF-based method. The yellow histogram represents the number of the selected steganalysis feature by Steganalysis-α method. The red histogram represents the number of the selected steganalysis feature by IF-based method.

    Figure 6.  Comparison with Steganalysis-α Method (GFR).

    It can be seen from Figure 6 that both the Steganalysis-α method and the IF-based method can reduce the GFR image steganalysis feature. In Figure 6, when payload is 0.5, the feature is reduced to 13205-D based on Steganalysis-α method. It removes 3795-D feature components. The average detection accuracy of reduced features is 0.6579. For features reduced by IFisher-based method, the number is 1249-D less than that of the Steganalysis-α method, and the detection accuracy is 0.14% higher than that of the Steganalysis-α method. When payload is 1.0, the feature is reduced to 11092-D based on Steganalysis-α method. The average detection accuracy of reduced features is 0.9697. For features reduced by IFisher-based method, the number is 1687-D less than that of the Steganalysis-α method, and the detection accuracy is 0.03% higher than that of the Steganalysis-α method.

    From Figure 6, it can be observed that the proposed I-Fisher criterion selects a smaller steganalysis feature than the Steganalysis-α method especially when embedding higher bit payloads, while also providing slightly better detection results.

    In the plots of Figure 7, we show the average detection accuracy of selected CC-PEV feature by the Steganalysis-α method and the IF-based method when the bit embedded payloads in the images are 0.1, 0.25, 0.5, 0.8, 1.0. The blue histogram represents the detection accuracy of the selected steganalysis feature by Steganalysis-α method. The green histogram represents the detection accuracy of the selected steganalysis feature by IF-based method. The yellow histogram represents the number of the selected steganalysis feature by Steganalysis-α method. The red histogram represents the number of the selected steganalysis feature by IF-based method.

    Figure 7.  Comparison with Steganalysis-α Method (CC-PEV).

    It can be seen from Figure 7 that both the Steganalysis-α method and the IF-based method can reduce the CC-PEV image steganalysis feature. In Figure 7, when payload is 0.5, the feature is reduced to 510-D based on Steganalysis-α method. It removes 38-D feature components. The average detection accuracy of reduced features is 0.52257. For features reduced by IFisher-based method, the number is 5-D less than that of the Steganalysis-α method, and the detection accuracy is 1.88% higher than that of the Steganalysis-α method.

    From Figure 7, it can be observed that the detection accuracy of the proposed method is higher than that of the Steganalysis-α method.

    In order to further reduce the dimension of steganalysis feature and improve the efficiency of steganalysis, this paper proposes a feature selection method based on I-Fisher criterion. First, adding the dispersion, this paper improves the traditional Fisher criterion. The I-Fisher criterion is able to measure separability more accurately than the traditional Fisher criterion. Then, we apply the I-Fisher criterion into the decision rough set α-positive region reduction during feature selection to select the steganalysis feature. Finally, a series of feature selection experimental results show that the proposed method can improve the detection accuracy of the steganalysis algorithm based on the selected features, while reducing the dimension and memory cost. In the future research work, we will continue to study how to evaluate the contribution of various steganalysis features.

    This work was supported in part by the National Natural Science Foundation of China (No. U1804263, U1636219, 61872448, 61602508, 61772549, and U1736214), the National Key R & D Program of China (No. 2016YFB0801303, 2016QY01W0105), the Science and Technology Innovation Talent Project of Henan Province (No. 2018JR0018).



    [1] J. Wang, T. Li, X. Luo, et al., Identifying computer generated images based on quaternion central moments in color quaternion wavelet domain, IEEE Trans. Circuits Syst. Video Technol., 29 (2019), 2775-2785.
    [2] Z. Zhou, Y. Mu and Q. M. J. Wu, Coverless image steganography using partial-duplicate image retrieval, Soft Comput., 23 (2019), 4927-4938.
    [3] J. Fridrich, Feature-based steganalysis for JPEG images and its implications for future design of steganographic schemes, International Workshop on Information Hiding, Springer, Berlin, Heidelberg, 2004, 67-81. Available from: https://linkspringer.xilesou.top/chapter/10.1007/978-3-540-30114-16.
    [4] Y. Zhang, D. Ye, J. Gan, et al., An image steganography algorithm based on quantization index modulation resisting scaling attacksand statistical detection, Comput. Mater. Continua, 56 (2018), 151-167.
    [5] Z. Qu, T. Zhu, J. Wang, et al., A Novel Quantum Stegonagraphy Based on Brown States, Comput. Mater. Continua, 56 (2018), 47-59.
    [6] C. Qin, X. Chen, X. Luo, et. al., Perceptual image hashing via dual-cross pattern encoding and salient structure detection, Inf. Sci., 423 (2018), 284-302.
    [7] A. Cheddad, J. Condell, K. Curran, et. al., Digital image steganography: Survey and analysis of current methods, Signal Process., 90 (2010), 727-752.
    [8] Y. Zhang, C. Qin, W. Zhang, et.al., On the fault-tolerant performance for a class of robust image steganography, Signal Process., 146 (2018), 99-111.
    [9] Y. Kang, F. Liu, C. Yang, et al., Color Image Steganalysis Based on Residuals of Channel Differences, Comput. Mater. Continua, 59 (2019), 315-329.
    [10] Y. Zhang, X. Luo, Y. Guo, et al., Multiple Robustness Enhancements for Image Adaptive Steganography in Lossy Channels, IEEE Trans. Circuits Syst. Video Technol., 2019 (2019).
    [11] Y. Yeung, W. Lu, Y. Xue, et al., Secure Binary Image Steganography with Distortion Measurement Based on Prediction, IEEE Trans. Circuits Syst. Video Technol., 2019 (2019).
    [12] V. Holub, J. Fridrich and T. Denemark, Universal distortion function for steganography in an arbitrary domain, EURASIP J. Inf. Secur., 1 (2014), 1-13.
    [13] V. Sachnev and H. J. Kim, Binary coded genetic algorithm with ensemble classifier for feature selection in JPEG steganalysis, 2014 IEEE Ninth International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP), 2014, 21-24. Available from: https://ieeexploreieee.xilesou.top/abstract/document/6827700.
    [14] R. R. Chhikara, P. Sharma and L. Singh, A hybrid feature selection approach based on improved PSO and filter approaches for image steganalysis, Int. J. Mach. Learn. Cybern., 7 (2016), 1195-1206.
    [15] J. Kodovsky and J. Fridrich, Steganalysis of JPEG images using rich models, Proceedings Volume 8303, Media Watermarking, Security, and Forensics, California, 2012, 23-25. Available from: https://www.spiedigitallibrary.org/conference-proceedings-of-spie/8303/83030A/Steganalysis-ofJPEG-images-using-rich-models/10.1117/12.907495.short.
    [16] Steganalysis in high dimensions: Fusing classifiers built on random subspaces, Media Watermarking, Security, and Forensics III. International Society for Optics and Photonics, California, 2011, 23-26. Available from: https://www.spiedigitallibrary.org/conferenceproceedings-of-spie/7880/78800L/Steganalysis-in-high-dimensions-fusing-classifiers-built-onrandom/10.1117/12.872279.short?SSO=1.
    [17] J. Qin, X. Sun, X. Xiang, et al., Principal feature selection and fusion method for image steganalysis, J. Electron. Imaging, 18 (2009), 033009.
    [18] E. Oja, Principal components. and minor components. and linear neural networks, Neural Networks, 5 (1992), 927-935.
    [19] X. Luo, F. Liu, S. Lian, et al., On the typical statistic features for image blind steganalysis, IEEE J. Sel. Area. Comm., 29 (2011), 1404-1422.
    [20] Z. Li and A. G. Bors, Selection of robust features for the cover source mismatch problem in 3D steganalysis, 2016 23rd International Conference on Pattern Recognition (ICPR), IEEE, 2016 4256-4261. Available from: https://ieeexploreieee.xilesou.top/abstract/document/7900302.
    [21] Y. Ma, X. Luo, X. Li, et al., Selection of rich model steganalysis features based on decision rough set α-positive region reduction, IEEE Trans. Circuits Syst. Video Technol., 29 (2019), 336-350.
    [22] J. Lu, F. Liu and X. Luo, Selection of image features for steganalysis based on the Fisher criterion, Digit. Invest., 11, (2014), 57-66.
    [23] X. Song, F. Liu, C. Yang, et al., Steganalysis of adaptive JPEG steganography using 2D Gabor filters, Proceedings of the 3rd ACM Workshop on Information Hiding and Multimedia Security, 2015, 15-23. Available from: https://dlacm.xilesou.top/citation.cfm?id=2756608.
    [24] T. Pevny and J. Fridrich, Merging Markov and DCT features for multiclass JPEG steganalysis, Electronic Imaging, Security, Steganography, and Watermarking of Multimedia Contents IX, International Society for Optics and Photonics, 2007. Available from: https://www.spiedigitallibrary.org/conference-proceedings-of-spie/6505/650503/MergingMarkov-and-DCT-features-for-multi-class-JPEG-steganalysis/10.1117/12.696774.short?SSO=1.
  • This article has been cited by:

    1. Caijie Qin, Wenxing Hu, Xinsheng Wang, Xibo Ma, Application of Artificial Intelligence in Diagnosis of Craniopharyngioma, 2022, 12, 1664-2295, 10.3389/fneur.2021.752119
    2. Yuanyuan Ma, Xinquan Yu, Xiangyang Luo, Dong Liu, Yi Zhang, Adaptive feature selection for image steganalysis based on classification metrics, 2023, 644, 00200255, 118973, 10.1016/j.ins.2023.118973
    3. Xiuyun Zhai, Mingtong Chen, A machine learning-based nano-photocatalyst module for accelerating the design of Bi2WO6/MIL-53(Al) nanocomposites with enhanced photocatalytic activity, 2023, 5, 2516-0230, 4065, 10.1039/D3NA00122A
    4. Xinquan Yu, Yuanyuan Ma, Yi Zhang, Xiaolong Li, Yao Zhao, Fast dominant feature selection with compensation for efficient image steganalysis, 2024, 220, 01651684, 109475, 10.1016/j.sigpro.2024.109475
  • Reader Comments
  • © 2020 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(4538) PDF downloads(359) Cited by(4)

Figures and Tables

Figures(7)  /  Tables(3)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog