Conditional random <i>k</i> satisfiability modeling for <i>k</i> = 1, 2 (CRAN2SAT) with non-monotonic Smish activation function in discrete Hopfield neural network

Nurshazneem Roslan; Saratha Sathasivam; Farah Liyana Azizan; Nurshazneem Roslan; Saratha Sathasivam; Farah Liyana Azizan

doi:10.3934/math.2024193

AIMS Mathematics

2024, Volume 9, Issue 2: 3911-3956. doi: 10.3934/math.2024193

Previous Article Next Article

Research article

Conditional random k satisfiability modeling for k = 1, 2 (CRAN2SAT) with non-monotonic Smish activation function in discrete Hopfield neural network

1.
School of Mathematical Sciences, Universiti Sains Malaysia, USM, Penang 11800, Malaysia
2.
Institute of Engineering Mathematics, Universiti Malaysia Perlis, Arau, Perlis 02600, Malaysia
3.
Centre for Pre-University Studies, Universiti Malaysia Sarawak, Kota Samarahan 94300, Sarawak, Malaysia

Received: 03 September 2023 Revised: 05 November 2023 Accepted: 09 November 2023 Published: 11 January 2024
MSC : 68N17, 68R07, 68T27

The current development of logic satisfiability in discrete Hopfield neural networks (DHNN)has been segregated into systematic logic and non-systematic logic. Most of the research tends to improve non-systematic logical rules to various extents, such as introducing the ratio of a negative literal and a flexible hybrid logical structure that combines systematic and non-systematic structures. However, the existing non-systematic logical rule exhibited a drawback concerning the impact of negative literal within the logical structure. Therefore, this paper presented a novel class of non-systematic logic called conditional random k satisfiability for k = 1, 2 while intentionally disregarding both positive literals in second-order clauses. The proposed logic was embedded into the discrete Hopfield neural network with the ultimate goal of minimizing the cost function. Moreover, a novel non-monotonic Smish activation function has been introduced with the aim of enhancing the quality of the final neuronal state. The performance of the proposed logic with new activation function was compared with other state of the art logical rules in conjunction with five different types of activation functions. Based on the findings, the proposed logic has obtained a lower learning error, with the highest total neuron variation TV = 857 and lowest average of Jaccard index, JSI = 0.5802. On top of that, the Smish activation function highlights its capability in the DHNN based on the result ratio of improvement Zm and TV. The ratio of improvement for Smish is consistently the highest throughout all the types of activation function, showing that Smish outperforms other types of activation functions in terms of Zm and TV. This new development of logical rule with the non-monotonic Smish activation function presents an alternative strategy to the logic mining technique. This finding will be of particular interest especially to the research areas of artificial neural network, logic satisfiability in DHNN and activation function.

Keywords:

discrete Hopfield neural network (DHNN),
conditional random two satisfiability,
non-systematic logic,
activation functions,
Smish activation function,
potential logic mining

Citation: Nurshazneem Roslan, Saratha Sathasivam, Farah Liyana Azizan. Conditional random k satisfiability modeling for k = 1, 2 (CRAN2SAT) with non-monotonic Smish activation function in discrete Hopfield neural network[J]. AIMS Mathematics, 2024, 9(2): 3911-3956. doi: 10.3934/math.2024193

Related Papers:

[1]	Xiaoyan Liu, Mohd Shareduwan Mohd Kasihmuddin, Nur Ezlin Zamri, Yunjie Chang, Suad Abdeen, Yuan Gao . Higher order Weighted Random k Satisfiability ( $k = 1, 3$ ) in Discrete Hopfield Neural Network. AIMS Mathematics, 2025, 10(1): 159-194. doi: 10.3934/math.2025009
[2]	Muhammad Aqmar Fiqhi Roslan, Nur Ezlin Zamri, Mohd. Asyraf Mansor, Mohd Shareduwan Mohd Kasihmuddin . Major 3 Satisfiability logic in Discrete Hopfield Neural Network integrated with multi-objective Election Algorithm. AIMS Mathematics, 2023, 8(9): 22447-22482. doi: 10.3934/math.20231145
[3]	Gaeithry Manoharam, Azleena Mohd Kassim, Suad Abdeen, Mohd Shareduwan Mohd Kasihmuddin, Nur 'Afifah Rusdi, Nurul Atiqah Romli, Nur Ezlin Zamri, Mohd. Asyraf Mansor . Special major 1, 3 satisfiability logic in discrete Hopfield neural networks. AIMS Mathematics, 2024, 9(5): 12090-12127. doi: 10.3934/math.2024591
[4]	Nurul Atiqah Romli, Nur Fariha Syaqina Zulkepli, Mohd Shareduwan Mohd Kasihmuddin, Nur Ezlin Zamri, Nur 'Afifah Rusdi, Gaeithry Manoharam, Mohd. Asyraf Mansor, Siti Zulaikha Mohd Jamaludin, Amierah Abdul Malik . Unsupervised logic mining with a binary clonal selection algorithm in multi-unit discrete Hopfield neural networks via weighted systematic 2 satisfiability. AIMS Mathematics, 2024, 9(8): 22321-22365. doi: 10.3934/math.20241087
[5]	Farah Liyana Azizan, Saratha Sathasivam, Nurshazneem Roslan, Ahmad Deedat Ibrahim . Logic mining with hybridized 3-satisfiability fuzzy logic and harmony search algorithm in Hopfield neural network for Covid-19 death cases. AIMS Mathematics, 2024, 9(2): 3150-3173. doi: 10.3934/math.2024153
[6]	Nur 'Afifah Rusdi, Nur Ezlin Zamri, Mohd Shareduwan Mohd Kasihmuddin, Nurul Atiqah Romli, Gaeithry Manoharam, Suad Abdeen, Mohd. Asyraf Mansor . Synergizing intelligence and knowledge discovery: Hybrid black hole algorithm for optimizing discrete Hopfield neural network with negative based systematic satisfiability. AIMS Mathematics, 2024, 9(11): 29820-29882. doi: 10.3934/math.20241444
[7]	Xiaojun Xie, Saratha Sathasivam, Hong Ma . Modeling of 3 SAT discrete Hopfield neural network optimization using genetic algorithm optimized K-modes clustering. AIMS Mathematics, 2024, 9(10): 28100-28129. doi: 10.3934/math.20241363
[8]	Caicai Feng, Saratha Sathasivam, Nurshazneem Roslan, Muraly Velavan . 2-SAT discrete Hopfield neural networks optimization via Crow search and fuzzy dynamical clustering approach. AIMS Mathematics, 2024, 9(4): 9232-9266. doi: 10.3934/math.2024450
[9]	S. Kathiresan, Ardak Kashkynbayev, K. Janani, R. Rakkiyappan . Multi-stability analysis of fractional-order quaternion-valued neural networks with time delay. AIMS Mathematics, 2022, 7(3): 3603-3629. doi: 10.3934/math.2022199
[10]	Yuwei Cao, Bing Li . Existence and global exponential stability of compact almost automorphic solutions for Clifford-valued high-order Hopfield neutral neural networks with $D$ operator. AIMS Mathematics, 2022, 7(4): 6182-6203. doi: 10.3934/math.2022344

Abstract

1. Introduction

The artificial neural network (ANN) is a computing framework that reflects the capability of the human brain to process information. The ANN's complex structure is due to the extensive interconnectivity of its neurons. By incorporating external input, the artificial neural network is trained to successfully execute a given task by utilizing all available neurons. Hopfield and Tank introduced the discrete Hopfield neural network (DHNN) in 1985 as one of the original subsets of ANN utilized in the solving of optimization issues. The DHNN has features such as fault tolerance, content addressable memory (CAM) and dynamical energy. The minimal energy of DHNN shows that the retrieved final neuron state is a global solution. Therefore, the outcome of the final energy function in DHNN is significantly influenced by the synaptic weight value gained by the network. Therefore, incorporating a symbolic rule is necessary to guarantee that the network consistently converges toward optimal solution. To address this problem, ^[1] is the primary work that incorporates satisfiability (SAT) logical rules into ANN. He introduced a basic principle of conducting logic programming within a DHNN through comparing the cost function and the Lyapunov energy function. Consequently, the connection between the logic (synaptic weight) is successfully computed by using the Abdullah method. This approach surpassed traditional learning methods like Hebbian learning ^[2]. Due to this success, ^[3] introduced Horn satisfiability (HORNSAT), a novel SAT concept in DHNN, incorporating a relaxation process during the retrieval phase. The result shows that HORNSAT can converge toward the absolute minimum energy effectively, and it is found that as the network becomes complex, the dynamic relaxation rate outperforms the constant relaxation rate. However, the implications of various logical rules within DHNNs remain unclear due to the constraints imposed by the HORNSAT structure. Therefore, a unique logical framework is needed, which allows more freedom with each clause not being limited to just one positive literal.

The authors of ^[4,5] extended the research to include other forms of structured logical rules, namely two satisfiability (2SAT) and three satisfiability (3SAT), respectively. The distinction between these two logical structures is that 2SAT contains two literals in each clause, whereas every clause within the 3SAT logic rule specifically includes three literals. The proposed 2SAT in DHNN demonstrates a high global minima ratio within a reasonable computational time. Even as the storage capacity of DHNN with 3SAT increases exponentially, the investigation into 3SAT has yielded an ideal value for the global minima ratio. Taking an alternative approach, ^[6] presented the integration of systematic 2SAT into the radial basis function neural network. This integration involves determining the center and width parameter values, which in turn enables the generation of output weights aligned with the 2SAT logic. Additionally, ^[7] proposed different evolutionary algorithms in the training phase of radial basis function neural network corresponding to 2SAT logic. Based on the result, the evolutionary programming algorithm is effective during the training phase and achieves an optimal output weight. Although systematic logical structures have been successfully integrated into DHNN, they suffer from a lack of diversity in terms of the logical structure. This leads to an overfitting of synaptic weights due to the similarity between the clauses. As a solution, the scope of the SAT domain in DHNN has expanded to the non-systematic logical structures. Non-systematic logical structures are distinguished by having a specific number of literals in each clause. This unique representation enables the incorporation of first-order logic into the current systematic logical structure. The types of logical rule have been expanded to the non-systematic logical structures. Initially, ^[8] proposed random k satisfiability by combining first-order and second-order logic for k = 1, 2. Random 2SAT(RAN2SAT) offers the flexibility of having a dynamic number of literals within the clause. However, experimental results showed that first-order clauses introduced more logical inconsistencies than second-order clauses. As a result, the DHNN was unable to finish the learning phase effectively due to an increased number of first-order clauses. This ultimately led to a less effective retrieval phase. Therefore, there are various perspectives that have been explored in the development of non-systematic logical structure. In contrast, ^[9] has proposed Y-type random 2SAT (YRAN2SAT), a flexible hybrid logical structure that combines systematic and non-systematic structures. It offers random enumeration of first-order, second-order or both types of clauses. The study implemented five possible pathways of YRAN2SAT that show an improved solution capacity. Despite the successful flexibility of YRAN2SAT in DHNN, the non-systematic structure of YRAN2SAT has limited capacity to retrieve diversified solution. ^[10] introduced weighted random k satisfiability (r2SAT) for k = 1, 2 as a new class of non-systematic logic with weighted ratios of negative literals. They also integrated a logical phase for generating the non-systematic structure with the designated quantity of negative literals. As a result, r2SAT performs well in producing diverse neuron states and global minima solutions. Additionally, ^[11] proposed an alternative approach where the logic phase of r2SAT is modified by integrating a binary artificial bee colony algorithm to control the distribution of negative literals. This approach has increased the global minima ratio. As a consequence of this discovery, using a dynamic allocation of negative literals will aid in generating a higher number of global minimum solutions with diverse final neuron states. However, the importance of having negative literal in the logical structure remains unclear and requires more analysis.

Furthermore, the existing logical structure for the above logical rule is fully utilized c possible combination of clauses (c = 2^k), where k is the order of the clause. For example, the first order clause will have two possible combinations of clauses, while the second order clause will have four possible combinations of clauses and the third order clause will have eight possible combinations of clauses, which will be randomly generated. As stated earlier, r2SAT regulates the proportion of negative literals within a clause. However, there is still a chance that all possible combinations will be generated. Thus, instead of evaluating the significance of including negative literals in the logical framework, it is equally crucial to assess the efficiency achieved by eliminating clauses with only positive literals (for the second order clause). Another interesting study in logic satisfiability is concerned about the quality of the final neuron state that is retrieved by the DHNN. Activation function is a crucial parameter within a DHNN. Activation functions are used to transform the weighted sum of inputs into a neuron's output signal. Therefore, the activation function plays a crucial role in retrieving the final neuron state within DHNN. An optimal activation function can provide greater variations and enhanced diversity in the final neuron states. For many years, a lot of activation functions have been explored in the neural network. ^[12] has conducted a survey that presents the developments of activation functions in neural networks. This survey has examined the performance of 18 activation functions across different types of networks. This survey mentioned that the hyperbolic tangent activation function is suffering from the gradient vanish problem. In another review, ^[13] conducted a survey into trainable activation functions within the neural network field. In this paper, a comprehensive survey of trainable activation functions has been presented. Based on the review, the neural networks utilizing trainable activation functions can reduce computational complexity as they involve fewer parameters to manage. The traditional activation function used in neural networks is hyperbolic tangent activation function. ^[14] has explored the two-dimensional parameter space of the system by analyzing hyperbolic tangent and piecewise-linear activation functions. While ^[15] uses the classical hyperbolic tangent activation function with a proposed simple Rectified Linear Unit (ReLU) activation function to develop circuit's implementation of the Hopfield neural network, the traditional activation function used in logic satisfiability DHNN is hyperbolic tangent activation function (HTAF). ^[16] has proposed HTAF and the Elliot symmetric activation function in doing 3SAT in DHNN. In this work, various activation functions have been employed as dynamic post-optimization techniques to convert the activation level of a unit (neuron) into an output signal. In terms of global solutions, global hamming distance and central processing unit or computational time, the HTAF is noted to exhibit superior performance compared to the Elliot symmetric activation function and the McCulloch-Pitts function. Despite the acceptable performance of the HTAF, there remains questions about the interpretability of error analysis and the quality of final neuron states in the retrieval phase of DHNN, indicating a need for further analysis. Hence, through the application of an optimal activation function, the efficiency of the updating rule in DHNN can be enhanced. This results in a greater variety and a higher degree of diversity in the final neuron states.

Therefore, the present study addresses these gaps by introducing a new type of non-systematic random k satisfiability, for k = 1, 2 that utilizes first and second order logic with a specific condition. The suggested logic will eliminate positive literals from the second-order clauses while not imposing any constraints on the first-order clauses. This highlights the significance of negative literals in the logical structure, and further examination can be conducted on synaptic weight management for this proposed logic. On the other hand, by implementing an optimal activation function, the update rule in the DHNN can be made more effective, providing a broader variation and increased diversity in the final states of the neurons. ^[10] represents the nearest effort in recognizing the potential impact of negative literals within a logic satisfiability DHNN, while ^[16] is the most relevant study that analyzes various activation functions in a logic satisfiability DHNN. Thus, the contribution of this paper as follows:

1) To formulate a new non-systematic logical rule, namely, conditional random k satisfiability, where k = 1, 2 and uses first order and second order logic without including both positive literal in the second order clauses. The proposed conditional RAN2SAT will demonstrate the role of negative literal in the second order clause.

2) To incorporate the proposed conditional RAN2SAT into DHNNs by reducing the logical inconsistency of the logical rule corresponding to the zero-cost function. The behavior of the DHNNs will be influenced by the cost function derived in line with the proposed logic.

3) To introduce a novel non-monotonic Smish activation function intended to improve the efficacy of the updating rule within DHNNs, which is expected to contribute to greater variety and enhance diversity in the final neuron states. The performance of the Smish activation function will be evaluated with different types of activation functions during the retrieval phase of DHNNs.

4) To investigate the performance of conditional RAN2SAT in DHNNs. In addition, the capability of non-monotonic Smish activation function with different logical rules will be analyzed. Various performance metrics, such as learning error, testing error, energy profile and similarity metric, are presented to validate the performance of the proposed conditional random k satisfiability, which utilizes the Smish activation function.

The article is structured as follows: Section two discusses the motivation behind the research. Followed by this, section three introduces the basic formulation of conditional random k satisfiability for k = 1, 2, which is also abbreviated as conditional random k satisfiability (CRAN2SAT). Section four elaborates the detailed presentation of the proposed model, CRAN2SAT, incorporating with DHNN. Furthermore, section five provides a concise discussion of the proposed activation function used during the retrieval phase of the DHNN model. Section six introduces the experimental configurations and the metrics used to evaluate performance in this study. Section seven explores the results and related discussions about the performance of the proposed DHNN-CRAN2SAT logic. This also illustrates the influence of activation functions on optimizing the final neuron state of the DHNN model, based on the selected metrics. The findings of this study are summarized in section eight.

2. Motivation

2.1. The exploration of satisfiability representation in DHNN

The current development of satisfiability representation is significant in the field of computer science and mathematics where it offers an alternative method of representing the information of any datasets. Therefore, the purpose of satisfiability representation integrated with DHNN is to enable the user to interpret the output from the network. In simpler terms, satisfiability representation can be understood as a logical guideline that depicts the output generated by the network. Even though there are various types of satisfiability representations integrated with DHNN, there is still an empty room for exploration regarding the structure, behavior and their potential application. The logical rule that has been proposed by other researchers ^{[8,9,10,17,18,19]} shows that each of the logical structures has their own characteristic, whereby all the logical rule has been successfully embedded in DHNN. Thus, this motivates this study to explore the satisfiability representation in other perspectives, because different satisfiability representation will demonstrate a different performance analysis. Consequently, various types of logical rules can help practitioners or engineers to choose the best logic that suits their problem. Satisfiability representation finds application in various domains including quantum chemistry ^[20], classification methods ^[21], chaos computing ^[22] and in logic mining methodologies ^[23]. Therefore, by exploring other types of logical rule in DHNN, it is expected to be a guidance for the outsider researcher when choosing the best logic according to their problem or datasets.

2.2. The effect of negative literal in second order clause

The logical combination of SAT is highly influenced by the order of clauses in non-systematic SAT ^[8,9,10]. For example, k SAT generates c possible combinations of clauses (c = 2^k), where k represents the clause order. A new class of non-systematic logic called weighted random k satisfiability was introduced by Zamri et al. ^[10]. It incorporates weighted ratios of negative literals for different values of k = 1, 2. They also included a logic phase to generate the desired number of negative literals in the non-systematic structure. Based on the successful performance of r2SAT in generating diverse neuron states and global minima solutions, it is evident that a dynamic distribution of negative literals contributes to producing diversified final neuron states. However, the significance and rationale behind using these negative literals within the logical rule remains unclear. Therefore, this study aims to analyze the impact of restricting the possible combinations in solving logic satisfiability using DHNN. We hypothesize that imposing this restriction will result in higher global solutions compared to the existing non-systematic logical rules. Thus, this study aims to investigate the impact of including at least one negative literal in each of the second-order clauses while no restrictions will be imposed on first-order clauses. By exploring this aspect, we can gain a better understanding of the importance of negative literals within the logical rule. Further exploration in this area could provide valuable insights for developing more efficient logical structures within logic satisfiability in DHNN.

2.3. Inadequate metrics for evaluating the effectiveness of activation functions in DHNN

The selection of the activation function plays a crucial role in neural networks as it has a significant impact on the network's ability to learn complex patterns and make accurate predictions ^[24]. Over the years, numerous activation functions have been investigated in the field of neural networks. Dubey et al. ^[12] conducted a survey that presents advancements in activation functions within neural networks. In DHNN logic satisfiability, an activation function acts as a mathematical operation that determines a final neuron state based on its initial input. Obtaining an optimal retrieved final neuron state is significant since the quality of the final neuron state reflects the nature of the logical rule in DHNN. The study conducted by Mansor and Sathasivam ^[16] demonstrates the successful performance of the Hyperbolic tangent activation function in DHNN. However, there is limited information on how errors are interpreted and how the final neuron state influences the overall performance in DHNN. This highlights a potential research opportunity to enhance the updating rule in DHNN by implementing an optimal activation function. Such an enhancement could lead to increased diversity and variations in the final neuron states, ultimately improving both effectiveness and efficiency of this neural network model in retrieving diversified final neuron states. According to ^[17], expanding the diversity of the final neuron state improves the ability of DHNN to identify more global solutions in various solution spaces. This urges the study to propose an optimal activation function to enhance the updating rule in the DHNN, with the aim of guaranteeing that the DHNN retrieves a more varied and diversified range of final neuron states. Therefore, this study aims to emphasize the importance of utilizing an optimal activation function to improve the effectiveness of updating rule in DHNN.

3. CRAN2SAT representation

CRAN2SAT is a logical representation that consists of first order and second order logic with a specific condition. The proposed logic will exclude both positive literals in the second-order clauses and does not impose any restrictions on the first-order clauses. It's worth noting that that the symbol c within CRAN2SAT is used as a versatile representation of "conditional", while also serving the purpose of distinguishing it from the conventional RAN2SAT with the newly introduced logical rule. The basic components of proposed CRAN2SAT are as follows:

(a) A set of $m$ second order clauses represent as $C_1^{\left(2 \right) * }, \, \, C_2^{\left(2 \right) * }, \, \, ..., \, \, C_m^{\left(2 \right) * },$ where $C_m^{\left(2 \right) * } = \left({{B_i}^ * \vee {D_i}^ * } \right),$ $m \in {{\rm Z}^ + }.$

(b) A set of $n$ first order clauses represent as $C_1^{\left(1 \right)}, \, \, C_2^{\left(1 \right)}, \, \, ..., \, \, C_n^{\left(1 \right)},$ where $C_n^{\left(1 \right)} = \left({{A_i}} \right),$ $n \in {{\rm Z}^ + }.$

The $C_m^{\left(2 \right) * }$ and $C_n^{\left(1 \right)}$ represents the second and first order clauses, where a set of literals can be either positive or negative literal such that

${B_i}^ * \in \left\{ {{B_i}^ *, \neg {B_i}^ * } \right\}$ , ${D_i}^ * \in \left\{ {{D_i}^ *, \neg {D_i}^ * } \right\}$ and ${A_i} \in \left\{ {{A_i}, \neg {A_i}} \right\}.$

Notably, $i$ denotes the number of independent literals within the clauses. It's worth noting that the primary distinction between CRAN2SAT and RAN2SAT lies within the components (a). The logical structure of RAN2SAT ^[8], will be fully utilized c possible combination of clauses (c = 2^k), where k is the order of the clause. The first order clause, $C_i^{\left(1 \right)}$ will have two possible combinations of clauses as in Eq (1). While the second order clause, $C_i^{\left(2 \right)}$ will have four possible combinations of clauses as in Eq (2). However, the logical structure of CRAN2SAT will exclude both positive literals in the second-order clauses. In other words there are only three possible combinations of second order clauses. Due to this condition, the second order clause of CRAN2SAT is structured based on $C_i^{\left(2 \right) * }$ as in Eq (3)

$C_i^{\left( 1 \right)} = \left\{ {{A_i}, \neg {A_i}} \right\},$

(1)

$C_i^{\left( 2 \right)} \in \left\{ {\left( {{B_i} \vee {D_i}} \right), \left( {\neg {B_i} \vee {D_i}} \right), \left( {{B_i} \vee \neg {D_i}} \right), \left( {\neg {B_i} \vee \neg {D_i}} \right)} \right\},$

(2)

$C_i^{\left( 2 \right) * } \in \left\{ {\left( {\neg {B_i}^ * \vee {D_i}^ * } \right), \left( {{B_i}^ * \vee \neg {D_i}^ * } \right), \left( {\neg {B_i}^ * \vee \neg {D_i}^ * } \right)} \right\}, \, \, \, \, \, {B_i}^ * \vee {D_i}^ * \notin C_i^{\left( 2 \right) * }.\,$

(3)

All the clauses are connected by logical AND $\left(\wedge \right)$ and literals within each clause are connected by logical OR $\left(\vee \right)$ . Therefore, by using the components in (a)–(d), the general formulation of CRAN2SAT or ${P_{CRAN2SAT}}$ and the definition of the clause in ${P_{CRAN2SAT}}$ is in Eq (5), where $m$ represents the total count of clauses with two literals, while the variable $n$ represents the total count of clauses with one literal

${P_{CRAN2SAT}} = \wedge _{i = 1}^m\, {C_i}^{\left( 2 \right) * } \wedge _{i = 1}^n\, {C_i}^{\left( 1 \right)},$

(4)

${C_i}^{\left( k \right)} = \left\{ {\begin{array}{*{20}{c}} {\left( {{B_i}^ * \vee {D_i}^ * } \right)\, \, , k = 2, } \\ {\, \, \, \, \, \, \, \, {A_i}\, \, \, \, \, \, \, \, \, \, \, \, , k = 1.} \end{array}} \right.$

(5)

From , there will be only 3 possible combinations for the second order clause (refer Eq (3)), which means the combination of the second order clauses can be appear repeatedly as the number of the second order clause m increases (maybe the clauses will appear consistently same). The main criteria that makes the proposed CRAN2SAT differ from r2SAT is that even though r2SAT controls the proportion of the negative literal in the clause, there is still a possibility of all the combinations in Eq (2) being generated. The bipolar value that holds value of one and negative one represents TRUE and FALSE, respectively. If all the clauses in ${P_{CRAN2SAT}}$ are satisfied, the logical rule of ${P_{CRAN2SAT}} = 1.$ In other words, the clauses in ${P_{CRAN2SAT}}$ are not satisfied and the logical rule of ${P_{CRAN2SAT}} = - 1.$ In this paper, a random distribution of $C_i^{\left(1 \right)}$ and $C_i^{\left(2 \right) * }$ will be embedded in DHNN. According to Eqs (4) and (5), the possible structure of ${P_{CRAN2SAT}}$ can be represented as in Table 1.

Table 1. Possible structure of

${P_{CRAN2SAT}}$ .

NN	m=n	Possible ${P_{CRAN2SAT}}$
3	1	${P_{CRAN2SAT}} = \left({{B_1} \vee \neg {D_1}} \right) \wedge \neg {A_1}$
3	1	${P_{CRAN2SAT}} = \left({\neg {B_1} \vee \neg {D_1}} \right) \wedge {A_1}$
6	2	${P_{CRAN2SAT}} = \left({{B_1} \vee \neg {D_1}} \right) \wedge \left({\neg {B_2} \vee \neg {D_2}} \right) \wedge \neg {A_1} \wedge {A_2}$
		${P_{CRAN2SAT}} = \left({\neg {B_1} \vee {D_1}} \right) \wedge \left({\neg {B_2} \vee {D_2}} \right) \wedge {A_1} \wedge {A_2}$
		${P_{CRAN2SAT}} = \left({\neg {B_1} \vee \neg {D_1}} \right) \wedge \left({\neg {B_2} \vee {D_2}} \right) \wedge \neg {A_1} \wedge \neg {A_2}$
9	3	${P_{CRAN2SAT}} = \left({\neg {B_1} \vee \neg {D_1}} \right) \wedge \left({\neg {B_2} \vee {D_2}} \right) \wedge \left({\neg {B_3} \vee \neg {D_3}} \right) \wedge \neg {A_1} \wedge {A_2}\, \wedge {A_3}\,$
		${P_{CRAN2SAT}} = \left({\neg {B_1} \vee {D_1}} \right) \wedge \left({\neg {B_2} \vee \neg {D_2}} \right) \wedge \left({{B_3} \vee \neg {D_3}} \right) \wedge {A_1} \wedge \neg {A_2}\, \wedge \neg {A_3}\,$
		${P_{CRAN2SAT}} = \left({\neg {B_1} \vee \neg {D_1}} \right) \wedge \left({{B_2} \vee \neg {D_2}} \right) \wedge \left({\neg {B_3} \vee {D_3}} \right) \wedge \neg {A_1} \wedge \neg {A_2} \wedge \neg {A_3}$

| Show Table

DownLoad: CSV

4. CRAN2SAT in DHNN

The DHNN is a subset of ANN. It comprises $N$ interconnected neurons with no hidden layers. One notable feature of DHNN is its CAM, which allows it to store patterns related to the problem. DHNN contains a limited number of neurons in bipolar form, denoted as S_i = {–1, 1} for $i \in {\rm N},$ which are interpreted as true and false respectively. The general formulation of updating the neuron state with predefined threshold value is as follows:

${S_i} = \left\{ {\begin{array}{*{20}{c}} { - 1, \, \, \, \, }&{{\text{if}}\, \, \, \, \sum\limits_j^N {\, {W_{ij}}{S_j}\, \, \geqslant \, \, \theta , } } \\ {1, \, \, \, \, \, \, \, \, }&{{\text{otherwise}}{\text{.}}\, \, \, \, } \end{array}} \right.$

(6)

Where ${S_i}$ and ${S_j}$ are states of the $i{\text{th}}$ and $j{\text{th}}$ neuron, respectively, ${W_{ij}}$ is the synaptic weight between $i{\text{th}}$ and $j{\text{th}}$ neuron, with $\theta$ as the predefine threshold value that is set to $\theta = 0.$ This is to ensure the energy of DHNN is monotonically decreases ^[3]. There are two characteristics for the synaptic weight in DHNN. Initially, in the DHNN there is an absence of self-connection among the neurons suggesting that the diagonal of synaptic weight is zero, where ${W_{ii}} = {W_{jj}} = 0.$ Second, the synaptic weight within DHNN consistently exhibits symmetry, denoted by ${W_{ij}} = {W_{ji}}.$ In this research paper, the logic of ${P_{CRAN2SAT}}$ is incorporated into DHNN by associating each variable of ${P_{CRAN2SAT}}$ with the corresponding neuron state.

The process of logic satisfiability in DHNN encompasses two parts, generally referred to as the learning phase and the retrieval phase. During the learning phase, the primary goal is to reduce the inconsistency of ${P_{CRAN2SAT}},$ which corresponds to the minimized cost function, ${E_{{P_{CRAN2SAT}}}}$ defined as:

${E_{{P_{CRAN2SAT}}}} = \sum\limits_{i = 1}^{NC} {\mathop \prod \limits_{j = 1}^{m + n} {Q_{ij}}} ,$

(7)

where NC and $m + n$ refer to the number of clauses and number of variables in ${P_{CRAN2SAT}}$ , respectively. The inconsistency of ${P_{CRAN2SAT}}$ is defined by taking the negation of ${P_{CRAN2SAT}}$ , denoted as ${Q_{ij}}$ , which is defined as

${Q_{ij}} = \left\{ {\begin{array}{*{20}{c}} {\frac{1}{2}\left( {1 - {S_X}} \right), }&{{\text{if}}\, \neg \, X, } \\ {\frac{1}{2}\left( {1 + {S_X}} \right), }&{{\text{if}}\, \, \, \, \, X.} \end{array}} \right.$

(8)

S_X denotes the neuron state in the clause with $X$ and $\neg X$ as a positive and negative literal respectively. X consists of arbitrary literals of $\left\{ {{A_i}, {B_i}, {C_i}} \right\}$ . In the general equation of the logical structure ${P_{CRAN2SAT}}$ represented by Eqs (4) and (5), the conjunction $\left(\vee \right)$ signifies multiplication while the disjunction $\left(\wedge \right)$ represents addition in the cost function. The minimized cost function is identified by the utmost number of clauses that have been satisfied. The value of ${E_{{P_{CRAN2SAT}}}}$ has a proportional relationship with the number of clauses in ${P_{CRAN2SAT}}$ that are not satisfied. When the number of unsatisfied clauses increase, it will result in a corresponding increase in the value of ${E_{{P_{CRAN2SAT}}}}.$ Based on Eq (7), if ${E_{{P_{CRAN2SAT}}}} = 0,$ this indicates that the logical inconsistency of ${P_{CRAN2SAT}}$ has been minimized. In other words, ${E_{{P_{CRAN2SAT}}}} = 0$ shows that all the clauses in ${P_{CRAN2SAT}}$ are satisfied.

According to Abdullah's method ^[1], the value of synaptic weight ${W_{ij}}$ is determined by comparing the coefficient of the cost function, ${E_{{P_{CRAN2SAT}}}}$ with the Lyapunov energy function denoted as ${H_{{P_{CRAN2SAT}}}}.$ The Lyapunov energy function is defined as

${H_{{P_{CRAN2SAT}}}} = - \frac{1}{2}\sum\limits_i^N {\sum\limits_j^N {W_{ij}^{}{S_i}{S_j}} } - \sum\limits_i^N {{W_i}{S_i}} .$

(9)

The Lyapunov energy function in DHNN is formulated to describe the energy value of the network. When this energy reaches its global minimum, the network achieves a stable state ^[25]. The DHNN model aims to ensure that solutions move towards the global minimum energy, which corresponds to the stable state of the neurons. However, it is worth mentioning that finding the global minimum energy depends on the efficiency of both learning and retrieval phases in DHNN. If these phases are inefficient, DHNN may end up trapped at a local minimum energy or suboptimal solution. Then, all the value of W_ij will be stored in a matrix form as a CAM. During the retrieval phase, the final neuron state generated by the DHNN is assessed to determine whether the solution achieved is global or local. During this phase, the retrieved neuron state is asynchronously updated by applying the W_ij values stored in the CAM. Consequently, through the utilization of the stored W_ij in CAM, the DHNN's local field is computed in the subsequent manner, where local field is denoted as h_i,

${h_i} = \sum\limits_{j \ne i}^N {{W_{ij}}{S_j} + {W_i}} .$

(10)

Once the value of ${h_i}$ is obtained, the final neuron state is retrieved by squashing the value of h_i by using an activation function. The final neuron state S_i(t) in DHNN will be updated by

${S_i}\left( t \right) = \left\{ {\begin{array}{*{20}{c}} {1, }&{{\text{if}}\, \, \, \, g\left( {{h_i}} \right) \geqslant 0, } \\ { - 1, }&{{\text{if}}\, \, \, \, g\left( {{h_i}} \right) < 0, } \end{array}} \right.$

(11)

where g(h_i) is obtained according to the types of the activation function. Currently, the HTAF is used to transform the value of h_i, to be either one or negative one. However, this paper proposes a new non-monotonic Smish activation function to enhance the efficiency of the updating rule within DHNN. Additionally, the DHNN-CRAN2SAT with proposed Smish activation function will be compared with other types of activation functions, namely, McCulloch-Pitts, piecewise linear activation function, Elliot activation function, HTAF and Swish activation function. A detailed explanation regarding the proposed Smish activation functions is discussed in section five.

Finally, the quality of the retrieved neuron state is assessed using the equation provided below:

$\left| {{H_{{P_{CRAN2SAT}}}} - H_{{P_{CRAN2SAT}}}^{\min }} \right| \leqslant Tol,$

(12)

where Tol = 0.001 is a predetermined tolerance value ^[3]. The ${H_{{P_{CRAN2SAT}}}}$ is the final energy associated with the final neuron state S_i(t), and $H_{{P_{CRAN2SAT}}}^{\min }$ are two absolute minimum energies computed by

$H_{{P_{CRAN2SAT}}}^{\min } = - \left( {\frac{{m + 2n}}{4}} \right),$

(13)

where $m$ and $n$ represent the number of second and first order clauses existing in ${P_{CRAN2SAT}},$ respectively. The final energy ${H_{{P_{CRAN2SAT}}}}$ is computed by using Eq (9). According to ^[25], the Lyapunov energy function plays a crucial role in determining the level of convergence within the network. The energy value obtained can be categorized as either the global minimum or a local minimum.

Hence, if Eq (12) is fulfilled, the resulting neuron state will be regarded as a global minimum energy. Otherwise, the retrieved neuron state will be confined to a local minimum energy.

5. Proposed non-monotonic Smish activation function in DHNN

This section will introduce the non-monotonic Smish activation function in solving logic satisfiability in DHNN. This, its characteristics, and properties will be discussed further. The activation function works as a transfer function to transform the output into the bipolar form. The final neuron state that is retrieved by DHNN will demonstrate the nature of the model. The purpose of introducing the non-monotonic Smish activation function in DHNN is to enhance the effectiveness of DHNN in retrieving a diversified final neuron state. Notably, this is the first attempt of applying the Smish activation function in solving logic satisfiability in DHNN. Smish activation function is a new type of nonlinear activation function and was proposed by ^[26]. Figure 1 illustrates the figure for the Smish activation function and its derivative.

Figure 1. Illustration of Smish activation function and its derivative.

Types of activation functions	Monotonic	Bounded	Diminish gradient	Output range
McCulloch-Pitts	Yes	Yes	Yes	$\left[{- \infty, + \infty } \right)$
Piecewise linear	Yes	Yes	Yes	(–1, 1)
Eliot symmetric	Yes	Yes	Yes	(–1, 1)
HTAF	Yes	Yes	Yes	(–1, 1)
Swish	No	No	No	$\left[{- \infty, + \infty } \right)$
Smish	No	No	No	$\left[{- 0.25, \infty } \right)$

Activation function	Function
McCulloch-Pitts	$g\left({{h_i}} \right) = {h_i}$
Piecewise linear	$g\left({{h_i}} \right) = \left\{ {\begin{array}{{20}{c}} { - 1, } \\ {{h_i}, } \\ {1, } \end{array}} \right.\begin{array}{{20}{c}} {\, \, \, \, if\, \, \, {h_i} \leqslant - 1, } \\ {\, \, \, \, \, \, \, \, \, \, \, if\, \, \, - 1 < {h_i} \leqslant 1, } \\ {if\, \, \, {h_i} > 1.} \end{array}$
Elliot symmetric	$g\left({{h_i}} \right) = \frac{{{h_i}}}{{1 + \left\| {{h_i}} \right\|}}$
Hyperbolic tangent	$g\left({{h_i}} \right) = \tanh \left({{h_i}} \right) = \frac{{{e^{{h_i}}} - {e^{ - {h_i}}}}}{{{e^{{h_i}}} + {e^{ - {h_i}}}}}$
Swish	$g\left({{h_i}} \right) = {h_i} \cdot sigmoid\left({\beta {h_i}} \right) = \frac{{{h_i}}}{{1 + {e^{ - \beta {h_i}}}}}$

Setting	Parameter value
Number of neurons (NN)	$3 \leqslant NN \leqslant 45$ ^[10]
Neuron combination (C)	100
Number of trials (NT)	100
Number of learning (NH)	100
Synaptic weight method	Abdullah method ^[1]
Threshold Central Processing Unit time	24 hours
Tolerance value, Tol	0.001 ^[3]
Initialization of neuron states	Random
Learning algorithm	Exhaustive search ^[3]
Activation function	McP, piecewise linear, Elliot, HTAF, Swish, Smish

Parameter	Parameter remarks
$pp$	Number of $\left({S_i^{\max }, S_i^{}} \right)$ where $S_i^{\max } = 1$ and $S_i^{} = 1$
$pn$	Number of $\left({S_i^{\max }, S_i^{}} \right)$ where $S_i^{\max } = 1$ and $S_i^{} = - 1$
$np$	Number of $\left({S_i^{\max }, S_i^{}} \right)$ where $S_i^{\max } = - 1$ and $S_i^{} = 1$
$nn$	Number of $\left({S_i^{\max }, S_i^{}} \right)$ where $S_i^{\max } = - 1$ and $S_i^{} = - 1$

Parameter	Parameter's name
NH	Number of learning
NT	Number of trials
C	Number of neuron combination
NT_Solution	Number of total solutions
NG_Solution	Number of global minimum solution
H_min	Minimum energy achieved in DHNN-CRAN2SAT
H_f	Final energy achieved in DHNN-CRAN2SAT
$S_i^{\max }$	Benchmark neuron state
S_i	Final neuron state
TV	Total neuron variation

Types of activation function
NN	Smish	Swish	Hyperbolic tangent	Elliot	Piecewise linear	McCulloch-Pitts
3	1.00000	1.00000 (0.00000)	1.00000 (0.00000)	1.00000 (0.00000)	1.00000 (0.00000)	1.00000 (0.00000)
6	1.00000	1.00000 (0.00000)	1.00000 (0.00000)	1.00000 (0.00000)	1.00000 (0.00000)	1.00000 (0.00000)
9	1.00000	1.00000 (0.00000)	1.00000 (0.00000)	1.00000 (0.00000)	1.00000 (0.00000)	1.00000 (0.00000)
12	0.95500	0.83000 (0.15060)	0.94000 (0.01596)	0.73000 (0.30822)	0.83000 (0.15060)	0.86000 (0.11047)
15	0.66000	0.45000 (0.46667)	0.62000 (0.06452)	0.47000 (0.40426)	0.51000 (0.29412)	0.52000 (0.26923)
18	0.34000	0.24000 (0.41667)	0.24000 (0.41667)	0.23000 (0.47826)	0.24000 (0.41667)	0.15000 (1.26667)
21	0.13000	0.08000 (0.62500)	0.08000 (0.62500)	0.12120 (0.07261)	0.08210 (0.58343)	0.11000 (0.18182)
24	0.07000	0.04000 (0.75000)	0.04000 (0.75000)	0.03490 (1.00573)	0.06000 (0.16667)	0.04000(0.75000)
27	0.04000	0.02000 (1.00000)	0.02140 (0.86916)	0.00000 (*)	0.01000 (3.00000)	0.00000 (*)
30	0.02000	0.01100 (0.81818)	0.01000 (1.00000)	0.00000 (*)	0.01000 (1.00000)	0.00000 (*)
33	0.01850	0.00000 (*)	0.01000 (0.85000)	0.00000 (*)	0.00000 (*)	0.00000 (*)
36	0.00000	0.00000	0.00000	0.00000	0.00000	0.00000
39	0.00000	0.00000	0.00000	0.00000	0.00000	0.00000
42	0.00000	0.00000	0.00000	0.00000	0.00000	0.00000
45	0.00000	0.00000	0.00000	0.00000	0.00000	0.00000
(+/=/–)		(8/3/0)	(8/3/0)	(8/3/0)	(8/3/0)	(8/3/0)

[1]	W. A. T. W. Abdullah, Logic programming on a neural network, Int. J. Intell. Syst., 7 (1992), 513–519. https://doi.org/10.1002/int.4550070604 doi: 10.1002/int.4550070604
[2]	G. Detorakis, T. Bartley, E. Neftci, Contrastive Hebbian learning with random feedback weights, Neural Networks, 114 (2019), 1–14. https://doi.org/10.1016/j.neunet.2019.01.008 doi: 10.1016/j.neunet.2019.01.008
[3]	S. Sathasivam, Upgrading logic programming in Hopfield network, Sains Malays., 39 (2010), 115–118.
[4]	M. S. M. Kasihmuddin, S. Sathasivam, M. A. Mansor, Hybrid genetic algorithm in the Hopfield network for logic satisfiability problem, Pertanika J. Sci. Technol., 2017.
[5]	M. A. Mansor, M. S. M. Kasihmuddin, S. Sathasivam, Artificial immune system paradigm in the Hopfield network for 3-satisfiability problem, Pertanika J. Sci. Technol., 25 (2017), 1173–1188.
[6]	S. Alzaeemi, M. A. Mansor, M. M. Kasihmuddin, S. Sathasivam, M. Mamat, Radial basis function neural network for 2 satisfiability programming, Indonesian J. Electron. Eng. Comput. Sci., 18 (2020), 459–469. https://doi.org/10.11591/ijeecs.v18.i1 doi: 10.11591/ijeecs.v18.i1
[7]	S. A. Alzaeemi, K. G. Tay, A. Huong, S. Sathasivam, M. K. M. Ali, Evolution performance of symbolic radial basis function neural network by using evolutionary algorithms, Comput. Syst. Sci. Eng., 47 (2023), 1163–1184. https://doi.org/10.32604/csse.2023.038912 doi: 10.32604/csse.2023.038912
[8]	S. Sathasivam, M. A. Mansor, A. I. M. Ismail, S. Z. M. Jamaludin, M. S. M. Kasihmuddin, M. Mamat, Novel random k satisfiability for k≤2 in Hopfield neural network, Sains Malays., 49 (2020), 2847–2857. https://doi.org/10.17576/jsm-2020-4911-23 doi: 10.17576/jsm-2020-4911-23
[9]	Y. Guo, M. S. M. Kasihmuddin, Y. Gao, M. A. Mansor, H. A. Wahab, N. E. Zamri, et al., YRAN2SAT: a novel flexible random satisfiability logical rule in discrete Hopfield neural network, Adv. Eng. Software, 171 (2022), 103169. https://doi.org/10.1016/j.advengsoft.2022.103169 doi: 10.1016/j.advengsoft.2022.103169
[10]	N. E. Zamri, S. A. Azhar, M. A. Mansor, A. Alway, M. S. M. Kasihmuddin, Weighted random k satisfiability for k = 1, 2 (r2SAT) in discrete Hopfield neural network, Appl. Soft Comput., 126 (2022), 109312. https://doi.org/10.1016/j.asoc.2022.109312 doi: 10.1016/j.asoc.2022.109312
[11]	S. S. M. Sidik, N. E. Zamri, M. S. M. Kasihmuddin, H. A. Wahab, Y. Guo, M. A. Mansor, Non-systematic weighted satisfiability in discrete Hopfield neural network using binary artificial bee colony optimization, Mathematics, 10 (2022), 1129. https://doi.org/10.3390/math10071129 doi: 10.3390/math10071129
[12]	S. R. Dubey, S. K. Singh, B. B. Chaudhuri, Activation functions in deep learning: a comprehensive survey and benchmark, Neurocomputing, 503 (2022), 92–108. https://doi.org/10.1016/j.neucom.2022.06.111 doi: 10.1016/j.neucom.2022.06.111
[13]	A. Apicella, F. Donnarumma, F. Isgrò, R. Prevete, A survey on modern trainable activation functions, Neural Networks, 138 (2021), 14–32. https://doi.org/10.1016/j.neunet.2021.01.026 doi: 10.1016/j.neunet.2021.01.026
[14]	A. C. Mathias, P. C. Rech, Hopfield neural network: the hyperbolic tangent and the piecewise-linear activation functions, Neural Networks, 34 (2012), 42–45. https://doi.org/10.1016/j.neunet.2012.06.006 doi: 10.1016/j.neunet.2012.06.006
[15]	C. Chen, F. Min, Y. Zhang, H. Bao, ReLU-type Hopfield neural network with analog hardware implementation, Chaos Solitons Fract., 167 (2023), 113068. https://doi.org/10.1016/j.chaos.2022.113068 doi: 10.1016/j.chaos.2022.113068
[16]	M. A. Mansor, S. Sathasivam, Accelerating activation function for 3-satisfiability logic programming, Int. J. Intell. Syst. Appl., 8 (2016), 44–50. https://doi.org/10.5815/ijisa.2016.10.05 doi: 10.5815/ijisa.2016.10.05
[17]	S. Abdeen, M. S. M. Kasihmuddin, N. E. Zamri, G. Manoharam, M. A. Mansor, N. Alshehri, S-type aandom k satisfiability logic in discrete Hopfield neural network using probability distribution: performance optimization and analysis, Mathematics, 11 (2023). https://doi.org/10.3390/math11040984 doi: 10.3390/math11040984
[18]	S. A. Karim, N. E. Zamri, A. Alway, M. S. M. Kasihmuddin, A. I. M. Ismail, M. A. Mansor, et al., Random satisfiability: a higher-order logical approach in discrete Hopfield neural network, IEEE Access, 9 (2021), 50831–50845. https://doi.org/10.1109/ACCESS.2021.3068998 doi: 10.1109/ACCESS.2021.3068998
[19]	V. Someetheram, M. F. Marsani, M. S. M. Kasihmuddin, N. E. Zamri, S. S. M. Sidik, S. Z. M. Jamaludin, Random maximum 2 satisfiability logic in discrete Hopfield neural network incorporating improved election algorithm, Mathematics, 10 (2022), 4734. https://doi.org/10.3390/math10244734 doi: 10.3390/math10244734
[20]	M. Soeken, G. Meuli, B. Schmitt, F. Mozafari, H. Riener, G. D. Micheli, Boolean satisfiability in quantum compilation, Philos. Trans. R. Soc. A, 378 (2020), 161. https://doi.org/10.1098/rsta.2019.0161 doi: 10.1098/rsta.2019.0161
[21]	C. Hireche, H. Drias, H. Moulai, Grid based clustering for satisfiability solving, Appl. Soft Comput., 88 (2020), 106069. https://doi.org/10.1016/j.asoc.2020.106069 doi: 10.1016/j.asoc.2020.106069
[22]	H. Yamashita, K. Aihara, H. Suzuki, Timescales of Boolean satisfiability solver using continuous-time dynamical system, Commun. Nonlinear Sci. Numer. Simul., 84 (2020), 105183. https://doi.org/10.1016/j.cnsns.2020.105183 doi: 10.1016/j.cnsns.2020.105183
[23]	L. C. Kho, M. S. M. Kasihmuddin, M. Mansor, S. Sathasivam, Logic mining in league of legends, Pertanika J. Sci. Technol., 28 (2020), 211–225.
[24]	J. Feng, S. Lu, Performance analysis of various activation functions in artificial neural networks, J. Phys., 1237 (2019), 022030. https://doi.org/10.1088/1742-6596/1237/2/022030 doi: 10.1088/1742-6596/1237/2/022030
[25]	G. Wang, Z. Hao, B. Zhang, L. Jin, Convergence and robustness of bounded recurrent neural networks for solving dynamic Lyapunov equations, Inf. Sci., 588 (2022), 106–123. https://doi.org/10.1016/j.ins.2021.12.039 doi: 10.1016/j.ins.2021.12.039
[26]	X. Wang, H. Ren, A. Wang, Smish: a novel activation function for deep learning methods, Electronics, 11 (2022). 540. https://doi.org/10.3390/electronics11040540 doi: 10.3390/electronics11040540
[27]	M. S. M. Kasihmuddin, M. A. Mansor, M. F. M. Basir, S. Sathasivam, Discrete mutation Hopfield neural network in propositional satisfiability, Mathematics, 7 (2019), 1133. https://doi.org/10.3390/math7111133 doi: 10.3390/math7111133
[28]	Y. Koçak, G. Ü. Şiray, New activation functions for single layer feedforward neural network, Expert Syst. Appl., 164 (2021), 113977. https://doi.org/10.1016/j.eswa.2020.113977 doi: 10.1016/j.eswa.2020.113977
[29]	S. Kiliçarslan, C. Közkurt, S. Baş, A. Elen, Detection and classification of pneumonia using novel superior exponential (SupEx) activation function in convolutional neural networks, Expert Syst. Appl., 217 (2023), 119503. https://doi.org/10.1016/j.eswa.2023.119503 doi: 10.1016/j.eswa.2023.119503
[30]	D. Misra, Mish: a self regularized non-monotonic activation function, arXiv, 2019. https://doi.org/10.48550/arXiv.1908.08681 doi: 10.48550/arXiv.1908.08681
[31]	J. Liu, Y. Liu, Q. Zhang, A weight initialization method based on neural network with asymmetric activation function, Neurocomputing, 483 (2022), 171–182. https://doi.org/10.1016/j.neucom.2022.01.088 doi: 10.1016/j.neucom.2022.01.088
[32]	F. L. Azizan, S. Sathasivam, M. K. M. Ali, Hybridised intelligent dynamic model of 3-satisfiability fuzzy logic Hopfield neural network, Pertanika J. Sci. Technol., 31 (2023), 6. https://doi.org/10.47836/pjst.31.4.06 doi: 10.47836/pjst.31.4.06
[33]	S. Z. M. Jamaludin, N. A. Romli, M. S. M. Kasihmuddin, A. Baharum, M. A. Mansor, M. F. Marsani, Novel logic mining incorporating log linear approach, J. King Saud Univ., 34 (2022), 9011–9027. https://doi.org/10.1016/j.jksuci.2022.08.026 doi: 10.1016/j.jksuci.2022.08.026
[34]	J. L. Salmeron, A. Ruiz-Celma, Elliot and symmetric Elliot extreme learning machines for Gaussian noisy industrial thermal modelling, Energies, 12 (2018), 90. https://doi.org/10.3390/en12010090 doi: 10.3390/en12010090
[35]	S. P. Ittiyavirah, S. A. Jones, P. Siddarth, Analysis of different activation functions using backpropagation neural networks, J. Theor. Appl. Inf. Technol., 47 (2013), 1344–1348.
[36]	H. Abdel-Nabi, G. Al-Naymat, M. Z. Ali, A. Awajan, HcLSH: a novel non-linear monotonic activation function for deep learning methods, IEEE Access, 11 (2023), 47794–47815. https://doi.org/10.1109/ACCESS.2023.3276298 doi: 10.1109/ACCESS.2023.3276298
[37]	J. Brownlee, Better deep learning: train faster, reduce overfitting, and make better predictions, Machine Learning Mastery, 2018.
[38]	P. Ramachandran, B. Zoph, Q. V. Le, Searching for activation functions, arXiv, 2017. https://doi.org/10.48550/arXiv.1710.05941 doi: 10.48550/arXiv.1710.05941
[39]	S. Elfwing, E. Uchibe, K. Doya, Sigmoid-weighted linear units for neural network function approximation in reinforcement learning, Neural Networks, 107 (2018), 3–11. https://doi.org/10.1016/j.neunet.2017.12.012 doi: 10.1016/j.neunet.2017.12.012
[40]	A. Alway, N. E. Zamri, S. A. Karim, M. A. Mansor, M. S. M. Kasihmuddin, M. M. Bazuhair, Major 2 satisfiability logic in discrete Hopfield neural network, Int. J. Comput. Math., 99 (2022), 924–948. https://doi.org/10.1080/00207160.2021.1939870 doi: 10.1080/00207160.2021.1939870
[41]	F. L. Azizan, S. Sathasivam, M. K. M. Ali, N. Roslan, C. Feng, Hybridised network of fuzzy logic and a genetic algorithm in solving 3-satisfiability Hopfield neural networks, Axioms, 12 (2023), 250. https://doi.org/10.3390/axioms12030250 doi: 10.3390/axioms12030250
[42]	Y. Gao, Y. Guo, N. A. Romli, M. S. M. Kasihmuddin, W. Chen, M. A. Mansor, et al., GRAN3SAT: creating flexible higher-order logic satisfiability in the discrete Hopfield neural network, Mathematics, 10 (2022), 1899. https://doi.org/10.3390/math10111899 doi: 10.3390/math10111899
[43]	S. Sathasivam, W. A. T. W. Abdullah, Logic learning in Hopfield networks, arXiv, 2008. https://doi.org/10.48550/arXiv.0804.4075 doi: 10.48550/arXiv.0804.4075
[44]	G. Pinkas, Symmetric neural networks and propositional logic satisfiability, Neural Comput., 3 (1991), 282–291. https://doi.org/10.1162/neco.1991.3.2.282 doi: 10.1162/neco.1991.3.2.282
[45]	P. Ong, Z. Zainuddin, Optimizing wavelet neural networks using modified cuckoo search for multi-step ahead chaotic time series prediction, Appl. Soft Comput., 80 (2019), 374–386. https://doi.org/10.1016/j.asoc.2019.04.016 doi: 10.1016/j.asoc.2019.04.016
[46]	G. Maguolo, L. Nanni, S. Ghidoni, Ensemble of convolutional neural networks trained with different activation functions, Expert Syst. Appl., 166 (2021), 114048. https://doi.org/10.1016/j.eswa.2020.114048 doi: 10.1016/j.eswa.2020.114048
[47]	D. L. Elliott, A better activation function for artificial neural networks, Unive. Md., 1993.
[48]	Y. Zhang, P. Li, C. Xu, X. Peng, R. Qiao, Investigating the effects of a fractional operator on the evolution of the ENSO model: bifurcations, stability and numerical analysis, Fractal Fract., 7 (2023), 602. https://doi.org/10.3390/fractalfract7080602 doi: 10.3390/fractalfract7080602
[49]	C. Xu, Z. Liu, P. Li, J. Yan, L. Yao, Bifurcation mechanism for fractional-order three-triangle multi-delayed neural networks, Neural Process. Lett., 55 (2023), 6125–6151. https://doi.org/10.1007/s11063-022-11130-y doi: 10.1007/s11063-022-11130-y
[50]	P. Li, Y. Lu, C. Xu, J. Ren, Insight into Hopf bifurcation and control methods in fractional order BAM neural networks incorporating symmetric structure and delay, Cogn. Comput., 15 (2023), 1825–1867. https://doi.org/10.1007/s12559-023-10155-2 doi: 10.1007/s12559-023-10155-2
[51]	P. Li, X. Peng, C. Xu, L. Han, S. Shi, Novel extended mixed controller design for bifurcation control of fractional-order Myc/E2F/miR-17-92 network model concerning delay, Math. Methods Appl. Sci., 46 (2023), 18878–18898. https://doi.org/10.1002/mma.9597 doi: 10.1002/mma.9597
[52]	A. G. Hussien, D. Oliva, E. H. Houssein, A. A. Juan, X. Yu, Binary whale optimization algorithm for dimensionality reduction, Mathematics, 8 (2020), 1821. https://doi.org/10.3390/math8101821 doi: 10.3390/math8101821
[53]	E. Pashaei, N. Aydin, Binary black hole algorithm for feature selection and classification on biological data, Appl. Soft Comput., 56 (2017), 94–106. https://doi.org/10.1016/j.asoc.2017.03.002 doi: 10.1016/j.asoc.2017.03.002
[54]	N. A. Rusdi, M. S. M. Kasihmuddin, N. A. Romli, G. Manoharam, M. A. Mansor, Multi-unit discrete Hopfield neural network for higher order supervised learning through logic mining: optimal performance design and attribute selection, J. King Saud Univ., 35 (2023), 101554. https://doi.org/10.1016/j.jksuci.2023.101554 doi: 10.1016/j.jksuci.2023.101554
[55]	S. A. Alzaeemi, S. Sathasivam, M. K. M. Ali, K. G. Tay, M. Velavan, Hybridized intelligent neural network optimization model for forecasting prices of rubber in Malaysia, Comput. Syst. Sci. Eng., 47 (2023), 1471–1491. https://doi.org/10.32604/csse.2023.037366 doi: 10.32604/csse.2023.037366

AIMS Mathematics

Conditional random k satisfiability modeling for k = 1, 2 (CRAN2SAT) with non-monotonic Smish activation function in discrete Hopfield neural network

Related Papers:

Abstract

1. Introduction

2. Motivation

2.1. The exploration of satisfiability representation in DHNN

2.2. The effect of negative literal in second order clause

2.3. Inadequate metrics for evaluating the effectiveness of activation functions in DHNN

3. CRAN2SAT representation

4. CRAN2SAT in DHNN

5. Proposed non-monotonic Smish activation function in DHNN

6. Experimental setup

6.1. Simulation platform

6.2. Existing and benchmark activation functions

6.2.1. McCulloch-Pitts

6.2.2. Elliot symmetric activation function

6.2.3. HTAF

6.2.4. Piecewise linear activation function

6.2.5. Swish activation function

6.3. Parameter and activation function setup in DHNN

6.4. Performance metrics

6.4.1. Learning phase

6.4.2. Retrieval phase

6.4.3. Similarity analysis

6.5. Comparative analysis and baseline methods

7. Results and discussion

7.1. Retrieval phase

7.1.1. Testing error analysis

7.1.2. Energy analysis

7.1.3. Similarity analysis

7.1.4. Synaptic weight analysis

7.2. Different activation functions for DHNN-CRAN2SAT

7.2.1. Analysis on DHNN-CRAN2SAT with proposed non-monotonic Smish activation

7.3. Comparison of six different types of activation function with existing SAT in DHNN

8. Conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. Motivation

2.1. The exploration of satisfiability representation in DHNN

2.2. The effect of negative literal in second order clause

2.3. Inadequate metrics for evaluating the effectiveness of activation functions in DHNN

3. CRAN2SAT representation

4. CRAN2SAT in DHNN

5. Proposed non-monotonic Smish activation function in DHNN

6. Experimental setup

6.1. Simulation platform

6.2. Existing and benchmark activation functions

6.2.1. McCulloch-Pitts

6.2.2. Elliot symmetric activation function

6.2.3. HTAF

6.2.4. Piecewise linear activation function

6.2.5. Swish activation function

6.3. Parameter and activation function setup in DHNN

6.4. Performance metrics

6.4.1. Learning phase

6.4.2. Retrieval phase

6.4.3. Similarity analysis

6.5. Comparative analysis and baseline methods

7. Results and discussion

7.1. Retrieval phase

7.1.1. Testing error analysis

7.1.2. Energy analysis