A weighted online regularization for a fully nonparametric model with heteroscedasticity

Lei Hu; Lei Hu

doi:10.3934/math.20231381

AIMS Mathematics

2023, Volume 8, Issue 11: 26991-27008. doi: 10.3934/math.20231381

Previous Article Next Article

Research article Special Issues

A weighted online regularization for a fully nonparametric model with heteroscedasticity

Lei Hu ^,

Shenzhen Institute of Information Technology, Shenzhen 518172, China

Received: 21 July 2023 Revised: 20 August 2023 Accepted: 28 August 2023 Published: 22 September 2023
MSC : 41A15, 62G05, 93E24

In this paper, combining B-spline function and Tikhonov regularization, we propose an online identification approach for reconstructing a smooth function and its derivative from scattered data with heteroscedasticity. Our methodology offers the unique advantage of enabling real-time updates based on new input data, eliminating the reliance on historical information. First, to address the challenge of heteroscedasticity and computation cost, we employ weight coefficients along with a judiciously chosen set of knots for interpolation. Second, a reasonable approach is provided to select weight coefficients and the regularization parameter in objective functional. Finally, We substantiate the efficacy of our approach through a numerical example and demonstrate its applicability in solving inverse problems. It is worth mentioning that the algorithm not only ensures the calculation efficiency, but also trades the data accuracy through the data volume.

Keywords:

Citation: Lei Hu. A weighted online regularization for a fully nonparametric model with heteroscedasticity[J]. AIMS Mathematics, 2023, 8(11): 26991-27008. doi: 10.3934/math.20231381

Related Papers:

[1]	Caicai Feng, Saratha Sathasivam, Nurshazneem Roslan, Muraly Velavan . 2-SAT discrete Hopfield neural networks optimization via Crow search and fuzzy dynamical clustering approach. AIMS Mathematics, 2024, 9(4): 9232-9266. doi: 10.3934/math.2024450
[2]	Farah Liyana Azizan, Saratha Sathasivam, Nurshazneem Roslan, Ahmad Deedat Ibrahim . Logic mining with hybridized 3-satisfiability fuzzy logic and harmony search algorithm in Hopfield neural network for Covid-19 death cases. AIMS Mathematics, 2024, 9(2): 3150-3173. doi: 10.3934/math.2024153
[3]	Muhammad Aqmar Fiqhi Roslan, Nur Ezlin Zamri, Mohd. Asyraf Mansor, Mohd Shareduwan Mohd Kasihmuddin . Major 3 Satisfiability logic in Discrete Hopfield Neural Network integrated with multi-objective Election Algorithm. AIMS Mathematics, 2023, 8(9): 22447-22482. doi: 10.3934/math.20231145
[4]	Gaeithry Manoharam, Azleena Mohd Kassim, Suad Abdeen, Mohd Shareduwan Mohd Kasihmuddin, Nur 'Afifah Rusdi, Nurul Atiqah Romli, Nur Ezlin Zamri, Mohd. Asyraf Mansor . Special major 1, 3 satisfiability logic in discrete Hopfield neural networks. AIMS Mathematics, 2024, 9(5): 12090-12127. doi: 10.3934/math.2024591
[5]	Xiaoyan Liu, Mohd Shareduwan Mohd Kasihmuddin, Nur Ezlin Zamri, Yunjie Chang, Suad Abdeen, Yuan Gao . Higher order Weighted Random k Satisfiability ( $k = 1, 3$ ) in Discrete Hopfield Neural Network. AIMS Mathematics, 2025, 10(1): 159-194. doi: 10.3934/math.2025009
[6]	Nur 'Afifah Rusdi, Nur Ezlin Zamri, Mohd Shareduwan Mohd Kasihmuddin, Nurul Atiqah Romli, Gaeithry Manoharam, Suad Abdeen, Mohd. Asyraf Mansor . Synergizing intelligence and knowledge discovery: Hybrid black hole algorithm for optimizing discrete Hopfield neural network with negative based systematic satisfiability. AIMS Mathematics, 2024, 9(11): 29820-29882. doi: 10.3934/math.20241444
[7]	Nurshazneem Roslan, Saratha Sathasivam, Farah Liyana Azizan . Conditional random k satisfiability modeling for k = 1, 2 (CRAN2SAT) with non-monotonic Smish activation function in discrete Hopfield neural network. AIMS Mathematics, 2024, 9(2): 3911-3956. doi: 10.3934/math.2024193
[8]	Jin Gao, Lihua Dai . Anti-periodic synchronization of quaternion-valued high-order Hopfield neural networks with delays. AIMS Mathematics, 2022, 7(8): 14051-14075. doi: 10.3934/math.2022775
[9]	S. Neelakandan, Sathishkumar Veerappampalayam Easwaramoorthy, A. Chinnasamy, Jaehyuk Cho . Fuzzy adaptive learning control network (FALCN) for image clustering and content-based image retrieval on noisy dataset. AIMS Mathematics, 2023, 8(8): 18314-18338. doi: 10.3934/math.2023931
[10]	Mohammed D. Kassim . Controlling stability through the rate of decay of the delay feedback kernels. AIMS Mathematics, 2023, 8(11): 26343-26356. doi: 10.3934/math.20231344

Abstract

1. Introduction

The Boolean satisfiability (SAT) problem is a classical issue in computational complexity theory and has been a significant research subject in computer science and artificial intelligence since the 1970s ^[1]. In 1971, S. A. Cook ^[2] proved that the SAT problem is the world's first NP-complete problem, meaning any NP problem can be reduced to the SAT problem for a polynomial-time solution. The SAT problem serves as a benchmark for the difficulty of a category of problems known as the core of NP-complete problems. It plays a crucial role in various areas of computer science, including theoretical computer science, complexity theory, cryptosystems, and artificial intelligence ^[3,4,5,6]. With the advancements in computer hardware performance and algorithm design, traditional SAT solvers have become effective in many practical applications ^[7,8,9,10]. However, as problems grow in size and complexity, traditional methods often face challenges such as inefficiency and high consumption of computational resources. This has prompted researchers to explore new solution methods and techniques. Among these, the discrete Hopfield neural network (DHNN) ^[11], a classical neural network model, has shown significant potential and effectiveness in solving combinatorial optimization problems since its inception. Hopfield ^[11] demonstrated the stability of network dynamics, highlighting that the evolution of network states is essentially a process of energy minimization. When the association weights are symmetric, the system reaches a stable state. This stable equilibrium point aligns with the correct storage state, providing a clear physical explanation for associative memory. The network, by emphasizing the collective function of neurons from a systems perspective, offers preliminary insights into the nature of associative memory. Due to its robust memory capabilities and parallel processing power, the DHNN is particularly effective in addressing combinatorial optimization problems such as SAT ^[12,13,14].

In the study of SAT problems, the 3-satisfiability (3SAT) logic has received significant attention from researchers because higher-order Boolean SAT can be converted or reduced to the 3SAT form ^[15]. In the 3SAT problem, each clause contains three literals, making it more complex and closer to practical logic constraint problems. To address the 3SAT problem, researchers have mapped the variables and clauses of a Boolean formula into the neurons and energy functions of a discrete Hopfield network. In this network, each variable and clause is encoded as a neuron's state and connection weights ^[16,17,18]. A satisfying solution to the Boolean formula is then found by adjusting the neuron states to minimize the energy function. This method of solving the 3-SAT problem implemented in a DHNN is referred to as the DHNN-3SAT model. The DHNN-3SAT model has garnered extensive attention and research interest due to its significant improvement in solving ability and effectiveness on 3SAT problems ^[19,20]. Early research efforts focused on basic discrete Hopfield network structures, utilizing simple connection weights and update rules. As research progressed, scholars proposed various improvement and optimization strategies to enhance the network's performance and efficiency. In 1992, Wan Abdullah successfully integrated special logic programming as symbolic rules into a DHNN ^[21], and in 2011, Sathasivam and Abdullah extended this approach and formally named it the Wan Abdullah method (WA method) ^[22]. In 2014, Sathasivam et al. embedded higher-order SAT into DHNN ^[23]. Kasihmuddin et al. ^[24] applied k-satisfiability planning in DHNN. In 2017, Mansor et al. ^[25] demonstrated the hybrid use of the DHNN artificial immune system for the 3-SAT problem. Subsequently, Kasihmuddin et al. ^[26] proposed a genetic algorithm for k-satisfiability logic programming based on DHNN. In 2021, Mansor and Sathasivam ^[12] proposed a DHNN-3SAT optimal performance index. In 2023, Azizan and Sathasivam ^[27] proposed a DHNN model with a 3SAT fuzzy logic model of DHNN. However, as researchers delved deeper into the DHNN-SAT model, they found that its computational efficiency is not optimal for large-scale problems due to the inherent limitations of the DHNN, with a tendency to fall into local minima. To address these issues, researchers have been working to integrate heuristic algorithms into the optimization process ^{[28,29,30,31]} to enhance the accuracy of the DHNN-SAT model. Currently, these research methods are achieving high global minimum ratios in DHNN-SAT models with fewer neurons. By adjusting the structure and parameters of the neural network, researchers ^[32] have been exploring various model variations and optimization strategies to further enhance the performance and generalizability of the model. These efforts not only offer a new perspective and approach to understanding and solving SAT problems but also make significant contributions and provide inspiration for the application of DHNNs in combinatorial optimization and discrete problem-solving.

Although the DHNN-3SAT model has been successful in addressing certain problems, it still has some challenges and limitations. First, the model's computational complexity and storage requirements may increase significantly with the problem size, leading to performance issues when dealing with large-scale SAT problems. Second, the model's training and optimization process may be sensitive to parameter tuning and initialization, necessitating more experimental validation and tuning. Third, it may take a longer time to reach a stable solution when dealing with complex problems, which can impact its practical application in engineering and other fields. Lastly, in real-life scenarios, the constraints of SAT problems often change over time, leading to the need for network redesign and the generation of a large number of redundant computations with the increase, decrease, and update of large-scale constraints, ultimately limiting the traditional DHNN-3SAT model's performance.

To address the changing constraints of the SAT problem and the increasing size and complexity of the network, this paper proposes a WA method based on basic logical clauses. This method utilizes information about the synaptic weights of the original SAT problem in the DHNN, leading to significant savings in repetitive calculations. In addition, to tackle the issue of increasing Boolean variables and logical clauses leading to a rapidly expanding solution space and the traditional DHNN-WA model being prone to oscillations and local minima, this paper introduces a DHNN 3SAT model, based on a genetic algorithm-optimized K-modes clustering. This approach uses the genetic optimization K-modes clustering algorithm to cluster the initial space, reducing the retrieval space and avoiding repeated searches, thus improving retrieval efficiency.

The paper is organized as follows: Section 2 introduces the knowledge related to the research, including 3SAT and DHNN. Section 3 details the implementation and workflow for determining the synaptic weights of the DHNN 3SAT model using the WA method. To address the issue of a large number of redundant computations caused by the changing constraints of the 3SAT problem, the basic logic clause-based WA (BLC-WA) method is proposed. Section 4 introduces the K-modes clustering algorithm optimized by a genetic algorithm. Section 5 details the implementation steps and development process of the DHNN 3SAT model based on genetic algorithm-optimized K-modes clustering. Section 6 presents an experimental comparative analysis of the DHNN-3SAT model based on the genetic optimization K-modes clustering algorithm (DHNN-3SAT-GAKM) model proposed in this paper, and the three models DHNN-3SAT-WA, the DHNN-3SAT-WA model by using the Genetic Algorithm (DHNN-3SAT-GA), and the DHNN-3SAT-WA model by using Competition Algorithm (DHNN-3SAT-ICA), which are comprehensively evaluated using four evaluation metrics. Finally, Section 7 summarizes the work presented in this paper.

2. Theoretical background

2.1. Boolean 3SAT Logic

Definition 2.1. 3SAT is a satisfiability problem for a set of logical clauses consisting strictly of 3 literal variables. 3SAT problems can be expressed in 3 conjunctive normal forms (CNFs). Let the set of Boolean variables be $\left\{{S}_{1}, {S}_{2}, \cdots, {S}_{n}\right\}$ and the set of logical clauses be $\left\{{C}_{1}, {C}_{2}, \cdots, {C}_{m}\right\}$ , then the general form of a CNF 3SAT formula $P$ containing $n$ Boolean variables and $m$ logical clauses is defined as:

$P = \underset{k = 1}{\overset{m}{\bigwedge }}{C}_{k} ,$

(1)

where the clause ${C}_{k}$ consists of 3 literals connected by the classical operator or ( $\vee$ ): ${C}_{k} = {Z}_{\left(k, 1\right)}\vee {Z}_{\left(k, 2\right)}\vee {Z}_{\left(k, 3\right)}$ , and the state of the literals can be either a positive variable or the negation of a positive variable, i.e., ${Z}_{\left(k, i\right)} = {S}_{j}$ or ${Z}_{\left(k, i\right)} = {\neg S}_{j}$ , $1\le k\le m$ , $1\le i\le 3$ , $1\le j\le n$ . Each literal variable takes on the binary discrete value $\left\{1, -1\right\}$ , where $1$ denotes true and $-1$ denotes false. Each clause in 3SAT contains unique variables, meaning there is no repetition of the same variable (variable or negation of a variable) in clause ${C}_{k}$ . Additionally, there are no repeated logical clauses within logical rules.

The problem denoted by 3SAT can be formally described as follows: Given a 3SAT formula, the task is to determine if there is an assignment of Boolean variables that makes the entire formula true. In particular, each clause in the formula must have at least one true literal for the whole formula to be true.

Instance. Suppose that for given a 3SAT problem, the conversion to the CNF 3SAT formula is:

$P = \left({S}_{1}\vee {S}_{2}\vee {S}_{3}\right)\wedge \left(\neg {S}_{1}\vee {S}_{2}\vee {S}_{3}\right)\wedge \left({S}_{1}\vee {\neg S}_{2}\vee {S}_{3}\right)\wedge \left({S}_{1}\vee {S}_{2}\vee \neg {S}_{3}\right)\wedge \left({\neg S}_{1}\vee \neg {S}_{2}\vee {S}_{3}\right)\wedge \left(\neg {S}_{1}\vee {S}_{2}\vee {\neg S}_{3}\right)\wedge \left({S}_{1}\vee {\neg S}_{2}\vee \neg {S}_{3}\right)\wedge \left({S}_{1}\vee {S}_{2}\vee {S}_{4}\right) .$

(2)

In Eq (2), $P$ is satisfiable if there exists a set of values for the variable ${S}_{1}, {S}_{2}, {S}_{3}, {S}_{4}$ such that $P = 1$ ; otherwise, $P$ is unsatisfiable.

The problem regarding 3SAT is a fundamental issue in computational complexity theory. Its NP-completeness and wide range of applications make it a crucial subject of research in both theoretical and practical contexts. Through a thorough examination of the 3SAT problem, a better understanding of computational complexity theory can be achieved, and effective tools and methods for solving practical problems can be provided. This study contributes to the advancement of computer science by exploring solution methods for 3SAT problems.

2.2. DHNN

Neural networks can be divided into two types based on the flow of information: Feed-forward and feedback neural networks. The output of a feedforward neural network depends only on the current input vector and weight matrix, independent of the network's previous input state. An example of this is the commonly used back propagation (BP) neural network. In 1982, physicist professor J. J. Hopfield proposed ^[11] a single-layer feedback neural network, later called the Hopfield neural network. This network is of two types: Continuous Hopfield neural network (CHNN) and discrete Hopfield neural network (DHNN) ^[33,34]. DHNN has garnered significant attention due to its concise network structure and powerful memory function. It holds potential practical value in image recovery and optimization problems ^[35,36,37]. depicts the topology of a DHNN network with $n$ neurons. Each neuron is functionally identical and interconnected in pairs. The neurons are represented by the set $O = \left\{{o}_{1}, {o}_{2}, \cdots, {o}_{n}\right\}$ , and their corresponding states are denoted by the vector $X = \left({x}_{1}, {x}_{2}, \cdots, {x}_{n}\right)$ , and the value of ${x}_{i}$ takes binary discrete values, typically $\left\{-1, 1\right\}$ or $\left\{0, 1\right\}$ . The state of the network is described as $X\left(t\right) = \left({x}_{1}\left(t\right), {x}_{1}\left(t\right), \cdots, {x}_{n}\left(t\right)\right)$ at time $t$ , and the DHNN is stimulated by an external input to start its evolution. The outputs of localized lots are generated before the final state. The output of the local lot of the double link is:

${h}_{i}\left(t\right) = \sum _{j}{w}_{ij}{x}_{i}\left(t\right)-{w}_{i} ,$

(3)

Figure 1. DHNN topology.

DownLoad: Full-Size Img PowerPoint

where ${w}_{i}$ denotes a predefined threshold. The output of higher-order linked local lots is represented by Eq (4) as proposed by Mansor et al ^[22].

${h}_{i}\left(t\right) = \cdots +\sum _{j}\sum _{k}{w}_{ijk}{x}_{i}\left(t\right){x}_{j}\left(t\right)+\sum _{j = 1}^{n}{w}_{ij}{x}_{i}\left(t\right)-{w}_{i} .$

(4)

The output state of the neuron ${o}_{i}$ at the time $t+1$ is denoted as:

${x}_{i}\left(t+1\right) = sgn\left({h}_{i}\left(t\right)\right) = \left\{\begin{array}{c}1, {h}_{i}\left(t\right)\ge 0, \\ -1, {h}_{i}\left(t\right) < 0, \end{array}\right.$

(5)

where $"sgn"$ denotes the sign function and ${w}_{ij}$ denotes the connection weights of neuron ${o}_{i}$ and neuron ${o}_{j}$ , with the weights specified as follows.

${w}_{ij} = \left\{\begin{array}{c}{w}_{ji}, i\ne j, \\ 0, i = j.\end{array}\right.$

(6)

In the network training phase, the Hebbian rule is usually used to calculate the weights ${w}_{ij}$ as:

${w}_{ij} = \sum _{s = 1}^{m}\left(2{x}_{i}^{s}-1\right)\left(2{x}_{j}^{s}-1\right) ,$

(7)

where $\mathrm{m}$ denotes the number of samples to be memorized.

The DHNN is essentially a nonlinear dynamical system. The network starts evolving from an initial state, and the DHNN is considered stable when its state no longer changes after a finite number of iterations. In DHNN, stability is determined by introducing the Lyapunov function as the energy function, which serves as an indicator of stability ^[38]. The system reaches stability when the energy function reaches a minimum point of invariance. The energy function in DHNN is defined as:

$E\left(X\right) = \cdots -\frac{1}{3}\sum _{i}\sum _{j}{\sum _{k}{w}_{ijk}{x}_{i}{x}_{j}x}_{k}-\frac{1}{2}\sum _{i}\sum _{j}{w}_{ij}{x}_{i}{x}_{j}-\sum _{i}{w}_{i}{x}_{i} .$

(8)

In 1983, Cohen and S. Grossberg showed that DHNNs evolve with a decreasing energy function and that a stable state of the network corresponds to a minimal value of the energy function. Consequently, for each stable state, we can check whether this state represents a global minimum by determining whether the energy function has reached a minimum ^[28]. If Eq (9) is satisfied, the stable state is considered a global minimum; otherwise, it is a local minimum.

$\left|E\left(X\right)-{E}_{min}\right| < \delta ,$

(9)

where ${E}_{min}$ denotes the minimum value of the energy function and $\delta$ is the user-defined tolerance value.

3. Design of DHNN-3SAT model weights

In this section, we will start by determining the synaptic weights of the DHNN 3SAT model using the WA method ^[21]. This method is a computational approach for deriving the synaptic weights of a network by aligning the cost function with the DHNN energy function. Our study acknowledges some challenges in this comparative method of deriving network synaptic weights, particularly as the number of variables and logical clauses increase. Additionally, the addition, deletion, and updating of logical clauses result in a large number of redundant computations. To tackle these issues, this section will outline the cost function of the basic logical clauses and compute the network synaptic weights by establishing the basic logical clauses of the CNF 3SAT formulae. This approach will allow for a more adaptable implementation in computing the network synaptic weights of the 3SAT when incorporated in a DHNN. The method is termed the BLC-WA method. Furthermore, the detailed calculation process using the BLC-WA method will be demonstrated with specific examples as logical clauses are added, deleted, and updated.

3.1. WA method

The WA method introduces a cost function based on propositional logic rules for the first time. It derives the synaptic weights of the network by comparing the cost function with the DHNN energy function, presenting a novel approach to using DHNN for solving the SAT problem. In this study, the WA method is used to incorporate the 3SAT problem into the DHNN for computing the network synaptic weights. The flowchart illustrating the implementation of the WA method is shown in Figure 2. The specific steps are as follows:

Figure 2. Design a flowchart of the real WA method.

DownLoad: Full-Size Img PowerPoint

Step 1. Given any 3SAT problem, transform it into a CNF 3SAT formula $P$ . Suppose the formula $P$ contains $n$ Boolean variables and $m$ logical clauses.

Step 2. The 3SAT formula $P$ is embedded into the DHNN, and for each Boolean variable, a unique neuron is specified. At moment $t$ , the state of these neurons is denoted by $\left\{{S}_{1}^{t}, {S}_{2}^{t}, \cdots, {S}_{n}^{t}\right\}$ .

Step 3. Applying De. Morgan's law to obtain $\neg P$ . When $\neg P = 0$ , correspond to the consistency interpretation of $\mathrm{P}$ ; when $\neg P =$ 1, correspond to the fact that at least one clause of $P$ is not satisfied.

Step 4. Deriving the cost function ${E}_{P}$ . When the literal variable in $\neg P$ is represented by $\frac{1}{2}\left(1-{S}_{i}\right)$ when it is ${\neg S}_{i}$ and $\frac{1}{2}\left(1+{S}_{i}\right)$ when it is ${S}_{i}$ , the logical clauses are internally connected by the multiplication operation and between logical clauses by addition. This creates the cost function ${E}_{P}$ . The magnitude of ${E}_{P}$ corresponds to the degree to which all logical clauses are satisfied. When ${E}_{P} = 0$ it represents a consistent interpretation of $P$ . A larger value of ${E}_{P}$ represents a larger number of unsatisfied logical clauses.

Step 5. Comparing the cost function ${E}_{P}$ with the energy function $E\left(X\right)$ , the DHNN synaptic weight matrix $W$ corresponding to the 3SAT formula P is obtained.

3.2. WA method applied to 3SAT instance

In this section, we use the problem of Eq (2) in Section 2.1 as an example to illustrate the process of computing synaptic weights in the DHNN using the WA method of embedding logical clauses into the DHNN.

To determine whether Eq (2) is satisfiable, the negation of Eq (2) is applied to De Morgan's law, which results in:

$\begin{array}{l} \neg P = \left( {\neg {S_1} \wedge \neg {S_2} \wedge \neg {S_3}} \right) \vee \left( {{S_1} \wedge \neg {S_2} \wedge \neg {S_3}} \right) \vee \left( {\neg {S_1} \wedge {S_2} \wedge \neg {S_3}} \right) \vee \left( {\neg {S_1} \wedge \neg {S_2} \wedge {S_3}} \right) \vee \left( {{S_1} \wedge {S_2} \wedge \neg {S_3}} \right) \vee \\ \left( {{S_1} \wedge \neg {S_2} \wedge {S_3}} \right) \vee \left( {\neg {S_1} \wedge {S_2} \wedge {S_3}} \right) \vee \left( {\neg {S_1} \wedge \neg {S_2} \wedge \neg {S_4}} \right). \end{array}$

(10)

Since seeking a consistent interpretation of the terms of Eq (2) is the same as finding the smallest combination of inconsistent interpretations of Eq (10), the cost function can be defined as follows:

$\begin{array}{l} {E_P} = \frac{1}{2}\left( {1 - {S_1}} \right)\frac{1}{2}\left( {1 - {S_2}} \right)\frac{1}{2}\left( {1 - {S_3}} \right) + \frac{1}{2}\left( {1 + {S_1}} \right)\frac{1}{2}\left( {1 - {S_2}} \right)\frac{1}{2}\left( {1 - {S_3}} \right) + \frac{1}{2}\left( {1 - {S_1}} \right)\frac{1}{2}\left( {1 + {S_2}} \right)\frac{1}{2}\left( {1 - {S_3}} \right) + \\ \frac{1}{2}\left( {1 - {S_1}} \right)\frac{1}{2}\left( {1 - {S_2}} \right)\frac{1}{2}\left( {1 + {S_3}} \right) + \frac{1}{2}\left( {1 + {S_1}} \right)\frac{1}{2}\left( {1 + {S_2}} \right)\frac{1}{2}\left( {1 - {S_3}} \right) + \frac{1}{2}\left( {1 + {S_1}} \right)\frac{1}{2}\left( {1 - {S_2}} \right)\frac{1}{2}\left( {1 + {S_3}} \right) + \\ \frac{1}{2}\left( {1 - {S_1}} \right)\frac{1}{2}\left( {1 + {S_2}} \right)\frac{1}{2}\left( {1 + {S_3}} \right) + \frac{1}{2}\left( {1 - {S_1}} \right)\frac{1}{2}\left( {1 - {S_2}} \right)\frac{1}{2}\left( {1 - {S_4}} \right) \\ = -\frac{1}{8}{S}_{1}{S}_{2}{S}_{3}-\frac{1}{8}{S}_{1}{S}_{2}{S}_{4}-\frac{1}{8}{S}_{1}{S}_{3}+\frac{1}{8}{S}_{1}{S}_{4}-\frac{1}{8}{S}_{2}{S}_{3}+\frac{1}{8}{S}_{2}{S}_{4}-\frac{1}{4}{S}_{1}-\frac{1}{4}{S}_{2}-\frac{1}{8}{S}_{3}-\frac{1}{8}{S}_{4}+1 \end{array}$

(11)

When the formula of Eq (2) provided The 3SAT formula is satisfied, the cost function ${E}_{P}$ reaches the minimum value of 0. At this point, the energy function for the corresponding DHNN converges to the global minimum, causing both the cost function and energy function to reach their minimum values. The network's synaptic weight matrix ${W}_{P}$ , embedded in the DHNN by Eq (2), is derived by comparing the cost function (11) with the energy function (8) using the WA method, and the results are shown in Table 1.

Table 1. WA method for 3SAT.

Weights	${w}_{123}$	${w}_{124}$	${w}_{134}$	${w}_{234}$	${w}_{12}$	${w}_{13}$	${w}_{14}$	${w}_{23}$	${w}_{24}$	${w}_{34}$	${w}_{1}$	${w}_{2}$	${w}_{3}$	${w}_{4}$
$P$	$\frac{1}{16}$	$\frac{1}{16}$	$0$	$0$	$0$	$\frac{1}{8}$	$-\frac{1}{8}$	$\frac{1}{8}$	$-\frac{1}{8}$	$0$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{8}$	$\frac{1}{8}$

| Show Table

DownLoad: CSV

3.3. BLC-WA method

Definition 3.1. For any CNF formula containing n Boolean variables and m logical clauses, it can be viewed as consisting of the following eight basic logical clauses:

${C}_{1}^{l} = \left({S}_{i}\vee {S}_{j}\vee {S}_{k}\right), {C}_{2}^{l} = \left(\neg {S}_{i}\vee {S}_{j}\vee {S}_{k}\right),$

${C}_{3}^{l} = \left({S}_{i}\vee {\neg S}_{j}\vee {S}_{k}\right), {C}_{4}^{l} = \left({S}_{i}\vee {S}_{j}\vee \neg {S}_{k}\right),$

${C}_{5}^{l} = \left({\neg S}_{i}\vee \neg {S}_{j}\vee {S}_{k}\right), {C}_{6}^{l} = \left(\neg {S}_{i}\vee {S}_{j}\vee {\neg S}_{k}\right),$

${C}_{7}^{l} = \left({S}_{i}\vee {\neg S}_{j}\vee \neg {S}_{k}\right), {C}_{8}^{l} = \left(\neg {S}_{i}\vee {\neg S}_{j}\vee \neg {S}_{k}\right) ,$

(12)

where $l$ denotes the index at which the basic logical clause is looked up.

Applying De Morgan's law to the eight basic logical clauses in Eq (12) yields the corresponding negative basic logical clauses:

${\neg C}_{1}^{l} = \left({\neg S}_{i}\wedge {\neg S}_{j}\wedge {\neg S}_{k}\right), \neg {C}_{2}^{l} = \left({S}_{i}\wedge \neg {S}_{j}\wedge {\neg S}_{k}\right),$

$\neg {C}_{3}^{l} = \left({\neg S}_{i}\wedge {S}_{j}\wedge \neg {S}_{k}\right), {C}_{4}^{l} = \left(\neg {S}_{i}\wedge {\neg S}_{j}\wedge {S}_{k}\right),$

$\neg {C}_{5}^{l} = \left({S}_{i}\wedge {S}_{j}\wedge \neg {S}_{k}\right), {\neg C}_{6}^{l} = \left({S}_{i}\wedge {\neg S}_{j}\wedge {S}_{k}\right),$

$\neg {C}_{7}^{l} = \left(\neg {S}_{i}\wedge {S}_{j}\wedge {S}_{k}\right), {\neg C}_{8}^{l} = \left({S}_{i}\wedge {S}_{j}\wedge {S}_{k}\right).$

(13)

The consistency clause of each basic logic clause in seeking pair Eq (12) is equal to the minimum of the inconsistency clause of the negated basic logic clause in seeking pair Eq (13). The corresponding cost function for each basic logic clause is defined as follows:

${E}_{{C}_{1}^{l}} = \frac{1}{2}\left(1-{S}_{i}\right)\frac{1}{2}\left(1-{S}_{j}\right)\frac{1}{2}\left(1-{S}_{k}\right) , {E}_{{C}_{2}^{l}} = \frac{1}{2}\left(1+{S}_{i}\right)\frac{1}{2}\left(1-{S}_{j}\right)\frac{1}{2}\left(1-{S}_{k}\right) ,$

${E}_{{C}_{3}^{l}} = \frac{1}{2}\left(1-{S}_{i}\right)\frac{1}{2}\left(1+{S}_{j}\right)\frac{1}{2}\left(1-{S}_{k}\right) , {E}_{{C}_{4}^{l}} = \frac{1}{2}\left(1-{S}_{i}\right)\frac{1}{2}\left(1-{S}_{j}\right)\frac{1}{2}\left(1+{S}_{k}\right) ,$

${E}_{{C}_{5}^{l}} = \frac{1}{2}\left(1+{S}_{i}\right)\frac{1}{2}\left(1+{S}_{j}\right)\frac{1}{2}\left(1-{S}_{k}\right) , {E}_{{C}_{6}^{l}} = \frac{1}{2}\left(1+{S}_{i}\right)\frac{1}{2}\left(1-{S}_{j}\right)\frac{1}{2}\left(1+{S}_{k}\right) ,$

${E}_{{C}_{7}^{l}} = \frac{1}{2}\left(1-{S}_{i}\right)\frac{1}{2}\left(1+{S}_{j}\right)\frac{1}{2}\left(1+{S}_{k}\right) , {E}_{{C}_{8}^{l}} = \frac{1}{2}\left(1+{S}_{i}\right)\frac{1}{2}\left(1+{S}_{j}\right)\frac{1}{2}\left(1+{S}_{k}\right) .$

(14)

Each basic logic clause is embedded into a DHNN separately. When each basic logic clause is satisfiable, the corresponding DHNN converges to the global minimum. At this point, the cost function and the corresponding energy function of the basic logic clause reach their minimum values. By comparing the cost function (14) of the basic logic clauses with the energy function (8), the basic logic clause weight matrix of the 3SAT formula can be derived. This weight matrix is abbreviated as 3SAT-BLCWM, and the results are shown in Table 2.

Table 2. 3SAT-BLCWM.

Weights	${C}_{1}^{l}$	${C}_{2}^{l}$	${C}_{3}^{l}$	${C}_{4}^{l}$	${C}_{5}^{l}$	${C}_{6}^{l}$	${C}_{7}^{l}$	${C}_{8}^{l}$
${w}_{i}$	$1/8$	$-1/8$	$1/8$	$1/8$	$-1/8$	$-1/8$	$1/8$	$-1/8$
${w}_{j}$	$1/8$	$1/8$	$-1/8$	$1/8$	$-1/8$	$1/8$	$-1/8$	$-1/8$
${w}_{k}$	$1/8$	$1/8$	$1/8$	$-1/8$	$1/8$	$-1/8$	$-1/8$	$-1/8$
${w}_{ij}$	$-1/8$	$1/8$	$1/8$	$-1/8$	$-1/8$	$1/8$	$1/8$	$-1/8$
${w}_{ik}$	$-1/8$	$-1/8$	$1/8$	$1/8$	$1/8$	$-1/8$	$1/8$	$-1/8$
${w}_{jk}$	$-1/8$	$1/8$	$-1/8$	$1/8$	$1/8$	$-1/8$	$1/8$	$-1/8$
${w}_{ijk}$	$1/16$	$-1/16$	$-1/16$	$-1/16$	$1/16$	$1/16$	$1/6$	$-1/16$

| Show Table

DownLoad: CSV

Any 3SAT formula can be seen as made up of the basic logical clauses in Eq (12). Each logical clause in the 3SAT formula corresponds to a basic logical clause. So, when the DHNN learns a new logical clause, it only needs to identify the corresponding basic logical clauses, then refer to Table 2, and combine and calculate the network weight values to derive the network synaptic weights after adding a new logical clause. Figure 3 illustrates the flowchart of calculating network synaptic weights using the BLC-WA method. The specific steps for calculating network synaptic weights using the BLC-WA method are as follows:

Figure 3. Flowchart of calculating connection weights based on the BLC-WA method.

DownLoad: Full-Size Img PowerPoint

Step 1. Given any 3SAT problem, transform it into CNF 3SAT formula $P$ , which is assumed to contain N Boolean variables and M logical clauses;

Step 2. The 3SAT-BLCWM was established using the WA method (Table 2);

Step 3. Analyze CNF 3SAT formula $P$ to map each logical clause to the basic logical clause;

Step 4. Based on (3SAT-BLCWM), the weights corresponding to each logical clause of the 3SAT formula $P$ are found. These weights are then spelled out by columns into the indexed result weight matrix ${W}^{\text{'}}$ ;

Step 5 The weight matrix $W$ of formula $P$ for 3SAT is obtained by combining and summing the result weight matrices ${W}^{\text{'}}$ by columns.

Next, we compute the 3SAT instances of Section 2.1 using the BLC-WA method based on Eq (2) which can be written in correspondence with the basic logical clause as:

$P = {C}_{1}^{1}\wedge {C}_{2}^{1}\wedge {C}_{3}^{1}\wedge {C}_{4}^{1}\wedge {C}_{5}^{1}\wedge {C}_{6}^{1}\wedge {C}_{7}^{1}\wedge {C}_{1}^{2} .$

(15)

According to (3SAT-BLCWM), the network synaptic weights corresponding to each logical clause of formula $P$ can be found by indexing the results of the weight matrix ${W}^{\text{'}}$ using the columns. The results are shown in . The network synaptic weight matrix ${W}_{P}$ for the 3SAT formula $P$ can be obtained by merging and adding the indexing results according to the columns, which are also shown in Table 3.

Table 3. Calculation of synaptic weights of CNF 3SAT network based on BLC-WA method.

Weights	${C}_{1}^{1}$	${C}_{2}^{1}$	${C}_{3}^{1}$	${C}_{4}^{1}$	${C}_{5}^{1}$	${C}_{6}^{1}$	${C}_{7}^{1}$	${C}_{1}^{2}$	$P$
${w}_{1}$	$1/8$	$-1/8$	$1/8$	$1/8$	$-1/8$	$-1/8$	$1/8$	$1/8$	$1/4$
${w}_{2}$	$1/8$	$1/8$	$-1/8$	$1/8$	$-1/8$	$1/8$	$-1/8$	$1/8$	$1/4$
${w}_{3}$	$1/8$	$1/8$	$1/8$	$-1/8$	$1/8$	$-1/8$	$-1/8$	$0$	$1/8$
${w}_{4}$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$1/8$	$1/8$
${w}_{12}$	$-1/8$	$1/8$	$1/8$	$-1/8$	$-1/8$	$1/8$	$1/8$	$-1/8$	$0$
${w}_{13}$	$-1/8$	$-1/8$	$1/8$	$1/8$	$1/8$	$-1/8$	$1/8$	$0$	$1/8$
${w}_{14}$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$-1/8$	$-1/8$
${w}_{23}$	$-1/8$	$1/8$	$-1/8$	$1/8$	$1/8$	$-1/8$	$1/8$	$0$	$1/8$
${w}_{24}$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$-1/8$	$-1/8$
${w}_{34}$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$0$
${w}_{123}$	$1/16$	$-1/16$	$-1/16$	$-1/16$	$1/16$	$1/16$	$1/16$	$0$	$1/16$
${w}_{124}$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$1/16$	$1/16$
${w}_{234}$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$0$

| Show Table

DownLoad: CSV

3.4. Design of weights for dynamic constraints for the 3SAT problem

In SAT problems, the constraints often change, with constraints increasing, decreasing, and updating. The traditional DHNN-SAT method requires redesigning the weights of the SAT problem and constructing a new DHNN when facing this type of problem, which does not utilize the original SAT problem's information. As the original SAT problem's constraints increase, the corresponding original CNF formulation also increases the logical clauses, leading to a large number of redundant computations when dealing with large-scale logical clauses. This severely limits the effectiveness of the traditional DHNN-SAT method in solving problems with large-scale increasing constraints. To address this issue, this study proposes the BLC-WA method, a new design method for the SAT problem with changing constraints. This method utilizes the synaptic weight information of the original SAT problem in DHNN, saving a significant amount of repeated calculations. In the following section, the network synaptic weight design method for SAT problems with increasing, decreasing, and updating constraints will be introduced, providing a new approach for solving SAT problems with constantly changing constraints.

3.4.1. Adding constraints

The addition of constraints to the original SAT problem is equivalent to adding logical clauses to the CNF SAT formula.

There is a 3SAT problem that translates into the CNF 3SAT formula:

$P = {C}_{1}^{{l}_{1}}\wedge {C}_{2}^{{l}_{2}}\wedge \cdots {\wedge C}_{m}^{{l}_{m}} .$

(16)

When $r$ logical clauses are added, the original CNF 3SAT formula becomes:

${P}_{add} = {C}_{1}^{{l}_{1}}\wedge {C}_{2}^{{l}_{2}}\cdots {\wedge C}_{m}^{{l}_{m}}\wedge {C}_{m+1}^{{l}_{m+1}}\wedge {C}_{m+2}^{{l}_{m+2}}\wedge \cdots {\wedge C}_{m+r}^{{l}_{m+2}} .$

(17)

Figure 4 depicts the flowchart of the BLC-WA method for solving the original 3SAT problem with additional constraints. This method is implemented as follows:

Figure 4. Flowchart of synaptic weights design based on state constraints of BLC-WA method.

DownLoad: Full-Size Img PowerPoint

Step 1. Let the original CNF 3SAT formula $P$ become ${P}_{add}$ by adding $r$ logical clauses (Eq 14);

Step 2. The basic logical clauses were mapped to the additional logical clauses, and the synaptic weights of the additional logical clauses were determined based on Table 2 (3SAT-BLCWM);

Step 3. The synaptic weights of the CNF 3SAT formula ${P}_{add}$ after adding the $r$ logical clauses were calculated using the following Eq (18).

${W}_{add} = {W}_{P}+{W}_{add\left(1\right)}+{W}_{add\left(2\right)}+\cdots +{W}_{add\left(r\right)} .$

(18)

The following is a concrete demonstration of the implementation process using the 3SAT instance from Section 2.1.

Assuming that Eq (2) combines the logical clauses ${C}_{2}^{2} = \neg {S}_{1}\vee {S}_{2}\vee {S}_{4}$ and ${C}_{3}^{2} = {S}_{1}\vee \neg {S}_{2}\vee {S}_{4}$ , the new CNF formula at this point is notated as ${P}_{\mathrm{a}\mathrm{d}\mathrm{d}}$ , specifically, as follows:

${P}_{add} = {C}_{1}^{1}\wedge {C}_{2}^{1}\wedge {C}_{3}^{1}\wedge {C}_{4}^{1}\wedge {C}_{5}^{1}\wedge {C}_{6}^{1}\wedge {C}_{7}^{1}\wedge {C}_{1}^{2}\wedge {C}_{2}^{2}\wedge {C}_{3}^{2} .$

(19)

In Section 3.3, the synaptic weights ( ${W}_{P}$ ) of the formula $P$ in the network have been obtained using the BLC-WA method. Then, the synaptic weights of the newly added logical clauses ( ${C}_{2}^{2}$ and ${C}_{3}^{2}$ ) are obtained by searching for (3SAT-BLCWM) and then combined and summed with the synaptic weights ( ${W}_{P}$ ) of the formula $P$ to obtain the new synaptic weights ( ${W}_{add}$ ) of the CNF formula 3SAT ${P}_{add}$ . The calculation results are shown in Table 4.

Table 4. Synaptic weights after adding, subtracting, and updating logical clauses.

Weights	$P$	${C}_{2}^{2}$	${C}_{3}^{2}$	${P}_{\mathrm{a}\mathrm{d}\mathrm{d}}$	${C}_{2}^{1}$	${C}_{3}^{1}$	${P}_{dec}$	${C}_{7}^{1}$	${C}_{1}^{2}$	${C}_{8}^{1}$	${C}_{1}^{3}$	${P}_{upd}$
${w}_{1}$	$1/4$	$-1/8$	$1/8$	$1/4$	$-1/8$	$1/8$	$1/4$	$1/8$	$1/8$	$-1/8$	$0$	$-1/8$
${w}_{2}$	$1/4$	$1/8$	$-1/8$	$1/4$	$1/8$	$-1/8$	$1/4$	$-1/8$	$1/8$	$-1/8$	$1/8$	$1/4$
${w}_{3}$	$1/8$	$0$	$0$	$1/8$	$1/8$	$1/8$	$-1/8$	$-1/8$	$0$	$-1/8$	$1/8$	$1/4$
${w}_{4}$	$1/8$	$1/8$	$1/8$	$3/8$	$0$	$0$	$1/8$	$0$	$1/8$	$0$	$1/8$	$1/8$
${w}_{12}$	$0$	$1/8$	$1/8$	$1/4$	$1/8$	$1/8$	$-1/4$	$1/8$	$-1/8$	$-1/8$	$0$	$-1/8$
${w}_{13}$	$1/8$	$0$	$0$	$1/8$	$-1/8$	$1/8$	$1/8$	$1/8$	$0$	$-1/8$	$0$	$-1/8$
${w}_{14}$	$-1/8$	$-1/8$	$1/8$	$-1/8$	$0$	$0$	$-1/8$	$0$	$-1/8$	$0$	$0$	$0$
${w}_{23}$	$1/8$	$0$	$0$	$1/8$	$1/8$	$-1/8$	$1/8$	$1/8$	$0$	$-1/8$	$-1/8$	$-1/4$
${w}_{24}$	$-1/8$	$1/8$	$-1/8$	$-1/8$	$0$	$0$	$-1/8$	$0$	$-1/8$	$0$	$-1/8$	$-1/8$
${w}_{34}$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$-1/8$	$-1/8$
${w}_{123}$	$1/16$	$0$	$0$	$1/16$	$-1/16$	$-1/16$	$3/16$	$1/16$	$0$	$-1/16$	$0$	$-1/16$
${w}_{124}$	$1/16$	$-1/16$	$-1/16$	$-1/16$	$0$	$0$	$1/16$	$0$	$1/16$	$0$	$0$	$0$
${w}_{234}$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$0$	$1/16$	$1/16$

| Show Table

DownLoad: CSV

3.4.2. Declining constraints

Setting the original CNF 3SAT formula (16) reduces $\mathrm{d}$ logical clauses, and the original CNF 3SAT formula becomes:

${P}_{dec} = {C}_{1}^{{l}_{1}}\wedge {C}_{2}^{{l}_{2}}\wedge \cdots {\wedge C}_{m-d}^{{l}_{m-d}} .$

(20)

The flowchart for solving the original 3SAT problem with reduced constraints based on the BLC-WA method is also shown in Figure 4. It is implemented as follows:

Step 1. The original CNF 3SAT formula $P$ is reduced by $d$ logical clauses to ${P}_{dec}$ ;

Step 2. The basic logical clauses were mapped to the reduced logical clauses, and the synaptic weights of the reduced logical clauses were determined based on Table 2 (3SAT-BLCWM);

Step 3. The synaptic weights of the CNF 3SAT formula ${P}_{dec}$ after declining the $d$ logical clauses were calculated using the following Eq (21).

${W}_{dec} = {W}_{P}-{W}_{dec\left(1\right)}-{W}_{dec\left(2\right)}-\cdots -{W}_{dec\left(d\right)} .$

(21)

The following is a concrete demonstration of the implementation process using the 3SAT instance from Section 2.1.

Assuming that Eq (2) reduces the logical clauses ${C}_{2}^{1} = \neg {S}_{1}\vee {S}_{2}\vee {S}_{3}$ and ${C}_{3}^{1} = {S}_{1}\vee {\neg S}_{2}\vee {S}_{3}$ , the new CNF formula at this point is notated as ${P}_{dec}$ , specifically, as follows:

${P}_{dec} = {C}_{1}^{1}\wedge {C}_{4}^{1}\wedge {C}_{5}^{1}\wedge {C}_{6}^{1}\wedge {C}_{7}^{1}\wedge {C}_{1}^{2} .$

(22)

To start, the synaptic weights for the reduced logical clauses ${C}_{2}^{1}$ and ${C}_{3}^{1}$ are determined by searching for (3SAT-BLCWM). Next, the synaptic weights ${W}_{P}$ of the original 3SAT formula $P$ are subtracted from the synaptic weights of the reduced logical clauses. This process yields the synaptic weights ${W}_{dec}$ for the new CNF 3SAT formula ${P}_{dec}$ . The computational results are also displayed in Table 4.

3.4.3. Updating constraints

When the original CNF formula (16) is updated with $u$ logical clauses, it can be regarded as a reduction of $u$ logical clauses from the original formula and the addition of $u$ new logical clauses. The updated CNF formula is:

${P}_{upd} = {C}_{1}^{{l}_{1}}\wedge {C}_{2}^{{l}_{2}}\wedge \cdots {\wedge C}_{m-u}^{{l}_{m-u}}\wedge {C}_{m-u+1}^{{l}_{m-u+1}^{\text{'}}}\wedge {C}_{m+2}^{{l}_{m-u+2}^{\text{'}}}\wedge \cdots {\wedge C}_{m}^{{l}_{m}^{\text{'}}}$

(23)

The flowchart when updating the constraints based on the BLC-WA method is also shown in Figure 4, which is implemented as follows:

Step 1. The original CNF 3SAT formula $P$ is updated by $u$ logical clauses to ${P}_{upd}$ ;

Step 2. The basic logical clauses were mapped to the updated logical clauses, and the synaptic weights of the updated logical clauses were determined based on Table 2 (3SAT-BLCWM);

Step 3. The synaptic weights of the CNF 3SAT formula ${P}_{upd}$ after updating the $u$ logical clauses were calculated using the following Eq (24).

${W}_{upd} = {W}_{P}-{W}_{dec\left(1\right)}-{W}_{dec\left(2\right)}-\cdots -{W}_{dec\left(u\right)}+{W}_{add\left(1\right)}+{W}_{add\left(2\right)}+\cdots +{W}_{add\left(u\right)}.$

(24)

The following is a concrete demonstration of the implementation process using the 3SAT instance from Section 2.1.

Suppose the logical clauses ${C}_{7}^{1} = {S}_{1}\vee {\neg S}_{2}\vee \neg {S}_{3}$ and ${C}_{1}^{2} = {S}_{1}\vee {S}_{2}\vee {S}_{4}$ in the original CNF 3SAT formula $P$ are updated to ${C}_{8}^{1} = \neg {S}_{1}\vee {\neg S}_{2}\vee \neg {S}_{3}$ and ${C}_{1}^{3} = {S}_{2}\vee {S}_{3}\vee {S}_{4}$ , and the updated CNF 3SAT formula is now denoted as ${P}_{upd}$ , specifically for:

${P}_{upd} = {C}_{1}^{1}\wedge {C}_{2}^{1}\wedge {C}_{3}^{1}\wedge {C}_{4}^{1}\wedge {C}_{5}^{1}\wedge {C}_{6}^{1}\wedge {C}_{8}^{1}\wedge {C}_{1}^{3} .$

(25)

To begin, find the synaptic weights of logical clauses ${C}_{7}^{1}$ , ${C}_{1}^{2}$ , ${C}_{8}^{1}$ and ${C}_{1}^{3}$ by searching for (3SAT-BLCWM). Then, subtract the synaptic weights of logical clause ${C}_{7}^{1}$ , ${C}_{1}^{2}$ from the original SAT formula $P$ . Finally, the synaptic weights of logical clause ${C}_{8}^{1}$ , ${C}_{1}^{3}$ are added to obtain the network synaptic weights ${W}_{upd}$ of the updated 3SAT formula ${P}_{upd}$ . The results of the computation are also displayed in Table 4.

4. Optimized K-modes clustering algorithm

4.1. K-modes clustering algorithm

The K-modes clustering algorithm is a method specifically designed for handling discrete data ^{[39,40,41,42]}. It extends the traditional K-means algorithm, which is mainly used for datasets with continuous attributes. The K-modes algorithm uses the Hamming distance as a metric ^[43], where this distance measures the number of differing attribute values between two sample points. In this algorithm, the Hamming distance is computed by adding the number of different attribute values between two samples, representing the degree of difference for a given sample compared to a clustering center. Finally, the samples are classified into the category that belongs to the clustering center with the smallest degree of difference. We can see the clustering process of the K-modes algorithm in Figure 5.

Figure 5. Clustering process of K-modes clustering algorithm.

DownLoad: Full-Size Img PowerPoint

Let $X = \left\{{X}_{1}, {X}_{2}, \cdots, {X}_{m}\right\}$ represent the set of samples to be clustered, and ${X}_{i} = \left({x}_{1}, {x}_{2}, \cdots, {x}_{n}\right)$ represent the $n$ -dimensional vector with each component taking discrete values. $Z = \left\{{Z}_{1}, {Z}_{2}, \cdots, {Z}_{k}\right\}$ represents the clustering center and ${Z}_{j} = \left({z}_{1}, {z}_{2}, \cdots, {z}_{n}\right), j = \mathrm{1, 2}, \cdots, k$ . The objective function of the K-modes clustering algorithm is defined as:

$F\left(\Phi , Z\right)\sum _{j = 1}^{k}\sum _{i = 1}^{m}{\varphi }_{ij}D\left({X}_{i}, {Z}_{j}\right) ,$

(26)

where ${\varphi }_{ij}\in \left\{\mathrm{0, 1}\right\}$ , $\sum _{j = 1}^{k}{\varphi }_{ij} = 1, 1\le i\le n, \Phi$ is the matrix of one, $k$ denotes the number of clusters, ${\varphi }_{ij} = 1$ if the $\mathrm{i}$ th object is classified in the $j$ -th class, otherwise ${\varphi }_{ij} = 0$ . ${Z}_{j}$ is the center of the $j$ -th class. $D\left({X}_{i}, {Z}_{j}\right)$ denotes the computation of the Hamming distance between ${X}_{i}$ and ${Z}_{j}$ :

$D\left({X}_{i}, {Z}_{j}\right) = \sum _{i = 1}^{n}d\left({x}_{i}, {z}_{i}\right),$

(27)

where $d\left({x}_{i}, {z}_{i}\right) = \left\{\begin{array}{c}0, {x}_{i} = {z}_{i}\\ 1, {x}_{i}\ne {z}_{i}\end{array}\right.$ .

The classification process must meet the following conditions: (1) every family must contain at least one sample; (2) each sample must belong to one and only one class. The fundamental steps of the K-modes clustering algorithm are as follows:

Step 1. Randomly identifying $\mathrm{k}$ clustering centers ${Z}_{1}, {Z}_{2}, {\cdots, Z}_{k}$ .

Step 2. For each sample ${X}_{i}\left(i = \mathrm{1, 2}, \cdots, m\right)$ in the dataset, its Hamming distance from the $k$ clustering centers is calculated separately using Eq (27), and the sample ${X}_{i}$ is classified into the category closest to the centroid.

Step 3. After dividing all the samples into clusters, the cluster center " ${Z}_{j}$ " is recalculated, and each center component is updated to its plural.

Step 4. Repeat the process of Steps 2 and 3 above until the objective function $F$ no longer changes.

4.2. K-modes clustering algorithm optimized by genetic algorithm

To address the limitations of the K-modes clustering algorithm, which make it difficult to determine the optimal number of clusters and easy to get stuck at a local optimum, researchers have incorporated a genetic algorithm with adaptive global optimization search capabilities into the K-modes clustering algorithm ^[44,45]. This involves using a fitness function to carry out genetic operations, primarily mutation, to automatically learn the cluster centroids for the K-modes algorithm. Figure 6. shows the workflow diagram of the K-modes clustering algorithm for genetic optimization, which was developed in the following steps:

Figure 6. Workflow diagram of K-modes clustering optimized by genetic algorithm.

DownLoad: Full-Size Img PowerPoint

Step 1. Parameter initialization. Set relevant parameters: Initial cluster number $k$ , population size $\mathrm{m}$ , crossover probability ${p}_{c}$ , variation probability ${p}_{m}$ , maximum number of iterations $t$ .

Step 2. Randomly generate the initial population. Randomly generate $\mathrm{k}$ initial clustering centers ${Z}_{1}, {Z}_{2}, {\cdots, Z}_{k}$ as initial population individuals.

Step 3. Take the population individual ${Z}_{1}, {Z}_{2}, {\cdots, Z}_{k}$ as the clustering center and use K-modes clustering algorithm for clustering.

Step 4. Calculate the fitness value of individuals in the population. Here the fitness function is defined as follows:

$f = \frac{{D}_{min}}{\overline{D}\left(X\right)} ,$

(28)

where ${D}_{min}$ is the minimum class spacing and $\overline{D}\left(X\right)$ is the average class spacing which is defined as follows:

${D}_{min} = \underset{i, j = 1}{\mathit{min}}D\left({Z}_{i}, {Z}_{j}\right) .$

(29)

$\overline{D}\left(X\right) = \frac{1}{k}\sum _{j = 1}^{k}\sum _{i = 1}^{{m}_{j}}\frac{D\left({X}_{i}, {Z}_{j}\right)}{{m}_{j}}.$

(30)

This fitness function is based on the idea that class separation should be maximized while intra-class spacing should be minimized. In other words, the goal is to maximize the distance between classes ( ${D}_{min}$ ) and minimize the variability within classes ( $\overline{D}\left(X\right)$ ). Throughout the evolutionary process, the individual population size is represented by the $\mathrm{k}$ value. If the $k$ value is less than the optimal number of clusters, increasing $k$ leads to a decrease in ${D}_{min}$ and $\overline{D}\left(X\right)$ , but the clustering division is not optimal. The decrease in $\overline{D}\left(X\right)$ is more significant than ${D}_{min}$ , resulting in an increase in the fitness function value. Conversely, if the $\mathrm{k}$ value exceeds the optimal number of clusters, the change in $\overline{D}\left(X\right)$ is not significant, and the intra-class spacing becomes very small due to secondary clustering. As a result, ${D}_{min}$ becomes very small, leading to a decrease in the overall fitness function value. Therefore, this fitness function can guide the $k$ value toward the optimal number of clusters when the initial clustering center is optimized.

Step 5. Perform selection, crossover, and mutation operations to generate a new generation population.

Step 6. Repeat Step 3 to Step 5 until the maximum number of iterations is reached.

Step 7. Calculate the fitness value for each individual in the population and select the output with the highest fitness value.

5. Development of DHNN-3SAT model based on genetic optimization K-modes clustering algorithm

The conventional DHNN-3SAT-WA model uses an exhaustive search (ES) during the retrieval phase ^[46], aiming to conduct a random search among individual candidate solutions to find a consistent interpretation that satisfies the 3SAT terms. Some researchers have proposed optimizing the traditional DHNN-3SAT-WA model by using heuristic algorithms such as the GA and ICA, denoted as DHNN-3SAT-GA ^[26] and DHNN-3SAT-ICA ^[38]. These methods can expedite the search for global or feasible solutions. However, unguided random initial assignment of candidate solutions leads to numerous repeated invalid solutions and fails to converge, often falling into a local optimum after DHNN evolution. Furthermore, as the number of Boolean variables and logical clauses increases, the size and logical complexity of the network expands, resulting in a rapid growth of the solution space. This makes the model susceptible to oscillations and more likely to land in local minima. Therefore, reducing the DHNN-3SAT-WA model's search time and preventing it from falling into local minima is a significant challenge in this field. The implementation process of the traditional DHNN-3SAT-WA, DHNN-3SAT-GA, and DHNN-3SAT-ICA models is depicted in Figure 7.

Figure 7. Implementation process of conventional DHNN-3SAT-WA, DHNN-3SAT-GA and DHNN-3SAT-ICA models.

DownLoad: Full-Size Img PowerPoint

This study proposes a new solution to address these challenges: the DHNN-3SAT model based on the genetic optimization K-modes clustering algorithm referred to as DHNN-3SAT-GAKM. In this model, candidate solutions in the allocation space are clustered using the K-modes clustering algorithm, leading to initial allocation through a random search from each class. By reducing repeated initial candidate solutions and avoiding local optima to some extent, this process accelerates the search for the global minimum, improving the efficiency of global minimum retrieval. To determine the optimal number of clusters for the K-modes clustering algorithm, the genetic algorithm with adaptive global optimization search capability is introduced. The number of clusters is determined by calculating the value of the constructed fitness function, further enhancing global search capability.

The DHNN-3SAT-GAKM model aims to find a consistent set of Boolean variable values for the 3SAT problem. During the model's initialization phase, each neuron in the DHNN is connected to a specific Boolean variable in the CNF, and the connection weights represent the relationship between the variable and the clause. A WA method using basic logical clauses will be employed to determine the cost during the learning phase. In the retrieval phase, the DHNN is utilized to evolve, update, and iterate until the network reaches a stable equilibrium state, signified by a minimal energy function value. The energy function's primary purpose is to indicate whether this stable state corresponds to a global minimum of the 3SAT problem, which in turn represents a consistent interpretation of the CNF. Please see Figure 8 for the flowchart of the DHNN-3SAT-GAKM model development, and the implementation steps are summarized as follows:

Figure 8. Flowchart for the development of DHNN-3SAT model based on K-modes clustering algorithm for genetic optimization.

DownLoad: Full-Size Img PowerPoint

Step 1. Model Preparation. For a given 3SAT problem, transform it into the corresponding CNF formulation, denoted as $P$ . Assume it contains $n$ Boolean variables and $m$ logical clauses. Initialize the optimization algorithm parameters.

Step 2. Each Boolean variable of the 3SAT formula is uniquely assigned a Hopfield neuron in the DHNN design, which consists of $n$ neurons $O = \left\{{o}_{1}, {o}_{2}, \cdots, {o}_{n}\right\}$ , with the state at moment $t$ denoted $X\left(t\right) = \left({x}_{1}\left(t\right), {x}_{1}\left(t\right), \cdots, {x}_{n}\left(t\right)\right)$ .

Step 3. The BLC-WA method was used to calculate the 3SAT formula $P$ synaptic weights and derive its cost function ${E}_{p}$ . When $P = 1$ , ${E}_{p} = 0$ , at which time the energy function reaches its minimum value, giving ${E}_{min} = -\frac{m}{8}$ .

Step 4. Generate an initial candidate solution space by randomly creating $\mathrm{m}$ initial candidate solutions $\left\{{X}_{1}\left(t\right), {X}_{2}\left(t\right), \cdots, {X}_{m}\left(t\right)\right\}$ .

Step 5. The initial candidate solution $\left\{{X}_{1}\left(t\right), {X}_{2}\left(t\right), \cdots, {X}_{m}\left(t\right)\right\}$ is clustered using the K-modes clustering algorithm based on genetic optimization to obtain the optimal number of clusters $k$ .

Step 6. Determine the candidate subset for retrieval. Candidate subset denoted as $\left\{{Y}_{1}\left(t\right), {Y}_{2}\left(t\right), \cdots, {Y}_{c}\left(t\right)\right\}$ , where $c = m/k$ , ${Y}_{l}\left(t\right) = \left({y}_{1}\left(t\right), {x、y}_{2}\left(t\right), \cdots, {y}_{n}\left(t\right)\right), l = \mathrm{1, 2}, \cdots, c$ . ${y}_{i}\left(t\right)$ corresponds to the state of $t$ at the moment of the neuron ${o}_{i}$ .

Step 7. DHNN Evolution. For ${Y}_{l}\left(t\right) = \left({y}_{1}\left(t\right), {y}_{1}\left(t\right), \cdots, {y}_{n}\left(t\right)\right)$ , $l = 0, t = 0$ , state updates are performed using Eq (5) until a stable state is reached. If ${Y}_{l}\left(t+1\right){\ne Y}_{l}\left(t\right)$ , then $t = t+1$ , and if ${Y}_{l}\left(t+1\right){ = Y}_{l}\left(t\right)$ , the network reaches a steady state. Proceed to the next step.

Step 8. Retrieval Phase. Check if the energy of the steady state satisfies $\left|E-{E}_{min}\right| < \delta$ . If it does, store the steady-state ${Y}_{l}\left(t\right)$ as a global minimum. If it doesn't, $l = l+1$ , consider it as a local minimum and go back to Step 6.

Step 9. Model Evaluation. The model is assessed using the metrics of global minimum ratio, Hamming distance, CPU time, steady-state retrieval rate, and global minimum retrieval rate.

6. Data experiments

To thoroughly evaluate the performance of the DHNN-3SAT-GAKM model and its ability to solve real-world application problems, this section examines its performance alongside the conventional DHNN-3SAT-WA, DHNN-3SAT-GA, and DHNN-3SAT-ICA models on a benchmark dataset. Experimental analyses were conducted to compare their performance and demonstrate the superiority of the DHNN-3SAT-GAKM model proposed in this study. The experiments were carried out using MATLAB R2023b on a laptop computer running the Windows 10 operating system, equipped with an AMD Razor R5-3500U processor and 8 GB of RAM.

6.1. Description of the dataset

This study utilizes the DIMACS Benchmark Instances AIM dataset from SATLIB, provided by Kazuo Iwama et al. (https://www.cs.ubc.ca/~hoos/SATLIB/benchm.html). The AIM dataset comprises of 48 instances, with 24 being satisfiable and 24 unsatisfiable. To create a representative set of instances, 12 of these satisfiable instances are chosen for this study. Each instance contains three clauses, and you can find specific descriptions of the instances in Table 5.

Table 5. Description of example data.

No.	Instance	variables	Clauses	No.	Instance	variables	Clauses
1	aim-50-1_6-yes1-1	50	80	7	aim-100-3_4-yes1-1	100	340
2	aim-50-2_0-yes1-1	50	100	8	aim-100-6_0-yes1-1	100	600
3	aim-50-3_4-yes1-1	50	170	9	aim-200-1_6-yes1-1	200	320
4	aim-50-6_0-yes1-1	50	300	10	aim-200-2_0-yes1-1	200	400
5	aim-100-1_6-yes1-1	100	160	11	aim-200-3_4-yes1-1	200	680
6	aim-100-2_0-yes1-1	100	200	12	aim-200-6_0-yes1-1	200	1200

| Show Table

DownLoad: CSV

6.2. Parameter setting

In the search phase, the traditional DHNN-3SAT-WA model directly examines 10, 000 different combinations of initial neuron assignments ^[46]. DHNN-3SAT-GA and DHNN-3SAT-ICA guide the search among these 10, 000 combinations using a genetic algorithm and an imperialistic competition algorithm, respectively. This paper introduces the DHNN-3SAT-GAKM model, which utilizes genetic optimization K-modes clustering to preprocess these 10000 neuron initial allocation combinations. It then selects a candidate subset for search. This approach reduces the actual search space and minimizes repeated local searches to avoid getting stuck in local minima, thereby improving the efficiency of retrieving the global minimum. The tolerance values for the conventional DHNN-3SAT-WA model align with Sathasivam's work ^[16]. The CPU time thresholds are based on Zamri's settings ^[47]. The parameter settings can be found in Table 6. The parameter settings for the DHNN-3SAT-GA model are in line with Kasihmuddin's work ^[26] and are listed in Table 7. The parameter settings for the DHNN-3SAT-ICA model remain consistent with Shazli's work ^[38], as shown in Table 8. Table 9 details the parameter settings of the model in this paper, with optimization of the relevant parameters through iterative tuning.

Table 6. DHNN-3SAT-WA model parameter settings.

parametric	parameter value	parametric	parameter value
Initial assigned amount	10000	tolerance value $\delta$	0.001
CPU time threshold	24 hours	-	-

| Show Table

DownLoad: CSV

Table 7. DHNN-3SAT-GA model parameter settings.

parametric	parameter value	parametric	parameter value
Initial assigned amount	10000	probability of mutation ${p}_{m}$	0.05
population size	50	Maximum Iterations $t$	100
crossover probability ${p}_{c}$	0.6	-	-

| Show Table

DownLoad: CSV

Table 8. DHNN-3SAT-ICA model parameter settings.

parametric	parameter value	parametric	parameter value
Initial assigned amount	10000	revolutionary rate $\alpha$	0.3
population size	50	Maximum Iterations $t$	100

| Show Table

DownLoad: CSV

Table 9. DHNN-3SAT-GAKM model parameter settings.

parametric	parameter value	parametric	parameter value
Initial assigned amount	10000	crossover probability ${p}_{c}$	0.6
population size	50	probability of mutation ${p}_{m}$	0.05
Initial number of clusters	3	Maximum Iterations $t$	100

| Show Table

DownLoad: CSV

6.3. Experimental results and discussion

We use the global minimum ratio (GMR) ^[16] and the mean CPU time (MCT) to assess the model's performance in this paper. To provide a more comprehensive evaluation of the model's ability to find the global minimum, we introduce 2 new evaluation metrics: the mean minimum Hamming distance (MMHD) and the mean logical satisfiability ratio (MLSR). This study will utilize a total of 4 evaluation metrics, as detailed in Table 10. The calculations are based on the average of 100 repeated experiment runs for each instance, and the results are displayed in Tables 11 and 12. Figures 9 to 12 compare our model, DHNN-3SAT-GAKM, with the models DHNN-3SAT-WA, DHNN-3SAT-GA, and DHNN-3SAT-ICA across the 4 evaluation metrics.

Table 10. Assessment indicators.

Indicators	calculation formula	instructions
GMR	$GMR=\frac{{N}_{GM}}{T}$	GMR represents the ratio of the global minimum solution to the total number of runs ^[16]. GMR is an effective metric for assessing the efficiency of an algorithm. A model is considered robust when its GMR value is close to 1 ^[23]. Here, ${N}_{GM}$ represents the number of times the global minimum is converged, and $T$ represents the total number of runs.
MCT	$MCT=\frac{1}{{N}_{GM}}\sum _{i}^{{T}_{GM}}{NT}_{i}$	The MCT refers to the average time needed for each model to reach the global minimum. A smaller MCT indicates that the model is more efficient in finding the global minimum. ${NT}_{i}$ represents the CPU time needed to find the global minimum at the $i$ th retrieval result, and ${N}_{GM}$ represents the number of times the global minimum converged.
MMHD	$MMHD=\frac{1}{T}\sum _{i}^{T}\underset{j}{\mathit{min}}D\left({X}_{i}, {Z}_{j}\right)$	The MMHD value represents the mean minimum Hamming distance, which is the average of the smallest bit difference between the retrieval result of each run and the global minimum. When the MMHD value is closer to 0, it indicates that the model retrieves a result closer to the global minimum. In this context, $D\left({X}_{i}, {Z}_{j}\right)$ represents the Hamming distance between the retrieval result of the $i$ th run and the global minimum, and $T$ represents the total number of runs.
MLSR	$MLSR=\frac{1}{T}\sum _{i}^{T}\frac{{N}_{sat\left(i\right)}}{m}$	The MLSR value indicates the average proportion of the total number of clauses that can be satisfied by the retrieval results. The closer the MLSR value is to 1, the closer the model retrieval results are to the global minimum. Here, ${N}_{sat\left(i\right)}$ denotes the number of satisfying clauses for the $i$ th retrieval result, and $m$ denotes the total number of clauses.

| Show Table

DownLoad: CSV

Table 11. Comparison of experimental results.

No.	GMR		MCT		MMHD		MLSR
No.	DHNN-3SAT-WA	DHNN-3SAT-GA	DHNN-3SAT-WA	DHNN-3SAT-GA	DHNN-3SAT-WA	DHNN-3SAT-GA	DHNN-3SAT-WA	DHNN-3SAT-GA
1	1.0000	1.0000	4.83	2.51	0.0000	0.0000	1	1
2	0.9123	0.9323	16.67	11.70	1.1000	1.0900	0.9744	0.9745
3	0.8234	0.8456	26.67	22.83	3.6000	3.4900	0.9355	0.9578
4	0.5802	0.6204	48.82	45.80	3.7200	3.7100	0.8923	0.8966
5	1.0000	1.0000	11.72	6.31	0.0000	0.0000	1	1
6	0.8812	0.9011	46.51	16.01	2.5000	2.4000	0.9433	0.9533
7	0.5467	0.6041	113.28	35.50	3.6200	3.6100	0.9288	0.9363
8	0.2018	0.2188	440.39	171.23	4.7400	4.2300	0.9139	0.9231
9	0.4114	0.4222	537.92	431.04	4.8600	4.4500	0.9835	0.9844
10	0.2261	0.2352	978.78	773.75	5.9800	5.6700	0.9312	0.9474
11	0.1616	0.1653	1369.44	1100.94	7.1000	6.8900	0.8537	0.8775
12	0.1413	0.1518	1566.18	1198.85	8.2200	8.1100	0.8234	0.8641

| Show Table

DownLoad: CSV

Table 12. Comparison of experimental results.

No.	GMR		MCT		MMHD		MLSR
No.	DHNN-3SAT-ICA	DHNN-3SAT-GAKM	DHNN-3SAT-ICA	DHNN-3SAT-GAKM	DHNN-3SAT-ICA	DHNN-3SAT-GAKM	DHNN-3SAT-ICA	DHNN-3SAT-GAKM
1	1.0000	1.0000	2.49	2.21	0.0000	0.0000	1	1
2	0.9441	0.9658	10.64	8.29	1.0700	1.0400	0.9761	0.9783
3	0.8542	0.9126	20.45	15.08	3.2700	1.1400	0.9662	0.9751
4	0.6356	0.7229	40.18	26.87	3.3700	2.9700	0.8978	0.9354
5	1.0000	1.0000	5.49	4.68	0.0000	0.0000	1	1
6	0.9124	0.9256	14.02	11.06	2.2000	2.1000	0.9644	0.9881
7	0.6205	0.6898	30.59	22.03	3.2000	2.9300	0.9375	0.9523
8	0.2291	0.3211	141.54	74.60	3.9800	3.7600	0.9251	0.9336
9	0.4301	0.4503	435.12	396.82	4.0800	3.8900	0.9851	0.9928
10	0.2488	0.2632	752.20	678.91	5.0800	4.7200	0.9488	0.9557
11	0.1689	0.2203	1108.03	935.02	6.0800	5.5500	0.8809	0.9028
12	0.1612	0.2097	1160.96	982.28	7.0800	6.3800	0.8732	0.8822

| Show Table

DownLoad: CSV

Figure 9. GMR.

DownLoad: Full-Size Img PowerPoint

Figure 10. MCT.

DownLoad: Full-Size Img PowerPoint

Figure 11. MMHD.

DownLoad: Full-Size Img PowerPoint

Figure 12. MLSR.

DownLoad: Full-Size Img PowerPoint

Tables 11 and 12 show the computational results of the DHNN-3SAT-GAKM model in this paper, as well as the DHNN-3SAT-WA, DHNN-3SAT-GA, and DHNN-3SAT-ICA models. The results are presented in terms of GMR, MCT, MMHD, and MLSR. These calculations are based on the metrics formulas provided in Table 10. Figures 9 to 12 illustrate the performance differences between this paper's DHNN-3SAT-GAKM model and the DHNN-3SAT-WA, DHNN-3SAT-GA, and DHNN-3SAT-ICA models. These differences are shown in the four evaluation metrics through radargram-based visualization. The DHNN-3SAT-GAKM model outperforms the DHNN-3SAT-WA, DHNN-3SAT-GA, and DHNN-3SAT-ICA models.

Figure 9 shows the GMR of each model in solving 3-SAT instances with varying levels of complexity. In this paper, the DHNN-3SAT-GAKM model achieved the highest GMR value, indicating its superior global retrieval ability and ability to avoid falling into local minima to some extent. Additionally, the DHNN-3SAT-GA and DHNN-3SAT-ICA models also demonstrated an improved ability over the traditional DHNN-3SAT-WA model to retrieve the global minimum to some extent. As the complexity of SAT problems, Boolean variables, and logical clauses increases, the GMRs of each model decrease rapidly. Hence, further optimization and improvement of the algorithms and architectures of the DHNN-3SAT models are needed to enhance their performance and efficiency when dealing with large-scale and complex SAT problems in the future.

In Figure 10, we can see the average time taken by each model to reach the global minimum. The MTC value of the DHNN-3SAT-GAKM model in this paper is the smallest, indicating that this model is more efficient in finding the global minimum. This is because the model initially clusters the allocation space using the K-modes clustering algorithm, enabling it to escape local minima more quickly and avoid repetitive retrieval of local minima. As a result, a large number of redundant calculations are reduced, leading to improved efficiency in converging to the global minimum. On the other hand, the traditional DHNN-3SAT-WA is more prone to getting stuck in local minima, especially as the number of local minimum solutions increases, resulting in a substantial number of repetitive evolutions and computations, ultimately affecting the efficiency of converging to the global minimum. While the DHNN-3SAT-GA and DHNN-3SAT-ICA models also use heuristic algorithms for guided search to some extent, helping to reduce the search space and speed up retrieval of the global minimum, the rapidly expanding search space due to the increasing complexity of the SAT problem can lead to longer retrieval times or even search failure. Consequently, for large-scale SAT problems, further improving the efficiency of searching for the global minimum is a future priority.

In dealing with large-scale SAT problems, the increasing logic complexity results in a progressively smaller solution space, making it very challenging to find a global minimum solution within a limited timeframe. Most attempts only end up finding the local minimum solution. The goal of model optimization at this stage is to make each retrieval result as close as possible to the global minimum solution. To evaluate the proximity of the model retrieval results to the global minimum solution, two new evaluation criteria are introduced in this study: the MMHD and the MLSR. These criteria reflect the proximity of the model retrieval results to the global optimum. A lower MMHD value and a higher MLSR value indicate that the model retrieval results are closer to the global minimum solution. Figures 11 and 12 illustrate the relationship between the MMHD and MLSR values of each model. These figures show that the DHNN-3SAT-GAKM model in this paper has the smallest MMHD value and the largest MLSR value, indicating that its retrieval results are closer to the global minimum than those of the DHNN-3SAT-WA, DHNN-3SAT-GA, and DHNN-3SAT-ICA models. This suggests that the retrieval results of the DHNN-3SAT-GAKM model are closer to the global minimum solution overall. Conversely, the DHNN-3SAT-GA and DHNN-3SAT-ICA models are closer to the global minimum than the overall retrieval results of the conventional DHNN-3SAT-WA.

Based on the combined analyses above, it can be observed that the DHNN-3SAT-GAKM model, introduced in this paper, exhibits significant improvements when compared to the traditional DHNN-3SAT-WA model, as well as the DHNN-3SAT-GA and DHNN-3SAT-ICA models that directly utilize heuristic algorithms for bootstrap retrieval. This demonstrates the superior performance of the DHNN-3SAT-GAKM model in retrieving the global minima in the SAT problem, while also highlighting its potential for practical applications.

7. Conclusions

This paper introduces a method for designing network synaptic weights based on basic logical clauses to handle dynamic changes in constraints in the SAT problem. This method aims to utilize synaptic weight information efficiently, reducing the need for repetitive calculations in the DHNN network. Additionally, it proposes a DHNN-3SAT model based on genetic algorithm optimized K-modes clustering to address the limitations of the traditional DHNN-3SAT-WA, which tends to get stuck in local minima. The new model uses genetic algorithms to cluster the initial space, effectively reducing the retrieval space and improving retrieval efficiency. Experimental results show that the DHNN-3SAT-GAKM model outperforms DHNN-3SAT-WA, DHNN-3SAT-GA, and DHNN-3SAT-ICA in terms of various evaluation metrics, including GMR, MCT, MMHD, and MLSR. This study not only expands the application of DHNN in solving the SAT problem but also offers valuable insights for future research.

The DHNN-3SAT model is an innovative approach to using deep learning technology to solve SAT problems, offering insights and potential for future research and applications. There are several areas for future work: first, optimizing and enhancing the algorithm and architecture of the DHNN-3SAT model to improve its performance on large-scale and complex SAT problems; second, exploring the model's extension to other NP-complete problems to demonstrate its versatility and applicability; and finally, conducting thorough research on the model in specific domains and practical applications to further promote the use of deep learning techniques in combinatorial optimization and decision-making problems. In summary, the proposal and study of the DHNN-3SAT model not only enhances methods in the field of SAT problem-solving but also provides new ideas and tools for solving complex problems using deep learning techniques. With ongoing technological and theoretical progress, the application of deep learning in combinatorial optimization problem-solving is expected to bring about broader development and deliver effective solutions for real-life complex problems.

Author contributions

Xiaojun Xie: Writing-review & editing, Writing-original draft, Methodology, Formal analysis, Conceptualization. Saratha Sathasivam: Writing-review & editing, Methodology, Funding acquisition, Conceptualization, Validation, Supervision. Hong Ma: Writing-review & editing, Writing-original draft, Methodology, Investigation, Formal analysis, Conceptualization. All authors have read and approved the final version of the manuscript for publication.

Acknowledgments

This research was supported by the Ministry of Higher Education Malaysia (MOHE) through the Fundamental Research Grant Scheme (FRGS), FRGS/1/2022/STG06/USM/02/11, and University Sains Malaysia.

Conflict of interest

All authors declare no conflicts of interest in this paper.

References

[1]	J. Cheng, X. Z. Jia, Y. B. Wang, Numerical differentiation and its applications, Inverse Probl. Sci. En., 15 (2007), 339–357. https://doi.org/10.1080/17415970600839093 doi: 10.1080/17415970600839093
[2]	M. Hanke, O. Scherzer, Inverse problems light: numerical differentiation, Am. Math. Mon., 108 (2001), 512–521. https://doi.org/10.1080/00029890.2001.11919778 doi: 10.1080/00029890.2001.11919778
[3]	D. H. Xu, Y. H. Xu, M. B. Ge, Q. F. Zhang, Models and numerics for differential equations and inverse problems, Beijing: Science Press, 2021.
[4]	P. Craven, G. Wahba, Smoothing noisy data with spline functions, Numer. Math., 31 (1978), 377–403. https://doi.org/10.1007/BF01404567 doi: 10.1007/BF01404567
[5]	D. L. Ragozin, Error bounds for derivative estimates based on spline smoothing of exact or noisy data, J. Approx. Theory, 37 (1983), 335–355. https://doi.org/10.1016/0021-9045(83)90042-4 doi: 10.1016/0021-9045(83)90042-4
[6]	J. P. Kaipio, E. Somersalo, Statistical and computational inverse problems, New York: Springer, 2005. https://doi.org/10.1007/b138659
[7]	L. Wasserman, All of nonparametric statistics, New York: Springer, 2006. https://doi.org/10.1007/0-387-30623-4
[8]	G. Claeskens, T. Krivobokova, J. D. Opsomer, Asymptotic properties of penalized spline estimators, Biometrika, 96 (2009), 529–544. https://doi.org/10.1093/biomet/asp035 doi: 10.1093/biomet/asp035
[9]	P. H. C. Eilers, B. D. Marx, Flexible smoothing with b-splines and penalties, Statist. Sci., 11 (1996), 89–121. https://doi.org/10.1214/ss/1038425655 doi: 10.1214/ss/1038425655
[10]	J. Zhang, J. Cheng, M. Zhong, A tikhonov regularization based algorithm for scattered data with random noise, arXiv: 2105.00747. https://doi.org/10.48550/arXiv.2105.00747
[11]	J. A. Fessler, Penalized weighted least-squares image reconstruction for positron emission tomography, IEEE T. Med. Imaging, 13 (1994), 290–300. https://doi.org/10.1109/42.293921 doi: 10.1109/42.293921
[12]	K. R. Ridgway, J. R. Dunn, J. L. Wilkin, Ocean interpolation by four dimensional weighted least squares-application to the waters around Australasia, J. Atmos. Ocean. Tech., 19 (2002), 1357–1375. https://doi.org/10.1175/1520-0426(2002)019 doi: 10.1175/1520-0426(2002)019
[13]	J. M. Wooldridge, Introductory econometrics: a modern approach, Boston: Cengage Learning, 2012.
[14]	Q. Feng, J. Hannig, J. S. Marron. A note on automatic data transformation, Stat., 5 (2016), 82–87. https://doi.org/10.1002/sta4.104 doi: 10.1002/sta4.104
[15]	J. Kalina, On heteroscedasticity in robust regression, International Days of Statistics and Economics, 41 (2011), 228–237. https://doi.org/10.1111/j.1467-9310.2011.00660.x doi: 10.1111/j.1467-9310.2011.00660.x
[16]	B. Sun, L. Ma, T. Shen, R. Geng, Y. Zhou, Y. Tian, A robust data-driven method for multiseasonality and heteroscedasticity in time series preprocessing, Wirel. Commun. Mob. Com., 2021 (2021), 6692390. https://doi.org/10.1155/2021/6692390 doi: 10.1155/2021/6692390
[17]	M. Marzjarani, A comparison of a general linear model and the ratio estimator, International Journal of Statistics and Probability, 9 (2020), 54–65. https://doi.org/10.5539/ijsp.v9n3p54 doi: 10.5539/ijsp.v9n3p54
[18]	A. Bashan, N. M. Yagmurlu, Y. Ucar, A. Esen, A new perspective for the numerical solution of the modified equal width wave equation, Math. Method. Appl. Sci., 44 (2021), 8925–8939. https://doi.org/10.1002/mma.7322 doi: 10.1002/mma.7322
[19]	A. Bashan, N. M. Yagmurlu, Y. Ucar, A. Esen, Finite difference method combined with differential quadrature method for numerical computation of the modified equal width wave equation, Numer. Meth. Part. D. E., 37 (2021), 690–706. https://doi.org/10.1002/num.22547 doi: 10.1002/num.22547
[20]	A. Bashan, A. Esen, Single soliton and double soliton solutions of the quadratic-nonlinear Korteweg-de Vries equation for small and long-times, Numer. Meth. Part. D. E., 37 (2021), 1561–1582. https://doi.org/10.1002/num.22597 doi: 10.1002/num.22597
[21]	Y. Ucar, N. M. Yagmurlu, A. Bashan, Numerical solutions and stability analysis of modified burgers equation via modified cubic b-spline differential quadrature methods, Sigma J. Eng. Nat. Sci., 37 (2019), 129–142.
[22]	A. Bashan, N. M. Yagmurlu, A mixed method approach to the solitary wave, undular bore and boundary-forced solutions of the regularized long wave equation, Comp. Appl. Math., 41 (2022), 169. https://doi.org/10.1007/s40314-022-01882-7 doi: 10.1007/s40314-022-01882-7
[23]	A. Bashan, N. M. Yagmurlu, Y. Ucar, A. Esen, Numerical approximation to the MEW equation for the single solitary wave and different types of interactions of the solitary waves, J. Differ. Equ. Appl., 28 (2022), 1193–1213. https://doi.org/10.1080/10236198.2022.2132154 doi: 10.1080/10236198.2022.2132154
[24]	G. Micula, S. Micula, Handbook of splines, Dordrecht: Springer, 1999. https://doi.org/10.1007/978-94-011-5338-6
[25]	A. Koppel, G. Warnell, E. Stump, A. Ribeiro, Parsimonious online learning with kernels via sparse projections in function space, J. Mach. Learn. Res., 20 (2019), 83–126.

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(1369) PDF downloads(51) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(5) / Tables(1)

AIMS Mathematics

A weighted online regularization for a fully nonparametric model with heteroscedasticity

Related Papers:

Abstract

1. Introduction

2. Theoretical background

2.1. Boolean 3SAT Logic

2.2. DHNN

3. Design of DHNN-3SAT model weights

3.1. WA method

3.2. WA method applied to 3SAT instance

3.3. BLC-WA method

3.4. Design of weights for dynamic constraints for the 3SAT problem

3.4.1. Adding constraints

3.4.2. Declining constraints

3.4.3. Updating constraints

4. Optimized K-modes clustering algorithm

4.1. K-modes clustering algorithm

4.2. K-modes clustering algorithm optimized by genetic algorithm

5. Development of DHNN-3SAT model based on genetic optimization K-modes clustering algorithm

6. Data experiments

6.1. Description of the dataset

6.2. Parameter setting

6.3. Experimental results and discussion

7. Conclusions

Author contributions

Acknowledgments

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

AIMS Mathematics

A weighted online regularization for a fully nonparametric model with heteroscedasticity

Related Papers:

Abstract

1. Introduction

2. Theoretical background

2.1. Boolean 3SAT Logic

2.2. DHNN

3. Design of DHNN-3SAT model weights

3.1. WA method

3.2. WA method applied to 3SAT instance

3.3. BLC-WA method

3.4. Design of weights for dynamic constraints for the 3SAT problem

3.4.1. Adding constraints

3.4.2. Declining constraints

3.4.3. Updating constraints

4. Optimized K-modes clustering algorithm

4.1. K-modes clustering algorithm

4.2. K-modes clustering algorithm optimized by genetic algorithm

5. Development of DHNN-3SAT model based on genetic optimization K-modes clustering algorithm

6. Data experiments

6.1. Description of the dataset

6.2. Parameter setting

6.3. Experimental results and discussion

7. Conclusions

Author contributions

Acknowledgments

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog