A | T | C | G | |
A | 0 | 1859 | 0 | 0 |
T | 1543 | 0 | 0 | 0 |
C | 0 | 0 | 0 | 1019 |
G | 0 | 0 | 856 | 0 |
In this paper, we consider the linear Rayleigh-Taylor instability of an equilibrium state of 3D gravity-driven compressible viscoelastic fluid with the elasticity coefficient κ is less than a critical number κc in a moving horizontal periodic domain. We first construct the maximal growing mode solutions to the linearized equations by studying a family of modified variational problems, and then we prove an estimate for arbitrary solutions to the linearized equations.
Citation: Caifeng Liu. Linear Rayleigh-Taylor instability for compressible viscoelastic fluids[J]. AIMS Mathematics, 2023, 8(7): 14894-14918. doi: 10.3934/math.2023761
[1] | Rajish Kumar P, Sunil Jacob John . On redundancy, separation and connectedness in multiset topological spaces. AIMS Mathematics, 2020, 5(3): 2484-2499. doi: 10.3934/math.2020164 |
[2] | Jerolina Fernandez, Hüseyin Işık, Neeraj Malviya, Fahd Jarad . Nb-fuzzy metric spaces with topological properties and applications. AIMS Mathematics, 2023, 8(3): 5879-5898. doi: 10.3934/math.2023296 |
[3] | Budi Nurwahyu, Naimah Aris, Firman . Some results in function weighted b-metric spaces. AIMS Mathematics, 2023, 8(4): 8274-8293. doi: 10.3934/math.2023417 |
[4] | Arife Atay . Disjoint union of fuzzy soft topological spaces. AIMS Mathematics, 2023, 8(5): 10547-10557. doi: 10.3934/math.2023535 |
[5] | Gültekin Soylu, Müge Çerçi . Metrization of soft metric spaces and its application to fixed point theory. AIMS Mathematics, 2024, 9(3): 6904-6915. doi: 10.3934/math.2024336 |
[6] | Li Shen, Jian Liu, Guo-Wei Wei . Evolutionary Khovanov homology. AIMS Mathematics, 2024, 9(9): 26139-26165. doi: 10.3934/math.20241277 |
[7] | Yaoqiang Wu . On partial fuzzy k-(pseudo-)metric spaces. AIMS Mathematics, 2021, 6(11): 11642-11654. doi: 10.3934/math.2021677 |
[8] | Saif Ur Rehman, Arjamand Bano, Hassen Aydi, Choonkil Park . An approach of Banach algebra in fuzzy metric spaces with an application. AIMS Mathematics, 2022, 7(5): 9493-9507. doi: 10.3934/math.2022527 |
[9] | Hasanen A. Hammad, Maryam G. Alshehri . Generalized Ξ-metric-like space and new fixed point results with an application. AIMS Mathematics, 2023, 8(2): 2453-2472. doi: 10.3934/math.2023127 |
[10] | Mazhar Mehmood, Abdullah Shoaib, Nabil Mlaiki . Fixed point results on triple controlled quasi rectangular metric like spaces. AIMS Mathematics, 2023, 8(5): 10049-10066. doi: 10.3934/math.2023509 |
In this paper, we consider the linear Rayleigh-Taylor instability of an equilibrium state of 3D gravity-driven compressible viscoelastic fluid with the elasticity coefficient κ is less than a critical number κc in a moving horizontal periodic domain. We first construct the maximal growing mode solutions to the linearized equations by studying a family of modified variational problems, and then we prove an estimate for arbitrary solutions to the linearized equations.
The strings of DNA sequences are shaped from nucleotides which are bonded together. DNA has four nucleotides called guanine G, cytosine C, adenine A and thymine T (or uracil U). G (resp. A) is paired with C (resp. T or U). The interaction between them is folding. The chain of nucleotides may be folding and bonding, but these interactions only occur under specific energy conditions. Here, we utilize nucleotide chains in conformity with the topological model; see [1,10]. A change or metamorphosis is called a mutation. Chromosome and gene alterations, known as mutations in biology, frequently manifest physically. The consequences of a mutation depend on the region where the genetic material's sequence has changed. On the other hand, insertion or deletion mutations might result in the production of gene products that are not functional. Large-scale mutations can also occur, resulting in the inversion, insertion, duplication, deletion, transposition, or translocation of lengthy strands of DNA. A mutation's outcome could be negative, positive, neutral, or even barely noticeable. A mutation may result in the removal or addition of a particular function, altered levels of expression, or even mortality in the developing embryo. A lot of scientists have worked hard to find and fix gene mutations. They have employed a few techniques for investigation, including single-stranded DNA oligonucleotide analysis, single-strand conformation polymorphism analysis, two-dimensional gene scanning, protein truncation testing, and denaturing high performance liquid chromatography [5,35]. The use of computer technology for the management of biological data is known as bioinformatics. To collect, store, analyze, and combine biological and family data for use in the discovery and development of gene-based drugs, information processing systems are utilized. The increase of publicly accessible genetic data as a result of the Human Genome Project has sparked the need for bioinformatics capabilities. A virus may cause mutations or the host may edit them, and sequencing mistakes can further complicate matters.
Multiset theory was introduced by Gostelow [23]. The concept of a multiset (or bag) is the generalization of a set. A member of a multiset has more than one membership (see [7,8,29,36,37]); the use of multisets in mathematics predates the name multiset by nearly 90 years.
Topology is a branch of geometry with the name of rubber sheet geometry. It has many real-life applications and solves some problems that are directly or indirectly related to continuity. Its study does not depend on the dimension, i.e., increasing or decreasing can occur without cutting [26,31,38]. Using the neighborhood system, graphs have been represented topologically, as in [14,32], and some topologies have been represented by neighborhoods and graphs, as in [28]. Recently, both graphs and rough sets have been used to represent structures such as self-similar fractals [11,15], the human heart [12,13,33] and DNA [17,18,19], making them useful in physics, medicine and biology [2,3,4], respectively.
Graph theory is a mathematical tool to solve some real-life problems. Graphs can be used to model many types of relations and processes in physical, biological [30], social and information systems. Many practical problems can be represented by graphs. Many previous studies have investigated the similarity of genetic sequences [20,21,27,34].
Our aim with this paper is to examine the existence of gene mutations based on relations, and through the use of metric space, topological structures and graph-based models. We generate a code that depends on the multiset, the relation and the metric space between the sequences of genes (DNA sequences) to determine the presence of the mutation, its locations, if any, and the amino acids. Graph theory is used to determine mutations of genes, and the similarity between DNA sequences will be studied. Finally, we combine the concept of a multiset and association indices to study the similarity between mutations for genes.
In what follows, a short survey of multisets and the corresponding theories will be given based on the work of Yager in [39]. Also, we introduce a short survey on genetic mutations, as described in [5,37], and some methods for mutation analysis and mutation detection are mentioned.
Definition 2.1. [9,24,39] A multiset M assigned from a nonempty set X and presented by a function CM(x):X→N, where N denotes the natural numbers. CM(x) represents the number of element x which occurs in M. In other words, M from X={x1,x2, ⋯,xn} to N is written as M= {m1x1,m2x2, ⋯, mnxn}, where mi is the number of xi, for i =1,2,⋯,n that can occur in M.
Proposition 2.2. Let M and N be two multisets assigned on X. Then, the following holds
(i) M=N if CM(x)=CN(x) ∀ x∈X.
(ii) M⊆N if CM(x)≤CN(x) ∀ x∈X.
(iii) W= M∪N if CW(x)= Max{CM(x),CN(x)} ∀ x∈X.
(v) W= M⊖N if CW(x)= Max{CM(x)−CN(x),0} ∀ x∈X.
(vi) W= M⊕N if CW(x)= CM(x)+CN(x) ∀ x∈X.
(vii) W= M∩N if CW(x)= Min{CM(x),CN(x)} ∀ x∈X.
Here, ⊖ and ⊕ denote a multiset subtraction and a multiset addition, respectively. It is noted that any set is a special case of multiset.
Definition 2.3. [22] Let K1 and K2 be two multisets assigned on X, and have CK1 and CK2, respectively. The Cartesian product of K1 and K2 is defined by K1×K2= {(mx,ny)mn: x∈mK1,y∈nK2}.
Definition 2.4. [22] A submultiset R of M×M is said to be a multiset relation on M, if for each (mx,ny) of R, there is a product of C1(x,y) and C2(x,y) which can be counted. The relationship between mx and ny can be formulated as mxRny.
In graph theory [10], the set of vertices will be denoted by V of a finite set. The set of edges have the form E(V)= {{u,v} s. t. u,v∈V u≠v}. In other words, u,v are called adjacent vertices. In this paper, the graph will be denoted by G=(V,E), VG is the vertex of G and EG is the set of edges. The graph G=(VG,EG) is a directed graph if each edge has a direction. The Łkaszyk-Karmowski distance function [43] is a function defining a distance between two random variables or two random vectors. The axioms of this function are as follows:
● d(x,y)>0,
● d(x,y)=d(y,x),
● d(x,z)≤d(x,y)+d(y,z).
In this section, the concepts of multisets, metric spaces and multiset relations are applied in MSC code building. The MSC code determines the existence of a mutation locations and number of mutations; it also identifies amino acids. The MSC code specifies the number of (A, T, G, C) elements, the relation between them, the places of their difference their numbers, and the different amino acids. Also, the code can be used to study the similarity between the DNA sequences (numbers of elements, matches and mismatches).
Definition 3.1. Let M3′5′ be a sense strand of DNA and M5′3′ be an antisense strand of DNA. Define a multiset of DNA sequence as M={mix:x∈{A,T,G,C}, where mi is the time of occurring for x}
Remark 3.2. In Definition 3.1, there are two DNA multisets M1 and M2 for M3′5′ and M5′3′, respectively.
Definition 3.3. Let M1 and M2 be DNA multisets. Define the DNA Cartesian product of M1 and M2 by M1×M2= {(mx,ny)mn: x∈mM1,y∈nM2}.
Definition 3.4. Let M1 and M2 be DNA multisets. Define multibinary relation R⊆M1×M2= {(mx,ny): x∈mM1,y∈nM2}.
Definition 3.5. Let M1 and M2 be DNA multisets. Define the correlation coefficient between M1 and M2 as CM1,M2=1|M1||M2|∑xi∈M1,yi∈M2C(xi)C(yi), where C(xi) and C(yi) are time of occurring for xi and yi in M1 and M2, respectively.
Corollary 3.6. CM1,M1=1|M1|2∑xi∈M1(C(xi))2.
Corollary 3.7. 0<CM1,M2≤1.
Proof. Since M1≠ϕ→|M1|≠0, M2≠ϕ→|M2|≠0 and |M1|≥CM1(xi), |M2|≥CM1(yi), then |M1||M2|≥∑CM1(xi)CM2(yi). Therefore, 0<CM1,M2≤1.
Remark 3.8. n(M1) is the number of elements existing in DNA multiset M1, and n(M1)=4 at most.
Definition 3.9. Let M1 and M2 be DNA multisets, n(M1)= n(M1). Define the distance function between M1 and M2 as dDNA(M1,M2)= CM1,M2√CM1,M1×CM2,M2.
Remark 3.10. From Definition 3.9, dDNA(M1,M2)= ∑xi,yiC(xi)C(yi)√∑xi(CM1(xi))2×∑xi(CM2(yi))2.
Theorem 3.11. dDNA(M1,M2) is associated with the following axioms:
(i) dDNA>0,
(ii) dDNA(M1,M1)=1,
(ii1) dDNA(M1,M2)=dDNA(M2,M1),
(iv) dDNA(M1,M3)+dDNA(M3,M2)≥dDNA(M1,M2).
Proof. (i) By Corollaries 3.6 and 3.7, 0<CM1,M2≤1, CM1,M1>0 and CM2,M2>0. Then, dDNA>0.
(ii) By Corollary 3.6, dDNA(M1,M1)=1.
(iii) By Definition 3.5, dDNA(M1,M2)= CM1,M2√CM1,M1×CM2,M2 = CM2,M1√CM2,M2×CM1,M1= dDNA(M2,M1).
(iv) Since |M1| ≥ CM1(xi), |M2| ≥ CM2(yi) and |M3| ≥ CM3(zi), then, by Definition 3.5, CM1,M2= 1|M1||M2| ∑xi∈M1,yi∈M2C(xi)C(yi), CM2,M3= 1|M2||M3| ∑zi∈M3,yi∈M2C(zi)C(yi) and CM1,M3= 1|M1||M3|∑xi∈M1,zi∈M3C(xi)C(zi). To prove that dDNA(M1,M3)+dDNA(M3,M2)≥dDNA(M1,M2), it is sufficient to prove that CM1,M2√CM1,M1×CM2,M2 ≤ CM1,M3√CM1,M1×CM3,M3+ CM3,M2√CM3,M3×CM2,M2. Since CM1,M3+ CM3,M2= 1|M1||M3| ∑xi∈M1,zi∈M3C(xi)C(zi)+ 1|M3||M2| ∑zi∈M3,yi∈M2C(zi)C(yi)= 1|M2||M1||M3| |M2| ∑xi∈M1,zi∈M3 C(xi)C(zi)+ 1|M1||M3||M2| |M1| ∑zi∈M3,yi∈M2 C(zi)C(yi) ≥ 1|M2||M1||M3| |M2| ∑xi∈M1,zi∈M3C(xi)C(zi) ≥ 1|M2||M1||M3| ∑yi∈M2C(yi) ∑xi∈M1,zi∈M3C(xi)C(zi) ≥ 1|M1||M2| ∑xi∈M1,yi∈M2C(xi)C(yi)= CM1,M2, where |M2||M1| ≤ ∑(C(xi))2 × ∑(C(yi))2, |M2||M3| ≤ ∑(C(yi))2 × ∑(C(zi))2, |M3||M1| ≤ ∑(C(zi))2 ×∑(C(xi))2, |M2||M1| ≥ √∑(C(xi))2×∑(C(yi))2, |M2||M3| ≥ √∑(C(yi))2×∑(C(zi))2 and |M3||M1| ≥ √∑(C(zi))2×∑(C(xi))2. Therefore, |M2||M1| ≥√|M2||M1| ≥√CM1,M1×CM2,M2.
Remark 3.12. We can call the function dDNA a DNA metric space.
Remark 3.13. Theorem 3.11 satisfies the condition of Łkaszyk-Karmowski distance.
Proposition 3.14. The distance function 1−dDNA is a metric space.
Proof. Refer to Theorem 3.11.
Theorem 3.15. If dDNA(M1,M2) = 1, then there is no mutation.
Proof. Let M1= {n1/G,n2/A,n3/T,n4/C}, M2= {m1/C,m2/T,m3/A,m4/G} and dDNA(M1,M2) = 1. Then, by Theorem 3.11, we get that dDNA(M1,M2)= 1 =CM1,M2√CM1,M1×CM2,M2. Then, CM1,M2= √CM1,M1 implies that C2M1,M2= CM1,M1 × CM2,M2. Using Definition 3.5, (n1m1+n2m2+n3m3+n4m4)2 = (n21+n22+n23+n24) ⋅ (m21+m22+m23+m24). Then, n1=λm1, n2=λm2, n3=λm3 and n4=λm4. So, if λ=1, then n1=m1, n2=m2, n3=m3 and n4=m4. This means that there is no mutation.
Corollary 3.16. From Theorem 3.15, we have that dDNA(M1,M2)≠1; then, there is a mutation.
We present the MSC code in the Appendix as an algorithm which is used to generate multisets, relations and a metric space between M1 and M2. Some examples are given to illustrate the proposed results and MSC code algorithm.
Example 3.17. Arabidopsis thaliana gamma-glutamylcysteine synthetase gene (abbr. CAD2) [44]
TairAccession:1005028114.GenBankAccession:AF068299.SequenceLength5277. |
5′ATCGATATGTAACACAAT⋯TGTATGTTTTT3′;
3′TAGCTATACATTGTGTTA⋯ACATACAAAAA5′. Using the MSC code algorithm, we have M1= {1019G,1543A,1859T,856C}, |M1|= 5277;
M2= {1019C,1543T,1859A,856G}, |M2|= 5277.
The distance between M1 and M2 equals 1 (no mutation). The relation between M1 and M2 according to their MSC code, is described in Table 1. The relation between M1 and M2 is
A | T | C | G | |
A | 0 | 1859 | 0 | 0 |
T | 1543 | 0 | 0 | 0 |
C | 0 | 0 | 0 | 1019 |
G | 0 | 0 | 856 | 0 |
R= {(1859T,1859A),(1543A,1543T),(1019G,1019C),(856C,856G)}. This relation indicates no mutation.
Example 3.18. If we do a mutation in CAD2 [44] in Example 3.1. Using the MSC code algorithm, we have M1= {1014G,1539A,1859T,860C}, |M1|= 5272; M2= {1014C,1543T,1859A,856G}, |M2|= 5272.
The distance between M1 and M2 equals 0.9999979580282978 according to the MSC code; the result was a mutation and this corresponds to data reported by the National Center for Biotechnology Information (NCBI) [44]. The position of the mutation is
[2568,2578,2595,2609,2639,5076];
[C,T,,C,C,,G,C];
[T,T,C,T,A,T].
The amino acid resulting from the mutation is presented in Table 2. The relation between M1 and M2 is outlined in Table 2. The relation between M1 and M2 is R= {(1G, 1A), (1T, 1T), (3C, 3T), (1C, 1C), (1539A, 1539T), (1858T, 1858A), (1013G, 1013C), (856C, 856G)} according to their MSC code as described in Table 3. This relation indicates a mutation.
5′⋯3′=M1 | 3′⋯5′=M2 | Amino acid 5′⋯3′=M1 | Amino acid 3′⋯5′=M2 | Position | |
0 | T | C | ATT | TAC | 2568 |
1 | T | T | TTA | TAT | 2578 |
2 | C | C | TGC | ACC | 2595 |
3 | T | C | TTT | ACA | 2609 |
4 | A | G | AAA | TGT | 2639 |
5 | T | C | ATT | TAC | 5076 |
A | T | C | G | |
A | 0 | 1858 | 0 | 1 |
T | 1539 | 1 | 3 | 0 |
C | 0 | 0 | 1 | 1013 |
G | 0 | 0 | 856 | 0 |
Corollary 3.19. From Theorem 3.15, the relation will be R= {(n1/G,n1/C)/n1n1, (n2/A,n2/T)/n2n2, (n3/T,n3/A)/n3n3, (n4/C,n4/G)/n4n4}.
Proposition 3.20. Let R be a relation between DNA multisets M1 and M2. Then, if R is either reflexive or transitive, then there is a mutation.
Proof. Suppose that M1= {n1x:x∈{A,T,G,C}, where n1 is the number occurrences of {x}= {n1/G,n2/A,n3/T,n4/C}, M2= m1y: y∈{A,T,G,C}, where m1 is the number occurrences of {y}={m1/C,m2/T,m3/A,m4/G} and R={(mx,ny)}:x∈mM1,y∈nM2}. Then, every C is linked with G. On the other side, A is linked with T≡U. Otherwise, a mutation will be occurred.
In the study of the congruence and determination of the presence of mutations between DNA sequences, we have four bases {A,T,G,C}; thus, we have the 12 mutation rates A→C, A→G, ⋯, T→G at a particular site. In the study of the similarity between the DNA sequences, we have 12 difference rates A→T, A→G, ⋯, T→A at a particular site. We can use these rates to study SARS-CoV-2 through the mutation of its genes. So, our study can yield a model of the pattern of mutations in SARS-CoV-2 and the alternative model for the mutations that occur in SARS- CoV-2 can be developed. If the length ni,j>0, then we suggest Si={A,T,C,G}, which has more than an average likelihood of linking to Sj. For each i=1,⋯,4, define Ri= Si∪Sj, ni,j>0. The collection R0= {Ri}4i is not itself a topology, but we extend it to one, defining 0 to be a minimal topological structure on genotypes containing R0. The topological space τ0 will be generated by a basis induced by a finite intersection of the sets in R0. The topological structure is referred as a mutation space and is called a mutation topological structure.
In this section, we use the proposed MSC code algorithm, the following definition is given.
Definition 4.1. Let X be the set of nucleotides of a DNA sequence such that X={A,T,G,C}, and let there exist a bonding between xi,MSCyj in X, referred to as n(xi,yj)≠0. Otherwise, n(xi,yj)=0.
Definition 4.2. Let X be the set of nucleotides of a DNA sequence. Define a relation R∗={(x,y): n(x,y)≠0,x,y∈X}.
We state some properties for the cases of mutations and no mutation.
(i) If there is no mutations, then
● R∗ is not reflexive,
● R∗ is symmetric,
● R∗ is transitive.
(ii) If there is a mutation, then R∗ may be reflexive, symmetric and transitive.
Example 4.3. {NM 000518.4 Homo sapiens hemoglobin subunit beta (abbr. HBB), mRNA [44] sequence: HBB gene range: 1 to 626};
5′ACATTTGCTT⋯CATTGC3′;
3′TGTAAACGAA⋯GTAACG5′;
M1= {157G, 167A, 137T, 165C}, |M1|= 626;
M2= {157C, 167T, 137A, 165G}, |M2|= 626.
The relation between M1 and M2 according to the MSC code is described in Tables 4 and 5. R∗= {(T,A), (A,T), (G,C), (C,G)}. This relation indicates no mutation and is consistent with the report by the NCBI [44].
A | T | C | G | |
A | 0 | 137 | 0 | 0 |
T | 167 | 0 | 0 | 0 |
C | 0 | 0 | 0 | 157 |
G | 0 | 0 | 165 | 0 |
A | T | C | G | |
A | - | √ | - | - |
T | √ | - | - | - |
C | - | - | - | √ |
G | - | - | √ | - |
Example 4.4. If we do a mutation in CAD2, then M1= {1012G, 1541A, 1856T, 856C}, |M1|= 5265, and M2= {1012C, 1540T, 1858A, 855G}, |M2|= 5265. The position of the mutation [17,686,5073] is [G,C,A] and [A,A,C]. The amino acid which results from the mutation is presented in Table 6. The distance between M1 and M2 equals 0.999999362175112, according to the MSC code. The relation between M1 and M2 according to the MSC code, is described in Tables 7 and 8. R∗= {(T,A), (C,A), (G,A), (A,T), (A,C), (G,C), (C,G)}. So, there is a mutation.
5′⋯3′ | 3′⋯5′ | Amino acid 5′⋯3′ | Amino acid 3′⋯5′ | Position | |
0 | G | A | AGA | TAT | 17 |
1 | C | A | TCT | AAA | 686 |
2 | A | C | ACA | TGC | 5073 |
A | T | C | G | |
A | 0 | 1856 | 1 | 1 |
T | 1540 | 0 | 0 | 0 |
C | 1 | 0 | 0 | 1011 |
G | 0 | 0 | 855 | 0 |
A | T | C | G | |
A | - | √ | √ | √ |
T | √ | - | - | - |
C | √ | - | - | √ |
G | - | - | √ | - |
Next, a topological structure will be defined in terms of R∗.
Definition 4.5. Let R∗ be a relation on X. Define a subbase S= {xR∗:x∈X} for some topology τDNA on X.
Example 4.6. (continued from Example 4.3) S= {AR∗, TR∗, CR∗, GR∗}= S= {{A}, {T},{C}, {G}}. Then, the base β will be {X,{A}, {T},{C}, {G}} and τDNA= {X,ϕ,{A}, {T}, {C}, {G}, {A,T}, {T,C}, {C,G}, {G,A}, {A,C}, {T,G}, {A,T,C}, {A,T,G}, {T,C,G}, {A,G,C}}. Therefore, this space is discrete. This means that, if every subset of X= {A,T,C,G} is open and closed, then there is no mutation.
Example 4.7. (continued from Example 4.4) S= {{T,C}, {A}, {A,G}, {A,C}}. Therefore, a base β= {X, {T,C}, {A}, {A,G}, {A,C}, {C}} and τDNA= {X, ϕ, {T,C}, {A}, {A,G}, {A,C}, {C}, {A,T,C}, {A,C,G}}≡ general topology.
Now, the existence of a mutation will be determined based on the type of topological structure.
Proposition 4.8. If the DNA sequence has a mutation, then the generated topology is a general topological structure.
Proof. Let τDNA be a class of sets on X generated by the mutation of DNA sequences. Consider that τDNA= {G:G= ⋃i(⋂njAij), Aij∈S} is a class of sets of X. Now, it is sufficient to prove that τDNA is a topological structure.
(i) ∩nj∈ϕAij=X∈τDNA and ⋃i∈ϕ(⋂njAij)=ϕ∈τDNA.
(ii) G1,G2,⋯Gn∈τDNA; then, G1=⋃i1(⋂nj1Ai1j1), G2=⋃i2(⋂nj2Ai2j2),⋯,Gn=⋃in(⋂njnAinjn). G1∩G2∩⋯∩Gn= ⋃i1,i2⋯in(n⋂j1Ai1j1)∩n⋂j2Ai2j2∩⋯∩⋃in⋂njnAinjn)= ⋃i1,i2,⋅,in(n⋂k1Bk1jk), where Bkjk=Ai1j1∩Ai2j2∩⋯∩Ainjn. Since each Ai1j1,Ai2j2⋯Ainjn∈S, Bkjk∈β; therefore, G1∩G2∩⋯∩Gn∈τDNA.
(ii) G1,G2,⋯Gn⋯∈τDNA; then, G1∪G2∪⋯∪Gn∪⋯= ⋃i1,i2⋯in(n⋂j1Ai1j1 ∪ n⋂j2Ai2j2 ∪⋯∪ ⋃inn⋂jnAinjn∪⋯). Hence, G1∪G2∪⋯∪Gn∪⋯∈τDNA.
Proposition 4.9. If there is no mutation in DNA, then τDNA is a discrete topology.
Proof. Suppose that the DNA sequence has no mutation. Then, n(x,x)=0 and n(x,y)≠0 ∀ x,y∈X. Hence, xR∗= {{y}: ∀ y∈X}. This means that A→T, C→G, T→A, G→C. So, S= {{y}:y∈X}. Therefore, τDNA is discrete.
The converse of Proposition 4.9 may not be true, in general.
Example 4.10. S={{T},{A}, {C,G},{C}} is a subbase for a discrete topological structure. But, there is a mutation because G is bonded with C.
Example 4.11. Let a DNA sequence be 5′ACGT3′ and 3′GATC5′. Then, S= {{G}, {A}, {T}, {C}} and τDNA is a discrete topology. But, there is a mutation, as no gene consists of only four nucleotides (length 4 only or a little more); the length of a gene is measured in kilobytes.
Corollary 4.12. If the topological structure generated from the DNA sequences has
(1)- a general topological structure, then there is a mutation;
(2)- a discrete topology, then there may or may not be a mutation. We use Theorem 3.15, Corollary 3.16 and Proposition 3.20 to figure out if there is a mutation.
Bass et al. [6] provided an overview of commonly used association indices, including the Jaccard index and the Pearson correlation coefficient, and compared their performance on different types of analysis for a biological network. An association index is a measure that quantifies interaction profile similarity. They discussed the differences and similarities between association indices. There exist many association indices:
(i) The Jaccard index: JAB= |NA∩NB||NA∪NB|;
(ii) The Simpson index: SAB= |NA∩NB|Min{|NA|,|NB|};
(iii) The geometric index: GAB= |NA∩NB|2|NA|.|NB|;
(iv) The cosine index: CAB=|NA∩NB|√|NA⋅NB|.
We apply association indices to determine whether there is a mutation by calculating the association indices for each pair of nucleotides {A,T,G,C} of the gene. If the associations are zero, there is no mutation; otherwise, there is a mutation.
Example 5.1. (continued from Example 4.3)
e1= 167= e5, e2= 157= e7, e3= 165= e6, e4= 137= e8.
Let X-type ={A}, Y-type ={T}.
Then, we have the following:
Jaccard = 02=0, Simpson = 01=0, Geometric = 01=0, Cosine = 01=0. Then, Jaccard = Simpson = Geometric = Cosine = 0. This is for all nucleotides {A,T,G,C}. Hence, there is no mutation. This is shown in Figures 1–3.
Example 5.2. (continued from Example 4.4)
Let X-type = {A}, Y-type = {G}.
Since e1= 1540= e5, e2=1011= e7, e3=855=e6 and e4=1856=e8, e9=1=e12, e10=1=e13 and e11=1=e14, we have the following:
Jaccard = 12,
Simpson = 11,
Geometric = 12,
Cosine = 1√2.
Then, Jaccard, Simpson, Geometric and Cosine ≠0. This is for all nucleotides A,T,G,C. Hence, there is a mutation. This is shown in Figures 4–7.
Note that the degree of a node A, say, |NA|, is defined as the number of nodes with which it interacts and |NA∩NB| is the shared partners. Biological processes are implemented through complex interaction networks. Metrics known as association indices can be used to quantify the similarity between genes through the use of a multiset. So, |NA| is the cardinality of a multiset M. Then, the similarity association indices become
MJAB= |NA∩NB||NA∪NB|, MSAB= |NA∩NB|Min{|NA|,|NB|}, MGAB= |NA∩NB|2|NA|⋅|NB and MCAB= |NA∩NB|√|NA|⋅|NB|.
The dissimilarity association indices are as follows
M∗JAB=1−|NA∩NB||NA∪NB|, M∗SAB=1−|NA∩NB|Min{|NA|,|NB|}, M∗GAB=1−|NA∩NB|2|NA|⋅|NB and M∗GAB=1−|NA∩NB|√|NA|⋅|NB|.
Remark 5.3. (i) MAB≥MAB.
(ii) If |NA|=|NB|, then MAB=MAB.
Theorem 5.4. The similarity association indices are DNA metric spaces.
Theorem 5.5. The dissimilarity association indices are metric spaces.
Example 5.6. (continued from Example 5.1)
Let X-type ={A}, Y-type ={T}; also, e1, e8 and e8=137.
NA={e1T,}, NB={e8A}.
Jaccard = 0304=0,
Simpson = 0137=0,
Geometric = 022879=0,
Cosine = 0151.26=0,
Then, Jaccard = Simpson = Geometric = Cosine = 0. This is for all nucleotides {A,T,G,C} since e1=1540=e5, e2=1011=e7, e3=855=e6 and e4=1856=e8, e9=1=e12, e10=1=e13; e11=1=e14. Hence, there is no mutation. This is shown in Figure 8.
Example 5.7. (continued from Example 5.2)
Let X-type = {A}, Y-type = {G}, e1= 1540, e6= 855 and e9 = 1. NA= {e1T,e9C}, NB={e6C}.
MJAB=|NA∩NB||NA∪NB| = |{e1T,e9C}∩{e6C}||NA∪NB| = |min{e9,e6}||max{{e6,e9},e1} = |{e9}||{e6,e1} = 12395≠0,
MSAB= |NA∩NB|Min{|NA|,|NB|}≠0,
MGAB= |NA∩NB|2|NA|⋅|NB≠0,
MCAB= |NA∩NB|√|NA|⋅|NB|≠0. Additionally, e1=1540=e5, e2=1011=e7, e3=855=e6; e4=1856=e8, e9=1=e12, e10=1=e13 and e11= 1= e14.
Then, Jaccard = Simpson = Geometric = Cosine ≠0. This is for all nucleotides {A,T,G,C}. Hence, there is a mutation. This is shown in Figure 9.
Example 5.8. (A similarity and dissimilarity between the sequences of DNA)
Let GATACCCCCCGG, GATACGACCCGG, GATACGCCCCGG, CATACGACTCGG and GATAGACTCGG be five sequences for DNA. Then, A= {3G, 2A, 1T, 6C}, |A|=12, B= {4C, 1T, 3A, 4G}, |B|= 12, C= {5C, 1T,2A, 4G}, |C|= 12, D= {4C, 2T, 3A, 3G}, |D|= 12 and E= {3C, 2T, 3A, 4G}, |E|= 12. Hence, M∗J(AB)=0.286, M∗G(AB)=0.306, M∗C(AB)=0.17, M∗S(AB)=0.17, M∗J(AC)=0.154, M∗G(AC)=0.16, M∗C(AC)=0.084, M∗S(AC)=0.084, M∗J(AD)=0.286, M∗G(AD)=0.306, M∗C(AD)=0.167, M∗S(AD)=0.17, M∗J(AE)=0.4, M∗G(AE)=0.438, M∗C(AE)=0.25 and M∗S(AE)=0.25. But, the balance of dissimilarity is (A,A)=0, (A,B)=0.17, (A,C)=0.08, (A,D)=0.33 and (A,E)=0.25, according to the NCBI [44]. By matching the results of the association indices with the reports from the NCBI, it was found that the association indices M∗C and M∗S are the best.
The complicated DNA research has become easier by using topology. Recently, many topologists found new methods to examine the mutations of DNA by using a combination of multiset topology and graph theory. Moreover, our presented results for repairing compatibility between the mathematical methods and biological solutions. In addition, we give a decision of mutation that is dependent on the metrics between two sequences of a gene and the topological structure derived from the relations. In the future, we can benefit from mutations by applying them end epidemics and in the fields of industry and agriculture. We have studied and identified mutations and showed how to make new ones, including how to fix mutations and apply Mathematica to construct models. Consequently, they are very significant in decision-making [25,40,41,42]. The introduced techniques are very useful in application because they pave the way for more topological applications for real-life problems. We also have an interesting application of our approaches to DNA sequences. The study of similarity between DNA sequences will be used to solve problems related to diseases and viruses, such as COVID-19 [16], which is an important example of mutations nowadays.
The authors declare they have not used artificial intelligence tools in the creation of this article.
This work was supported by Researchers Supporting Project number (RSP2023R488), King Saud University, Riyadh, Saudi Arabia.
The authors declare that there are no conflicts of interest.
[caption = Read the data from files and print the length of each DNA sequence, label = {Read}, language = python] |
import pandas as pd |
m1 = pd.read_csv('M1.txt', header = None) |
M2 = pd.read_csv('M2.txt', header = None) |
print (len(M1.values[0][0])) |
print (len(M2.values[0][0])) |
output |
[caption = Mh_dna is a function to count A, T, G, C and distance function to calculate distance, label = {Mh_dna}, language = python] import |
math def Mh_dna(x): |
count_a = 0 |
count_t = 0 |
count_g = 0 |
count_c = 0 |
for i in x: |
if i = ='A': |
count_a = count_a+1 |
if i = ='T': |
count_t = count_t+1 |
if i = ='G': |
count_g = count_g+1 |
if i = ='C': |
count_c = count_c+1 |
return count_a, count_t, count_g, count_c |
def distance(M1, M2): |
#n2 = no(A), n3 = no(T), n1 = no(G), n4 = no(C) |
#m3 = no(A), m2 = no(T), m4 = no(G), m1 = no(C) |
n2, n3, n1, n4 = Mh_dna(M1) |
m3, m2, m4, m1 = Mh_dna(M2) |
C_aa = n2**2+n3**2+n1**2+n4**2 |
C_bb = m2**2+m3**2+m1**2+m4**2 |
C_ab = n1*m1+n2*m2+n3*m3+n4*m4 |
dist = C_ab/math.sqrt(C_aa* C_bb) |
return dist |
def sequence_IDENTICAL(seq_a, seq_b): |
len1 = len(seq_a) |
len2 = len(seq_b) |
mismatches = [] |
for pos in range (0, min(len1, len2)) : |
if seq_a[pos]! = seq_b[pos]: |
mismatches.append('|') |
else: |
mismatches.append(' ') |
print (seq_a) |
print ("".join(mismatches)) |
print (seq_b) |
def seq_count_pair(seq_a, seq_b): |
len1 = len(seq_a) |
len2 = len(seq_b) |
columns = ['A', 'T', 'C', 'G'] |
index = ['A', 'T', 'C', 'G'] |
df = pd.DataFrame(0, columns = columns, index = index) |
for pos in range (0, min(len1, len2)): |
for i, j in enumerate(columns): |
if (seq_a[pos] = = columns[i] and seq_b[pos] = = columns[0]): |
k = columns[0] |
df[j][k] = df[j][k]+1 |
elif(seq_a[pos] = = columns[i] and seq_b[pos] = = columns[1]): |
k = columns[1] |
df[j][k] = df[j][k]+1 |
elif(seq_a[pos] = = columns[i] and seq_b[pos] = = columns[2]): |
k = columns[2] |
df[j][k] = df[j][k]+1 |
elif(seq_a[pos] = = columns[i] and seq_b[pos] = = columns[3]): |
k = columns[3] |
df[j][k] = df[j][k]+1 |
return df |
def sequence_complement(seq_a, seq_b): |
len1 = len(seq_a) |
len2 = len(seq_b) |
i = [] |
mismatches = [] |
for pos in range (0, min(len1, len2)) : |
if ((seq_a[pos] = = 'A' and seq_b[pos] = = 'T') |
or(seq_a[pos] = = 'T' and seq_b[pos] = = 'A') |
or(seq_a[pos] = = 'G' and seq_b[pos] = = 'C') |
or(seq_a[pos] = = 'C' and seq_b[pos] = = 'G')): |
mismatches.append('') |
else: |
i.append(pos) |
if(len(i)>0): |
x = [seq_a[j] for j in i] |
y = [seq_b[j] for j in i] |
print(i) |
print (x) |
print (y) |
kg = [] |
kg2 = [] |
for k1 in i: |
if k1%3==0: |
k2=str(seq_a[k1-2]+seq_a[k1-1]+seq_a[k1]) |
k3=str(seq_b[k1-2]+seq_b[k1-1]+seq_b[k1]) |
kg.append(k2) |
kg2.append(k3) |
elif k1%3==1: |
k2=str(seq_a[k1]+seq_a[k1+1]+seq_a[k1+2]) |
k3=str(seq_b[k1]+seq_b[k1+1]+seq_b[k1+2]) |
kg.append(k2) |
kg2.append(k3) |
elif k1%3==2: |
k2 = str(seq_a[k1-1]+seq_a[k1]+seq_a[k1+1]) |
k3 = str(seq_b[k1-1]+seq_b[k1]+seq_b[k1+1]) |
kg.append(k2) |
kg2.append(k3) |
return i, x, y, kg, kg2 |
import pandas as pd |
M1 = pd.read_csv('M1.txt', header = None) |
M2 = pd.read_csv('M2.txt', header = None) |
print (len(M1.values[0][0])) |
print (len(M2.values[0][0])) |
M11 = M1.values[0][0] n2, n3, n1, n4 = Mh_dna(M11) |
print("M1: count A=", n2, "count T=", n3, "count G=", n1, "count C=", n4) |
M22 = M2.values[0][0] m3, m2, m4, m1 = Mh_dna(M22) |
print("M2: count A=", m3, "count T=", m2, "count G=", m4, "count C=", m1) |
t = distance(M11, M22) print('distance = ', t) |
df_p = seq_count_pair(M11, M22) i, x, y, kg, kg2 = sequence_complement(M11, M22) |
df2 = pd.DataFrame({'Position':i, 'M1':x, 'M2':y, 'Amino acid M1':kg, 'Amino acid M2':kg2}) |
columns = ['A', 'T', 'C', 'G'] |
index = ['A', 'T', 'C', 'G'] |
df = pd.DataFrame(0, columns = columns, index = index) |
[1] | S. Chandrasekhar, Hydrodynamic and hydromagnetic stability, Clarendon Press, 1961. |
[2] |
R. Duan, F. Jiang, On the Rayleigh-Taylor instability for incompressible, inviscid magnetohydrodynamic flows, SIAM J. Appl. Math., 71 (2011), 1990–2013. https://doi.org/10.1137/110830113 doi: 10.1137/110830113
![]() |
[3] |
D. Ebin, Ill-posedness of the Rayleigh-Taylor and Helmholtz problems for incompressible fluids, Commun. Part. Diff. Eq., 13 (1988), 1265–1295. https://doi.org/10.1080/03605308808820576 doi: 10.1080/03605308808820576
![]() |
[4] | G. L. Gui, Z. F. Zhang, Global stability of the compressible viscous surface waves in an infinite layer, 2022, arXiv: 2208.06654. |
[5] | Y. Guo, I. Tice, Compressible, inviscid Rayleigh-Taylor instability, Indiana U. Math. J., 60 (2011), 677–712. |
[6] |
Y. Guo, I. Tice, Linear Rayleigh-Taylor instability for viscous, compressible fluids, SIAM J. Math. Anal., 42 (2010), 1688–1720. http://dx.doi.org/10.1137/090777438 doi: 10.1137/090777438
![]() |
[7] |
G. J. Huang, F. Jiang, W. W. Wang, On the nonlinear Rayleigh-Taylor instability of nonhomogeneous incompressible viscoelastic fluids under L2-norm, J. Math. Anal. Appl., 455 (2017), 873–904. https://doi.org/10.1016/j.jmaa.2017.06.022 doi: 10.1016/j.jmaa.2017.06.022
![]() |
[8] |
H. Hwang, Y. Guo, On the dynamical Rayleigh-Taylor instability, Arch. Ration. Mech. An., 167 (2003), 235–253. https://doi.org/10.1007/s00205-003-0243-z doi: 10.1007/s00205-003-0243-z
![]() |
[9] |
F. Jiang, On effects of viscosity and magnetic fields on the largest growth rate of linear Rayleigh-Taylor instability, J. Math. Phys., 57 (2016), 111503. https://doi.org/10.1063/1.4966924 doi: 10.1063/1.4966924
![]() |
[10] |
F. Jiang, S. Jiang, On instability and stability of three-dimensional gravity flows in a bounded domain, Adv. Math., 264 (2014), 831–863. https://doi.org/10.1016/j.aim.2014.07.030 doi: 10.1016/j.aim.2014.07.030
![]() |
[11] |
F. Jiang, S. Jiang, On linear instability and stability of the Rayleigh-Taylor problem in magnetohydrodynamics, J. Math. Fluid Mech., 17 (2015), 639–668. https://doi.org/10.1007/s00021-015-0221-x doi: 10.1007/s00021-015-0221-x
![]() |
[12] |
F. Jiang, S. Jiang, On the stabilizing effect of the magnetic field in the magnetic Rayleigh-Taylor problem, SIAM J. Math. Anal., 50 (2018), 491–540. https://doi.org/10.1137/16M1069584 doi: 10.1137/16M1069584
![]() |
[13] |
F. Jiang, S. Jiang, G. X. Ni, Nonlinear instability for nonhomogeneous incompressible viscous fluids, Sci. China Math., 56 (2013), 665–686. https://doi.org/10.1007/s11425-013-4587-z doi: 10.1007/s11425-013-4587-z
![]() |
[14] |
F. Jiang, S. Jiang, Y. J. Wang, On the Rayleigh-Taylor instability for the incompressible viscous magnetohydrodynamic equations, Commun. Part. Diff. Eq., 39 (2014), 399–438. https://doi.org/10.1080/03605302.2013.863913 doi: 10.1080/03605302.2013.863913
![]() |
[15] |
F. Jiang, S. Jiang, G. C. Wu, On stabilizing effect of elasticity in the Rayleigh-Taylor problem of stratified viscoelastic fluids, J. Funct. Anal., 272 (2017), 3763–3824. https://doi.org/10.1016/j.jfa.2017.01.007 doi: 10.1016/j.jfa.2017.01.007
![]() |
[16] |
J. Jang, I. Tice, Instability theory of the Navier-Stokes-Poisson equations, Anal. PDE, 6 (2013), 1121–1181. https://doi.org/10.2140/apde.2013.6.1121 doi: 10.2140/apde.2013.6.1121
![]() |
[17] |
J. Jang, I. Tice, Y. J. Wang, The compressible viscous surface-internal wave problem: Nonlinear Rayleigh-Taylor instability, Arch. Ration. Mech. An., 221 (2016), 215–272. https://doi.org/10.1007/s00205-015-0960-0 doi: 10.1007/s00205-015-0960-0
![]() |
[18] |
H. Kull, Theory of the Rayleigh-Taylor instability, Phys. Rep., 206 (1991), 197–325. https://doi.org/10.1016/0370-1573(91)90153-D doi: 10.1016/0370-1573(91)90153-D
![]() |
[19] |
J. Prüss, G. Simonett, On the Rayleigh-Taylor instability for the two-phase Navier-Stokes equations, Indiana Univ. Math. J., 59 (2010), 1853–1872. https://doi.org/10.1512/iumj.2010.59.4145 doi: 10.1512/iumj.2010.59.4145
![]() |
[20] |
L. Rayleigh, Investigation of the character of the equilibrium of an in compressible heavy fluid of variable density, P. Lond. Math. Soc., s1-14 (1882), 170–177. https://doi.org/10.1112/plms/s1-14.1.170 doi: 10.1112/plms/s1-14.1.170
![]() |
[21] |
G. I. Taylor, The stability of liquid surface when accelerated in a direction perpendicular to their planes. I., Proc. Roy. Soc. London A, 201 (1950), 192–196. https://doi.org/10.1098/rspa.1950.0052 doi: 10.1098/rspa.1950.0052
![]() |
[22] |
W. W. Wang, Y. Y. Zhao, On the Rayleigh-Taylor instability in compressible viscoelastic fluids, J. Math. Anal. Appl., 463 (2018), 198–221. https://doi.org/10.1016/j.jmaa.2018.03.018 doi: 10.1016/j.jmaa.2018.03.018
![]() |
[23] |
Y. J. Wang, I. Tice, The viscous surface-internal wave problem: nonlinear Rayleigh-Taylor instability, Commun. Part. Diff. Eq., 37 (2012), 1967–2028. https://doi.org/10.1080/03605302.2012.699498 doi: 10.1080/03605302.2012.699498
![]() |
[24] |
Y. J. Wang, I. Tice, C. Kim, The viscous surface-internal wave problem: Global well-posedness and decay, Arch. Ration. Mech. Anal., 212 (2014), 1–92. https://doi.org/10.1007/s00205-013-0700-2 doi: 10.1007/s00205-013-0700-2
![]() |
[25] |
Y. Y. Zhao, W. W. Wang, On the Rayleigh-Taylor instability incompressible viscoelastic fluids under L1-norm, J. Comput. Appl. Math., 383 (2021), 113130. https://doi.org/10.1016/j.cam.2020.113130 doi: 10.1016/j.cam.2020.113130
![]() |
1. | M. Badr, Radwan Abu-Gdairi, A. A. Nasef, Mutations of Nucleic Acids via Matroidal Structures, 2023, 15, 2073-8994, 1741, 10.3390/sym15091741 | |
2. | Andrzej Tomski, Szymon Łukaszyk, Reply to "Various issues around the L1-norm distance", 2024, 2976-730X, 1, 10.59973/ipil.77 | |
3. | Md Mirazul Hoque, Baby Bhattacharya, Binod Chandra Tripathy, Connectedness and Compactness via Subspace Mixed M-Topologies, 2024, 94, 0369-8203, 335, 10.1007/s40010-024-00884-w | |
4. | H. Saber Osman, E. El-Seidy, A. E. Radwan, A. M. Kozae, S. A. El-Sheikh, Ab̧delfattah El-Atik, Connectivity on multi-pretopological structures and decision-making for COVID-19, 2025, 36, 1012-9405, 10.1007/s13370-025-01248-2 | |
5. | Satabdi Ray, Md Mirazul Hoque, Baby Bhattacharya, Binod Chandra Tripathy, Separation axioms in fuzzy multiset topological spaces, 2025, 0219-1377, 10.1007/s10115-025-02352-2 | |
6. | Radwan Abu-Gdairi, R. Mareay, M. Badr, On Multi-Granulation Rough Sets with Its Applications, 2024, 79, 1546-2226, 1025, 10.32604/cmc.2024.048647 |
A | T | C | G | |
A | 0 | 1859 | 0 | 0 |
T | 1543 | 0 | 0 | 0 |
C | 0 | 0 | 0 | 1019 |
G | 0 | 0 | 856 | 0 |
5′⋯3′=M1 | 3′⋯5′=M2 | Amino acid 5′⋯3′=M1 | Amino acid 3′⋯5′=M2 | Position | |
0 | T | C | ATT | TAC | 2568 |
1 | T | T | TTA | TAT | 2578 |
2 | C | C | TGC | ACC | 2595 |
3 | T | C | TTT | ACA | 2609 |
4 | A | G | AAA | TGT | 2639 |
5 | T | C | ATT | TAC | 5076 |
A | T | C | G | |
A | 0 | 1858 | 0 | 1 |
T | 1539 | 1 | 3 | 0 |
C | 0 | 0 | 1 | 1013 |
G | 0 | 0 | 856 | 0 |
A | T | C | G | |
A | 0 | 137 | 0 | 0 |
T | 167 | 0 | 0 | 0 |
C | 0 | 0 | 0 | 157 |
G | 0 | 0 | 165 | 0 |
A | T | C | G | |
A | - | √ | - | - |
T | √ | - | - | - |
C | - | - | - | √ |
G | - | - | √ | - |
5′⋯3′ | 3′⋯5′ | Amino acid 5′⋯3′ | Amino acid 3′⋯5′ | Position | |
0 | G | A | AGA | TAT | 17 |
1 | C | A | TCT | AAA | 686 |
2 | A | C | ACA | TGC | 5073 |
A | T | C | G | |
A | 0 | 1856 | 1 | 1 |
T | 1540 | 0 | 0 | 0 |
C | 1 | 0 | 0 | 1011 |
G | 0 | 0 | 855 | 0 |
A | T | C | G | |
A | - | √ | √ | √ |
T | √ | - | - | - |
C | √ | - | - | √ |
G | - | - | √ | - |
A | T | C | G | |
A | 0 | 1859 | 0 | 0 |
T | 1543 | 0 | 0 | 0 |
C | 0 | 0 | 0 | 1019 |
G | 0 | 0 | 856 | 0 |
5′⋯3′=M1 | 3′⋯5′=M2 | Amino acid 5′⋯3′=M1 | Amino acid 3′⋯5′=M2 | Position | |
0 | T | C | ATT | TAC | 2568 |
1 | T | T | TTA | TAT | 2578 |
2 | C | C | TGC | ACC | 2595 |
3 | T | C | TTT | ACA | 2609 |
4 | A | G | AAA | TGT | 2639 |
5 | T | C | ATT | TAC | 5076 |
A | T | C | G | |
A | 0 | 1858 | 0 | 1 |
T | 1539 | 1 | 3 | 0 |
C | 0 | 0 | 1 | 1013 |
G | 0 | 0 | 856 | 0 |
A | T | C | G | |
A | 0 | 137 | 0 | 0 |
T | 167 | 0 | 0 | 0 |
C | 0 | 0 | 0 | 157 |
G | 0 | 0 | 165 | 0 |
A | T | C | G | |
A | - | √ | - | - |
T | √ | - | - | - |
C | - | - | - | √ |
G | - | - | √ | - |
5′⋯3′ | 3′⋯5′ | Amino acid 5′⋯3′ | Amino acid 3′⋯5′ | Position | |
0 | G | A | AGA | TAT | 17 |
1 | C | A | TCT | AAA | 686 |
2 | A | C | ACA | TGC | 5073 |
A | T | C | G | |
A | 0 | 1856 | 1 | 1 |
T | 1540 | 0 | 0 | 0 |
C | 1 | 0 | 0 | 1011 |
G | 0 | 0 | 855 | 0 |
A | T | C | G | |
A | - | √ | √ | √ |
T | √ | - | - | - |
C | √ | - | - | √ |
G | - | - | √ | - |