Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. See more
Download
Standard view
Full view
of .
Look up keyword
Like this
1Activity
0 of .
Results for:
No results containing your search query
P. 1
A Novel Hybridization of ABC with CBR for Pseudoknotted RNA Structure

A Novel Hybridization of ABC with CBR for Pseudoknotted RNA Structure

Ratings: (0)|Views: 79 |Likes:
Published by ijcsis
The RNA molecule is substantiated to play important functions in living cells. The class of RNA with pseudoknots, has essential roles in designing remedies for many virus diseases in therapeutic domain. These various useful functions can be inferred from RNA secondary structure with pseudoknots. Many computational intensive efforts have been emerged with the aim of predicting the pseudoknotted RNA secondary structure. The computational approaches are much promising to predict the RNA structure.
The reason behind this is that, the experimental methods for determining the RNA tertiary structure are difficult, timeconsuming and tedious. In this paper, we introduce ABCRna, a novel method for predicting RNA secondary structure with pseudoknots. This method combines heuristic-based KnotSeeker with a thermodynamic programming model, UNAFold. ABCRna is a hybrid swarm-based intelligence method inspired by the secreting honey process in natural honey-bee colonies. The novel aspect of this method is adapting Case-Based Reasoning (CBR) and knowledge base, two prominent Artificial Intelligence techniques. They are employed particularly to enhance the quality performance of the proposed method. The CBR provides an intelligent decision, which results more accurate predicted RNA structure. This modified ABCRna method is tested using different kinds of RNA sequences to prove and compare its efficiency against other pseudoknotted RNA predicted methods in the literature. The proposed ABCRna algorithm performs faster with significant improvement in accuracy, even for long RNA sequences.
The RNA molecule is substantiated to play important functions in living cells. The class of RNA with pseudoknots, has essential roles in designing remedies for many virus diseases in therapeutic domain. These various useful functions can be inferred from RNA secondary structure with pseudoknots. Many computational intensive efforts have been emerged with the aim of predicting the pseudoknotted RNA secondary structure. The computational approaches are much promising to predict the RNA structure.
The reason behind this is that, the experimental methods for determining the RNA tertiary structure are difficult, timeconsuming and tedious. In this paper, we introduce ABCRna, a novel method for predicting RNA secondary structure with pseudoknots. This method combines heuristic-based KnotSeeker with a thermodynamic programming model, UNAFold. ABCRna is a hybrid swarm-based intelligence method inspired by the secreting honey process in natural honey-bee colonies. The novel aspect of this method is adapting Case-Based Reasoning (CBR) and knowledge base, two prominent Artificial Intelligence techniques. They are employed particularly to enhance the quality performance of the proposed method. The CBR provides an intelligent decision, which results more accurate predicted RNA structure. This modified ABCRna method is tested using different kinds of RNA sequences to prove and compare its efficiency against other pseudoknotted RNA predicted methods in the literature. The proposed ABCRna algorithm performs faster with significant improvement in accuracy, even for long RNA sequences.

More info:

Published by: ijcsis on Dec 05, 2010
Copyright:Attribution Non-commercial

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
download as PDF, TXT or read online from Scribd
See more
See less

12/05/2010

pdf

text

original

 
(IJCSIS) International Journal of Computer Science and Information Security,Vol.
8
 , No.
8
 , 2010
A Novel Hybridization of ABC with CBR forPseudoknotted RNA Structure
Ra’ed M. Al-Khatib, Nur’Aini Abdul Rashid and Rosni AbdullahSchool of Computer ScienceUniversiti Sains Malaysia USMPenang, Malaysiarmaak.cod09@student.usm.my, {nuraini, rosni} @ cs.usm.my
 Abstract
— The RNA molecule is substantiated to playimportant functions in living cells. The class of RNA withpseudoknots, has essential roles in designing remedies formany virus diseases in therapeutic domain. These varioususeful functions can be inferred from RNA secondarystructure with pseudoknots. Many computational intensiveefforts have been emerged with the aim of predicting thepseudoknotted RNA secondary structure. The computationalapproaches are much promising to predict the RNA structure.The reason behind this is that, the experimental methods fordetermining the RNA tertiary structure are difficult, time-consuming and tedious. In this paper, we introduce ABCRna, anovel method for predicting RNA secondary structure withpseudoknots. This method combines heuristic-basedKnotSeeker with a thermodynamic programming model,UNAFold. ABCRna is a hybrid swarm-based intelligencemethod inspired by the secreting honey process in naturalhoney-bee colonies. The novel aspect of this method is adaptingCase-Based Reasoning (CBR) and knowledge base, twoprominent Artificial Intelligence techniques. They areemployed particularly to enhance the quality performance of the proposed method. The CBR provides an intelligentdecision, which results more accurate predicted RNAstructure. This modified ABCRna method is tested usingdifferent kinds of RNA sequences to prove and compare itsefficiency against other pseudoknotted RNA predicted methodsin the literature. The proposed ABCRna algorithm performsfaster with significant improvement in accuracy, even for longRNA sequences.
 Keywords-
 
 RNA
 
 secondary
 
 structure; pseudoknots; Case-Bases Reasoning; Artificial Bee Colony (ABC) algorithm.
I.
 
I
NTRODUCTION
Ribonucleic acid or (RNA) is one of the nucleic acids,which plays diverse roles and functions. Basically, one kindof RNA is the messenger RNA (mRNA). It works as anintermediary in carrying the genetic information code fromDNA to make proteins [1]. This carried genetic code is usedin the natural process for synthesizing proteins in living cell.However, the recent biological studies confirmed that thereare other kinds of RNAs, which play various useful roles [2].The latest discovered functions of RNA molecule, include:splicing introns, catalyst for reaction and a regular in cellularactivities [3, 4]. Predicting the RNA structure is the key todetermine and scrutinize the active functions of RNAmolecule. This fact is emphasized by central dogma inbiochemistry and biology research domain [5, 6]. The RNAsecondary structural outputs provide the base for shaping theRNA three-dimension (3D) structure, which is the first stepof the RNA tertiary structure phase.The importance of the computational methods forpredicting RNA secondary structure has been acknowledgedas a demanding research area, by computer scientists. Also,there are many conditions, facing the experimental methodsthat are used by biologists [7, 8]. The Nuclear MagneticResonance (NMR) and X-ray crystallography are the twopopular experimental purification methods that are used todetermine the RNA 3D spatial structure [9, 10]. Lateststudies confirmed that many classes of RNA moleculebroadly fold in the pseudoknot motif [11, 12]. Whereas, theRNA structural functions of pseudoknot elements, have beenemphasized to be prominent for medical processes anddesigning anti-viral treatments, in therapeutic research [13].Consequently, the computational RNA prediction methodsfor predicting the RNA secondary structures are extensivelyutilized with manageable efforts [14].The RNA molecules come in two main shapes: the Stem-loop and the Pseudoknots, as illustrated in Figure 1 in termsof RNA structure classifiers [15]. The Stem-loop is a non-crossing RNA structure motif. While, the Pseudoknots is acrossing RNA structure, which plausibly has been spotted by[16]. Further, the pseudoknotted RNAs has been proven toplay several vital roles. From complexity points of view, thetop prediction methods of RNA without pseudoknotsfunctional element are MFold [17] and Vienna [18]algorithms which execute with complexities
O
(
n
3
) in timeand
O
(
n
2
) in space. PknotsRG [19] is one of the most properalgorithm for predicting RNA with pseudoknots. It requires
O
(
n
4
) and
O
(
n
2
) in time and space complexities, respectively.Even if the pseudoknotted RNA secondary structureprediction problem has been stated as Non-deterministicPolynomial time (NP)-Complete problem [20, 21], it is aninsisted matter to be solved [22, 23], in recent years.In order to overcome the prediction problem of RNAsecondary structure with pseudoknots, this article introduces
School of Computer Sciences, University Sains Malaysia Penang, Malaysia.
289http://sites.google.com/site/ijcsis/ISSN 1947-5500
 
(IJCSIS) International Journal of Computer Science and Information Security,Vol.
8
 , No.
8
 , 2010
a nature-inspired hybrid method called “ABCRna”.Innovatively, this approach combines a new derivation fromArtificial Bee Colony (ABC) algorithm with a specialdeterministic constraints [24]. On top of this, it is borrowedfrom the Artificial Intelligence (AI) field, which is a kind of nature swarm-intelligence [25]. The objective of thisproposed method is to build the entire RNA secondarystructure with pseudoknots from a given single-strandedRNA primary sequence. Indeed, this proposed method is acombination of KnotSeeker (heuristic-based method [3])with UNAFold (a dynamic programming method [26]) forsolving the RNA structural related issue. This hybrid methodis a new derivation from ABC algorithm. It adapts theinspired swarm-based intelligence behavior of the honey-bees in collecting nectar and converting that to honey androyal jelly [27]. Naturally, every individual worker bee visitsmany flower patches during the round-trip of collectingnectar and pollen. Then it goes back to the hive to submit themixed nectar to the nurse bee. Finally, the nurse bee startsmaking honey by a natural biological secreting process.Intuitively, the proposed RNA structural hybrid methodis deployed and built to solve the related pseudoknotted RNAbioinformatics problem. By a deeper understanding of theCBR technique [28], the proposed hybrid model obtains aglobal optima RNA structural assurance results with moreaccuracy and better performance. Finally, the results showthat the ABCRna method significantly improves theexecution time and the accuracy in both sensitivity andspecificity. This improvement when comparing the outputswith the other pseudoknotted RNA prediction methodsexisting in the state-of-the-art like; FlexStem [29], HotKnots[30] and PknotsRG [19].The remainder of this article is ordered as follows: In thenext section, we start with describing the secondary structureof the RNA molecule, in computer context representation. Insection 3 background materials, gives a concise expression tothe generic ABC optimization method. Then, a derivation of 
Figure 1. A stem-loop and pseudoknots of RNA structures types.
 ABC is adapted to generate the proposed method. Next, theCBR as a modern AI technique, is extensively and widelydiscussed, from theoretical concept. Section 4 presents theproposed method with the implemental mapping betweenpseudoknotted RNA secondary structural prediction and thesecreting process of making honey. Subsequently, thefollowing section reports the comparative benchmark of theproposed method. The results of ABCRna is comparingagainst the results of other RNA prediction methods in theliterature. Finally, the article ends with conclusion remarks,in section 6.II.
 
S
ECONDARY
S
TRUCTURES OF
RNA
 
 A.
 
 RNA Stem-Loop (non-pseudoknots)
The single-stranded RNA molecule forms many foldedstructures in hierarchal shape; the primary RNA singlesequence, the secondary structure of RNA molecule, thethree-dimensional (3D) or tertiary RNA functional structureand the quaternary structure for RNA polymerase [31].Generally, the RNA computational methods predict thesecondary structure of the given RNA primary sequence.Thus, the
 RNA secondary structure
defines: as an RNAstructural motif, which in some parts includes the double-stranded motifs. These parts joined by complementary andcanonical base pairings with the other parts, which are thenon-paired single bases. The double-stranded motif partscoming in several shaped of stem-loops:
hairpin
,
internal
(or
interior 
),
bulge
,
multi
-
branch
external bases and
stacking
(or
helices
) loops. As explained above and illustrated in Figure2, the RNA primary sequence (RNA bases) folds and joinson itself in real RNA secondary structure by hydrogenchemical bonds for low energy and more stability [15]. Inmathematical and computational representation concept, thevarious layers of RNA structures can be defined as follows:
 
 b
=
 
b
1
,
b
2
,
,
b
i
,
…, b
n
, where
 b
is an RNA primarysequence and
b
i
is the RNA base or nucleotide [32, 33].The element
b
i
is also a member of set which includes{‘
 A
’,’
’,’
G
’,’
’,’
 N 
’}. While, the first four alphabets arerepresentation of the original paired bases (paired-nucleotides) of the real RNA molecule:
 Adenine, Cytosine,Guanine
and
Uracil
, respectively. The last nucleotide ‘
 N 
is assigned to the non-paired base. Such that the
n
is thelength of the given RNA sequence and
1≤≤
.
 
S
={(
b
i
,
b
 j
)}, such that (
b
i
,
b
 j
) belongs to the canonical basepairs.
S
is the secondary structure of the given RNAprimary sequence which satisfies the following conditions:-
 
(
b
i
,
b
 j
)
{(
 A
,
), (
,
 A
), (
G
,
), (
,
G
), (
G
,
) ,(
,
G
)},these are the sets of RNA base-pairs. While, the basepairs include in the set {
A-U 
,
U-A
,
G-C 
,
C-G
} is aWatson-Crick RNA base-pairs [34], the set {
A-U 
,
U-A
}is a Wobble RNA base-pair [35].-
 
Then
S
= {(
b
i
,
b
 j
):
1≤<≤
and
 >
},where

is a threshold constant number depend onthe limit length of the minimum un-paired bases in astem-loop (hairpin, stem or bulge ... etc). The

istypically taken to be equal three.
290http://sites.google.com/site/ijcsis/ISSN 1947-5500
 
(IJCSIS) International Journal of Computer Science and Information Security,Vol.
8
 , No.
8
 , 2010
-
 
If (
b
i
,
b
 j
)
 
S
, (
b
,
b
l
)
 
S
and if 
b
i
=
b
, then
b
 j
=
b
l
. Thisimplies (
b
i
,
b
 j
) = (
b
,
b
l
). In another words, every base(nucleotide) in RNA secondary structure make join byhydrogen bond at most with another one base (non-tripleor only allow one-to-one).-
 
If (
b
i
,
b
 j
)
 
S
, (
b
,
b
l
)
 
S
and
<
, this can include twolocation elements in RNA stem-loop structure (
non- pseudoknots
):
 
If 
<<<
, then the two base pairs are form atype of nested location elements (nested-fashion), asdepictured in Figure 3 a
.
 
 
If 
<<<
, then the two base pairs are form atype of juxtaposed location elements (juxtaposed-fashion) [36], as shown in Figure 3 b
.
 
 B.
 
 RNA with Pseudoknots
The majority of RNA molecule classes fold in functionalstructural elements called pseudoknots. Indeed, they belongto the (3D) tertiary structure element and perform animportant useful roles and constructive functions [37].The pseudoknots substructure can theoretically satisfy thefollowing term. If there are two base pairs (
b
i
,
b
 j
) and (
b
,
b
l
),then satisfy the conditions:
<<<
or
<<<
,as shown in Figure 3 c and d. These two base paired shapesare represented the pseudoknots RNA structural elements. Inanother word, the pseudoknots is a crossing sub-structuralfunctional element in the RNA molecules. It formsinteraction the unpaired bases part of the stem-loop, whichfolds back and join in a loop region located outside thatstem-loop.In spite of the prediction algorithms of RNA withpseudoknots structural elements, have been proven to be NP-complete problem [21]. It is a demanding research areabecause of the pseudoknotted RNAs has importance as keyfunctions. Further it plays essential roles in viral and cellularregulatory [38].
Figure 2. Different RNA element shapes, the image is adapted from [39].Figure 3. The diagrammatic position relation between different types of RNA base pairs. (a) two base-pair in juxtaposed fashion. (b) two base-pair innested fashion. (c)&(d) two base-pair in pseudoknots.
III.
 
B
ACKGROUND
M
ATERIALS
 
 A.
 
Problem Statement of RNA with Pseudoknot 
Pseudoknotted RNA secondary structure is the problemof predicting its secondary structure from a given primarysequence. Particularly, it has recently become attractiveresearch area. Due to that the RNA with pseudoknots, hasmany important and useful roles, which needs to be solvedcomputationally [40]. The existing pseudoknotted RNAprediction algorithms perform in exponential timecomplexity. The best prediction method run, in the worstcase,
O
(
n
4
) in time and
O
(
n
2
) in space [19]. Thus they runvery slowly and need an ever increasing memory-space,especially for long sequences. Veritably, this means that theprediction solving algorithms of the pseudoknotted RNAsecondary structural problem, suffer from long executiontime and storage complexities. To the best knowledge of theauthors, the final structural results suffer from poor qualityand inaccuracy, for long RNA sequences.The pseudoknots class of the RNA structural predictionissue, has been proven an NP-complete problem [20].Increasingly, the collecting nectar to make honey is aninspired field for the bioinformatics researchers, which isderived from the original ABC model [24]. In this article, anew hybrid method as a sub-area of swarm intelligenceapproaches for solving the pseudoknotted RNA structuralproblem is adapted. Besides that the CBR as a modern AItechnique highlighted a way to be deployed, in term of enhancement the final results of the proposed hybridABCRna model. From comparison points of view, we findthis method improved the accuracy of the RNA structuraloutputs with good performance.
 B.
 
Swarm-Intelligence in AI TechnologySwarm Intelligence
(SI): is an emergent and bioinspiredfield of AI, which has been generated from numerousresearches in social insect’s behavioural models [41]. Thephrase
swarm
comes up to present solution to overcome theoptimization problems. These optima solutions have beensuccessfully got by utilizing the co-operative and
291http://sites.google.com/site/ijcsis/ISSN 1947-5500

You're Reading a Free Preview

Download
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->