Professional Documents
Culture Documents
Download textbook 11Th International Conference On Practical Applications Of Computational Biology Amp Bioinformatics 1St Edition Florentino Fdez Riverola ebook all chapter pdf
Download textbook 11Th International Conference On Practical Applications Of Computational Biology Amp Bioinformatics 1St Edition Florentino Fdez Riverola ebook all chapter pdf
https://textbookfull.com/product/practical-applications-of-
computational-biology-and-bioinformatics-13th-international-
conference-florentino-fdez-riverola/
https://textbookfull.com/product/8th-international-conference-on-
practical-applications-of-computational-biology-bioinformatics-
pacbb-2014-1st-edition-julio-saez-rodriguez/
https://textbookfull.com/product/practical-applications-of-
computational-biology-bioinformatics-14th-international-
conference-pacbb-2020-1st-edition-gabriella-panuccio/
https://textbookfull.com/product/emerging-trends-in-applications-
and-infrastructures-for-computational-biology-bioinformatics-and-
systems-biology-systems-and-applications-1st-edition-arabnia/
https://textbookfull.com/product/genedis-2018-computational-
biology-and-bioinformatics-panayiotis-vlamos/
https://textbookfull.com/product/introduction-to-bioinformatics-
with-r-a-practical-guide-for-biologists-chapman-hall-crc-
computational-biology-series-1st-edition-edward-curry/
https://textbookfull.com/product/proceedings-of-the-
international-conference-on-information-technology-systems-
icits-2018-1st-edition-alvaro-rocha/
Advances in Intelligent Systems and Computing 616
Florentino Fdez-Riverola
Mohd Saberi Mohamad
Miguel Rocha
Juan F. De Paz
Tiago Pinto Editors
11th International
Conference on Practical
Applications of
Computational Biology
& Bioinformatics
Advances in Intelligent Systems and Computing
Volume 616
Series editor
Janusz Kacprzyk, Polish Academy of Sciences, Warsaw, Poland
e-mail: kacprzyk@ibspan.waw.pl
About this Series
The series “Advances in Intelligent Systems and Computing” contains publications on theory,
applications, and design methods of Intelligent Systems and Intelligent Computing. Virtually
all disciplines such as engineering, natural sciences, computer and information science, ICT,
economics, business, e-commerce, environment, healthcare, life science are covered. The list
of topics spans all the areas of modern intelligent systems and computing.
The publications within “Advances in Intelligent Systems and Computing” are primarily
textbooks and proceedings of important conferences, symposia and congresses. They cover
significant recent developments in the field, both of a foundational and applicable character.
An important characteristic feature of the series is the short publication time and world-wide
distribution. This permits a rapid and broad dissemination of research results.
Advisory Board
Chairman
Nikhil R. Pal, Indian Statistical Institute, Kolkata, India
e-mail: nikhil@isical.ac.in
Members
Rafael Bello Perez, Universidad Central “Marta Abreu” de Las Villas, Santa Clara, Cuba
e-mail: rbellop@uclv.edu.cu
Emilio S. Corchado, University of Salamanca, Salamanca, Spain
e-mail: escorchado@usal.es
Hani Hagras, University of Essex, Colchester, UK
e-mail: hani@essex.ac.uk
László T. Kóczy, Széchenyi István University, Győr, Hungary
e-mail: koczy@sze.hu
Vladik Kreinovich, University of Texas at El Paso, El Paso, USA
e-mail: vladik@utep.edu
Chin-Teng Lin, National Chiao Tung University, Hsinchu, Taiwan
e-mail: ctlin@mail.nctu.edu.tw
Jie Lu, University of Technology, Sydney, Australia
e-mail: Jie.Lu@uts.edu.au
Patricia Melin, Tijuana Institute of Technology, Tijuana, Mexico
e-mail: epmelin@hafsamx.org
Nadia Nedjah, State University of Rio de Janeiro, Rio de Janeiro, Brazil
e-mail: nadia@eng.uerj.br
Ngoc Thanh Nguyen, Wroclaw University of Technology, Wroclaw, Poland
e-mail: Ngoc-Thanh.Nguyen@pwr.edu.pl
Jun Wang, The Chinese University of Hong Kong, Shatin, Hong Kong
e-mail: jwang@mae.cuhk.edu.hk
Editors
123
Editors
Florentino Fdez-Riverola Juan F. De Paz
Escuela Superior de Ingeniería Informática Departamento de Informática y Automática
Universidad de Vigo Universidad de Salamanca
Ourense Salamanca
Spain Spain
Miguel Rocha
Department de Informática
Universidade do Minho
Braga
Portugal
v
vi Preface
and highly valuable work and support. Their effort has helped to contribute to the
success of the PACBB’17 event. PACBB’17 would not exist without your
assistance.
General Co-chairs
Program Committee
vii
viii Organization
Organising Committee
xi
xii Contents
1
ESEI - Escuela Superior de Ingeniería Informática, Edificio Politécnico, Universidad de Vigo,
Campus Universitario As Lagoas s/n, 32004 Ourense, Spain
{hlfernandez,dgpena,mrjato,riverola}@uvigo.es
2
CINBIO - Centro de Investigaciones Biomédicas, University of Vigo,
Campus Universitario Lagoas-Marcosende, 36310 Vigo, Spain
3
UCIBIO-REQUIMTE, Departamento de Química, Faculdade de Ciências e Tecnologia,
Universidade Nova de Lisboa, 2829-516 Caparica, Portugal
{jeduardoaraujo,jlcapelom}@bioscopegroup.org
1 Introduction
2D-gel electrophoresis and mass spectrometry using matrix assisted laser desorption
ionization coupled to time of flight analysers, MALDI-TOF-MS, are widely used in
conjunction in order to perform proteome analysis [1, 2]. In brief, while the comparison
of 2D-gels allows obtaining a set of differentially expressed spots, MALDI-TOF-MS
allows to identify the proteins separated in such spots.
Once in the laboratory, the samples were centrifuged, and then the supernatant was
withdrawn, aliquoted and stored at –80 °C until analysis. Most abundant proteins
(MAPs) in plasma can mask or interfere with the detection of proteins belonging to the
low-abundance protein fraction [9]. To avoid this problem, protein equalization from
plasma samples was performed with dithiothreitol, DTT, according to the protocol
described by Warder et al. [10] with minor modifications as described by Fernández
et al. [11] and Araújo et al. [12–14]. This process was performed with five replicates for
each patient. Then, the total protein content was determined using a Bradford protein
assay [15].
Two dimensional gel electrophoresis separation was carried out by duplicate for each
patient and for the healthy pool. Then, 2D-gels obtained for each patient and the pool
of healthy volunteers were compared using the Progenesis SameSpots software v4.0
(NonLinear Dynamics) to find out the differentially expressed proteins. All spots of
interest were excised and subjected to in-gel protein(s) digestion and then to protein
fingerprint identification by mass spectrometry using MALDI-TOF-MS [16]. Finally,
S2P was used to process the spots data (i.e. differentially expressed spots) obtained with
the SameSpots software as well as to analyze them along with the protein identifications
obtained from Mascot.
2.2 Implementation
S2P v1.0.0 is implemented in Java and it was constructed using the AIBench framework
[17], which has been demonstrated to be suitable for rapid development of scientific
applications [18, 19]. The Graphical User Interface (GUI) was constructed in Java Swing
using freely available extensions such as SwingX or GC4S. S2P also makes use of several
well-established open-source libraries such as JFreeChart, charts4j, iText and the
Apache Commons Mathematics library.
The source code of the project is freely available at https://github.com/sing-
group/S2P under a GNU GPL 3.0 License (http://www.gnu.org/copyleft/gpl.html). It is
divided into three modules: (i) core, which contains the default implementation API, (ii)
gui, which contains several reusable GUI components, and (iii) aibench, which contains
a GUI application based on the AIBench framework.
With the goal of showing the main features of S2P as well as its usefulness to analyse
real data, this section shows how it has been used to process and analyse the case study
data presented.
Figure 1 illustrates the five main steps where S2P was used through the experiments:
(1) to merge the SameSpots report into a single table where all samples can be compared;
(2) to design the MALDI plate; (3) to load and filter the Mascot identifications; (4) to
link the Mascot identifications with their corresponding spots using the MALDI plate;
and (5) to examine and analyse spots data along with Mascot identifications. All data
4 H. López-Fernández et al.
The case study dataset was composed by 7 anonymous patients diagnosed with
bladder cancer, 7 anonymous patients diagnosed with lower urinary tract symptoms
(LUTS) and 6 healthy individuals that were pooled. The Progenesis SameSpots software
was used to compare the 2D-gels corresponding to each individual against the health
pool’s 2D-gels to obtain the differentially expressed spots. These results were exported
using the “Export report” option of SameSpots, which creates one HTML file per
comparison (i.e. 14 files in this case). S2P was then used to parse and merge these reports
into a single table with samples in columns and spots in rows (Step 1 of Fig. 1). This
table was exported into a comma-separated values (CSV) file that can be easily reopen
with S2P as well as external applications such as Excel, LibreOffice or R.
Then, these differentially expressed spots were first treated and then analysed
through MALDI-TOF MS in order to identify their protein content. To do that, a dedi‐
cated sample treatment is done [16] and the pool of peptides obtained is spotted twice
into a MALDI plate, which is then introduced into the MALDI apparatus for analysis.
Usually, researchers fill a sheet with the position of the spots in the plate so that they
can trace back where each spot was placed. This is important to know which spot is
associated to each MALDI spectrum and, therefore, to know which Mascot identifica‐
tions are associated to each spot. However, keeping a unique handwritten copy of this
key information is risky as it can be lost or mislead and, most likely, it will be no way
S2P: A Desktop Application for Fast and Easy Processing 5
to recover this information. For these two reasons, S2P incorporates a MALDI plate
editor that allows the storage of digital copies of experiments’ plates as well as print
them into PDF files (Step 2 of Fig. 1). S2P also allows to automatically filling the plate
using a set of previously loaded spots (Step 1 of Fig. 1), allowing the user to define
parameters such as matrix dimensions (i.e. number of rows and columns) or the number
of replicates of each spot. In our case study, S2P was used to create the MALDI plate
and to obtain a printed copy of it that is used to guide the experimental work.
Once the MALDI-TOF MS analysis was done, the MALDI-based spectra of the
digested protein(s) were submitted to Mascot in order to identify the proteins. Then,
they were exported into a HTML file that was loaded into S2P in order to remove dupli‐
cated entries and exclude identifications with a Mascot score under 56 (Step 3 of Fig. 1).
This processed list of Mascot identifications was exported into a CSV file so that it can
be directly loaded into S2P later or used in other applications (e.g. Excel). Then, these
Mascot identifications integrated with the spots data using the MALDI plate (Step 4 of
Fig. 1) to know which identifications are associated with each spot.
Finally (Step 5 of Fig. 1), S2P allows an integrated analysis of the spots data and the
Mascot identifications (Fig. 2). In the context of our case study, this option was firstly
used to try to identify potential biomarkers of the two conditions of interest. When the
healthy pool was compared with the bladder cancer patients, four differentially expressed
spots present in at least 5 of 7 bladder cancer patients (Fig. 3A) were found. The corre‐
sponding proteins were: (i) Serum albumin (Spot Number [SN] = 137), (ii) Gelsolin
(SN = 137), (iii) Fibrinogen gamma chain (SN = 337), (iv) Ig alpha-1 chain C region
(SN = 360), (v) Ig alpha-2 chain C region (SN = 360) and (vi) Haptoglobin (SN = 266).
When the healthy pool was compared with the LUTS patients, we found five differen‐
tially expressed spots that were present in at least 4 of 7 LUTS patients (Fig. 3B). The
associated proteins were the following: (i) CD5 antigen-like (SN = 244), (ii) Heparin
cofactor 2 (SN = 175 and SN = 190), (iii) Hemopexin (SN = 175), (iv) Serum albumin
(SN = 192 and SN = 190) and (v) Inter-alpha-trypsin inhibitor heavy chain H4 (SN = 88).
As it can be seen in Fig. 3, a small set of candidate biomarkers was identified that
can be associated to each disease. Due to this reason, a complementary approach was
experimented: exporting all spots data from Samespots instead of exporting only those
spots that were differentially expressed when each individual and the healthy pool were
compared. This way, we used S2P to process these new dataset (analogously to step 1)
and then to find spots whose average value were statistically different between bladder
cancer and LUTS patients. Following this strategy, 40 differentially expressed spots (i.e.
having t-test p-values corrected using Benjamini-Hochberg less than 0.05) between
bladder cancer and LUTS were found, 27 of which have protein identifications associ‐
ated (corresponding to 14 unique proteins). This also allowed us to compare the distri‐
bution of the expression values of each condition using box plots. For instance, Fig. 4
shows the box plots of the two spots identified in Fig. 3 that are differentially expressed
between bladder cancer and LUTS patients. This information must be carefully
analysed, but the usefulness of S2P to fast and accurate process and analyse data is thus
proven.
Finally, it is important to remark that doing the steps described above manually took
more than two weeks of handling. Now, with the help of S2P this data processing time
has been dramatically reduced to a few minutes. Moreover, S2P offers the additional
data analysis features shown that also allow researchers saving a lot of their valuable
time.
4 Conclusions
Acknowledgements. This work has been partially funded by (i) the “Platform of integration of
intelligent techniques for analysis of biomedical information” project (TIN2013-47153-C3-3-R)
from Spanish Ministry of Economy and Competitiveness, (ii) the “Discovery of biomarkers for
bladder carcinoma diagnosis” project from Nova Medical School, (iii) Unidade de Ciências
Biomoleculares Aplicadas-UCIBIO, which is financed by national funds from FCT/MEC/
Portugal (UID/Multi/04378/2013), and (iv) Consellería de Cultura, Educación e Ordenación
Universitaria (Xunta de Galicia) and FEDER (European Union). H. López-Fernández is supported
by a post-doctoral fellowship from Xunta de Galicia. J. L. Capelo acknowledges Associação
Cientifica ProteoMass for financial support. J. E. Araújo acknowledges the financial support given
by the Portuguese Foundation for Science and Technology under doctoral grant number
SFRH/BD/109201/2015. SING group thanks CITI (Centro de Investigación, Transferencia e
Innovación) from University of Vigo for hosting its IT infrastructure.
References
1. Susnea, I., Bernevic, B., Wicke, M., Ma, L., Liu, S., Schellander, K., Przybylski, M.:
Application of MALDI-TOF-Mass spectrometry to proteome analysis using stain-free gel
electrophoresis. In: Cai, Z., Liu, S. (eds.) Applications of MALDI-TOF Spectroscopy, pp. 37–
54. Springer, Heidelberg (2012)
2. Martinez, J.L.C., Espiño, C.L., Santos, H.M.: Mass spectrometry-based proteomics: what is
it expecting ahead? J. Proteomics 145, 1–2 (2016)
3. Nagalla, S.R., Canick, J.A., Jacob, T., Schneider, K.A., Reddy, A.P., Thomas, A., Dasari, S.,
Lu, X., Lapidus, J.A., Lambert-Messerlian, G.M., Gravett, M.G., Roberts, C.T., Luthy, D.,
Malone, F.D., D’Alton, M.E.: Proteomic analysis of maternal serum in down syndrome:
identification of novel protein biomarkers. J. Proteome Res. 6, 1245–1257 (2007)
4. Thongboonkerd, V., Mcleish, K.R., Arthur, J.M., Klein, J.B.: Proteomic analysis of normal
human urinary proteins isolated by acetone precipitation or ultracentrifugation. Kidney Int.
62, 1461–1469 (2002)
5. Hsueh, C.-T., Liu, D., Wang, H.: Novel biomarkers for diagnosis, prognosis, targeted therapy
and clinical trials. Biomark. Res. 1, 1 (2013)
6. Baker, M.: Reproducibility: Seek out stronger science. Nature 537, 703–704 (2016)
8 H. López-Fernández et al.
7. Zeeberg, B.R., Riss, J., Kane, D.W., Bussey, K.J., Uchio, E., Linehan, W.M., Barrett, J.C.,
Weinstein, J.N.: Mistaken Identifiers: Gene name errors can be introduced inadvertently when
using Excel in bioinformatics. BMC Bioinformatics 5, 80 (2004)
8. Ziemann, M., Eren, Y., El-Osta, A.: Gene name errors are widespread in the scientific
literature. Genome Biol. 17 (2016)
9. Anderson, N.L.: The Human Plasma Proteome: History, Character, and Diagnostic Prospects.
Mol. Cell. Proteomics 1, 845–867 (2002)
10. Warder, S.E., Tucker, L.A., Strelitzer, T.J., McKeegan, E.M., Meuth, J.L., Jung, P.M., Saraf,
A., Singh, B., Lai-Zhang, J., Gagne, G., Rogers, J.C.: Reducing agent-mediated precipitation
of high-abundance plasma proteins. Anal. Biochem. 387, 184–193 (2009)
11. Fernández, C., Santos, H.M., Ruíz-Romero, C., Blanco, F.J., Capelo-Martínez, J.-L.: A
comparison of depletion versus equalization for reducing high-abundance proteins in human
serum. Electrophoresis 32, 2966–2974 (2011)
12. Araújo, J.E., Santos, T., Jorge, S., Pereira, T.M., Reboiro-Jato, M., Pavón, R., Magriço, R.,
Teixeira-Costa, F., Ramos, A., Santos, H.M.: Matrix-assisted laser desorption/ionization time-
of-flight mass spectrometry-based profiling as a step forward in the characterization of
peritoneal dialysis effluent. Anal. Methods 7, 7467–7473 (2015)
13. Araújo, J.E., Jorge, S.: Teixeira e Costa, F., Ramos, A., Lodeiro, C., Santos, H.M., Capelo,
J.L.: A cost-effective method to get insight into the peritoneal dialysate effluent proteome. J.
Proteomics 145, 207–213 (2016)
14. Araújo, J.E., Jorge, S., Magriço, R., Costa, T.E, Ramos, A., Reboiro-Jato, M., Fdez-Riverola,
F., Lodeiro, C., Capelo, J.L., Santos, H.M.: Classifying patients in peritoneal dialysis by mass
spectrometry-based profiling. Talanta 152, 364–370 (2016)
15. Bradford, M.M.: A rapid and sensitive method for the quantitation of microgram quantities
of protein utilizing the principle of protein-dye binding. Anal. Biochem. 72, 248–254 (1976)
16. Oliveira, E., Araújo, J.E., Gómez-Meire, S., Lodeiro, C., Perez-Melon, C., Iglesias-Lamas,
E., Otero-Glez, A., Capelo, J.L., Santos, H.M.: Proteomics analysis of the peritoneal dialysate
effluent reveals the presence of calcium-regulation proteins and acute inflammatory response.
Clin. Proteomics 11, 17 (2014)
17. Glez-Peña, D., Reboiro-Jato, M., Maia, P., Rocha, M., Díaz, F., Fdez-Riverola, F.: AIBench:
a rapid application development framework for translational research in biomedicine.
Comput. Methods Programs Biomed. 98, 191–203 (2010)
18. López-Fernández, H., Reboiro-Jato, M., Glez-Peña, D., Méndez Reboredo, J.R., Santos, H.M.,
Carreira, R.J., Capelo-Martínez, J.L., Fdez-Riverola, F.: Rapid development of Proteomic
applications with the AIBench framework. J. Integr. Bioinforma. 8, 171 (2011)
19. Reboiro-Jato, M., Glez-Peña, D., Méndez-Reboredo, J.R., Santos, H.M., Carreira, R.J.,
Capelo, J.L., Fdez-Riverola, F.: Building proteomics applications with the aibench application
framework (2011)
Multi-Enzyme Pathway Optimisation
Through Star-Shaped Reachable Sets
1 Introduction
1.1 Multi-Enzyme Pathways
In this paper, we consider a set of chemical reactions catalysed by several
enzymes. Such reactions take place inside cells and are also used in synthetic
biology, e.g. in manufacturing of chemical compounds, biodegradation, medi-
cine, etc. Currently, there are large databases of enzymes based on which path-
ways can be constructed to turn given substrates into desired products [1]. The
enzyme kinetic optimisation of these processes is high on the agenda as it may
lead to a substantial economy of time and consumables. Such optimisation may
This research was supported by the National Sustainability Programme of the Czech
Ministry of Education, Youth and Sports (LO1214) and the RECETOX research
infrastructure (LM2011028).
Electronic supplementary material The online version of this chapter (doi:10.
1007/978-3-319-60816-7 2) contains supplementary material, which is available to
authorized users.
c Springer International Publishing AG 2017
F. Fdez-Riverola et al. (eds.), 11th International Conference on Practical
Applications of Computational Biology & Bioinformatics, Advances in Intelligent
Systems and Computing 616, DOI 10.1007/978-3-319-60816-7 2
10 S. Mazurenko et al.
also provide insights into the evolution of cells since some studies suggest that
optimal pathways are evolutionarily advantageous and can be predicted based
on the genetic information of living cells [2].
We consider an n-step chemical reaction in which the state variables are
the concentrations of metabolites produced and consumed in the course of the
reaction:
E0 E2 En−2 En−1
S −−→ M1 −−→ . . . −−−→ Mn−2 −−−→ P.
The control here are the concentrations of enzymes Ei , the sum of which is
limited from above. We will prove that under some general assumptions about
the rate equations, one can expect the set of all the possible states of such
systems to be star-shaped at any point in time. As a result, an optimisation of
the pathway using star-shaped reachable sets [3] can be implemented to obtain
the maximum concentration of the final product and the corresponding optimal
profile of enzymes.
which indicates that at any moment in time the total enzyme concentration
must not exceed a certain predefined value Emax . This limitation, for example,
Multi-Enzyme Pathway Optimisation Through Star-Shaped Reachable Sets 11
rε (l, 0) = ε → +0.
(here for i = 0 symbols ∂s r/∂li and li should be omitted).
As far as initial condition (B) is concerned, we will replace the coordinate x1
with x∗1 = x1 − 1. If in addition to the above we require that fi is non-negative
and non-decreasing in x1 , the following holds:
Corollary 1. Suppose for (1) the initial concentration x1 (0) = 1, xi (0) = 0 for
i = 2..n, and f0 ≡ 0. Moreover, suppose that in addition to the requirements of
Proposition 1 on fi , the fi that depend on x1 are non-negative and non-decreasing
in x1 . Then for (1) with the new coordinate x∗1 = x1 − 1 Proposition 1 holds.
2.3 Examples
Here we will list the examples of (1) relevant to the enzyme kinetics, for which
Proposition 1 holds:
All the above functions may be present in any combination, thereby providing
a significant flexibility for the model selection.
Moreover, the same enzyme can be used in different steps if the following
additional requirement holds: for any enzyme e used in several reactions the
value λfi (x )/fi (λx ) is independent of i for the respective i s. This will be the
case, e.g. in Michaelis-Menten kinetics since the free enzyme, and consequently,
the denominator of fi , will be the same across the respective i s. Reversible
reactions are also covered. In other cases when the star-shapedness cannot be
guaranteed, one may still use general reachable set methods [13], albeit forgoing
the advantage of the reduced dimensionality.
We will now proceed to several examples.
Example 1. The first example is a three-metabolite scheme with a constant sup-
ply of substrate zero, and it demonstrates the standard bang-bang optimal profile
[2,9,10] (Fig. 1):
This switching between the two regimes stems from the intuitive fact that the
rate of the reaction is increasing with the increase in x1 . As a result, the optimal
strategy is to accumulate x1 first and then to switch to production of x2 .
Now, the simple accumulation of x1 will not yield an optimal solution; due to
the inhibition, the reaction rate would decrease for large values of x1 . Hence, e1
should be switched on earlier and not to its maximal value as can be seen from
the optimal control synthesis in Fig. 2.
Another random document with
no related content on Scribd:
back
back
back
back
back
back
back
back
back
back
back
back
back
back
back
back
back
back
back
back