Professional Documents
Culture Documents
Jonathan Wijaya
Keywords: Wastewater treatment, Sequencing batch reactor (SBR), Machine learning, Artificial intelligence,
Microbiome, 16S rRNA gene
2 Materials/Methods
In this study, two WWTPs with different
operational characteristics are used (A2O, and Figure 1: Alpha dan Beta diversity from 2 types
SBR). Monthly samples were taken from both of WWTPs.
aerobic tanks in each WWTPs from February
2018 until July 2019. Triplicate samples from Figure 1A shows that the A2O and SBR
each process (anaerobic and aerobic) and processes can be differentiate by the ordination
influent in SBR were also taken in February analysis (NMDS). The stress value of < 0.3
2018. The DNA samples were extracted and shows that the NMDS plot could represent the
assessed for its quality and quantity using ordination analysis. The A2O and SBR process
manufacturer standards. Polymerase chain also significantly different (P < 0.05) using
reaction (PCR) amplification of 16S rRNA PERMANOVA. Figure 1B-E shows the alpha
genes and sequencing using MiSeq sequencer diversity indices from Chao, Ace, Shannon, and
were conducted on the DNA samples. The raw Inverse Simpson. Asterisk symbols shows the
pair-end sequencing results were pre-processes significantly difference between groups. SBR
using bioinformatics tool (MOTHUR) for processes has lower richness, but higher
monitoring the quality of sequences and diversity.
removing the chimeric sequences. The
sequences were clustered to the taxonomy
alignment and operational taxonomy units
(OTUs) with > 97% nucleotide identity cut-off.
The results of MOTHUR generates table with
OTUs number, taxonomy, and counts of
sequences in each sample. Classification and
regression analysis using six different machine
learning (ML) modelling were constructed to
show the prediction performance of WW
operational characteristics (A2O, or SBR) and
environmental factor (water temperature). In
addition to the machine learning analysis, Figure 2: ML model classification performance
The core microbiome for ML modelling was
chosen based on the pareto law chart with the
occurrence more than 80% and relative
abundance more than 1%. Figure 2 shows the
results of ML prediction modelling in
classifying the A2O and SBR process. The
results show the high prediction accuracy from 4
different models (SVMRBF, SVML, LR, and
RF) with accuracy ranged from 93-96% and area
under the curve (AUC) from 0.98-0.99. The
confusion matrix in Figure 2C-F show the
detailed prediction from each model. The
highest prediction performance was achieved by
the SVMRBF and SVML, which were the
support vector machine algorithms.