Professional Documents
Culture Documents
A R T I C L E I N F O A B S T R A C T
Keywords: Modern computer-assisted synthesis offers solid support for easy virtual screening of easily synthesizable com
Synthetic accessibility pounds. By using the synthetic accessibility score this restriction may be bypassed. In the present work, a
Computer-assisted synthesis planning chemical space of organic semi-conductors is generated. Synthetic accessibility of generated organic semi-
Organic solar cells
conductors is studied using Synthetic accessibility (SA) score and Synthetic Bayesian classifier (SYBA) score.
Machine learning
These methods are very fast. A significance difference is observed between results obtained from two groups. The
distribution of predicted scores using two methods is also different. More research is required in this field.
Generated organic semi-conductors are screened on the basis of calculated scores. This study is indicating the
potential of synthetic accessibility scores in the virtual screening. These faster methods are ideal for large scale
screening.
* Corresponding author.
E-mail address: ahmadchemist@hotmail.com (F. Ahmad).
https://doi.org/10.1016/j.cplett.2023.140852
Received 27 July 2023; Received in revised form 7 September 2023; Accepted 24 September 2023
Available online 25 September 2023
0009-2614/© 2023 Elsevier B.V. All rights reserved.
B. Basha et al. Chemical Physics Letters 831 (2023) 140852
and absorbing electrons are affected by energy levels. Their backbones have single and multiple bonds, resulting in a continual
Semiconducting polymers comprised of extended chains (macro network of overlapping π-orbitals. On excitation, charge carriers (elec
molecules) are made of repeating units called monomers. These poly trons and holes) can migrate in this delocalized π-electron system.
mers feature a conjugated π-electron system extending along their Despite the fact that semiconducting polymers typically have less
backbone, which makes them relatively good conductors of electricity. mobility than inorganic semiconductors, improvements in materials and
2
B. Basha et al. Chemical Physics Letters 831 (2023) 140852
2. Computational methods
3
B. Basha et al. Chemical Physics Letters 831 (2023) 140852
less parameter. research [23,24]. Organic semi-conductors are the origin of evolving
optoelectronic technologies for example plastic solar cells and flexible
2.3. Similarity analysis transistors [25]. Polymeric materials are also stimulating because of the
statistical depiction of the typical number of reiterating units [26,27].
The clustering and heat map is used to performed the chemical Device efficiency, though important for functional applications, poorly
similarity analysis to understand structural behavior. Extended Con imitate the realization of molecular scheme. The variety of synthetic
nectivity Fingerprints of diameter 4 (ECFP4) is used for this purpose organic chemistry facilitates the ingredients to be tweaked such that
[18]. In first step, fingerprints are generated from chemical structures. In they can be integrated into numerous applications [28–30]. The func
second step, similarity is represented using a similarity coefficient, its tioning of organic semi-conductors can be altered by refining their
values ranging from 0 (i.e., molecules are considered as totally different) compositional components. Judicious molecular design helps to regu
to 1 (i.e., molecules are considered as identical). In third step, results are late frontier molecular orbital energy levels and influence performance
expressed using cluster and heatmap. of organic semi-conductors in solar cell devices [31–33]. Molecular
design plays a vital function in rising organic semi-conductors for
3. Results and discussion electronic devices in existing and incipient tools [34,35]. By directing
the ideal molecular design of organic semi-conductors, we can expand
3.1. Designing of organic semi-conductors their functioning and generate new multipurpose materials in countless
arenas [36,37]. We have selected electron-rich and electron-deficient
Organic semi-conductors are gradually being used as the functional groups to design organic semi-conductors. Chemical structures of
coating in extensive series of latest technologies [19–22]. The designing these groups are given in Fig. 1. A variety of building of blocks are
of organic semi-conductors is an exciting and rapidly evolving field of selected to design the organic semi-conductors of all types. These
4
B. Basha et al. Chemical Physics Letters 831 (2023) 140852
building blocks are famous and easily synthesizable. hard-to-synthesize compounds. While the Synthetic Bayesian accessi
bility (SYBA) works on Bernoulli naïve Bayes classifier. In SYBA classi
fier, a model was trained of already synthesized molecules (easy to
3.2. Synthetic accessibility synthesize) and hypothetical molecules. Easy to synthesized molecules
was randomly grouped from the ZINC database [40] and the hypo
There are methods to study the synthetic accessibility. The accuracy thetical group of molecules was created from an easy-to-synthesize by
of these methods cannot be very high. Therefore, we have selected two Nonpher tool [41] by continuously adding/removing of atom or bond
methods. One is synthetic accessibility score (SAScore)[38] and other is until certain design and complexity threshold. Molecules with negative
Synthetic Bayesian Accessibility (SYBA) [17]. SA Score is rapid tech SYBA score are considered difficult to synthesize and molecules with
nique to learn the synthetic accessibility assessment, through the anal positive SYBA score are considered easy to synthesize. The distribution
ysis of ECFP4 [46] components acquired from one million compounds of calculated SA score is given in Fig. 2. A significant majority of organic
casually selected from the PubChem database [39]. The rationale of semi-conductors have shown high SA score. The distribution of SYBA
measuring the SA score based on the occurrence of molecular compo score is given in Fig. 3. Here values are more evenly distributed. There is
nent in the PubChem database or any database, it has the confirmed significant difference between the behavior of two methods.
contribution in controlling the specific functionality with precision fol
lowed by the specific synthetic route. In this, every molecular compo
nent was assigned a numerical value with frequent components 3.3. Chemical space visualization
constitute positive scores and non/less-frequent components have
negative value. Lesser the SA values, higher the synthetic possibility Chemical space visualization is very important in designing mole
(easy to make) of the particular molecule and vice versa. However, we cules and t-distributed stochastic neighbor embedding (t-SNE) is one of
should bear in mind 6.0 is considered [38] threshold to classify easy- and the best methods [42–44]. t-SNE works by reducing the dimensionality
5
B. Basha et al. Chemical Physics Letters 831 (2023) 140852
6
B. Basha et al. Chemical Physics Letters 831 (2023) 140852
which is necessary for the visualizing high-dimensional chemical space. SYBA score is shown in Fig. 5. SYBA scores are also dispersed in nature.
Fig. 4 shows t-SNE plot showing SA score. Each dot on a t-SNE plot
characterizes a molecule. However, similarity with other molecules
3.4. Screening of organic semi-conductors
indicated by the location of a molecules on t-SNE plot. The distance
between similar molecules is placed together while dissimilar molecules
The chanced availability of large number of organic compounds
are placed farther apart. We can see multiple groups in t-SNE plot.
makes it difficult to synthesized and fabricated all the compounds for
Usually, organic semi-conductors have high SA values. t-SNE plot for
photovoltaic applications [45–47]. We have screened the generated
7
B. Basha et al. Chemical Physics Letters 831 (2023) 140852
organic semi-conductors on the basis of synthetic accessibility scores. As approach will decrease the chances of error. The chemical structures of
we have calculated types of scores. We have selected the organic semi- top 30 organic semi-conductors are given in Figs. 6-8. SA scores and
conductors that are easy to synthesize according to both scores. This SYBA scores are given in Table 1.
8
B. Basha et al. Chemical Physics Letters 831 (2023) 140852
Table 1
Synthetic accessibility score of selected organic semiconductor molecules 1–30.
Name SA Score SYBA Score
1 1 31.73
2 1 31.73
3 1 21.15
4 1 21.15
5 1.73 17.2
6 1.73 17.2
7 1.73 17.2
8 1.73 17.2
9 1.73 17.2
10 1.73 17.2
11 1.78 30.08
12 2.11 25.4
13 2.11 25.4
14 2.11 25.4
15 2.11 25.4
16 2.11 24.02
17 2.31 32.03
18 2.58 27.77
19 2.58 27.77
20 2.58 26.39
21 2.59 5.28
22 2.62 14.79
23 2.62 14.1
24 2.71 5.84
25 2.79 34.71
26 2.79 34.71
27 2.81 29.48
28 3 22.32
29 3 22.32
30 3 22.32
4. Conclusion
In this study, we explore the easily synthesizable OSC molecules by Fig. 9. Clustering of organic semi-conductors on the basis of similarity.
exploring chemical space using two theoretical methods such as SA score
and SYBA score. The results indicate that SA and SYBA are the effective
the time and resources required for experimental synthesis. Our findings
strategies to screen generated organic semi-conductors and prioritize on
stimulate the further research in exploring the computational tactics for
the basis of their higher SA of virtually generated molecules. This
discovery of novel OSCs and other molecules with tailored properties.
screening procedure benefits in narrowing down to the molecules that
are beyond prospective to be synthesized successfully, thereby reducing
9
B. Basha et al. Chemical Physics Letters 831 (2023) 140852
Beriham Basha: Methodology, Investigation, Writing – review & [1] A. Mahmood, Triphenylamine based dyes for dye sensitized solar cells: A review,
Sol. Energy 123 (2016) 127–144.
editing, Funding acquisition. Farooq Ahmad: Software, Visualization, [2] H.-R. Bai, Q. An, M. Jiang, H.S. Ryu, J. Yang, X.-J. Zhou, H.-F. Zhi, C. Yang, X. Li,
Writing – review & editing, Supervision. Misbah Sehar Abbasi: Writing H.Y. Woo, J.-L. Wang, Isogenous Asymmetric-Symmetric Acceptors Enable
– original draft, Software, Formal analysis, Writing – review & editing. Efficient Ternary Organic Solar Cells with Thin and 300 nm Thick Active Layers
Simultaneously, Adv. Funct. Mater. 32 (2022) 2200807.
Tayyaba Najam: Software, Visualization, Writing – review & editing. [3] A. Mahmood, J.-Y. Hu, B. Xiao, A. Tang, X. Wang, E. Zhou, Recent progress in
Syed Shoaib Ahmad Shah: Software, Formal analysis, Writing – review porphyrin-based materials for organic solar cells, J. Mater. Chem. A 6 (2018)
& editing. Z.A. Alrowaili: Writing – review & editing. M.S. Al-Buriahi: 16769–16797.
[4] M.R.S.A. Janjua, Quantum chemical design and prediction that complements
Writing – review & editing. understanding: How do the transition metals enhance the CO2 sensing ability of
inorganic Mg12O12 nanoclusters? J. Phys. Chem. Solid 167 (2022), 110789.
Declaration of Competing Interest [5] M.H. Tahir, T. Mubashir, T.-U.-H. Shah, A. Mahmood, Impact of electron-
withdrawing and electron-donating substituents on the electrochemical and charge
transport properties of indacenodithiophene-based small molecule acceptors for
The authors declare that they have no known competing financial organic solar cells, J. Phys. Org. Chem. 32 (2019) e3909.
interests or personal relationships that could have appeared to influence [6] Q. Nie, A. Tang, Q. Guo, E. Zhou, Benzothiadiazole-based non-fullerene acceptors,
the work reported in this paper. Nano Energy 87 (2021), 106174.
[7] A. Mahmood, A. Irfan, J.-L. Wang, Molecular level understanding of the chalcogen
atom effect on chalcogen-based polymers through electrostatic potential, non-
Data availability covalent interactions, excited state behaviour, and radial distribution function,
Polym. Chem. 13 (2022) 5993–6001.
[8] A. Mahmood, A. Irfan, J.-L. Wang, Machine Learning for Organic Photovoltaic
Data will be made available on request. Polymers: A Minireview, Chin. J. Polym. Sci. 40 (2022) 870–876.
[9] R.S. Bohacek, C. McMartin, W.C. Guida, The art and practice of structure-based
Acknowledgements drug design: a molecular modeling perspective, Med. Res. Rev. 16 (1996) 3–50.
[10] P. Schneider, G. Schneider, De Novo Design at the Edge of Chaos, J. Med. Chem. 59
(2016) 4077–4086.
The authors express their gratitude to Princess Nourah bint Abdul [11] P.S. Kutchukian, E.I. Shakhnovich, De novo design: balancing novelty and confined
rahman University Researchers Supporting Project number chemical space, Expert Opin. Drug Discov. 5 (2010) 789–812.
[12] S. Hussain, Z. Zahid, M. Shahid, M.A. Abid, Synthesis, Spectroscopic and
(PNURSP2023R326), Princess Nourah bint Abdulrahman University, Antimicrobial Studies of Homo- and Heteronuclear Tin(IV)/Pd(II) Complexes of 2-
Riyadh, Saudi Arabia. This work was supported by TianChi Talent Amino-6-(Dithiocarboxyamino)Hexanoic Acid, Arab. J. Sci. Eng. 44 (2019)
funding of Xinjiang Uygur Autonomous Region, Urumqi, China. 6423–6434.
10
B. Basha et al. Chemical Physics Letters 831 (2023) 140852
[13] A. Mahmood, Y. Sandali, J.-L. Wang, Easy and fast prediction of green solvents for [32] M.R.S.A. Janjua, S. Jamil, A. Mahmood, A. Zafar, M. Haroon, H.N. Bhatti, Solvent-
small molecule donor-based organic solar cells through machine learning, PCCP 25 Dependent Non-Linear Optical Properties of 5,5′-Disubstituted-2,2′-bipyridine
(2023) 10417–10426. Complexes of Ruthenium(<span class=“sc-ex”>ii</span>): A Quantum Chemical
[14] A. Mahmood, A. Irfan, J.-L. Wang, Machine learning and molecular dynamics Perspective, Aust. J. Chem. 68 (2015) 1502–1507.
simulation-assisted evolutionary design and discovery pipeline to screen efficient [33] H.-R. Bai, Q. An, H.-F. Zhi, M. Jiang, A. Mahmood, L. Yan, M.-Q. Liu, Y.-Q. Liu,
small molecule acceptors for PTB7-Th-based organic solar cells with over 15% Y. Wang, J.-L. Wang, A Random Terpolymer Donor with Similar Monomers Enables
efficiency, J. Mater. Chem. A 10 (2022) 4170–4180. 18.28% Efficiency Binary Organic Solar Cells with Well Polymer Batch
[15] G. Skoraczyński, M. Kitlas, B. Miasojedow, A. Gambin, Critical assessment of Reproducibility, ACS Energy Lett. (2022) 3045–3057.
synthetic accessibility scores in computer-assisted synthesis planning, [34] S.-U.-D. Khan, A. Mahmood, U.A. Rana, S. Haider, Utilization of electron-deficient
J. Cheminformatics 15 (2023) 6. thiadiazole derivatives as π-spacer for the red shifting of absorption maxima of
[16] G. Landrum, RDKit: Open-source cheminformatics. http://www.rdkit.org. diarylamine-fluorene based dyes, Theor. Chem. Acc. 134 (2014) 1596.
[17] M. Voršilák, M. Kolář, I. Čmelo, D. Svozil, SYBA: Bayesian estimation of synthetic [35] M.R.S.A. Janjua, Impact of symmetry breaking on the performance of non-fullerene
accessibility of organic compounds, J. Cheminformatics 12 (2020) 35. acceptors (NFAs) for photo and thermally stable organic solar cells (OSCs): A DFT-
[18] D. Rogers, M. Hahn, Extended-Connectivity Fingerprints, J. Chem. Inf. Model. 50 based interrogation and investigation, J. Photochem. Photobiol. A Chem. 444
(2010) 742–754. (2023), 115003.
[19] Z. Dai, Q. Guo, Y. Ding, Z. Wang, N. Jiang, E. Zhou, Constructing D-π-A Type [36] M.R.S.A. Janjua, A. Mahmood, F. Ahmad, Solvent effects on nonlinear optical
Polymers as Dopant-Free Hole Transport Materials for High-Performance CsPbI2Br response of certain tetrammineruthenium(II) complexes of modified 1,10-
Perovskite Solar Cells, ACS Appl. Mater. Interfaces 15 (2023) 9784–9791. phenanthrolines, Can. J. Chem. 91 (2013) 1303–1309.
[20] T. Dai, A. Tang, Z. He, M. Du, P. Lei, Q. Zeng, Z. Wang, Y. Wang, S. Lu, Y. Zhong, [37] S. Hussain, S. Ali, S. Shahzadi, M.N. Tahir, S. Ramzan, M. Shahid, Synthesis,
E. Zhou, Modulating intermolecular interactions by collaborative material design spectroscopic characterization, X-ray crystal structure and biological activities of
to realize THF-processed organic photovoltaic with 1.3 V open-circuit voltage, homo- and heterobimetallic complexes with potassium-1-
Energ. Environ. Sci. 16 (2023) 2199–2211. dithiocarboxylatopiperidine-4-carboxylate, Polyhedron 119 (2016) 483–493.
[21] M.I. Abdullah, M.R.S.A. Janjua, M.F. Nazar, A. Mahmood, Quantum Chemical [38] P. Ertl, A. Schuffenhauer, Estimation of synthetic accessibility score of drug-like
Designing of Efficient TC4-Based Sensitizers by Modification of Auxiliary Donor molecules based on molecular complexity and fragment contributions,
and π-Spacer, Bull. Chem. Soc. Jpn 86 (2013) 1272–1281. J. Cheminformatics 1 (2009) 8.
[22] X. Zhao, Q. An, H. Zhang, C. Yang, A. Mahmood, M. Jiang, M.H. Jee, B. Fu, S. Tian, [39] S. Kim, P.A. Thiessen, E.E. Bolton, J. Chen, G. Fu, A. Gindulyte, L. Han, J. He, S. He,
H.Y. Woo, Y. Wang, J.-L. Wang, Double Asymmetric Core Optimizes Crystal B.A. Shoemaker, J. Wang, B. Yu, J. Zhang, S.H. Bryant, PubChem Substance and
Packing to Enable Selenophene-based Acceptor with Over 18 % Efficiency in Compound databases, Nucleic Acids Res. 44 (2016) D1202–D1213.
Binary Organic Solar Cells, Angew. Chem. Int. Ed. 62 (2023) e202216340. [40] T. Sterling, J.J. Irwin, ZINC 15 – Ligand Discovery for Everyone, J. Chem. Inf.
[23] J. Zhou, Z. He, Y. Sun, A. Tang, Q. Guo, E. Zhou, Organic Photovoltaic Cells Based Model. 55 (2015) 2324–2337.
on Nonhalogenated Polymer Donors and Nonhalogenated A-DA′D-A-Type [41] M. Voršilák, D. Svozil, Nonpher: computational method for design of hard-to-
Nonfullerene Acceptors with High VOC and Low Nonradiative Voltage Loss, ACS synthesize structures, J. Cheminformatics 9 (2017) 20.
Appl. Mater. Interfaces 14 (2022) 41296–41303. [42] T. Mubashir, M. Hussain Tahir, Y. Altaf, F. Ahmad, M. Arshad, A. Hakamy,
[24] M.R. Janjua, A. Mahmood, M.F. Nazar, Z. Yang, S. Pan, Electronic absorption M. Sulaman, Statistical analysis and visualization of data of non-fullerene small
spectra and nonlinear optical properties of ruthenium acetylide complexes: a DFT molecule acceptors from Harvard organic photovoltaic database. Structural
study toward the designing of new high NLO response compounds, Acta Chim. similarity analysis with famous non-fullerene small molecule acceptors to search
Slov. 61 (2014) 382–390. new building blocks, J. Photochem. Photobiol. A Chem. 437 (2023), 114501.
[25] L. Yan, H. Zhang, Q. An, M. Jiang, A. Mahmood, M.H. Jee, H.-R. Bai, H.-F. Zhi, [43] T. Mubashir, M. Hussain Tahir, M.H.H. Mahmoud, Z. Shafiq, M. Ashraf, I.H. El
S. Zhang, H.Y. Woo, J.-L. Wang, Regioisomer-Free Difluoro-Monochloro Terminal- Azab, Z.M. El-Bahy, M. Ramzan Saeed Ashraf Janjua, Designing of symmetric and
based Hexa-Halogenated Acceptor with Optimized Crystal Packing for Efficient asymmetric small molecule acceptors for organic solar cells: A farmwork based on
Binary Organic Solar Cells, Angew. Chem. Int. Ed. 61 (2022) e202209454. Machine learning, virtual screening and structural analysis, J. Photochem.
[26] M. Ji, C. Dong, Q. Guo, M. Du, Q. Guo, X. Sun, E. Wang, E. Zhou, Recent Advances Photobiol. A Chem. 444 (2023), 114977.
in Organic Photovoltaic Materials Based on Thiazole-Containing Heterocycles, [44] A. Irfan, A. Mahmood, Designing of Efficient Acceptors for Organic Solar Cells:
Macromol. Rapid Commun. 44 (2023) 2300102. Molecular Modelling at DFT Level, J. Clust. Sci. 29 (2018) 359–365.
[27] S. Hussain, S. Ali, S. Shahzadi, M.N. Tahir, M. Shahid, Synthesis, characterization, [45] S. Hussain, S. Ali, S. Shahzadi, C. Rizzoli, M. Shahid, Diorganotin(IV) Complexes
biological activities, crystal structure and DNA binding of organotin(IV) 5- with Monohydrate Disodium Salt of Iminodiacetic Acid: Synthesis,
chlorosalicylates, J. Coord. Chem. 68 (2015) 2369–2387. Characterization, Crystal Structure and Biological Activities, J. Chin. Chem. Soc. 62
[28] S. Hussain, S. Ali, S. Shahzadi, M.N. Tahir, M. Shahid, Synthesis, characterization, (2015) 793–802.
single crystal XRD and biological screenings of organotin(IV) derivatives with 4-(2- [46] S. Hussain, S. Ali, S. Shahzadi, M. Shahid, Heterobimetallic complexes containing
hydroxyethyl)piperazine-1-carbodithioic acid, J. Coord. Chem. 69 (2016) Sn(IV) and Pd(II) with 4-(2-Hydroxyethyl)piperazine-1-carbodithioic acid:
687–703. Synthesis, characterization and biological activities, Cogent Chemistry 1 (2015)
[29] S. Hussain, S. Ali, S. Shahzadi, M.N. Tahir, M. Shahid, K.S. Munawar, S.M. Abbas, 1029038.
Synthesis, spectroscopy, single crystal XRD and biological studies of multinuclear [47] A. Irfan, A. Mahmood, Computational Designing of Low Energy Gap Small
organotin dicarboxylates, Polyhedron 117 (2016) 64–72. Molecule Acceptors for Organic Solar Cells, J. Mex. Chem. Soc. 61 (2017) 309–316.
[30] L. Wang, Q. An, L. Yan, H.-R. Bai, M. Jiang, A. Mahmood, C. Yang, H. Zhi, J.- [48] A. Mahmood, M. Saqib, M. Ali, M.I. Abdullah, B. Khalid, Theoretical investigation
L. Wang, Non-fullerene acceptors with hetero-dihalogenated terminals induce for the designing of novel antioxidants, Can. J. Chem. 91 (2013) 126–130.
significant difference in single crystallography and enable binary organic solar cells [49] S. Hussain, S. Ali, S. Shahzadi, S.K. Sharma, K. Qanungo, I.H. Bukhari,
with 17.5% efficiency, Energ. Environ. Sci. 15 (2022) 320–333. Homobimetallic complexes containing Sn(IV) with acetylene dicarboxylic acid:
[31] Y. Zhang, Y. Ji, Y. Zhang, W. Zhang, H. Bai, M. Du, H. Wu, Q. Guo, E. Zhou, Recent their syntheses and structural interpretation by spectroscopic, semi-empirical, and
Progress of Y6-Derived Asymmetric Fused Ring Electron Acceptors, Adv. Funct. DFT techniques, J. Coord. Chem. 65 (2012) 278–285.
Mater. 32 (2022) 2205115.
11