Professional Documents
Culture Documents
3. RESULTS AND DISCUSSION For the second part, we set up the machine learning
model for sweetness prediction. The Linear correlation
For the evaluation of the Applicability domain, a matrix (Image 2) of the molecular properties below
database of sweet compounds’ database shows that there is a correlation between Element
(data_qsar.xlsx) was used to build a machine-learning count, Bond count, and Molecular weight since the
model, which was split into a learning and test set, Molecular weight and Bond count are linked with the
containing 155 and 153 molecules, respectively. number of atoms of molecules.
Based on the molecules present in the database, a
comparison was made between the learning set and the
test set, where, approximately, 87% are similar with
224 molecules and 19% are dissimilar with 49
molecules. In the diagram below it is possible to see the
results.
4. CONCLUSION
5. REFERENCES
[01] Hanser T, Barber C, Marchaland JF, Werner S.
Applicability domain: towards a more formal definition. SAR
QSAR Environ Res. 2016 Nov;27(11):893-909. doi:
10.1080/1062936X.2016.1250229. Epub 2016 Nov 9. PMID:
27827546.
5. REFERENCES
[01] Cherkasov A, Muratov EN, Fourches D, Varnek
For the second part, a chemical space was built using A, Baskin II, Cronin M, Dearden J, Gramatica P, Martin
both databases of Fragrance and Taste, which is a YC, Todeschini R, Consonni V, Kuz'min VE, Cramer R,
Benigni R, Yang C, Rathman J, Terfloth L, Gasteiger J,
concept in cheminformatics that refers to the property
Richard A, Tropsha A. QSAR modeling: where have you
space with all possible molecules in relation to a given
been? Where are you going to? J Med Chem. 2014 Jun
set of construction principles and conditions.
26;57(12):4977-5010. doi: 10.1021/jm4004285. Epub
2014 Jan 6. PMID: 24351051; PMCID: PMC4074254.