Welcome to Scribd!

AD Standarized Approach

Uploaded by

0% found this document useful (0 votes)

12 views1 page

The document describes an algorithm for determining if compounds are outliers or within the applicability domain of a model. It involves: 1) Standardizing descriptor values for all compounds using mean and standard deviation of the training set descriptors. 2) Computing the maximum and minimum standardized descriptor values for each compound. 3) Compounds with both maximum and minimum values below 3 are considered non-outliers/within domain. 4) Compounds with maximum above 3 require calculating a new standardized value to determine outlier status.

Original Description:

Original Title

AD standarized approach

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

12 views1 page

AD Standarized Approach

Uploaded by

JosAcuña

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

The algorithm and methodology for the proposed approach are discussed below:

a) First, all the descriptors appearing in the developed model (for training and test set
compounds) are standarized using the following formula:

Ski=¿ Xki−x̄ i∨ ¿ ¿
σ Xi
k 1,2,3… nComp (here, nComp = total number of compounds)
i 1,2,3… nDes (here, nDes = total number of descriptors)
Ski Standarized descriptor i for compound k (from the training or test set)
Xki Original descriptor i for compound k (from the training or test set)
x̄ i Mean value of the descriptor Xi for the training set compounds only
σ Xi Standard deviation of the descriptor Xi for the training set compounds only

The above calculation should run for all descriptor values (nComp x nDes) present in the model.
So, we will have now nDes number of the standarized descriptor S i(k) (i=1 to nDes) values for any
compound k.

b) Thereafter, one needs to compute the maximum S i(k) value (|Si|max(k)|) for the compound k.
If |Si|max(k) is lower than or equal to 3, then that compound is not an X-outlier (if in the
training set) or is within applicability domain (if in the test set).
c) If |Si|max(k) is above 3 , then one should compute |Si|min(k). If |Si|min(k) > 3, then the
compound is an X-outlier (If in the training set) or is not within applicability domain (if in
the test set).
d) If |Si|max(k)| > 3 and |Si|min(k)| <3, then one should compute S new(k) from the following
equation:

Snew ( k )= Ś k + 1.28 x σSk

Where,

Snew(k) Snew value for the compound k

Ś k Mean of Si(k) values of the compound k
σSk Standard deviation of Si(k) values of the compound k

If the calculated Snew(k) value is lower than or equal to 3, then that compound is not an X-outlier (If
the training set) or is within applicability domain (if in the test set).

1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
Document5 pages
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
Muhammad Hur Rizvi
No ratings yet
Lab Num
Document29 pages
Lab Num
Navjot Wadhwa
0% (1)
Solution First Point ML-HW4
Document6 pages
Solution First Point ML-HW4
Juan Sebastian Otálora Montenegro
100% (1)
36-708 Statistical Machine Learning Homework #3 Solutions: DUE: March 29, 2019
Document22 pages
36-708 Statistical Machine Learning Homework #3 Solutions: DUE: March 29, 2019
S
No ratings yet
Implementing Custom Randomsearchcv: 'Red' 'Blue'
Document1 page
Implementing Custom Randomsearchcv: 'Red' 'Blue'
Tayub khan.A
No ratings yet
2021SEM2 CS3230 Sols
Document10 pages
2021SEM2 CS3230 Sols
Nancy Q
No ratings yet
Test 1: Compsci 100: Owen Astrachan September 30, 2009
Document17 pages
Test 1: Compsci 100: Owen Astrachan September 30, 2009
Gobara Dhan
No ratings yet
Practice-Sheet 1 - Practice Sheet Practice-Sheet 1 - Practice Sheet
Document2 pages
Practice-Sheet 1 - Practice Sheet Practice-Sheet 1 - Practice Sheet
Shubham Kumar
No ratings yet
Ai Combined Update
Document274 pages
Ai Combined Update
John Doe
No ratings yet
Final Term Test Soln 2020
Document9 pages
Final Term Test Soln 2020
Sourabh Raj
No ratings yet
Comp 372 Assignment 2
Document9 pages
Comp 372 Assignment 2
Hussam Shah
No ratings yet
E9 205 - Machine Learning For Signal Processing: Practice Midterm Exam
Document4 pages
E9 205 - Machine Learning For Signal Processing: Practice Midterm Exam
rishi gupta
No ratings yet
Aviation Maths
Document94 pages
Aviation Maths
yihesak
No ratings yet
Lab Experiment No. 4 Name: Dhruv Jain SAP ID: 60004190030 Div/Batch: A/A2 Aim
Document9 pages
Lab Experiment No. 4 Name: Dhruv Jain SAP ID: 60004190030 Div/Batch: A/A2 Aim
Dhruv jain
No ratings yet
Seguridad ML
Document7 pages
Seguridad ML
andres python
No ratings yet
Linear Models - Numeric Prediction
Document7 pages
Linear Models - Numeric Prediction
ar9vega
No ratings yet
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
Document7 pages
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
Kartik Katekar
No ratings yet
Complex Problem AI
Document13 pages
Complex Problem AI
js9118164
No ratings yet
Guide Function Fit
Document11 pages
Guide Function Fit
Jean claude onana
No ratings yet
Cheat ML
Document1 page
Cheat ML
Rahul Babarwal
No ratings yet
ML Cheatsheet
Document1 page
ML Cheatsheet
francoisroyer7709
No ratings yet
K-Nearest Neighbor On Python Ken Ocuma
Document9 pages
K-Nearest Neighbor On Python Ken Ocuma
Aliyha Dionio
100% (2)
Ex 7
Document17 pages
Ex 7
api-322416213
No ratings yet
Computational Techniques in Statistics: Exercise 1
Document5 pages
Computational Techniques in Statistics: Exercise 1
وجدان الشدادي
No ratings yet
The University of Nottingham
Document4 pages
The University of Nottingham
P6E7P7
No ratings yet
Final 13
Document9 pages
Final 13
نورالدين بوجناح
No ratings yet
Bhaumik-Project - C - Report K Mean Complexity
Document10 pages
Bhaumik-Project - C - Report K Mean Complexity
Mahiye Ghosh
No ratings yet
Machine Learning
Document56 pages
Machine Learning
Mani Vrs
100% (3)
05 Logistic - Regression
Document7 pages
05 Logistic - Regression
adalina
No ratings yet
MapReduce Algorithms For K-Means Clustering
Document11 pages
MapReduce Algorithms For K-Means Clustering
fahmynadhif
No ratings yet
K-Means in Python - Solution
Document6 pages
K-Means in Python - Solution
Rodrigo Violante
No ratings yet
Data Structures and Algorithms Unit - V: Dynamic Programming
Document19 pages
Data Structures and Algorithms Unit - V: Dynamic Programming
Varuni Devi
No ratings yet
Perceptron&ADALINEcode
Document2 pages
Perceptron&ADALINEcode
Karismo Bing
No ratings yet
Assignment 2 Solution
Document12 pages
Assignment 2 Solution
Razin
No ratings yet
Tutorial For Machine Learning
Document14 pages
Tutorial For Machine Learning
Raji Ramachandran
No ratings yet
GyandeepSarmah
Document6 pages
GyandeepSarmah
Aman Bansal
100% (1)
STA302 Week12 Full
Document30 pages
STA302 Week12 Full
tianyuan gu
No ratings yet
Experiment 3.1 K-Mean
Document8 pages
Experiment 3.1 K-Mean
Arslan Mansoori
No ratings yet
21BEC505 Exp2
Document7 pages
21BEC505 Exp2
jay
No ratings yet
Ex 7
Document17 pages
Ex 7
carlos_cabrera_51
No ratings yet
Assignment 5
Document14 pages
Assignment 5
niotihasan02
No ratings yet
System Simulation and Modeling Lab
Document23 pages
System Simulation and Modeling Lab
Pavleen Kaur
No ratings yet
Team1 Lab2
Document13 pages
Team1 Lab2
Loan Nguyễn Thị Minh
No ratings yet
Ain Shams University Faculty of Engineering
Document2 pages
Ain Shams University Faculty of Engineering
سلمى طارق عبدالخالق عطيه Unknown
No ratings yet
Multiplym 2
Document23 pages
Multiplym 2
adyant.gupta2022
No ratings yet
Mslab All 2376
Document32 pages
Mslab All 2376
Anubhav Khurana
No ratings yet
Module 2 Daa FINAL 20201
Document29 pages
Module 2 Daa FINAL 20201
trichy_sathish
No ratings yet
Math10282 Ex05 - An R Session
Document6 pages
Math10282 Ex05 - An R Session
deimante
No ratings yet
Is Lab Aman Agarwal PDF
Document8 pages
Is Lab Aman Agarwal PDF
Aman Bansal
No ratings yet
Automatic Activity Recognition of Weight Lifting Exercises Using Sensor Data
Document4 pages
Automatic Activity Recognition of Weight Lifting Exercises Using Sensor Data
svglolla
No ratings yet
Chapter 6 Built-In Functions
Document5 pages
Chapter 6 Built-In Functions
Killua 0707
No ratings yet
Text Clustering
Document47 pages
Text Clustering
Alex Ciocan
No ratings yet
Assignment 2 Solution
Document2 pages
Assignment 2 Solution
Razin
No ratings yet
Cga Till Lab 9
Document29 pages
Cga Till Lab 9
saloniaggarwal0304
No ratings yet
Homework 2
Document8 pages
Homework 2
mbowman10
No ratings yet
Simple Case Study of Implementing K Means Clustering On The IRIS Dataset
Document4 pages
Simple Case Study of Implementing K Means Clustering On The IRIS Dataset
gargwork1990
No ratings yet
Assignment1 PDF
Document2 pages
Assignment1 PDF
rineeth m
No ratings yet
Mslab All 2982
Document31 pages
Mslab All 2982
Anubhav Khurana
No ratings yet
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
Rating: 3.5 out of 5 stars
3.5/5 (8)