The Utility of Multiple Random Sampling in the Development of SAR Models

Authors

N. B. Sussman

O. T. Macina

H. G. Claycamp

S. G. Grant

H. S. Rosenkranz

Proceedings:

Predictive Toxicology of Chemicals: Experiences and Impact of AI Tools

Volume

Issue:

Predictive Toxicology of Chemicals: Experiences and Impact of AI Tools

Track:

Contents

Downloads:

Download PDF

Abstract:

We propose an approach that incorporates into the model-building process resampling of the parent chemical database to form N random databases, from which N independent random models are generated. The multiple random sampling allows us to treat the parent database as an empirical chemical population distribution. This approach helps to overcome to some degree the representation bias in the parent database. Model building researchers will recognize the similarity of this approach to the bootstrap. In fact, it is the bootstrap methodology but applied not only to estimate the distribution of the prediction accuracy/error, but also to evaluate the consistency of random models developed from a given database. The idea for employing the bootstrap approach for this purpose was presented by Efron and Gong in a demonstration that dealt with a similar prediction problem, a situation which they deemed to be "hopelessly beyond traditional theoretical solutions" (Efron and Gong, 1983).

Spring

Predictive Toxicology of Chemicals: Experiences and Impact of AI Tools

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.