Predicting error bars for QSAR models

Timon Schroeter, Anton Schwaighofer, Sebastian Mika, Antonius Ter Laak, Detlev Suelzle, Ursula Ganzer, Nikolaus Heinrich, Klaus Robert Müller

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Unfavorable physicochemical properties often cause drug failures. It is therefore important to take lipophilicity and water solubility into account early on in lead discovery. This study presents log D7 models built using Gaussian Process regression, Support Vector Machines, decision trees and ridge regression algorithms based on 14556 drug discovery compounds of Bayer Schering Pharma. A blind test was conducted using 7013 new measurements from the last months. We also present independent evaluations using public data. Apart from accuracy, we discuss the quality of error bars that can be computed by Gaussian Process models, and ensemble and distance based techniques for the other modelling approaches.

    Original languageEnglish
    Title of host publicationCompLife 2007 - 3rd International Symposium on Computational Life Science
    Pages158-167
    Number of pages10
    DOIs
    Publication statusPublished - 2007
    Event3rd International Symposium on Computational Life Science, CompLife 2007 - Utrecht, Netherlands
    Duration: 2007 Oct 42007 Oct 5

    Publication series

    NameAIP Conference Proceedings
    Volume940
    ISSN (Print)0094-243X
    ISSN (Electronic)1551-7616

    Other

    Other3rd International Symposium on Computational Life Science, CompLife 2007
    Country/TerritoryNetherlands
    CityUtrecht
    Period07/10/407/10/5

    ASJC Scopus subject areas

    • General Physics and Astronomy

    Fingerprint

    Dive into the research topics of 'Predicting error bars for QSAR models'. Together they form a unique fingerprint.

    Cite this