Fast and Accurate Uncertainty Estimation in Chemical Machine Learning.

07:00 EST 3rd January 2019 | BioPortfolio

Summary of "Fast and Accurate Uncertainty Estimation in Chemical Machine Learning."

We present a scheme to obtain an inexpensive and reliable estimate of the uncertainty associated with the predictions of a machine-learning model of atomic and molecular properties. The scheme is based on resampling, with multiple models being generated based on sub-sampling of the same training data. The accuracy of the uncertainty prediction can be benchmarked by maximum likelihood estimation, which can also be used to correct for correlations between resampled models, and to improve the performance of the uncertainty estimation by a cross-validation procedure. In the case of sparse Gaussian Process Regression models, this resampled estimator can be evaluated at negligible cost. We demonstrate the reliability of these estimates for the prediction of molecular energetics, and for the estimation of nuclear chemical shieldings in molecular crystals. Extension to estimate the uncertainty in energy differences, forces, or other correlated predictions is straightforward. This method can be easily applied to other machine learning schemes, and will be beneficial to make data-driven predictions more reliable, and to facilitate training-set optimization and active-learning strategies.


Journal Details

This article was published in the following journal.

Name: Journal of chemical theory and computation
ISSN: 1549-9626


DeepDyve research library

PubMed Articles [15667 Associated PubMed Articles listed on BioPortfolio]

Rapid estimation of activation energy in heterogeneous catalytic reactions via machine learning.

Estimation of activation energies within heterogeneous catalytic reactions is performed using machine learning and catalysts dataset. In particular, descriptors for determining activation energy are r...

Modeling of stem form and volume through machine learning.

Taper functions and volume equations are essential for estimation of the individual volume, which have consolidated theory. On the other hand, mathematical innovation is dynamic, and may improve the f...

Machine learning in suicide science: Applications and ethics.

For decades, our ability to predict suicide has remained at near-chance levels. Machine learning has recently emerged as a promising tool for advancing suicide science, particularly in the domain of s...

Efficient corrections for DFT noncovalent interactions based on ensemble learning models.

Machine learning has exhibited powerful capabilities in many areas. However, machine learning models are mostly database dependent, requiring a new model if the database changes. Therefore, a universa...

Geometric morphometrics aided by machine learning in craniofacial surgery.

Geometric morphometrics aided by machine learning provide detailed and accurate statistical models of facial form. They promise to be extremely effective tools in surgical planning and assessment; how...

Clinical Trials [4017 Associated Clinical Trials listed on BioPortfolio]

Machine Learning-Based Risk Profile Classification of Patients Undergoing Elective Heart Valve Surgery

Machine learning methods potentially provide a highly accurate and detailed assessment of expected individual patient risk before elective cardiac surgery. Correct anticipation of this ris...

Machine Learning From Fetal Flow Waveforms to Predict Adverse Perinatal Outcomes

The aim of this study is to get a proof of concept for using a computational model of fetal haemodynamics, combined with machine learning based on Doppler patterns of the fetal cardiovascu...

Learning Curve CUSUM of LV Ejection Fraction by Visual Estimation in Novice Practitioners

This retrospective, cross-sectional cohort study was conducted to investigate the novice practitioners' learning curve of visual estimation of LV ejection fraction (%) through a echocardio...

Subpopulation-Specific Sepsis Identification Using Machine Learning

The investigators propose to develop and evaluate a hospital department-specific machine learning based clinical decision support (CDS) system for early sepsis prediction, focused on impro...

Computerized Cognitive Bias Intervention for Intolerance of Uncertainty

This investigation examines the efficacy of a brief, one-session computerized interpretation bias modification paradigm (CBM-I) in the reduction of intolerance of uncertainty. Intolerance ...

Medical and Biotech [MESH] Definitions

A MACHINE LEARNING paradigm used to make predictions about future instances based on a given set of labeled paired input-output training (sample) data.

A MACHINE LEARNING paradigm used to make predictions about future instances based on a given set of unlabeled paired input-output training (sample) data.

SUPERVISED MACHINE LEARNING algorithm which learns to assign labels to objects from a set of training examples. Examples are learning to recognize fraudulent credit card activity by examining hundreds or thousands of fraudulent and non-fraudulent credit card activity, or learning to make disease diagnosis or prognosis based on automatic classification of microarray gene expression profiles drawn from hundreds or thousands of samples.

A type of ARTIFICIAL INTELLIGENCE that enable COMPUTERS to independently initiate and execute LEARNING when exposed to new data.

Process in which individuals take the initiative, in diagnosing their learning needs, formulating learning goals, identifying resources for learning, choosing and implementing learning strategies and evaluating learning outcomes (Knowles, 1975)

Quick Search


DeepDyve research library

Searches Linking to this Article