Publications Search

CIS Project 22359, Final Technical Report. Discretized Posterior Approximation in High Dimensions

Duersch, Jed A.; Duersch, Jed A.; Catanach, Thomas A.

Our primary aim in this work is to understand how to efficiently obtain reliable uncertainty quantification in automatic learning algorithms with limited training datasets. Standard approaches rely on cross-validation to tune hyper parameters. Unfortunately, when our datasets are too small, holdout datasets become unreliable—albeit unbiased—measures of prediction quality due to the lack of adequate sample size. We should not place confidence in holdout estimators under conditions wherein the sample variance is both large and unknown. More poigniantly, our training experiments on limited data (Duersch and Catanach, 2021) show that even if we could improve estimator quality under these conditions, the typical training trajectory may never even encounter generalizable models.

More Details

TYPE SAND Report YEAR 2021

OSTI DOI

Parsimonious Inference Foundations for Trustworthy-by-Design Machine Learning

Duersch, Jed A.; Catanach, Thomas A.

Abstract not provided.

More Details

TYPE Presentation YEAR 2021

OSTI

Parsimonious Rational Belief

Duersch, Jed A.; Catanach, Thomas A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

A Bayesian Perspective on Machine Learning and UQ

Catanach, Thomas A.; Duersch, Jed A.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Randomized projection for rank-revealing matrix factorizations and low-rank approximations

SIAM Review

Duersch, Jed A.; Gu, Ming

Rank-revealing matrix decompositions provide an essential tool in spectral analysis of matrices, including the Singular Value Decomposition (SVD) and related low-rank approximation techniques. QR with Column Pivoting (QRCP) is usually suitable for these purposes, but it can be much slower than the unpivoted QR algorithm. For large matrices, the difference in performance is due to increased communication between the processor and slow memory, which QRCP needs in order to choose pivots during decomposition. Our main algorithm, Randomized QR with Column Pivoting (RQRCP), uses randomized projection to make pivot decisions from a much smaller sample matrix, which we can construct to reside in a faster level of memory than the original matrix. This technique may be understood as trading vastly reduced communication for a controlled increase in uncertainty during the decision process. For rank-revealing purposes, the selection mechanism in RQRCP produces results that are the same quality as the standard algorithm, but with performance near that of unpivoted QR (often an order of magnitude faster for large matrices). We also propose two formulas that facilitate further performance improvements. The first efficiently updates sample matrices to avoid computing new randomized projections. The second avoids large trailing updates during the decomposition in truncated low-rank approximations. Our truncated version of RQRCP also provides a key initial step in our truncated SVD approximation, TUXV. These advances open up a new performance domain for large matrix factorizations that will support efficient problem-solving techniques for challenging applications in science, engineering, and data analysis.

More Details

TYPE Journal Article YEAR 2020

Scopus OSTI DOI