Publications Search

New and Future Capabilities for the Kokkos Programming Model

Hollman, David S.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2021

OSTI

A Universal Abstraction for Async

Niebler, Eric N.; Hollman, David S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Parameter Sensitivity Analysis of the SparTen High Performance Sparse Tensor Decomposition Software

2020 IEEE High Performance Extreme Computing Conference, HPEC 2020

Myers, Jeremy M.; Dunlavy, Daniel D.; Teranishi, Keita T.; Hollman, David S.

Tensor decomposition models play an increasingly important role in modern data science applications. One problem of particular interest is fitting a low-rank Canonical Polyadic (CP) tensor decomposition model when the tensor has sparse structure and the tensor elements are nonnegative count data. SparTen is a high-performance C++ library which computes a low-rank decomposition using different solvers: a first-order quasi-Newton or a second-order damped Newton method, along with the appropriate choice of runtime parameters. Since default parameters in SparTen are tuned to experimental results in prior published work on a single real-world dataset conducted using MATLAB implementations of these methods, it remains unclear if the parameter defaults in SparTen are appropriate for general tensor data. Furthermore, it is unknown how sensitive algorithm convergence is to changes in the input parameter values. This report addresses these unresolved issues with large-scale experimentation on three benchmark tensor data sets. Experiments were conducted on several different CPU architectures and replicated with many initial states to establish generalized profiles of algorithm convergence behavior.

More Details

TYPE Conference Poster YEAR 2020

Scopus OSTI

Parameter Sensitivity Analysis of the SparTen High Performance Sparse Tensor Decomposition Software

Myers, Jeremy M.; Dunlavy, Daniel D.; Teranishi, Keita T.; Hollman, David S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Evolution of the Kokkos Ecosystem

Miles, Jeffery S.; Trott, Christian R.; Brunini, Victor B.; Bettencourt, Matthew T.; Poliakoff, David Z.; Rajamanickam, Sivasankaran R.; Hollman, David S.; Wolf, Michael W.; Glass, Micheal W.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Performance and Parallelization of CP-Alternate Poisson Regression Sparse Tensor Decomposition

Teranishi, Keita T.; Hollman, David S.; Myers, Jeremy M.; Dunlavy, Daniel D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Kokkos Core Status Update

Trott, Christian R.; Liber, Nevin L.; Lebrun-Grandie, Damien L.; Poliakoff, David Z.; Hollman, David S.; Lewis, Cannada L.; Sunderland, Daniel S.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

P1673R2: A free function linear algebra interface based on the BLAS

Hoemmen, Mark F.; Hollman, David S.; Trott, Christian R.; Liber, Nevin L.; Rajamanickam, Sivasankaran R.; Lo, Li-Ta L.; Lopez, Graham L.; Caday, Peter C.; Knepper, Sarah K.; Costa, Timothy B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Mdspan in C++: A Case Study in the Integration of Performance Portable Features into International Language Standards

Proceedings of P3HPC 2019: International Workshop on Performance, Portability and Productivity in HPC - Held in conjunction with SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis

Hollman, David S.; Lelbach, Bryce; Edwards, H.C.; Hoemmen, Mark F.; Sunderland, Daniel S.; Trott, Christian R.

Multi-dimensional arrays are ubiquitous in high-performance computing (HPC), but their absence from the C++ language standard is a long-standing and well-known limitation of their use for HPC. This paper describes the design and implementation of mdspan, a proposed C++ standard multidimensional array view (planned for inclusion in C++23). The proposal is largely inspired by work done in the Kokkos project - a C++ performance-portable programming model de- ployed by numerous HPC institutions to prepare their code base for exascale-class supercomputing systems. This paper describes the final design of mdspan af- ter a five-year process to achieve consensus in the C++ community. In particular, we will lay out how the design addresses some of the core challenges of performance-portable programming, and how its cus- tomization points allow a seamless extension into areas not currently addressed by the C++ Standard but which are of critical importance in the heterogeneous computing world of today's systems. Finally, we have provided a production-quality implementation of the proposal in its current form. This work includes several benchmarks of this implementation aimed at demon- strating the zero-overhead nature of the modern design.

More Details

TYPE Conference Poster YEAR 2019

Scopus OSTI

mdspan in C++: A Case Study in the Integration of Performance Portable Features into International Language Standards

Hollman, David S.; Lelbach, Bryce L.; Edwards, H.C.; Hoemmen, Mark F.; Sunderland, Daniel S.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI DOI

The Art of Breaking Things: a new tool for fighting against Hyrum's law in the new world of concept-driven design

Hollman, David S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Performance and Parallelization of CP-Alternate Poisson Regression Sparse Tensor Decomposition

Teranishi, Keita T.; Hollman, David S.; Barrett, Richard F.; Myers, Jeremy M.; Dunlavy, Daniel D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

A free function linear algebra interface based on the BLAS

Caday, Peter C.; Hoemmen, Mark F.; Hollman, David S.; Liber, Nevin L.; Lo, Li-Ta L.; Lopez, Graham L.; Luszczek, Piotr L.; Knepper, Sarah K.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Development of parallel sparse CP-APR tensor decomposition solvers

Teranishi, Keita T.; Hollman, David S.; Dunlavy, Daniel D.; Barrett, Richard F.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

The Ongoing Saga of ISO-C++ Executors

Hollman, David S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Thoughts on Curiously Recurring Template Pattern

Hollman, David S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

The Future of Parallelism and Concurrency in C++ and How It Relates to DOE Abstraction Layers

Hollman, David S.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

Modern C++ in Computational Science

Hollman, David S.; Hoemmen, Mark F.; Sunderland, Daniel S.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

P1393R0: A General Property Customization Mechanism

Hollman, David S.; Kohlhoff, Chris K.; Lelbach, Bryce L.; Brown, Gordon B.; Dominiak, Micha&#322 D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

Making openMP ready for c++ executors

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Scogland, Thomas R.W.; Sunderland, Daniel S.; Olivier, Stephen L.; Hollman, David S.; Evans, Noah; de Supinski, Bronis R.

For at least the last 20 years, many have tried to create a general resource management system to support interoperability across various concurrent libraries. The previous strategies all suffered from additional toolchain requirements, and/or a usage of a shared programing model that assumed it owned/controlled access to all resources available to the program. None of these techniques have achieved wide spread adoption. The ubiquity of OpenMP coupled with C++ developing a standard way to describe many different concurrent paradigms (C++23 executors) would allow OpenMP to assume the role of a general resource manager without requiring user code written directly in OpenMP. With a few added features such as the ability to use otherwise idle threads to execute tasks and to specify a task “width”, many interesting concurrent frameworks could be developed in native OpenMP and achieve high performance. Further, one could create concrete C++ OpenMP executors that enable support for general C++ executor based codes, which would allow Fortran, C, and C++ codes to use the same underlying concurrent framework when expressed as native OpenMP or using language specific features. Effectively, OpenMP would become the de facto solution for a problem that has long plagued the HPC community.

More Details

TYPE Conference Poster YEAR 2019

Scopus OSTI