Publications Search

Kokkos Kernels: FY20 update

Berger-Vergiat, Luc B.; Rajamanickam, Sivasankaran R.; Dang, Vinh Q.; Ellingwood, Nathan D.; Kelley, Brian M.; Harvey, Evan C.; Wilke, Jeremiah J.; Acer, Seher A.

Abstract not provided.

More Details

TYPE Conference Presenation YEAR 2021

OSTI DOI

Kokkos Kernels: FY20 update

Rajamanickam, Sivasankaran R.; Berger-Vergiat, Luc B.; Acer, Seher A.; Dang, Vinh Q.; Ellingwood, Nathan D.; Harvey, Evan C.; Kelley, Brian M.; Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Presenation YEAR 2021

OSTI DOI

Codesign for the Masses

Lewis, Cannada L.; Hammond, Simon D.; Wilke, Jeremiah J.

In this position paper we will address challenges and opportunities relating to the design and codesign of application specific circuits. Given our background as computational scientists, our perspective is from the viewpoint of a highly motivated application developer as opposed to career computer architects

More Details

TYPE Other Report YEAR 2021

OSTI DOI

Evaluating Trade-offs in Potential Exascale Interconnect Technologies

Hemmert, Karl S.; Bair, Ray B.; Bhatale, Abhinav B.; Groves, Taylor G.; Jain, Nikhil J.; Lewis, Cannada L.; Mubarak, Misbah M.; Pakin, Scott P.; Ross, Robert B.; Wilke, Jeremiah J.

This report details work to study trade-offs in topology and network bandwidth for potential interconnects in the exascale (2021-2022) timeframe. The work was done using multiple interconnect models across two parallel discrete event simulators. Results from each independent simulator are shown and discussed and the areas of agreement and disagreement are explored.

More Details

TYPE Other Report YEAR 2020

OSTI DOI

An Evaluation of Ethernet Performance for Scientific Workloads

Proceedings of INDIS 2020: Innovating the Network for Data-Intensive Science, Held in conjunction with SC 2020: The International Conference for High Performance Computing, Networking, Storage and Analysis

Kenny, Joseph P.; Wilke, Jeremiah J.; Ulmer, Craig D.; Baker, Gavin M.; Knight, Samuel K.; Friesen, Jerrold A.

Priority-based Flow Control (PFC), RDMA over Converged Ethernet (RoCE) and Enhanced Transmission Selection (ETS) are three enhancements to Ethernet networks which allow increased performance and may make Ethernet attractive for systems supporting a diverse scientific workload. We constructed a 96-node testbed cluster with a 100 Gb/s Ethernet network configured as a tapered fat tree. Tests representing important network operating conditions were completed and we provide an analysis of these performance results. RoCE running over a PFC-enabled network was found to significantly increase performance for both bandwidth-sensitive and latency-sensitive applications when compared to TCP. Additionally, a case study of interfering applications showed that ETS can prevent starvation of network traffic for latency-sensitive applications running on congested networks. We did not encounter any notable performance limitations for our Ethernet testbed, but we found that practical disadvantages still tip the balance towards traditional HPC networks unless a system design is driven by additional external requirements.

More Details

TYPE Conference Poster YEAR 2020

Scopus OSTI

Modern problems require modern solutions: How modern CMake supports modern C++ in Kokkos

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Opportunities and limitations of Quality-of-Service in Message Passing applications on adaptively routed Dragonfly and Fat Tree networks

Wilke, Jeremiah J.; Kenny, Joseph P.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Opportunities and limitations of Quality-of-Service in Message Passing applications on adaptively routed Dragonfly and Fat Tree networks

Proceedings - IEEE International Conference on Cluster Computing, ICCC

Wilke, Jeremiah J.; Kenny, Joseph P.

Avoiding communication bottlenecks remains a critical challenge in high-performance computing (HPC) as systems grow to exascale. Numerous design possibilities exist for avoiding network congestion including topology, adaptive routing, congestion control, and quality-of-service (QoS). While network design often focuses on topological features like diameter, bisection bandwidth, and routing, efficient QoS implementations will be critical for next-generation interconnects. HPC workloads are dominated by tightly-coupled mathematics, making delays in a single message manifest as delays across an entire parallel job. QoS can spread traffic onto different virtual lanes (VLs), lowering the impact of network hotspots by providing priorities or bandwidth guarantees that prevent starvation of critical traffic. Two leading topology candidates, Dragonfly and Fat Tree, are often discussed in terms of routing properties and cost, but the topology can have a major impact on QoS. While Dragonfly has attractive routing flexibility and cost relative to Fat Tree, the extra routing complexity requires several VLs to avoid deadlock. Here we discuss the special challenges of Dragonfly, proposing configurations that use different routing algorithms for different service levels (SLs) to limit VL requirements. We provide simulated results showing how each QoS strategy performs on different classes of application and different workload mixes. Despite Dragonfly's desirable characteristics for adaptive routing, Fat Tree is shown to be an attractive option when QoS is considered.

More Details

TYPE Conference Poster YEAR 2020

Scopus OSTI

Modern problems require modern solutions: How modern CMake supports modern C++ in performance portability

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Putting compilers in the simulation co-design loop with surrogate performance models

Wilke, Jeremiah J.; Lewis, Cannada L.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Supercomputer in a workstation: simulation as a development platform for network architectures

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Kokkos Kernels

Rajamanickam, Sivasankaran R.; Acer, Seher A.; Berger-Vergiat, Luc B.; Dang, Vinh Q.; Ellingwood, Nathan D.; Kelley, Brian M.; Kim, Kyungjoo K.; Trott, Christian R.; Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

The Exascale Computing Project: Hardware Evaluation for Interconnects

Hemmert, Karl S.; Wilke, Jeremiah J.; Ross, Rob R.; Groves, Taylor G.; Karlin, Ian K.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Kokkos CMake: Build Systems and for Modern C++

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Evaluation of novel interconnect technologies for ASC applications

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

Significant Vendor Impact of Sandia?s Portals Networking Technology

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

Supercomputer in a workstation: simulation as a development platform for network architectures

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

Batched Linear Algebra in Kokkos Kernels

Rajamanickam, Sivasankaran R.; Berger-Vergiat, Luc B.; Dang, Vinh Q.; Ellingwood, Nathan D.; Kim, Kyungjoo K.; McLendon, William C.; Trott, Christian R.; Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Build Systems and Package Management for Modern C++

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

The programming and performance challenges of memory-semantic interconnects for scientific computing

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Kokkos Kernels

Rajamanickam, Sivasankaran R.; Berger-Vergiat, Luc B.; Dang, Vinh Q.; Ellingwood, Nathan D.; Kim, Kyungjoo K.; McLendon, William C.; Trott, Christian R.; Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

Distributed memory futures for compile-time deterministic-by-default concurrency in distributed C++ applications

Hollman, David S.; Wilke, Jeremiah J.; Morales, Nicolas M.; Lewis, Cannada L.; Markosyan, Aram H.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Milestone Highlight Slide for STPR04-16

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Milestone Highlight Slide STPR04-15

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

ECP Milestone Memo for 2.3.1.04.14

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Other Report YEAR 2018

OSTI DOI

ECP Milestone Report DARMA-MPI Interoperability WBS 2.3.1.04 Milestone 15

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Other Report YEAR 2018

OSTI DOI

ECP Milestone Report DARMA-Kokkos Data and Execution Interoperability WBS 2.3.1.04 Milestone 16

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Other Report YEAR 2018

OSTI DOI

ECP Milestone Memo for 2.3.1.04.16

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Other Report YEAR 2018

OSTI DOI

ECP Milestone Deliverable Memo ? 2.3.1.04.15

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Other Report YEAR 2018

OSTI DOI

An Asynchronous Task-Based Parallelization Strategy for Multiscale Solid Mechanics

Perez, Jorge P.; Littlewood, David J.; Bennett, Janine C.; Hollman, David S.; Lifflander, Jonathan; Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Compiler-assisted Source-to-Source Skeletonization of Application Models for System Simulation

Wilke, Jeremiah J.; Kenny, Joseph P.; Knight, Samuel K.; Rumley, Sebastien R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI DOI

Bringing minimal routing back to HPC through silicon photonics

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Supercomputer in a laptop: Distributed application and runtime development via architecture simulation

Knight, Samuel K.; Kenny, Joseph P.; Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI DOI

Supercomputer in a laptop: Distributed application and runtime development via architecture simulation

Wilke, Jeremiah J.; Knight, Samuel K.; Kenny, Joseph P.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

The Pitfalls of Provisioning Exascale Networks: A Trace Replay Analysis for Understanding Communication Performance

Kenny, Joseph P.; Sargsyan, Khachik S.; Knight, Samuel K.; Michelogiannakis, George M.; Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI DOI

Flexible and Scalable Applications Models for the Structural Simulation Toolkit

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

A Gentle Introduction to the DARMA Programming Model

Hollman, David S.; Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Optimizing Dynamic Data-Effect Analysis for Iterative Overdecomposed Applications

Lifflander, Jonathan; Markosyan, Aram H.; Hollman, David S.; Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

DARMA: A software stack model for supporting asynchronous data effects programming in the Exascale Computing Project (ECP)

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

DARMA Programming Model: Distributed Asynchronous Resilient Models for Applications

Wilke, Jeremiah J.; Lifflander, Jonathan; Hollman, David S.; Borghesi, G.; Markosyan, Aram H.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Putting the "co" back into co-design for system-level studies with the SST macroscale tools

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Mitigating the risk and measuring the reward of data-effects programming for dynamic irregular algorithms using DARMA

Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Interconnect Working Group

Hemmert, Karl S.; Bair, Ray B.; Bhatele, Abhinav B.; Groves, Taylor G.; Hammond, Simon D.; Jain, Nikhil J.; Levenhagen, Michael J.; Mubarak, Misbah M.; Pakin, Scott P.; Ross, Rob R.; Wilke, Jeremiah J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

The pitfalls of provisioning exascale networks: A trace replay analysis for understanding communication performance

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Kenny, Joseph P.; Sargsyan, Khachik S.; Knight, Samuel K.; Michelogiannakis, George; Wilke, Jeremiah J.

Data movement is considered the main performance concern for exascale, including both on-node memory and off-node network communication. Indeed, many application traces show significant time spent in MPI calls, potentially indicating that faster networks must be provisioned for scalability. However, equating MPI times with network communication delays ignores synchronization delays and software overheads independent of network hardware. Using point-to-point protocol details, we explore the decomposition of MPI time into communication, synchronization and software stack components using architecture simulation. Detailed validation using Bayesian inference is used to identify the sensitivity of performance to specific latency/bandwidth parameters for different network protocols and to quantify associated uncertainties. The inference combined with trace replay shows that synchronization and MPI software stack overhead are at least as important as the network itself in determining time spent in communication routines.

More Details

TYPE Conference Poster YEAR 2018

Scopus OSTI

Compiler-assisted Source-to-Source Skeletonization of Application Models for System Simulation

Wilke, Jeremiah J.; Kenny, Joseph P.; Knight, Samuel K.; Sargsyan, Khachik S.; Rumley, Sebastien R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI DOI

Compilers (and humans that use them) need more expressive intermediate representations to handle extreme heterogeneity

Wilke, Jeremiah J.; Hollman, David S.; Lifflander, Jonathan

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Compiler-assisted source-to-source skeletonization of application models for system simulation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Wilke, Jeremiah J.; Kenny, Joseph P.; Knight, Samuel K.; Rumley, Sebastien

Performance modeling of networks through simulation requires application endpoint models that inject traffic into the simulation models. Endpoint models today for system-scale studies consist mainly of post-mortem trace replay, but these off-line simulations may lack flexibility and scalability. On-line simulations running so-called skeleton applications run reduced versions of an application that generate traffic that is the same or similar to the full application. These skeleton apps have advantages for flexibility and scalability, but they often must be custom written for the simulator itself. Auto-skeletonization of existing application source code via compiler tools would provide endpoint models with minimal development effort. These source-to-source transformations have been only narrowly explored. We introduce a pragma language and corresponding Clang-driven source-to-source compiler that performs auto-skeletonization based on provided pragma annotations. We describe the compiler toolchain, validate the generated skeletons, and show scalability of the generated simulation models beyond 100Â K endpoints for example MPI applications. Overall, we assert that our proposed auto-skeletonization approach and the flexible skeletons it produces can be an important tool in realizing balanced exascale interconnect designs.

More Details

TYPE Conference Poster YEAR 2018

Scopus OSTI