Kevin Pedretti
Scalable System Software
Scalable System Software
(505) 844-1399
Sandia National Laboratories, New Mexico
P.O. Box 5800
Albuquerque, NM 87185-1319
Biography
Kevin Pedretti is a Distinguished Member of Technical Staff in the Scalable System Software department at Sandia National Laboratories. He has helped develop several large-scale parallel computers, including the Red Storm system that was productized as the Cray XT line of supercomputers and Astra, the first petascale supercomputer based on Arm processors. Prior to joining Sandia in 2001, he studied engineering at the University of Iowa where he received a B.S.E. in Electrical Engineering in 1999 and an M.S. in Computer Engineering in 2001. His current research interests include operating systems for massively parallel supercomputers, full-stack hardware & software co-design, and exploring cloud technologies in the context of high performance computing.
- 2023 – Sandia Employee Recognition Award – DetNet HPC Software-as-a-Service Team
- 2021 – Sandia Employee Recognition Award – Astra Supercomputer Team
- 2020 – Cray Users’ Group Annual Technical Conference 2020 (CUG’20) Best Paper Award, “Enabling Power Management and Control on Astra: The First Petascale Arm Supercomputer”
- 2020 – Sandia Employee Recognition Award – Led collaborative development of the Advanced Tri-Labs Software Environment (ATSE), critical to the success of Astra
- 2019 – Sandia Employee Recognition Award – Led the collaborative development and deployment of the Advanced Tri-Labs Software Environment (ATSE), critical to the success of the deployment of the first and fastest Arm-based supercomputer (Astra)
- 2019 – Defense Programs Award for Excellence – Astra Supercomputer Team
- 2018 – Sandia Employee Recognition Award – Astra Supercomputer Team
- 2018 – R&D100 Award for Power API
- 2018 – R&D100 Special Recognition Award for Corporate Social Responsibility for the Power API
- 2016 – Defense Programs Award for Excellence – Successful Deployment and Acceptance of the Trinity Supercomputer, Team Member
- 2011 – Defense Programs Award for Excellence – Sandia Red Storm Supercomputer Operating System Team
- 2010 – NNSA Environmental Stewardship Award – Red Storm Energy Savings, Team Member
- 2010 – FLC Award for Excellence in Technology Transfer – Red Storm Massively Parallel Processor Supercomputer Architecture, Team Member
- 2010 – Sandia Employee Recognition Award – Kitten Operating System Virtualization Team, Team Representative
- 2009 – R&D100 Award – Catamount N-Way Lightweight Kernel
- 2006 – R&D100 Award – Compute Process Allocator (Fact Sheet)
- 2006 – Lockheed Martin NOVA Award – Red Storm Supercomputer Design and Development Team
- 2005 – Sandia Award for Excellence – For developing a C-based firmware for the Red Storm network interface
- 2003 – Sandia Award for Excellence – For technical excellence in the design and development of the Red Storm node allocator
- Kevin Pedretti, Andrew J. Younge, Simon D. Hammond, James H. Laros III, Matthew L. Curry, Michael J. Aguilar, Robert J. Hoekstra, Ron Brightwell. Chronicles of Astra: Challenges and Lessons from the First Petascale Arm Supercomputer, International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), November 2020.
- Ryan E. Grant, Simon D. Hammond, James H. Laros III, Michael Levenhagen, Stephen L. Olivier, Kevin Pedretti, H. Lee Ward, Andrew J. Younge. Enabling Power Measurement and Control on Astra: The First Petascale Arm Supercomputer, Proceedings of the 2020 Cray Users’ Group Annual Technical Conference (CUG’20), Virtual Online Event, October, 2020. [Best Paper]
- Simon David Hammond, Clayton Hughes, Michael J. Levenhagen, Courtenay T. Vaughan, Andrew J Younge, Benjamin Schwaller, Michael J Aguilar, Kevin Pedretti, James H. Laros. Evaluating the Marvell ThunderX2 Server Processor for HPC Workloads, The 6th Special Session on High-Performance Computing Benchmarking and Optimization (HPBench’19), July 2019.
- Andrew J. Younge, Kevin Pedretti, Ryan E. Grant, Ron Brightwell. A Tale of Two Systems: Using Containers to Deploy HPC Applications on Supercomputers and Clouds, IEEE International Conference on Cloud Computing Technology and Science (CloudCom’17), December 2017.
- Ryan E. Grant, James H. Laros III, Michael Levenhagen, Stephen L. Olivier, Kevin Pedretti, H. Lee Ward, Andrew J. Younge. Evaluating Energy and Power Profiling Techniques for HPC Workloads, International Green and Sustainable Computing Conference (IGSC’17), October 2017.
- Kurt B. Ferreira, Scott Levy, Kevin Pedretti, Ryan E. Grant. Characterizing MPI Matching via Trace-based Simulation, EuroMPI/USA, September 2017.
- Andrew J. Younge, Kevin Pedretti, Ryan E. Grant, Brian L. Gaines, Ron Brightwell. Enabling Diverse Software Stacks on Supercomputers using High-Performance Virtual Clusters, IEEE International Conference on Cluster Computing (Cluster’17), September 2017.
- Jiannan Ouyang, Brian Kocoloski, John R. Lange, Kevin Pedretti. Achieving Performance Isolation with Lightweight Co-Kernels, ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC’15), Portland, Oregon, June 2015.
- Mehmet Deveci, Sivasankaran Rajamanickam, Vitus J. Leung, Kevin Pedretti, Stephen L. Olivier, David P Bunde, Umit Catalyurek, Karen D. Devine. Exploiting Geometric Partitioning in Task Mapping for Parallel Computers, IEEE International Parallel and Distributed Processing Symposium (IPDPS’14), Phoenix, Arizona, May 2014.
- James Laros, Kevin Pedretti, Suzanne Kelly, Wei Shu, Courtenay Vaughan. Energy Based Performance Tuning for Large Scale High Performance Computing Systems, 20th High Performance Computing Symposium (HPC 2012), Orlando, Florida, March 2012.
- Ming-Yu Hsieh, Jie Meng, Michael Levenhagen, Kevin Pedretti, Ayse Coskun, Arun Rodrigues. SST + gem5 = A Scalable Simulation Infrastructure for High Performance Computing (Short Paper), 5th International ICST Conference on Simulation Tools and Techniques (SIMUTools), March 2012.
- Kurt Ferreira, Jon Stearley, James H. Laros III, Ron Oldfield, Kevin Pedretti, Ron Brightwell, Rolf Riesen, Patrick G. Bridges, Dorian Arnold. Evaluating the Viability of Process Replication Reliability for Exascale Systems, International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), Seattle, Washington, November 2011.
- Ron Brightwell and Kevin Pedretti. An Intra-Node Implementation of OpenSHMEM Using Virtual Address Space Mapping, Fifth Conference on Partitioned Global Address Space Programming Models (PGAS), Galveston Island, Texas, October 2011.
- Kevin Pedretti, Ron Brightwell, Doug Doerfler, K. Scott Hemmert, James H. Laros III. The Impact of Injection Bandwidth Performance on Application Scalability, EuroMPI, Santorini, Greece, September 2011.
- Brian W. Barrett, Ron Brightwell, K. Scott Hemmert, Kevin Pedretti, Kyle Wheeler, Keith D. Underwood. Enhanced Support for OpenSHMEM Communication in Portals, IEEE Hot Interconnects, Santa Clara, California, August 2011.
- John Lange, Kevin Pedretti, Peter Dinda, Patrick Bridges, Chang Bae, Philip Soltero, Alexander Merritt. Minimal Overhead Virtualization of a Large Scale Supercomputer, ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE), Newport Beach, California, March 2011.
- Kurt B. Ferreira, Patrick G. Bridges, Ron Brightwell, Kevin Pedretti. The Impact of System Design Parameters on Application Noise Sensitivity, IEEE International Conference on Cluster Computing, Crete, Greece, September 2010.
- John Lange, Kevin Pedretti, Trammell Hudson, Peter Dinda, Zheng Cui, Lei Xia, Patrick Bridges, Andy Gocke, Steven Jaconette, Michael Levenhagen, Ron Brightwell. Palacios and Kitten: New High Performance Operating Systems For Scalable Virtualized and Native Supercomputing, IEEE International Parallel and Distributed Processing Symposium (IPDPS), Atlanta, Georgia, April 2010.
- James H. Laros III, Kevin Pedretti, Suzanne M. Kelly, John P. Vandyke, Kurt B. Ferreira, Courtenay T. Vaughan, Mark Swan. Topics on Measuring Real Power Usage on High Performance Computing Platforms, IEEE International Conference on Cluster Computing, New Orleans, Louisiana, September 2009.
- Ron Brightwell, Trammell Hudson, Kevin Pedretti. SMARTMAP: Operating System Support for Efficient Data Sharing Among Processes on a Multi-Core Processor, International Conference for High Performance Computing, Networking, Storage, and Analysis (SC’08), Austin, Texas, November 2008.
- Ron Brightwell, Trammell Hudson, Kevin Pedretti, Rolf Riesen, Keith Underwood. Implementation and Performance of Portals 3.3 on the Cray XT3, IEEE International Conference on Cluster Computing, Boston, Massachusetts, September 2005.
- Ron Brightwell, Kevin Pedretti, Keith Underwood. Initial Performance Evaluation of the Cray SeaStar Interconnect, 13th IEEE Symposium on High-Performance Interconnects, Stanford, California, August 2005.
- Kevin Pedretti, Ron Brightwell. A NIC-Offload Implementation of Portals for Quadrics QsNet, Fifth LCI International Conference on Linux Clusters, Austin, Texas, May 2004.
- Nishank Trivedi, Kevin Pedretti, Terry A. Braun, Todd E. Scheetz, Thomas L. Casavant. Alternative Parallelization Strategies in EST Clustering, Seventh International Conference on Parallel Computing Technologies (PaCT), September 2003.
- Kevin Pedretti, Ron Brightwell, Josh Williams. Cplant Runtime System Support for Multi-Processor and Heterogeneous Compute Nodes, IEEE International Conference on Cluster Computing, Chicago, Illinois, September 2002.
- Kevin Pedretti, Todd E. Scheetz, Terry A. Braun, Chad A. Roberts, Natalie L. Robinson, Thomas L. Casavant. A Parallel Expressed Sequence Tag (EST) Clustering Program, Sixth International Conference on Parallel Computing Technologies (PaCT), September 2001.
- Samuel A. Fineberg and Kevin Pedretti. Analysis of 100Mb/s Ethernet for the Whitney Commodity Computing Testbed, Eighth Symposium on the Frontiers of Massively Parallel Computation, Annapolis, Maryland, February, 1999.
- Mehmet Deveci, Karen D. Devine, Kevin Pedretti, Mark Taylor, Sivasankaran Rajamanickam, Umit V. Catalyurek. Geometric Mapping of Tasks to Processors on Parallel Computers with Mesh or Torus Networks, IEEE Transactions on Parallel and Distributed Systems, Volume 30, Issue 9, September 2019.
- Brian Kocoloski, John Lange, Kevin Pedretti, Ron Brightwell. “Hobbes: A Multi-Kernel Infrastructure for Application Composition”, Book Chapter, Operating Systems for Supercomputers and High Performance Computing, 2019, ISBN-13: 978-9811366239.
- Ron Brightwell, Kurt Ferreira, Arthur B. Maccabe, Kevin Pedretti, Rolf Riesen. “Sandia Line of LWKs (Lightweight Kernels)”, Book Chapter, Operating Systems for Supercomputers and High Performance Computing, 2019, ISBN-13: 978-9811366239.
- Kurt B. Ferreira, Scott Levy, Kevin Pedretti, Ryan E. Grant. Characterizing MPI Matching via Trace-based Simulation, Parallel Computing, Volume 77, 2018.
- Ryan E. Grant, Michael Levenhagen, Stephen L. Olivier, David DeBonis, Kevin Pedretti, James H. Laros III. Standardizing Power Monitoring and Control at Exascale, IEEE Computer, Vol. 49, No. 10, pp. 38-46, October 2016.
- Richard F. Barrett, Dougles W. Doerfler, Sudip S. Dosanjh, Simon D. Hammond, Karl S. Hemmert, Michael A. Heroux, Paul T. Lin, Kevin Pedretti, Arun F. Rodrigues, Timothy G. Trucano, Justin P. Lutjiens. Exascale Design Space Exploration and Co-design, Future Generation Computer Systems, January 2014.
- James H. Laros III, Kevin Pedretti, Suzanne M. Kelly, Wei Shu, Kurt Ferreira, John Van Dyke, Courtenay T. Vaughan. Energy-Efficient High Performance Computing – Measurement and Tuning, Springer Publications, SpringerBriefs in Computer Science ISBN 978-1-4471-4491-5, 2013.
- Mahesh Rajan, Courtenay T. Vaughan, Douglas W. Doerfler, Richard F. Barrett, Paul T. Lin, Kevin Pedretti, K Scott Hemmert. Application-Driven Analysis of Two Generations of Capability Computing Platforms: The Transition to Multicore Processors, Concurrency and Computation: Practice and Experience, Volume 24, Issue 18, December 2012.
- Patrick G. Bridges, Dorian Arnold, Kevin Pedretti, Madhav Suresh, Feng Lu, Peter Dinda, Russ Joseph, Jack Lange. VM-based Emulation of Future Generation HPC Systems, International Journal of High Performance Computing Applications, Volume 26, Number 2, May 2012.
- Kurt B. Ferreira, Patrick G. Bridges, Ron Brightwell, Kevin Pedretti. The Impact of System Design Parameters on Application Noise Sensitivity, Journal of Cluster Computing, 2011.
- Ron Brightwell, Trammell Hudson, Kevin Pedretti, Keith D. Underwood. SeaStar Interconnect: Balanced Bandwidth for Scalable Performance, IEEE Micro, Volume 26, Number 3, May/June 2006.
- Todd E. Scheetz, Nishank Trivedi, Kevin Pedretti, Terry A. Braun, Thomas L. Casavant. Gene Transcript Clustering: A Comparison of Parallel Approaches, Future Generation Computer Systems, Volume 21, Number 5, May 2005.
- Nishank Trivedi, Jared Bischof, Steve Davis, Kevin Pedretti, Todd E. Scheetz, Terry A. Braun, Chad A. Roberts, Natalie L. Robinson, Val C. Sheffield, M. Bento Soares, Thomas L. Casavant. Parallel Creation of Non-redundant Gene Indices from Partial mRNA Transcripts, Future Generation Computer Systems, Volume 18, Number 6, May 2002.
- Ryan C. Braun, Kevin Pedretti, Thomas L. Casavant. Todd E. Scheetz, Clay L. Birkett, Chad A. Roberts. Parallelization of Local BLAST Service on Workstation Clusters, Future Generation Computer Systems, Volume 17, Number 6, April 2001.
- Ville Ahlgren, Stefan Andersson, Jim Brandt, Nicholas P. Cardo, Sudheer Chunduri, Jeremy Enos, Parks Fields, Ann Gentile, Richard Gerber, Joe Greenseid, Annette Greiner, Bilel Hadri, Yun (Helen) He, Dennis Hoppe, Urpo Kaila, Kaki Kelly, Mark Klein, Alex Kristiansen, Steve Leak, Mike Mason, Kevin Pedretti, Jean-Guillaume Piccinali, Jason Repik, Jim Rogers, Susanna Salminen, Mike Showerman, Cary Whitney, Jim Williams. Cray System Monitoring: Successes, Requirements, and Priorities, Proceedings of the 2018 Cray Users’ Group Annual Technical Conference (CUG’18), Stockholm, Sweden, May 2018.
- Adam DeConinck, Hai Ah Nam, David Morton, Amanda Bonnie, Cory Lueninghoener, James M. Brandt, Ann C. Gentile, Kevin Pedretti, Anthony M. Agelastos, Courtenay T. Vaughan, Simon D. Hammond, Benjamin A. Allan, Mike Davis, Jason Repik. Runtime collection and analysis of system metrics for production monitoring of Trinity Phase II, Proceedings of the 2017 Cray Users’ Group Annual Technical Conference (CUG’17), Redmond, Washington, May 2017.
- James H. Laros III, Kevin Pedretti, Ryan E. Grant, Stephen L. Olivier, Michael Levenhagen, David DeBonis, Scott Pakin, Steven Martin, Matthew Kappel, Paul Falde. ACES and Cray Collaborate on Advanced Power Management for Trinity, Proceedings of the 2016 Cray Users’ Group Annual Technical Conference (CUG’16), London, UK, May 2016.
- Jim Brandt, David DeBonis, Ann Gentile, Jim Lujan, Cindy Martin, Dave Martinez, Stephen Olivier, Kevin Pedretti, Narate Taerat, Ron Velarde. Enabling Advanced Operational Analysis Through Multi-Subsystem Data Integration on Trinity, Proceedings of the 2015 Cray Users’ Group Annual Technical Conference (CUG’15), Chicago, Illinois, April 2015.
- Kevin Pedretti, Courtenay T. Vaughan, Richard F. Barrett, Karen Devine, K. Scott Hemmert. Using the Gemini Performance Counters, Proceedings of the 2013 Cray Users’ Group Annual Technical Conference (CUG’13), Napa Valley, California, May 2013.
- Kevin Pedretti, Courtenay Vaughan, Karl Scott Hemmert, Brian Barrett. Application Sensitivity to Link and Injection Bandwidth on a Cray XT4 System, Proceedings of the 2008 Cray Users’ Group Annual Technical Conference (CUG’08), Helsinki, Finland, May 2008.
- Kurt Ferreira, Kevin Pedretti, Michael Levenhagen, Ron Brightwell. Exploring Memory Management Strategies in Catamount, Proceedings of the 2008 Cray Users’ Group Annual Technical Conference (CUG’08), Helsinki, Finland, May 2008.
- Ron Brightwell, Trammell Hudson, Kevin Pedretti, Keith Underwood. An Accelerated Implementation of Portals on the Cray SeaStar, Proceedings of the 2006 Cray Users’ Group Annual Technical Conference (CUG’06), Lugano, Switzerland, May 2006.
- Kevin Pedretti and Trammell Hudson. Developing Custom Firmware for the Red Storm SeaStar Network Interface, Proceedings of the 2005 Cray Users’ Group Annual Technical Conference (CUG’05), Albuquerque, New Mexico, May 2005.
- Ron Brightwell, Trammell Hudson, Kevin Pedretti, Rolf Riesen, Keith Underwood. Portals 3.3 on the Sandia/Cray Red Storm System, Proceedings of the 2005 Cray Users’ Group Annual Technical Conference (CUG’05), Albuquerque, New Mexico, May 2005.
- James A. Ang, Robert A. Ballance, Lee Ann Fisk, Jeanette R. Johnston, Kevin Pedretti. Red Storm Capability Computing Queuing Policy, Proceedings of the 2005 Cray Users’ Group Annual Technical Conference (CUG’05), Albuquerque, New Mexico, May 2005.
- Kevin Pedretti. Accurate Parallel Clustering of EST (Gene) Sequences, Master’s Thesis, Department of Electrical and Computer Engineering, The University of Iowa, Iowa City, Iowa, May 2001.
- Ron Brightwell, Kevin Pedretti, Trammell Hudson. Direct Access Inter-process Shared Memory, Patent #8,566,536, Granted October 2013.
- Kevin Pedretti. Distributed Processor Allocation for Launching Applications in a Massively Connected Processors Complex, Patent #7,454,595, Granted November 2008.
- Ron A. Oldfield, Steven J. Owen, Timothy Shead, Shawn Martin, Christopher Siefert, Mark Frederick Hoemmen, John Kaushagen, Ali Pinar, Matthew Gregor Peterson, Craig Michael Vineyard, Sam Green, Peter Feghali, Vitus J. Leung, Kevin Pedretti, Andrew J. Younge. “2.3.4.04 SNL Data and Visualization: ML Projects at Sandia”, ECP PI Meeting, Houston, TX, January 2019.
- Alexander M. Merritt, Kevin Pedretti, and Karsten Schwan. Techniques for Managing Data Distribution in NUMA Systems, International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC’10), New Orleans, Louisiana, November 2010.
- Kevin Pedretti. Characterization of Intra-node Topology and Locality, International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC’07), Reno, Nevada, November 2007.
- Kevin Pedretti. Chronicles of Astra: Challenges and Lessons from the First Petascale Arm Supercomputer, Energy Efficient High-Performance Computing Working Group Workshop (EE HPC WG), ARM Operational Experiences Panel, December 2020.
- Kevin Pedretti, James H. Laros III, Simon David Hammond. Experiences Scaling a Production Arm Supercomputer to Petaflops and Beyond, Arm Research Summit, September 2019.
- Kevin Pedretti, James H. Laros III, Simon Hammond. Vanguard Astra: Maturing the ARM Software Ecosystem for U.S. DOE/ASC Supercomputing, Workshop on Communication Architectures for HPC, Big Data, Deep Learning and Clouds at Extreme Scale (ExaComm’18), June 2018.
- Kevin Pedretti, James H. Laros III, Simon Hammond. Maturing the ARM Software Ecosystem for U.S. DOE/ASC Supercomputing, SIAM Conference on Parallel Processing for Scientific Computing (SIAMPP’18), March 2018.
- Kevin Pedretti. The Hobbes Node Virtualization Layer: Lessons Learned and Path Forward, SOS 21 Workshop, March 2017.
- Kevin Pedretti. Why Virtualization in Large-Scale HPC?, Panel on Sweet Spots and Limits for Virtualization, SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE’16), April 2016.
- Kevin Pedretti and Torsten Hoefler. A Comparison of Task Mapping Strategies on Two Generations of Cray Systems, SIAM Conference on Parallel Processing for Scientific Computing, February 2014.
- Kevin Pedretti. Kitten: A Lightweight Operating System for Ultrascale Supercomputers, New Mexico Consortium Ultrascale Systems Research Center, August 2011.
- Kevin Pedretti. The Kitten Lightweight Kernel, FastOS Phase II Workshop held in conjunction with ACM International Conference on Supercomputing (ICS09), June 2009.
- Kevin Pedretti. Lightweight Operating Systems for Scalable Native and Virtualized Supercomputing, ORNL Future Technologies Colloquium Series, April 2009.
- Kevin Pedretti. Quad-core Catamount and R&D in Multi-core Lightweight Kernels, Salishan Conference on High-Speed Computing, April 2008.
- James H. Laros III, Kevin Pedretti, Simon D. Hammond, Andrew J. Younge, Matthew L. Curry, Paul T. Lin, Courtenay T. Vaughan. FY19 L2 Milestone #6810 Report: Astra Acceptance and Software Environment Development, Sandia Technical Report, SAND2019-10738, September 2019.
- James H. Laros III, Kevin Pedretti, Simon Hammond, Michael Aguilar, Ron Brightwell, Matthew L. Curry, Ryan E. Grant, Robert Hoekstra, Ruth Klundt, Stephen Monk, Jeffry Ogden, Stephen L. Olivier, Randall Scott, Lee Ward, Andrew J. Younge. FY18 L2 Milestone #6360 Report: Initial Capability of an Arm-based Advanced Architecture Prototype System and Software Environment, Sandia Technical Report, SAND2018-10083, September 2018.
- Mehmet Deveci, Karen D. Devine, Kevin Pedretti, Mark A. Taylor, Sivasankaran Rajamanickam, Umit V.Catalyurek. Geometric Partitioning and Ordering Strategies for Task Mapping on Parallel Computers, Sandia Technical Report, SAND2018-4335R, April 2018.
- Brian W. Barrett, Ron Brightwell, Ryan E. Grant, Scott Hemmert, Kevin Pedretti, Kyle Wheeler, Keith Underwood, Rolf Riesen, Arthur B. Maccabe, Trammell Hudson. The Portals 4.1 Network Programming Interface, Sandia Technical Report, SAND2017-3825, April 2017.
- James H. Laros III, Ryan E. Grant, Michael Levenhagen, Stephen Olivier, Kevin Pedretti, Lee Ward, Andrew J. Younge. High-Performance Computing – Power Application Programming Interface Specification Version 2.0 (HPC Power API v2.0), Sandia Technical Report, SAND2017-2684, March 2017.
- Kevin Pedretti, Suzanne M. Kelly, Michael J. Levenhagen. Summary of Multi-Core Hardware and Programming Model Investigations, Sandia Technical Report, SAND2008-3205, May 2008.
- Rolf Riesen, Ron Brightwell, Kevin Pedretti, Arthur B. Maccabe, Trammell Hudson. The Portals 3.3 Message Passing Interface – Revision 2.1, Sandia Technical Report, SAND20006-0420, April 2006.
- Kevin Pedretti and Samuel A. Fineberg. Analysis of 2D Torus and Hub Topologies of 100 Mb/s Ethernet for the Whitney Commodity Computing Testbed, NAS Technical Report, NAS-97-017, September 1997.
- Gregory A. Koenig, Matthias Maiterth, Siddhartha Jana, Natalie Bates, Kevin Pedretti, Milos Puzovic, Andrea Borghesi, Andrea Bartolini, David Montoya. “Energy and Job Scheduling and Resource Management: Global Survey — An In-Depth Analysis“, Workshop on Data-center Automation, Analytics, and Control (DAAC’18) at SC’18, November 2018.
- Ville Ahlgren, Stefan Andersson, Jim Brandt, Nicholas P. Cardo, Sudheer Chunduri, Jeremy Enos, Parks Fields, Ann Gentile, Richard Gerber, Joe Greenseid, Annette Greiner, Bilel Hadri, Yun (Helen) He, Dennis Hoppe, Urpo Kaila, Kaki Kelly, Mark Klein, Alex Kristiansen, Steve Leak, Mike Mason, Kevin Pedretti, Jean-Guillaume Piccinali, Jason Repik, Jim Rogers, Susanna Salminen, Mike Showerman, Cary Whitney, Jim Williams. Large-Scale HPC Monitoring: Experiences and Recommendations, Workshop on Monitoring and Analysis for High-Performance Computing Systems Plus Applications (HPCMASPA’18), September 2018.
- Scott Levy, Kevin Pedretti, Kurt B. Ferreira. Open Science on Trinity’s Knights Landing Partition: An Analysis of User Job Data, Workshop on Scheduling and Resource Management for Parallel and Distributed Systems (SRMPDS’18), August 2018.
- Kevin Pedretti, Ryan E. Grant, James H. Laros III, Michael Levenhagen, Stephen L. Olivier, Lee Ward, Andrew J. Younge. A Comparison of Power Management Mechanisms: P-states vs. Node-Level Power Cap Control, Workshop on High-Performance Power-Aware Computing (HPPAC’18), May 2018.
- Matthias Maiterth, Gegory Koenig, Kevin Pedretti, Siddhartha Jana, Natalie Bates, Andrea Borghesi, David Richard Montoya, Andrea Bartollini, Milos Puzovic. Energy & Power Aware Job Scheduling & Resource Management: Global Survey – Initial Analysis, Workshop on High-Performance Power-Aware Computing (HPPAC’18), May 2018.
- Noah Evans, Kevin Pedretti, Brian Kocoloski, John Lange, Michael Lange, Patrick G. Bridges. A Cross-Enclave Composition Mechanism for Exascale System Software, Workshop on Runtime and Operating Systems for Supercomputers (ROSS’16), June 2017.
- Ryan E. Grant, Michael Levenhagen, Stephen L. Olivier, David DeBonis, Kevin Pedretti, James H. Laros III. Overcoming Challenges in Scalable Power Monitoring with the Power API, Workshop on High-Performance, Power-Aware Computing (HPPAC’16), May 2017.
- Kevin Pedretti, Stephen L. Olivier, Kurt B. Ferreira, Galen Shipman, Wei Shu. Early Experiences with Node-Level Power Capping on the Cray XC40 Platform, Workshop on Energy Efficient Supercomputing (E2SC’15), held in conjunction with SC’15, Austin, Texas, November 2015. [slides]
- Ryan E. Grant, Kevin Pedretti, Ann Gentile. Overtime: a tool for analyzing performance variation due to network interference, Workshop on Exascale MPI (ExaMPI’15), Austin, Texas, November 2015.
- Galen Shipman, Patrick McCormick, Kevin Pedretti, Stephen L. Olivier, Kurt B. Ferreira, Ramanan Sankaran, Sean Treichler, Alex Aiken, Michael Bauer. Analysis of Application Sensitivity to System Performance Variability in a Dynamic Task-Based Runtime, Workshop on Runtime Systems for Extreme-Scale Programming Models and Architectures (RESPA’15), Austin, Texas, November 2015.
- Richard F. Barrett, Dylan T. Stark, Courtenay T. Vaughan, Ryan E. Grant, Stephen L. Olivier, Kevin Pedretti. Toward an evolutionary task-parallel integrated MPI + X programming model, Workshop on Programming Models and Applications for Multicores and Manycores (PMAM’15), San Francisco, California, February 2015.
- Brian Kocoloski, John Lange, Hasan Abbasi, David E. Bernholdt, Terry R. Jones, Jai Dayal, Noah Evans, Michael Lang, Jay Lofstead, Kevin Pedretti, Patrick G. Bridges. System-Level Support for Composition of Applications, Workshop on Runtime and Operating Systems for Supercomputers (ROSS’15), held in conjunction with ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC’15), Portland, Oregon, June 2015.
- Jim Brandt, Karen Devine, Ann Gentile, Kevin Pedretti. Demonstrating Improved Application Performance Using Dynamic Monitoring and Task Mapping, Workshop on Monitoring and Analysis for High-Performance Computing Systems Plus Applications (HPCMASPA’14), held in conjunction with IEEE Cluster 2014, Madrid, Spain, September 2014.
- Dylan T. Stark, Richard F. Barrett, Ryan E. Grant, Stephen E. Olivier, Kevin Pedretti, Courtenay T. Vaughan. Early Experiences Co-Scheduling Work and Communication Tasks for Hybrid MPI + X Applications, Workshop on Exascale MPI (ExaMPI’14), New Orleans, Louisiana, November 2014.
- Kurt B. Ferreira, Kevin Pedretti, Ron Brightwell, Patrick G. Bridges, David Fiala, Frank Mueller. Evaluating Operating System Vulnerability to Memory Errors, Workshop on Runtime and Operating Systems for Supercomputers (ROSS’12), held in conjunction with the 26th ACM/SIGARCH International Conference on Supercomputing (ICS’12), Venice, Italy, June 2012.
- Jon Stearley, Kurt Ferreira, David Robinson, Dorian Arnold, Patrick Bridges, Jim Laros, Kevin Pedretti, Rolf Riesen. Does Partial Replication Pay Off?, Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS’12), Boston, Massachusetts, June 2012.
- Patrick G. Bridges, Dorian Arnold, Kevin Pedretti. VM-based Slack Emulation of Large-scale Systems, Workshop on Runtime and Operating Systems for Supercomputers (ROSS’11), held in conjunction with the 25th ACM/SIGARCH International Conference on Supercomputing (ICS’11), Tucson, Arizona, May 2011.
- Courtenay Vaughan, Mahesh Rajan, Richard Barrett, Doug Doerfler, and Kevin Pedretti. Investigating the Impact of the Cielo Cray XE6 Architecture on Scientific Application Codes, Workshop on Large-Scale Parallel Processing (LSPP’11), held in conjunction with the 25th IEEE International Parallel and Distributed Processing Symposium (IPDPS’11), Anchorage, Alaska, May 2011.
- Kevin Pedretti and Patrick G. Bridges. Opportunities for Leveraging OS Virtualization in High-End Supercomputing, Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds (MASVDC’10), held in conjunction with The 43rd IEEE/ACM International Symposium on Microarchitecture (MICRO-43), Atlanta, Georgia, December 2010.
- Ron Brightwell and Kevin Pedretti. Optimizing Multi-Core MPI Collectives with SMARTMAP, The Third International Workshop on Advanced Distributed and Parallel Network Applications (ADPNA’09), held in conjunction with The 37th International Conference on Parallel Processing (ICPP’09), Vienna, Austria, September 2009.
- Ron Brightwell, Kevin Pedretti, Kurt Ferreira. Instrumentation and Analysis of MPI Queue Times on the SeaStar High-Performance Network, Workshop on Advanced Networking and Communications, 17th International Conference on Computer Communications and Networks, St. Thomas, US Virgin Islands, August 2008.
Current Projects
- Vanguard-Astra Arm-based Supercomputer
- Advanced Tri-lab Software Environment (ATSE)
- Kitten Lightweight Kernel
- HPC Power API
- Portals Network Programming Interface