Sandia National Laboratories
Navigation
Current Projects
Publications
Journal Articles
Conference Papers
Workshop Papers
Posters
Technical Reports
Other Papers
Talks
Patents
Awards and Honors

Kevin Pedretti

Principal Member of Technical Staff
Scalable System Software
Kevin Pedretti

Contact Info:
spacer
Kevin Pedretti
P.O. Box 5800, MS 1319
Albuquerque, NM 87185-1319
spacer
ktpedre@sandia.gov
Phone: (505) 844-1399
Fax: (505) 845-7442
Office: CSRI/112


I work in the Scalable System Software department at Sandia National Laboratories. This has given me the opportunity to help develop several large-scale parallel computers, including the Red Storm system that was later productized by Cray as the Cray XT line of supercomputers. Prior to joining Sandia in 2001, I studied engineering at the University of Iowa where I received a B.S.E. in Electrical Engineering in 1999 and an M.S. in Computer Engineering in 2001. My current research interests are in the areas of operating systems for massively parallel supercomputers, techniques for improving application resiliency to hardware faults, high-performance and scalable networking, multi-core processors, and hardware virtualization in the context of high performance computing.


Current Projects

  • Hobbes Extreme Scale Operating System
  • XPRESS: eXascale PRogramming Environment and System Software
  • Kitten Lightweight Kernel
  • Portals
  • HPC Power API
  • Power Reduction Techniques for Next Generation Systems

  • Publications

    Journal Articles

    Conference Papers

    Workshop Papers

    • Kevin T. Pedretti, Stephen L. Olivier, Kurt B. Ferreira, Galen Shipman, Wei Shu. Early Experiences with Node-Level Power Capping on the Cray XC40 Platform, Workshop on Energy Efficient Supercomputing (E2SC 2015), held in conjunction with SC'15, Austin, Texas, November 2015. [slides]
    • Brian Kocoloski, John Lange, Hasan Abbasi, David E. Bernholdt, Terry R. Jones, Jai Dayal, Noah Evans, Michael Lang, Jay Lofstead, Kevin Pedretti, Patrick G. Bridges. System-Level Support for Composition of Applications, Workshop on Runtime and Operating Systems for Supercomputers (ROSS 2015), held in conjunction with ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), Portland, Oregon, June 2015.
    • Jim Brandt, Karen Devine, Ann Gentile, Kevin Pedretti. Demonstrating Improved Application Performance Using Dynamic Monitoring and Task Mapping, Workshop on Monitoring and Analysis for High Performance Computing Systems Plus Applications (HPCMASPA), held in conjunction with IEEE Cluster 2014, Madrid, Spain, September 2014.
    • Kurt B. Ferreira, Kevin T. Pedretti, Ron Brightwell, Patrick G. Bridges, David Fiala, Frank Mueller. Evaluating Operating System Vulnerability to Memory Errors, Workshop on Runtime and Operating Systems for Supercomputers (ROSS 2012), held in conjunction with the 26th ACM/SIGARCH International Conference on Supercomputing (ICS), Venice, Italy, June 2012.
    • Jon Stearley, Kurt Ferreira, David Robinson, Dorian Arnold, Patrick Bridges, Jim Laros, Kevin Pedretti,Rolf Riesen. Does Partial Replication Pay Off?, Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS), Boston, Massachusetts, June 2012.
    • Patrick G. Bridges, Dorian Arnold, Kevin T. Pedretti. VM-based Slack Emulation of Large-scale Systems, Workshop on Runtime and Operating Systems for Supercomputers (ROSS), held in conjunction with the 25th ACM/SIGARCH International Conference on Supercomputing (ICS), Tucson, Arizona, May 2011.
    • Courtenay Vaughan, Mahesh Rajan, Richard Barrett, Doug Doerfler, and Kevin T. Pedretti. Investigating the Impact of the Cielo Cray XE6 Architecture on Scientific Application Codes, Workshop on Large-Scale Parallel Processing (LSPP), held in conjunction with the 25th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Anchorage, Alaska, May 2011.
    • Kevin T. Pedretti and Patrick G. Bridges. Opportunities for Leveraging OS Virtualization in High-End Supercomputing, Workshop on Micro Architectural Support for Virtualization, Data Center Computing, and Clouds (MASVDC'10), held in conjunction with The 43rd IEEE/ACM International Symposium on Microarchitecture (MICRO-43), Atlanta, Georgia, December 2010.
    • Ron Brightwell and Kevin T. Pedretti. Optimizing Multi-Core MPI Collectives with SMARTMAP, The Third International Workshop on Advanced Distributed and Parallel Network Applications (ADPNA 2009), held in conjunction with The 37th International Conference on Parallel Processing (ICPP 2009), Vienna, Austria, September 2009.
    • Ron Brightwell, Kevin T. Pedretti, Kurt Ferreira. Instrumentation and Analysis of MPI Queue Times on the SeaStar High-Performance Network, Workshop on Advanced Networking and Communications, 17th International Conference on Computer Communications and Networks, St. Thomas, US Virgin Islands, August 2008.

    Posters

    Technical Reports

    Other Papers

    Talks

    Patents


    Awards and Honors

    • 2011 - Defense Programs Award for Excellence - Sandia Red Storm Supercomputer Operating System Team, Team Member
    • 2010 - NNSA Environmental Stewardship Award - Red Storm Energy Savings, Team Member
    • 2010 - FLC Award for Excellence in Technology Transfer - Red Storm Massively Parallel Processor Supercomputer Architecture, Team Member
    • 2010 - Sandia Employee Recognition Award - Kitten Operating System Virtualization Team, Team Representative
    • 2009 - R&D100 Award - Catamount N-Way Lightweight Kernel
    • 2006 - R&D100 Award - Compute Process Allocator (Fact Sheet)
    • 2006 - Lockheed Martin NOVA Award - Red Storm Supercomputer Design and Development Team
    • 2005 - Sandia Award for Excellence - For developing a C-based firmware for the Red Storm network interface
    • 2003 - Sandia Award for Excellence - For technical excellence in the design and development of the Red Storm node allocator