JACC.shared: Leveraging HPC Metaprogramming and Performance Portability for Computations That Use Shared Memory GPUs Conference Paper April, 2025
Using a Large Language Model as a Building Block to Generate Usable Validation and Verification Suite for OpenMP Conference Paper March, 2025
MatRIS: Multi-level Math Library Abstraction for Heterogeneity and Performance Portability using IRIS Runtime... Conference Paper November, 2023
CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator Types Conference Paper November, 2023
Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores Conference Paper November, 2023
Julia as a unifying end-to-end workflow language on the Frontier exascale system Conference Paper November, 2023
Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUs Conference Paper November, 2023