1

Optimizing MPI Collectives on Shared Memory Multi-cores

HighRPM: Combining Integrated Measurement and Power Modeling for High-Resolution Power Monitoring

ChatGPT, Make a Secure Malloc for me

Optimizing Multi-grid Computation and Parallelization on Multi-cores

Memory-aware Optimization for Sequences of Sparse Matrix-Vector Multiplications

HiGIL: Hierarchical Graph Inference Learning for Fact Checking

STRONGHOLD: Fast and Affordable Billion-scale Deep Learning Model Training

Towards Scalable Supercomputing Resource Management

AIACC-Training: Optimizing Distributed Deep Learning Training through Multi-streamed and Concurrent Gradient Communications

Automating Reinforcement Learning Architecture Design for Code Optimization