figshare
Browse
artifact.zip (1.12 GB)

A Compiler Framework for Optimizing Dynamic Parallelism on GPUs

Download (1.12 GB)
Version 2 2022-01-03, 11:12
Version 1 2021-11-19, 09:13
software
posted on 2021-11-19, 09:13 authored by Juan Gómez Luna, Onur Mutlu, Wen-mei Hwu, Izzat El Hajj, Mhd Ghaith OlabiMhd Ghaith Olabi
Our artifact is a compiler implemented within Clang for optimizing applications that use dynamic parallelism. Since building the compiler requires building Clang/LLVM which can be time and resource consuming, we provide pre-built binaries in a Docker image, along with the required dependences and the benchmarks/datasets on which the compiler has been evaluated. Reviewers can use these binaries to transform the CUDA code with our optimizations, then compile and run the code on a CUDA-capable GPU. Scripts are provided to automate this process.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC