Preliminary benchmarks showing the performance effect of SYCL Graph in GROMACS. Presented performance is the median of five runs.
Benchmarks on Intel A770 use the PME variant as more representative of typical atomistic simulations and the one where the effect of SYCL Graphs is expected to be more pronounced due to larger number of kernels per step. Due to current limitations of FFT library / SYCL Graph interoperability, the benchmarks on NVIDIA V100 use RF variant of the benchmark systems.