论文标题

在随机LAPH方法中的Baryon-Block构造的性能优化

Performance Optimization of Baryon-block Construction in the Stochastic LapH Method

论文作者

Nguyen, Phuong, Hörz, Ben

论文摘要

在高级晶格QCD框架中的测量内核实现可以快速原型制作,但可以使硬件功能显着未被充分利用。如果在不优化的例程中花费的时间通常很少,这是可以接受的权衡。然而,现代光谱项目的计算成本可以与DIRAC方程的生成规格配置和计算解决方案的成本相提并论。随机LAPH方法中的一个关键内核是重计块的计算。我们讨论了几种实施策略,并通过Intel(R)Xeon(R)Platinum 8358处理器(以前是Ice Lake)在系统上实现了7.2倍的速度。

Implementations of measurement kernels in high-level Lattice QCD frameworks enable rapid prototyping, but can leave hardware capabilities significantly underutilized. This is an acceptable tradeoff if the time spent in unoptimized routines is generally small. The computational cost of modern spectroscopy projects however can be comparable to or even exceed the cost of generating gauge configurations and computing solutions of the Dirac equation. One such key kernel in the stochastic LapH method is the computation of baryon blocks; we discuss several implementation strategies and achieve a 7.2x speedup over the current implementation on a system with Intel(R) Xeon(R) Platinum 8358 processors, formerly Ice Lake.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源