论文标题

应用车顶线模型进行深度学习绩效优化

Applying the Roofline model for Deep Learning performance optimizations

论文作者

Czaja, Jacek, Gallus, Michal, Wozna, Joanna, Grygielski, Adam, Tao, Luo

论文摘要

在本文中,我们提出了一种使用Intel Xeon自动创建屋顶线模型(NUMA)的方法。最后,我们介绍了Intel Onednn库中实现的高效深度学习原始素的评估。

In this paper We present a methodology for creating Roofline models automatically for Non-Unified Memory Access (NUMA) using Intel Xeon as an example. Finally, we present an evaluation of highly efficient deep learning primitives as implemented in the Intel oneDNN Library.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源