在以下网页上:
https://software.intel.com/en-us/articles/intel-optimized-tensorflow-wheel-now-available
它说:“conda包的英特尔优化的Tensorflow与新的2018年英特尔Python发行版的Linux。”
我不认为这是真的。
在我们的集群中,我们有:
英特尔并行工作室XE 2018
在另一个安装中,我们需要:
Intel部署Python (IDP) 2018 (2.7,3.6)。
下面是一个屏幕截图,其中包含了优化的tensorflow没有附带IDP 2018的证据:
[whowell@acf-knl004 ~]$ module load intel-python/2.7/2018.0.018
[whowell@acf-knl004 ~]$ which python
/global/opt/intel/python/2018.0.018/intelpython2/bin/python
[whowell@acf-knl004 ~]$ python -c 'import tensorflow as tf;print
tf.__file__;sess = tf.InteractiveSession()'
/global/opt/intel/python/2018.0.018/intelpython2/lib/python2.7/site-
packages/tensorflow/__init__.pyc
2017-10-09 18:28:23.284004: W
tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow libra\ry
wasn't compiled to use SSE4.1 instructions, but these are available on your
machine and could speed up CPU computations.
2017-10-09 18:28:23.284243: W
tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library
wasn't compiled to use SSE4.2 instructions, but these are available on your
machine and could speed up CPU computations.
2017-10-09 18:28:23.284311: W
tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library
wasn't compiled to use AVX instructions, but these are available on your
machine and could speed up CPU computations.
2017-10-09 18:28:23.284456: W
tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library
wasn't compiled to use AVX2 instructions, but these are available on your
machine and could speed up CPU computations.
2017-10-09 18:28:23.284518: W
tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library
wasn't compiled to use AVX512F instructions, but these are available on your
machine and could speed up CPU computations.
2017-10-09 18:28:23.284579: W
tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library
wasn't compiled to use FMA instructions, but these are available on your
machine and could speed up CPU computations.其中一个优化是使用向量指令进行构建。因为我们收到警告说tensorflow使用的不是用向量指令构建的,所以我们不能使用优化的版本。
我们有大型分布式tensorflow跑步的生物信息学家要做,而新的KNL就是为了这个目的而设计的。这是至关重要的,我们开始为KNL优化的tensorflow。我们的工作人员中有几年在Xeon Phi架构和NVIDIA GPU的前沿研究经验,很可能在未来会有合作的机会,以扩展tensorflow产品。
你建议我们下一步做什么?
我们非常感谢在这一问题上提供的任何协助。
Mitch Horton博士,田纳西大学计算科学联合研究所,田纳西州诺克斯维尔
发布于 2017-10-16 19:00:17
有可能这些警告是不合法的。原因是有两种方法可以获得二进制代码,以使用特定于体系结构的指令。
警告只检查方式1,它们无法判断Intel的操作是否使用动态调度逻辑自动选择架构的最佳指令。
https://stackoverflow.com/questions/46776718
复制相似问题