搜索 - 腾讯云开发者社区-腾讯云

文章/答案/技术大牛

发布

1回答

改变Tensorflow PTXAS定位

support CC 8.6 2021-01-08 20:52:53.437690: W tensorflow/stream_executor/gpu/asm_compiler.cc:194] Used ptxas:314] Unimplemented: /usr/local/cuda-11.0/bin/ptxas ptxas too old.Modify $PATH to customize ptxas location.作为测试，我安装了C

浏览 0提问于2021-01-09得票数 5

2回答

CUDA --ptxas-options="-v“不显示任何输出

尝试通过在CUDA ->命令行->附加选项中添加--ptxas-options="-v"来构建CUDA程序。我仍然没有看到答案中的ptxas信息。host-compilation C++ -c -m 64 -o "x64\Release\CUDA_Dissertation.obj" -odir "x64\Release" -ext none -int real --ptxas-optionsDocuments\Visual Studio 2008\Projects\M

浏览 0修改于2017-05-23得票数 2

2回答

CUDA ptxas警告(条目的堆栈大小)

在编译CUDA代码时，我收到了以下我不理解的警告：'_Z24gpu_kernel_get

浏览 0修改于2019-12-14得票数 3

2回答

Tensorflow 2.4.1 -无法调用ptxas.exe

当我运行脚本(使用卷积算法)时，我会得到一个警告：Couldn't invoke ptxas.exe --version，它是在Call to CreateProcess failed.Modify $PATH to customize ptxas location.我怎么解决这个问题的？

浏览 3修改于2021-03-14得票数 5

1回答

ptxas抱怨(输入)我的可悲的设备功能

%rs2, %r2; st.param.b32 [func_retval0+0], %r1;}ptxas /tmp/a.ptx, line 27; error : Arguments mismatch for instruction 'sad' ptxas

浏览 1修改于2020-02-16得票数 0

1回答

解释ptxas的详细输出，第二部分

当我们用ptxas -v编译内核ptxas -v文件，或者用-ptxas-options=-v从.cu文件编译它时，我们得到了几行输出，例如：ptxas infoint, double*, double*, double*)

浏览 15修改于2019-05-16得票数 1

1回答

nvcc --ptxas-options=-v (寄存器和内存使用)错误

我想用nvcc的--ptxas-options=-v标志编译我的cuda程序，以实现寄存器和内存的使用，以便在CUDA GPU占用计算器中使用它们。

浏览 1修改于2012-03-26得票数 1

回答已采纳

1回答

nvcc fatal：'--ptxas-options=-v'：需要一个数字

尝试构建a Windows port of Faster-RCNN时出现nvcc fatal : '--ptxas-options=-v': expected a number错误。

浏览 37修改于2019-06-10得票数 3

回答已采纳

1回答

如何禁用关于不可确定堆栈大小的ptxas警告？

在编译CUDA设备代码时，您可能会得到错误(为可读性设置行间隔)： ptxas warning : Stack size for entry function '_ZN7kernels11print_stuffIiEEvv

浏览 3修改于2019-12-30得票数 0

回答已采纳

1回答

WIN+CUDA 6.5.19+compute_52 --ptxas-options=-v未显示输出

我还试图通过典型的标志-ptxas-options=-v获取有关PTX输出的其他信息。Files\NVIDIA GPU Computing Toolkit\CUDA\v6.5\include" --keep-dir x64\Release -maxrregcount=0 --ptxas-optionsFiles\NVIDIA GPU Computing Toolkit\CUDA\v6.5\include" --keep-dir x64\Release -maxrregcount=0 --ptxas-opt

浏览 3修改于2015-04-16得票数 1

2回答

cuda 5.0动态并行错误: ptxas致命。无法解析的外部函数'cudaLaunchDevice

我在带有CUDA5的Linux上使用计算能力为35的tesla k20。通过一个简单的子内核调用，它给出了一个编译错误：Unresolved extern function cudaLaunchDevicenvcc --compile -G -O0 -g -gencode arch=compute_35 , code=sm_35 -x cu -o fill.cu fill.o

浏览 0修改于2012-12-20得票数 5

回答已采纳

1回答

ptxas文件中的CUDA外部类链接和未解析的extern函数

extern function '_ZN16LibraryNameSpace5int2_aSEi' C:\Users\Documents\Project\Test\Testing_Files\ptxas

浏览 1修改于2013-06-19得票数 12

回答已采纳

1回答

ptxas“不支持双”警告时使用推力：：排序对一个结构数组

但是，当我使用nvcc进行编译时，会收到以下警告： ptxas /tmp/tmpxft_00005186_00000000-5_antsim.ptx，第1520行；警告:不支持Double。

浏览 11提问于2014-02-03得票数 0

回答已采纳

1回答

将缓存与OpenACC配合使用

$acc缓存时的输出ptxas info : Compiling entry function 'acc_lap2d_39_gpu' forstores, 0 bytes spill loadsptxas info : Compiling, 0 bytes spill loads ptxas info :

浏览 8提问于2015-08-20得票数 1

1回答

如何解释ptx函数名

warn-spills --use_fast_math -maxrregcount 128 nv_wavenet_perf.cu -o nv_wavenet_perf_dualptxasmemory in function '_Z25nv_wavenet_singleBlock_8RIffLi64ELi256ELi256ELi1EEv17nv_wavenet_paramsIT_T0_E' ptxasmemory in function '_Z25nv_wavenet_singleBlock_8RI

浏览 1修改于2018-09-13得票数 1

回答已采纳

1回答

何时与寄存器/局部变量一起使用易失性

以下是这两个版本的ptxas -v输出 __volatile__ float array[32];ptxas info : Compilingentry function '_Z2swPcS_PfiiiiS0_' for 'sm_20'88 bytes stack frame, 0 bytes spill s

浏览 2修改于2014-09-23得票数 5

回答已采纳

1回答

NVCC ptas=-v输出

A我用"nvcc -ccbin=icpc源代码/* -Iinclude -arch=sm_35 --ptxas--arch=sm_35=-v“编译了我的程序。产出如下：ptxas info : 0 bytes gmemptxasinfo : Compiling entry function '_Z21process_full_instance

浏览 4提问于2014-10-13得票数 2

回答已采纳

1回答

在CUDA中更改arch参数会使我使用更多的寄存器

我一直在我的Tesla K20m上写一个内核，当我用-Xptas=-v编译软件时，我得到了以下结果：ptxas info : Compilingentry function '_Z9searchKMPPciPhiPiS1_' for 'sm_10' ptxas info : Used 8 registers, 80 bytes smem，如果我提到参数-arch=sm_35，我的内核执行时间会急剧增加，并且使用的寄存器数量也会增加

浏览 0修改于2017-05-23得票数 0

回答已采纳

1回答

nvcc =-v输出太混乱

很难从输出中读出内核名称，例如：ptxas info : Compiling entry function '_Z14dshape_U_noBigPdS_PKdS1_S1_PKi' for 'sm_20' 0bytes stack frame, 0 bytes spill stor

浏览 0提问于2014-06-17得票数 0

1回答

sm_20显示错误的lmem统计数据？

使用--ptxas-options=-v选项编译的CUDA内核似乎在指定 GPU体系结构时显示错误的sm_20 lmem(本地内存)统计信息。int i = 0; i < num; ++i )}1>ptxas info : Compiling entry function '_Z9fooKernelPi'

浏览 7提问于2011-02-24得票数 1

第 2 页第 3 页第 4 页第 5 页第 6 页第 7 页第 8 页第 9 页第 10 页第 11 页

点击加载更多

改变Tensorflow PTXAS定位

CUDA --ptxas-options="-v“不显示任何输出

CUDA ptxas警告(条目的堆栈大小)

Tensorflow 2.4.1 -无法调用ptxas.exe

ptxas抱怨(输入)我的可悲的设备功能

解释ptxas的详细输出，第二部分

nvcc --ptxas-options=-v (寄存器和内存使用)错误

nvcc fatal：'--ptxas-options=-v'：需要一个数字

如何禁用关于不可确定堆栈大小的ptxas警告？

WIN+CUDA 6.5.19+compute_52 --ptxas-options=-v未显示输出

cuda 5.0动态并行错误: ptxas致命。无法解析的外部函数'cudaLaunchDevice

ptxas文件中的CUDA外部类链接和未解析的extern函数

ptxas“不支持双”警告时使用推力：：排序对一个结构数组

将缓存与OpenACC配合使用

如何解释ptx函数名

何时与寄存器/局部变量一起使用易失性

NVCC ptas=-v输出

在CUDA中更改arch参数会使我使用更多的寄存器

nvcc =-v输出太混乱

sm_20显示错误的lmem统计数据？

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐