Is this a duplicate?
Type of Bug
Silent Failure
Component
cuda.core
Describe the bug
Attempting to compile PTX with an undefined extern function seems to succeed and produce a cubin with an empty kernel. I am doing this like:
from cuda.core.experimental import Program, ProgramOptions
options = ProgramOptions(arch="sm_75")
program = Program(code, code_type="ptx", options=options)
cubin = program.compile("cubin")
The same PTX compiled with ptxas results in a failure to compile:
$ ptxas -arch sm_75 add_float16.ptx
ptxas fatal : Unresolved extern function '_ZplRK6__halfS1__1'
Am I using the Program.compile() interface incorrectly?
How to Reproduce
Run the attached file:
This will produce a cubin with an empty kernel, and show ptxas raises an error for the same PTX.
cudapy.py
Expected behavior
An exception or some other error raised by Program.compile()
Operating System
Ubuntu Linux
nvidia-smi output
No response
Is this a duplicate?
Type of Bug
Silent Failure
Component
cuda.core
Describe the bug
Attempting to compile PTX with an undefined extern function seems to succeed and produce a cubin with an empty kernel. I am doing this like:
The same PTX compiled with ptxas results in a failure to compile:
Am I using the
Program.compile()interface incorrectly?How to Reproduce
Run the attached file:
This will produce a cubin with an empty kernel, and show
ptxasraises an error for the same PTX.cudapy.py
Expected behavior
An exception or some other error raised by
Program.compile()Operating System
Ubuntu Linux
nvidia-smi output
No response