How to insert CUDA kernel code/lib slice into a triton kernel? #5279

juinshell · 2024-11-28T08:39:52Z

juinshell
Nov 28, 2024

Hello, guys! I'm trying to fuse some cuda device api call to a triton gpu kernel. I found this way. However, libdevice.bc is llvm ir(nvvm ir) code, It is hard for me to generate nvvm ir from .cu file(refer) .

So I sincerely ask if there is a way to call code snippets of the cuda kernel in tirton, like getting threadblock index, some sync op, or thirdparty api like nvshmem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to insert CUDA kernel code/lib slice into a triton kernel? #5279

{{title}}

Replies: 0 comments

Select a reply

How to insert CUDA kernel code/lib slice into a triton kernel? #5279

juinshell Nov 28, 2024

Replies: 0 comments

juinshell
Nov 28, 2024