We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我已经看了custom_op example中提到的custom op的转换方式,我注意到example中提及到的算子都是使用python定义的,而很多SOTA算法都在模型中使用了cuda编写的device端的计算。
既然cuda核函数现在无法在rknn系列开发板上编译,我能不能自己编写opencl核函数,对齐cuda核函数的输入输出,实现调用npu的自定义算子。
如果可以的话,我希望能知道在哪里有较为详细的说明。
更详细的说,在哪里可以找到
rknn_register_custom_ops API注册npu自定义算子的例子?我只在 04_Rockchip_RKNPU_API_Reference_RKNNRT_V2.3.0_CN找到了cpu的。
The text was updated successfully, but these errors were encountered:
应该不支持
Sorry, something went wrong.
No branches or pull requests
我已经看了custom_op example中提到的custom op的转换方式,我注意到example中提及到的算子都是使用python定义的,而很多SOTA算法都在模型中使用了cuda编写的device端的计算。
既然cuda核函数现在无法在rknn系列开发板上编译,我能不能自己编写opencl核函数,对齐cuda核函数的输入输出,实现调用npu的自定义算子。
如果可以的话,我希望能知道在哪里有较为详细的说明。
更详细的说,在哪里可以找到
rknn_register_custom_ops
API注册npu自定义算子的例子?我只在
04_Rockchip_RKNPU_API_Reference_RKNNRT_V2.3.0_CN找到了cpu的。
The text was updated successfully, but these errors were encountered: