This repository has been archived by the owner on Jun 27, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 19
Issues: iree-org/iree-nvgpu
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Triton] Compute capability for hal.executable.source must match IREE cuda SM version
bug
Something isn't working
enhancement
New feature or request
#144
opened Jun 15, 2023 by
ezhulenev
[Triton] Construct stream.executable from Triton functions
enhancement
New feature or request
#143
opened Jun 15, 2023 by
ezhulenev
[Triton] Use nested passes to lower Triton to HAL executable
enhancement
New feature or request
#142
opened Jun 15, 2023 by
ezhulenev
[cuDNN] Use semaphores to synchronize before/after cuDNN operations
enhancement
New feature or request
#139
opened Jun 14, 2023 by
ezhulenev
[cuDNN] cuDNN module should create a dedicated CUDA stream
enhancement
New feature or request
#138
opened Jun 14, 2023 by
ezhulenev
[Triton] Add minimal matmul example (no auto tuning)
enhancement
New feature or request
#129
opened Jun 9, 2023 by
ezhulenev
[Triton] Triton executables should support shared memory
enhancement
New feature or request
#128
opened Jun 9, 2023 by
ezhulenev
[Triton] Do not use temp files for passing PTX to HAL executable
enhancement
New feature or request
#126
opened Jun 9, 2023 by
ezhulenev
[Triton] Block dimension should be inferred from the Triton function
enhancement
New feature or request
#125
opened Jun 9, 2023 by
ezhulenev
[Triton] num-warps and num-stages should be a property of triton.executable.export
enhancement
New feature or request
#124
opened Jun 9, 2023 by
ezhulenev
Set up continuous integration for OpenXLA Nvgpu project
enhancement
New feature or request
#58
opened May 4, 2023 by
ezhulenev
[RFC] Integration with cuDNN via IREE compiler/runtime plugins
enhancement
New feature or request
#12
opened Apr 18, 2023 by
ezhulenev
ProTip!
Follow long discussions with comments:>50.