Skip to content

Releases: JuliaGPU/AMDGPU.jl

v0.2.8

06 Jul 16:00
3073525
Compare
Choose a tag to compare

AMDGPU v0.2.8

Diff since v0.2.7

Merged pull requests:

v0.2.7

20 May 19:17
360c030
Compare
Choose a tag to compare

AMDGPU v0.2.7

Diff since v0.2.6

Closed issues:

  • "Spills" from adjacent views of ROCVector (#130)

Merged pull requests:

v0.2.6

06 Apr 19:08
f836632
Compare
Choose a tag to compare

AMDGPU v0.2.6

Diff since v0.2.5

Closed issues:

  • ROCArrays matrix multiplication not working (#103)
  • Data race in kernel packet writing? (#121)

Merged pull requests:

  • Add mark/wait synchronization system (#116) (@jpsamaroo)
  • CompatHelper: bump compat for "GPUCompiler" to "0.11" (#122) (@github-actions[bot])
  • Replace arrays with Refs in ccall. (#123) (@chriselrod)
  • Fix packet launch (#125) (@jpsamaroo)

v0.2.6 for Zenodo

09 Apr 23:06
f836632
Compare
Choose a tag to compare
v0.2.6 for Zenodo Pre-release
Pre-release
Merge pull request #116 from JuliaGPU/jps/mark-wait

Add mark/wait synchronization system

v0.2.5

29 Mar 19:03
308941e
Compare
Choose a tag to compare

AMDGPU v0.2.5

Diff since v0.2.4

Merged pull requests:

v0.2.4

26 Mar 00:03
9f387fa
Compare
Choose a tag to compare

AMDGPU v0.2.4

Diff since v0.2.3

Closed issues:

  • Implement execution contexts (#16)
  • Add/test broadcasting support to ROCArray (#12)
  • Add queue/device/system sync functionality (#24)
  • Support OpenCL.jl as device runtime (#23)
  • Distribute ROCR/ROCT via artifacts (#6)
  • Allow setting Private and Group segment sizes manually (#56)
  • FATAL ERROR: Symbol "ccalllib_libhsa-runtime64445"not found on AMDGPU (#73)
  • test failures and crashes on 580 (#92)
  • Tests allocate memory indefinitely (#106)
  • Check for invalid workgroup sizes (#110)
  • Add example for gridsize usage and workgroup sizing (#113)

Merged pull requests:

v0.2.3

05 Feb 19:00
3ffacab
Compare
Choose a tag to compare

AMDGPU v0.2.3

Diff since v0.2.2

Closed issues:

  • Add support for trap handlers (#8)
  • Unreachable reached in SIISelLowering.cpp due to unhandled AS (#76)
  • Ensure that CI tests all available external libraries (#85)
  • Only load libhsa-runtime64 major version 1 (#93)

Merged pull requests:

v0.2.2

20 Jan 18:12
471f6a7
Compare
Choose a tag to compare

AMDGPU v0.2.2

Diff since v0.2.1

Closed issues:

  • Implement RNGs (#14)
  • Allow 0-argument kernels (#10)
  • Add tests to match CUDAnative (#22)
  • Build fails on OSX (#19)
  • Add options to at-roc to allow initializing globals (#36)
  • Some errors during test. Are they cause for concern? (#70)

Merged pull requests:

v0.2.1

21 Oct 00:00
f681252
Compare
Choose a tag to compare

AMDGPU v0.2.1

Diff since v0.2.0

Closed issues:

  • Throw error in wait() call on queue error (#7)
  • Test code doesn't work (#43)
  • Broken build script? (#46)
  • HSASignal and HSAKernelInstance references can be accidentally GC'd (#63)

Merged pull requests:

v0.2.0

27 Aug 18:00
5bf7abf
Compare
Choose a tag to compare

AMDGPU v0.2.0

Diff since v0.1.2

Merged pull requests: