v0.2.0
What's Changed
- Added JACCASYNC API for threads.jl backend. Note that JACC.Async work… by @pedrovalerolara in #161
- Bump Atomix to 1.0.1 by @williamfgc in #165
- Remove rocm by @williamfgc in #166
- Bump AMDGPU CI version by @williamfgc in #168
- Update CI labels by @williamfgc in #169
- Update README info by @williamfgc in #164
- Added @init_backend for user convenience. by @PhilipFackler in #170
- Update
parallel_reduceby @PhilipFackler in #173 - Remove
Arraytype and addarrayfunction by @PhilipFackler in #177 - Fix bug in oneAPI 1D reduce by @PhilipFackler in #182
- WIP: AMDGPU compute occupancy by @PhilipFackler in #178
- WIP: Better blocks/threads calculations for CUDA backend by @PhilipFackler in #136
- Add parallel_for API with keyword struct by @PhilipFackler in #188
- Use computed occupancy for amdgpu parallel_reduce by @PhilipFackler in #190
- Release v0.2.0 by @PhilipFackler in #191
Full Changelog: v0.1.1...v0.2.0