-
Notifications
You must be signed in to change notification settings - Fork 1
Description
Initial idea of using POCL as a cuda translation layer isnt viable because of POCL not working with image formats on cuda.
Currently reaching out to Yasroslav Pogrebnyak, the developer of the VF_overlay_cuda ffmpeg filter.
Reaching out to nyanmisaka. Seem's to have a lot of experience working on FFmpeg filters and frankly knows more than I do.
In addition to this, collaborating with Ed Borasky to confirm function on jetson platforms.
vf_tonemap_cuda.txt
(renamed from .c to .txt to make github happy )
Missing: tonemap.cu with proper kernel side code. this is easy once I know how to properly call the cuda kernel side from the ffmpeg side.
Standard stride blocks should work, define total amount of blocks using height. most resolution will be 16:9, so by using height parameter, we have a higher chance of hitting divisible by 3 cleanly, so we can take advantage of cuda language data structure.
Other option is taking the R G and B value of a given pixel which is guaranteed to be *3. this might also help for other tone mapping algorithms that use relative offset from local peak luma as input for tonemapping output