diff options
author | SiCong Li <sicong.li@arm.com> | 2023-07-18 17:56:49 +0100 |
---|---|---|
committer | SiCong Li <sicong.li@arm.com> | 2023-07-28 15:29:15 +0000 |
commit | 16b37527906c68885f81a8db35f9d6040d73efec (patch) | |
tree | 9669b5ebda00b3e3b1ac55992c144b09324b5997 /src/dynamic_fusion/runtime/gpu | |
parent | 9129549110527fd53655d3e6b61e8e59bed6f97f (diff) | |
download | ComputeLibrary-16b37527906c68885f81a8db35f9d6040d73efec.tar.gz |
Port ElementwiseBinary to CKW part 2
* Add fp16 support
* Implement broadcasting to elementwise binary
* Implement kernel name and kernel config id
* Always use explicit cast in ckw unary, binary and ternary elementwise
functions. This is to address the accidental use of double literals,
with other benefits.
* Refactor TypeConverter for smaller includes
Resolves COMPMID-6260
Change-Id: I26b726746f8c0dd7b5942ad379d56f4d7642d15f
Signed-off-by: SiCong Li <sicong.li@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/9999
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Jakub Sujak <jakub.sujak@arm.com>
Reviewed-by: Viet-Hoa Do <viet-hoa.do@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Benchmark: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'src/dynamic_fusion/runtime/gpu')
-rw-r--r-- | src/dynamic_fusion/runtime/gpu/cl/ClKernelRuntime.cpp | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/src/dynamic_fusion/runtime/gpu/cl/ClKernelRuntime.cpp b/src/dynamic_fusion/runtime/gpu/cl/ClKernelRuntime.cpp index 92ca8557f1..15a5632d0b 100644 --- a/src/dynamic_fusion/runtime/gpu/cl/ClKernelRuntime.cpp +++ b/src/dynamic_fusion/runtime/gpu/cl/ClKernelRuntime.cpp @@ -45,6 +45,7 @@ void ClKernelRuntime::configure(const ClCompileContext &compile_ctx, const GpuKe opencl::ClKernelLibrary &klib = opencl::ClKernelLibrary::get(); _kernel = static_cast<cl::Kernel>(compile_ctx.create_kernel(code.name(), code.name(), // program name has to be provided to differentiate between different unfusable components' kernels. + // Each program contains exactly one kernel code.code(), klib.kernel_path() /* Kernel path: Used in cases of embedded kernels */, code.build_options().options(), |