utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ uname -a
Linux utku 4.15.0-29-generic #31-Ubuntu SMP Tue Jul 17 15:39:52 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ clinfo
Number of platforms 1
Platform Name Clover
Platform Vendor Mesa
Platform Version OpenCL 1.1 Mesa 18.0.5
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd
Platform Extensions function suffix MESA
Platform Name Clover
Number of devices 2
Device Name AMD CARRIZO (DRM 3.23.0 / 4.15.0-29-generic, LLVM 6.0.0)
Device Vendor AMD
Device Vendor ID 0x1002
Device Version OpenCL 1.1 Mesa 18.0.5
Driver Version 18.0.5
Device OpenCL C Version OpenCL C 1.1
Device Type GPU
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Max compute units 6
Max clock frequency 757MHz
Max work item dimensions 3
Max work item sizes 256x256x256
Max work group size 256
Preferred work group size multiple 64
Preferred / native vector sizes
char 16 / 16
short 8 / 8
int 4 / 4
long 2 / 2
half 8 / 8 (cl_khr_fp16)
float 4 / 4
double 2 / 2 (cl_khr_fp64)
Half-precision Floating-point support (cl_khr_fp16)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Single-precision Floating-point support (core)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Correctly-rounded divide and sqrt operations No
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 64, Little-Endian
Global memory size 1863519436 (1.736GiB)
Error Correction support No
Max memory allocation 465879859 (444.3MiB)
Unified memory for Host and Device No
Minimum alignment for any data type 128 bytes
Alignment of base address 32768 bits (4096 bytes)
Global Memory cache type None
Image support No
Local memory type Local
Local memory size 32768 (32KiB)
Max number of constant args 16
Max constant buffer size 465879859 (444.3MiB)
Max size of kernel argument 1024
Queue properties
Out-of-order execution No
Profiling Yes
Profiling timer resolution 0ns
Execution capabilities
Run OpenCL kernels Yes
Run native kernels No
Device Extensions cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_fp64 cl_khr_fp16
Device Name Radeon 500 Series (POLARIS12 / DRM 3.23.0 / 4.15.0-29-generic, LLVM 6.0.0)
Device Vendor AMD
Device Vendor ID 0x1002
Device Version OpenCL 1.1 Mesa 18.0.5
Driver Version 18.0.5
Device OpenCL C Version OpenCL C 1.1
Device Type GPU
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Max compute units 8
Max clock frequency 1046MHz
Max work item dimensions 3
Max work item sizes 256x256x256
Max work group size 256
Preferred work group size multiple 64
Preferred / native vector sizes
char 16 / 16
short 8 / 8
int 4 / 4
long 2 / 2
half 8 / 8 (cl_khr_fp16)
float 4 / 4
double 2 / 2 (cl_khr_fp64)
Half-precision Floating-point support (cl_khr_fp16)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Single-precision Floating-point support (core)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Correctly-rounded divide and sqrt operations No
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 64, Little-Endian
Global memory size 3218305024 (2.997GiB)
Error Correction support No
Max memory allocation 1927633305 (1.795GiB)
Unified memory for Host and Device No
Minimum alignment for any data type 128 bytes
Alignment of base address 32768 bits (4096 bytes)
Global Memory cache type None
Image support No
Local memory type Local
Local memory size 32768 (32KiB)
Max number of constant args 16
Max constant buffer size 1927633305 (1.795GiB)
Max size of kernel argument 1024
Queue properties
Out-of-order execution No
Profiling Yes
Profiling timer resolution 0ns
Execution capabilities
Run OpenCL kernels Yes
Run native kernels No
Device Extensions cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_fp64 cl_khr_fp16
NULL platform behavior
clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) Clover
clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) Success [MESA]
clCreateContext(NULL, ...) [default] Success [MESA]
clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT) Success (1)
Platform Name Clover
Device Name AMD CARRIZO (DRM 3.23.0 / 4.15.0-29-generic, LLVM 6.0.0)
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) Success (2)
Platform Name Clover
Device Name AMD CARRIZO (DRM 3.23.0 / 4.15.0-29-generic, LLVM 6.0.0)
Device Name Radeon 500 Series (POLARIS12 / DRM 3.23.0 / 4.15.0-29-generic, LLVM 6.0.0)
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) Success (2)
Platform Name Clover
Device Name AMD CARRIZO (DRM 3.23.0 / 4.15.0-29-generic, LLVM 6.0.0)
Device Name Radeon 500 Series (POLARIS12 / DRM 3.23.0 / 4.15.0-29-generic, LLVM 6.0.0)
ICD loader properties
ICD loader Name OpenCL ICD Loader
ICD loader Vendor OCL Icd free software
ICD loader Version 2.2.11
ICD loader Profile OpenCL 2.1
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ cmake ..
-- The C compiler identification is GNU 7.3.0
-- The CXX compiler identification is Clang 6.0.0
-- Check for working C compiler: /usr/bin/cc
-- Check for working C compiler: /usr/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Detecting C compile features
-- Detecting C compile features - done
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- We are on: Linux (Linux)
-- Found PkgConfig: /usr/bin/pkg-config (found version "0.29.1")
CMake Warning at CMakeLists.txt:36 (find_package):
By not providing "FindOpenCV.cmake" in CMAKE_MODULE_PATH this project has
asked CMake to find a package configuration file provided by "OpenCV", but
CMake did not find one.
Could not find a package configuration file provided by "OpenCV" with any
of the following names:
OpenCVConfig.cmake
opencv-config.cmake
Add the installation prefix of "OpenCV" to CMAKE_PREFIX_PATH or set
"OpenCV_DIR" to a directory containing one of the above files. If "OpenCV"
provides a separate development package or SDK, be sure it has been
installed.
CUDA_TOOLKIT_ROOT_DIR not found or specified
-- Could NOT find CUDA (missing: CUDA_TOOLKIT_ROOT_DIR CUDA_NVCC_EXECUTABLE CUDA_INCLUDE_DIRS CUDA_CUDART_LIBRARY)
-- CUDA not found. disabled.
-- Configuring done
-- Generating done
-- Build files have been written to: /home/utku/Programlar/waifu2x-converter-cpp-5.2/out
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ make
Scanning dependencies of target conv
[ 5%] Building C object CMakeFiles/conv.dir/conv.c.o
[ 10%] Linking C executable conv
[ 10%] Built target conv
Scanning dependencies of target gensrcs
[ 15%] Generating modelHandler_OpenCL.cl.h
[ 15%] Built target gensrcs
Scanning dependencies of target w2xc
[ 21%] Building CXX object CMakeFiles/w2xc.dir/src/modelHandler.cpp.o
[ 26%] Building CXX object CMakeFiles/w2xc.dir/src/modelHandler_avx.cpp.o
clang: warning: argument unused during compilation: '-s' [-Wunused-command-line-argument]
[ 31%] Building CXX object CMakeFiles/w2xc.dir/src/modelHandler_fma.cpp.o
clang: warning: argument unused during compilation: '-s' [-Wunused-command-line-argument]
[ 36%] Building CXX object CMakeFiles/w2xc.dir/src/modelHandler_sse.cpp.o
clang: warning: argument unused during compilation: '-s' [-Wunused-command-line-argument]
[ 42%] Building CXX object CMakeFiles/w2xc.dir/src/modelHandler_OpenCL.cpp.o
[ 47%] Building CXX object CMakeFiles/w2xc.dir/src/convertRoutine.cpp.o
[ 52%] Building CXX object CMakeFiles/w2xc.dir/src/threadPool.cpp.o
[ 57%] Building CXX object CMakeFiles/w2xc.dir/src/modelHandler_CUDA.cpp.o
[ 63%] Building CXX object CMakeFiles/w2xc.dir/src/w2xconv.cpp.o
/home/utku/Programlar/waifu2x-converter-cpp-5.2/src/w2xconv.cpp:411:10: warning:
enumeration value 'W2XCONV_ERROR_OPENCL' not handled in switch [-Wswitch]
switch (conv->last_error.code) {
^
/home/utku/Programlar/waifu2x-converter-cpp-5.2/src/w2xconv.cpp:441:10: warning:
enumeration value 'W2XCONV_ERROR_OPENCL' not handled in switch [-Wswitch]
switch (e->code) {
^
2 warnings generated.
[ 68%] Building CXX object CMakeFiles/w2xc.dir/src/common.cpp.o
[ 73%] Building CXX object CMakeFiles/w2xc.dir/src/cvwrap.cpp.o
[ 78%] Building CXX object CMakeFiles/w2xc.dir/src/Env.cpp.o
[ 84%] Building CXX object CMakeFiles/w2xc.dir/src/Buffer.cpp.o
[ 89%] Linking CXX shared library libw2xc.so
[ 89%] Built target w2xc
Scanning dependencies of target runbench
[ 94%] Building C object CMakeFiles/runbench.dir/w32-apps/runbench.c.o
[100%] Linking C executable runbench
[100%] Built target runbench
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ ./runbench
Writing OpenCL-Binary to: /home/utku/Programlar/waifu2x-converter-cpp-5.2/out/Radeon_500_Series_(POLARIS12___DRM_3.23.0___4.15.0-29-generic__LLVM_6.0.0).bin
Radeon 500 Series (POLARIS12 / DRM 3.23.0 / 4.15.0-29-generic, LLVM 6.0.0)
start process block (0,0) ...
Iteration #1(1->32)...(18.4992[ms], 8.61471[GFLOPS], 1.9742[GB/s])
Iteration #2(32->32)...(53.7721[ms], 94.8389[GFLOPS], 1.31721[GB/s])
Iteration #3(32->64)...(39.7677[ms], 256.474[GFLOPS], 2.67161[GB/s])
Iteration #4(64->64)...(83.3661[ms], 244.689[GFLOPS], 1.69923[GB/s])
Iteration #5(64->128)...(142.544[ms], 286.21[GFLOPS], 1.49068[GB/s])
Iteration #6(128->128)...(319.534[ms], 255.356[GFLOPS], 0.886654[GB/s])
Iteration #7(128->1)...(7.79648[ms], 81.7627[GFLOPS], 18.3114[GB/s])
total : 0.665865[sec], 238.618[GFLOPS]
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ ls
CMakeCache.txt CTestTestfile.cmake modelHandler_sse.bc
CMakeFiles libw2xc.so modelHandler_sse.ii
cmake_install.cmake Makefile modelHandler_sse.s
conv modelHandler_OpenCL.cl.h runbench
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ sudo make install
[sudo] password for utku:
[ 10%] Built target conv
[ 15%] Built target gensrcs
[ 89%] Built target w2xc
[100%] Built target runbench
Install the project...
-- Install configuration: ""
-- Installing: /usr/local/include/w2xconv.h
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ w2xc
w2xc: komut bulunamadı
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ gensrcs
gensrcs: komut bulunamadı
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ waifu2x-converter-cpp
waifu2x-converter-cpp: komut bulunamadı
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ waifu2x-converter-cpp
waifu2x-converter-cpp: komut bulunamadı
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$ ls
CMakeCache.txt CTestTestfile.cmake modelHandler_OpenCL.cl.h runbench
CMakeFiles install_manifest.txt modelHandler_sse.bc
cmake_install.cmake libw2xc.so modelHandler_sse.ii
conv Makefile modelHandler_sse.s
utku@utku:~/Programlar/waifu2x-converter-cpp-5.2/out$