Git Product home page Git Product logo

rocm-build's People

Contributors

baryluk avatar xuhuisheng avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

rocm-build's Issues

[Question] Is it possible to have precompiled Kernels for Navi 14?

Environment

Hardware description
GPU RX5500XT
CPU Ryzen 5 2600
Software version
OS Debian GNU/Linux 11.7
ROCm 5.2
Python 3.9

What is the expected behavior

This is not a bug report, just a question.

As you correctly state "Please test override hsa version first. Then ROCm will use gfx1030 fatbin to run on gfx1012, which needn't re-compile". This is my case, I can fake my board to be gfx1030, things work. But Stable Diffusion WebUI takes around 3 minutes to compile the kernels after complaining:

MIOpen(HIP): Warning [SQLiteBase] Missing system database file: gfx1030_11.kdb Performance may degrade. Please follow instructions to install: https://github.com/ROCmSoftwarePlatform/MIOpen#installing-miopen-kernels-package

Of course I can't install gfx1030_11.kdb because it doesn't exist. I know almost nothing about the ROCm internals so perhaps I'm asking something ridiculous. Do you have such a .kdb? Is mandatory to have the actual GPU to compile them? Can I compile them using the actual GPU, but without needing to compile the whole ROCm (which could take hours in my system, assuming I can free enough disk)

Thanks in advance

rocBLAS build problem on Navi14

My rocBLAS build is not able to find the "Threads" library on my Ubuntu 20.4. Kernel version 5..4.0-66-generic. Any suggestions? Below is the transcript.

sgodey@sgodey-NPL:~/rocm-build$ bash navi14/22.rocblas.sh
|====|
|SLOW|
|====|
Reading package lists... Done
Building dependency tree
Reading state information... Done
gfortran is already the newest version (4:9.3.0-1ubuntu2).
python3-venv is already the newest version (3.8.2-0ubuntu2).
0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
~/rocm-build/build/rocblas ~/rocm-build/build/rocblas
HEAD is now at cc18d25f update tensile-tag for SWDEV-257443 (#1176)
patching file library/src/blas_ex/rocblas_gemm_ext2.hpp
HEAD is now at ab44bf46 fix GSU bug: PostGSU kernel refer to Nan data of C matrix even when beta is zero (#1217)
patching file Tensile/Common.py
patching file Tensile/TensileCreateLibrary.py
CMake Error at /usr/share/cmake-3.16/Modules/FindPackageHandleStandardArgs.cmake:146 (message):
Could NOT find Threads (missing: Threads_FOUND)
Call Stack (most recent call first):
/usr/share/cmake-3.16/Modules/FindPackageHandleStandardArgs.cmake:393 (_FPHSA_FAILURE_MESSAGE)
/usr/share/cmake-3.16/Modules/FindThreads.cmake:220 (FIND_PACKAGE_HANDLE_STANDARD_ARGS)
CMakeLists.txt:34 (find_package)

-- Configuring incomplete, errors occurred!
See also "/home/sgodey/rocm-build/build/rocblas/CMakeFiles/CMakeOutput.log".
See also "/home/sgodey/rocm-build/build/rocblas/CMakeFiles/CMakeError.log".
sgodey@sgodey-NPL:~/rocm-build$

I have the below pthread libraries installed:

libopenblas-pthread-dev/focal-updates,now 0.3.8+ds-1ubuntu0.20.04.1 amd64 [installed]
Optimized BLAS (linear algebra) library (dev, pthread)

libopenblas0-pthread/focal-updates,now 0.3.8+ds-1ubuntu0.20.04.1 amd64 [installed,automatic]
Optimized BLAS (linear algebra) library (shared lib, pthread)

libopenblas64-0-pthread/focal-updates,now 0.3.8+ds-1ubuntu0.20.04.1 amd64 [installed]
Optimized BLAS (linear algebra) library (shared lib, 64bit, pthread)

libopenblas64-pthread-dev/focal-updates,now 0.3.8+ds-1ubuntu0.20.04.1 amd64 [installed]
Optimized BLAS (linear algebra) library (dev, 64bit, pthread)

can't find hipcc

when i exec bash navi10/22.rocblas.sh , something is worng.

CMake Error at /usr/share/cmake-3.16/Modules/CMakeDetermineCXXCompiler.cmake:48 (message):
  Could not find compiler set in environment variable CXX:

  /opt/rocm/bin/hipcc.

Call Stack (most recent call first):
  CMakeLists.txt:31 (project)


CMake Error: CMAKE_CXX_COMPILER not set, after EnableLanguage
-- Configuring incomplete, errors occurred!

i need to install something to get hipcc?

Getting the 5500 XT to work with a custom build of PyTorch and ROCm

Environment

Hardware Description
GPU AMD Radeon RX 5500 XT
CPU AMD Ryzen 7 5800X
Software Version
OS Arch (Host); Ubuntu 22.04 (Docker)
ROCm 5.2.3
Python 3.10.12

What is the Expected Behavior

All build scripts will pass and install respective packages; unit tests won't raise runtime errors. It should behave exactly like the precompiled wheel package for PyTorch 1.13.1 stable and 2.0.0 nightly, considered ancient by today's rapidly evolving technologies.

The latest stable ROCm version that works properly with the RX 5000 series cards is 5.2.x. Since I'm aware that later versions (5.3+) break compatibility with these cards, I'll try my luck by compiling PyTorch 2.2.0 against ROCm 5.2.3 using your build script, which is the latest stable PyTorch version as of writing.

I read that someone created a wheel with PyTorch 2.1.0 and can confirm that it works on my system without crashing.

What Actually Happens

Building rocALUTION failed with illegal instruction detected, similar to the linked comment on issue #35. I guess it can't be used on this card without hacky workarounds. Fortunately, it's not a requirement for PyTorch. All other toolchains succeed without errors. Here's a simple build log I created to do this stuff, including the need to patch parts of code and install additional build dependencies along the way.

ROCm 5.2.3 gfx1012 Ubuntu 22.04 Docker build log
00.rocm-core.sh: PASS
11.rocm-llvm.sh: PASS
12.roct-thunk-interface.sh: PASS
13.rocm-cmake.sh: PASS
14.rocm-device-libs.sh: PASS
15.rocr-runtime.sh: PASS
* need xxd (apt install xxd)
16.rocminfo.sh: PASS
* need kmod (apt install kmod)
17.rocm-compilersupport.sh: PASS
18.hip.sh: PASS
* need dot (apt install graphviz)
21.rocfft.sh: PASS
* may need GPU exposure in container (/dev/dri; /dev/kfd)
navi14/22.rocblas.sh: PASS
* edit CMakeLists.txt to include Python path
23.rocprim.sh: PASS
24.rocrand.sh: PASS
* download hipRAND sources manually and comment out N/A patch
navi14/25.rocsparse.sh: PASS
26.hipsparse.sh: PASS
27.rocm_smi_lib.sh: PASS
28.rccl.sh: PASS
29.hipfft.sh: PASS
31.rocm-opencl-runtime.sh: PASS
32.clang-ocl.sh: PASS
33.rocprofiler.sh: PASS
* comment out N/A patch
34.roctracer.sh: PASS
35.half.sh: PASS
36.miopen.sh: PASS
* patch Boost 1.74.0 to resolve linker error, see https://github.com/boostorg/spirit/commit/f3998fb2bbbcd29aacfc1b27d92af570d154fb9b
* build it with -fPIC
* add -DCMAKE_CXX_FLAGS="-I${ROCM_INSTALL_DIR}/include/rocblas" and set -DCMAKE_PREFIX_PATH to path of patched Boost to cmake args
37.rocm-utils.sh: PASS
41.rocdbgapi.sh: PASS
42.rocgdb.sh: PASS
* needs GMP (apt install libgmp-dev)
43.rocm-dev.sh: PASS
51.rocsolver.sh: PASS
* add -DCMAKE_CXX_FLAGS="-I${ROCM_INSTALL_DIR}/include/rocblas" to cmake args
52.rocthrust.sh: PASS
53.hipblas.sh: PASS
* add -DCMAKE_CXX_FLAGS="-I${ROCM_INSTALL_DIR}/include/rocblas" to cmake args
54.rocalution.sh: FAIL
* inserting include_directories(${ROCM_PATH}/include/rocblas) inside body of if(SUPPORT_HIP) in CMakeLists.txt errors with illegal instruction detected
55.hipcub.sh: PASS
56.hipsolver.sh: PASS
* add -DCMAKE_CXX_FLAGS="-I${ROCM_INSTALL_DIR}/include/rocblas" to cmake args
57.rocm-libs.sh: PASS
61.amdmigraphx.sh: PASS
* may need cJSON (apt install libcjson-dev)
* apply these changes in dev-requirements.txt for glibc >= 2.34: [email protected] => [email protected]; and in requirements.txt: google => protocolbuffers, [email protected] => [email protected]
* open Embed.cmake to place "#include <string>" in file(WRITE ...) within generate_embed_source function
* add -DCMAKE_CXX_FLAGS="-I${ROCM_INSTALL_DIR}/include/rocblas" to cmake args
62.rock-dkms.sh: PASS
* set permission mask to 755 on both postinst and prerm
71.rocm_bandwidth_test.sh: PASS
72.hipfort.sh: PASS
73.rocmvalidationsuite.sh: PASS
* add -DCMAKE_CXX_FLAGS="-I${ROCM_INSTALL_DIR}/include/rocblas" to cmake args
* modify ROCBLAS_INC_DIR "${ROCM_PATH}/include" to "${ROCM_PATH}/include/rocblas" in CMakeLists.txt
* bump GIT_TAG from release-1.10.0 to release-1.11.0 to bypass uninitialized variable errors in CMakeGtestDownload.cmake
74.rocr_debug_agent.sh: PASS
75.hipify.sh: PASS

PyTorch 2.2.0 with ROCm 5.2.3: PASS
* backport MIOPEN_CONVOLUTION_ATTRIB_DETERMINISTIC = 0 in miopenConvolutionAttrib_t under MIOpen header (ROCm)
* comment out hipblasCgelsBatched, hipblasDgelsBatched, hipblasSgelsBatched, and hipblasZgelsBatched in pytorch/aten/src/ATen/hip/HIPBlas.cpp
* add "#include <hipsolver/internal/hipsolver-types.h>" to pytorch/aten/src/ATen/native/hip/linalg/BatchLinearAlgebraLib.h
Torchaudio: PASS
Torchvision: PASS

After all the builds were finished, I ran your check scripts to ensure everything was installed properly. With the exception of rocALUTION, which apparently isn't supported for this family of cards, they appeared to look fine. However, I seem to get a partially functional installation. The run-miopen.sh and run-miopen-img.sh check scripts produced compilation errors. As for the other checks, they all run OK without problems. Thankfully, it's virtually identical to the prebuilds. Below is the output of run-miopen.sh:

MIOPEN_VERSION_MAJOR:2
MIOPEN_VERSION_MINOR:17
MIOPEN_VERSION_PATCH:0
ws_size = 576
find conv algo
MIOpen(HIP): Error [Do] 'amd_comgr_do_action(kind, handle, in.GetHandle(), out.GetHandle())' AMD_COMGR_ACTION_COMPILE_SOURCE_TO_BC: ERROR (1)
MIOpen(HIP): Error [BuildHip] comgr status = ERROR (1)
MIOpen(HIP): Warning [BuildHip] In file included from /tmp/comgr-5f42cb/input/naive_conv.cpp:1:
In file included from /tmp/hip_pch.39257/hip_pch.h:1:
In file included from /root/rocm-test/rocm-5.2/HIP/include/hip/hip_runtime.h:54:
In file included from /usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/thread:44:
In file included from /usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/this_thread_sleep.h:38:
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/chrono:666:36: error: no matching conversion for functional-style cast from 'const duration<long, std::ratio<1, 1>>' to '__cd' (aka 'duration<long, ratio<num, den>>')
 return __cd(__cd(__lhs).count() - __cd(__rhs).count());
                                   ^~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/chrono:1036:47: note: in instantiation of function template specialization 'std::chrono::operator-<long, std::ratio<1, 1000000000>, long, std::ratio<1, 1>>' requested here
 return __time_point(__lhs.time_since_epoch() -__rhs);
                                              ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/chrono:3402:47: note: in instantiation of function template specialization 'std::chrono::operator-<std::filesystem::__file_clock, std::chrono::duration<long, std::ratio<1, 1000000000>>, long, std::ratio<1, 1>>' requested here
   return __file_time{__t.time_since_epoch()} - _S_epoch_diff;
                                              ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/chrono:3369:16: note: in instantiation of function template specialization 'std::filesystem::__file_clock::_S_from_sys<std::chrono::duration<long, std::ratio<1, 1000000000>>>' requested here
      { return _S_from_sys(chrono::system_clock::now()); }
               ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/chrono:514:2: note: candidate constructor not viable: no known conversion from 'const duration<[...], ratio<[...], 1>>' to 'const duration<[...], ratio<[...], 1000000000>>' for 1st argument
 duration(const duration&) = default;
 ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/chrono:521:23: note: candidate template ignored: requirement '__and_<std::is_convertible<const std::chrono::duration<long, std::ratio<1, 1>> &, long>, std::__or_<std::chrono::treat_as_floating_point<long>, std::__not_<std::chrono::treat_as_floating_point<std::chrono::duration<long, std::ratio<1, 1>>>>>>::value' was not satisfied [with _Rep2 = std::chrono::duration<long>]
   constexpr explicit duration(const _Rep2& __rep)
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/chrono:529:14: note: candidate template ignored: substitution failure [with _Rep2 = long, _Period2 = std::ratio<1, 1>]: non-type template argument is not a constant expression
   constexpr duration(const duration<_Rep2, _Period2>& __d)
             ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/chrono:512:12: note: candidate constructor not viable: requires 0 arguments, but 1 was provided
 constexpr duration() = default;
           ^
In file included from /tmp/comgr-5f42cb/input/naive_conv.cpp:1:
In file included from /tmp/hip_pch.39257/hip_pch.h:1:
In file included from /root/rocm-test/rocm-5.2/HIP/include/hip/hip_runtime.h:62:
In file included from /root/rocm-test/rocm-5.2/hipamd/include/hip/amd_detail/amd_hip_runtime.h:434:
In file included from /opt/rocm/llvm/lib/clang/14.0.0/include/cuda_wrappers/complex:35:
In file included from /usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/stdexcept:39:
In file included from /usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/string:55:
In file included from /usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6608:
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6620:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<long, int, char, int>' requested here
  { return __gnu_cxx::__stoa<long, int>(&std::strtol, "stoi", __str.c_str(),
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<int, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<int, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6625:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<long, long, char, int>' requested here
  { return __gnu_cxx::__stoa(&std::strtol, "stol", __str.c_str(),
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<long, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<long, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6630:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<unsigned long, unsigned long, char, int>' requested here
  { return __gnu_cxx::__stoa(&std::strtoul, "stoul", __str.c_str(),
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<unsigned long, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<unsigned long, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6635:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<long long, long long, char, int>' requested here
  { return __gnu_cxx::__stoa(&std::strtoll, "stoll", __str.c_str(),
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<long long, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<long long, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6640:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<unsigned long long, unsigned long long, char, int>' requested here
  { return __gnu_cxx::__stoa(&std::strtoull, "stoull", __str.c_str(),
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<unsigned long long, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<unsigned long long, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6646:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<float, float, char>' requested here
  { return __gnu_cxx::__stoa(&std::strtof, "stof", __str.c_str(), __idx); }
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<float, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<float, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6650:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<double, double, char>' requested here
  { return __gnu_cxx::__stoa(&std::strtod, "stod", __str.c_str(), __idx); }
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<double, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<double, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6654:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<long double, long double, char>' requested here
  { return __gnu_cxx::__stoa(&std::strtold, "stold", __str.c_str(), __idx); }
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<long double, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<long double, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6751:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<long, int, wchar_t, int>' requested here
  { return __gnu_cxx::__stoa<long, int>(&std::wcstol, "stoi", __str.c_str(),
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<int, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<int, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6756:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<long, long, wchar_t, int>' requested here
  { return __gnu_cxx::__stoa(&std::wcstol, "stol", __str.c_str(),
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<long, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<long, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6761:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<unsigned long, unsigned long, wchar_t, int>' requested here
  { return __gnu_cxx::__stoa(&std::wcstoul, "stoul", __str.c_str(),
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<unsigned long, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<unsigned long, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6766:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<long long, long long, wchar_t, int>' requested here
  { return __gnu_cxx::__stoa(&std::wcstoll, "stoll", __str.c_str(),
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<long long, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<long long, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6771:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<unsigned long long, unsigned long long, wchar_t, int>' requested here
  { return __gnu_cxx::__stoa(&std::wcstoull, "stoull", __str.c_str(),
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<unsigned long long, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<unsigned long long, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6777:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<float, float, wchar_t>' requested here
  { return __gnu_cxx::__stoa(&std::wcstof, "stof", __str.c_str(), __idx); }
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<float, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<float, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6781:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<double, double, wchar_t>' requested here
  { return __gnu_cxx::__stoa(&std::wcstod, "stod", __str.c_str(), __idx); }
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<double, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<double, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:85:7: error: no matching function for call to '_S_chk'
   || _Range_chk::_S_chk(__tmp, std::is_same<_Ret, int>{}))
      ^~~~~~~~~~~~~~~~~~
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:6785:23: note: in instantiation of function template specialization '__gnu_cxx::__stoa<long double, long double, wchar_t>' requested here
  { return __gnu_cxx::__stoa(&std::wcstold, "stold", __str.c_str(), __idx); }
                      ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:70:4: note: candidate function not viable: no known conversion from 'std::is_same<long double, int>' to 'std::false_type' (aka 'integral_constant<bool, false>') for 2nd argument
   _S_chk(_TRet, std::false_type) { return false; }
   ^
/usr/lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/ext/string_conversions.h:73:4: note: candidate function not viable: no known conversion from 'std::is_same<long double, int>' to 'std::true_type' (aka 'integral_constant<bool, true>') for 2nd argument
   _S_chk(_TRet __val, std::true_type)
   ^
17 errors generated when compiling for gfx1012.

terminate called after throwing an instance of 'miopen::Exception'
  what():  /root/rocm-test/rocm-5.2/MIOpen/src/hipoc/hipoc_program.cpp:300: Code object build failed. Source: naive_conv.cpp
run-miopen.sh: line 9: 146789 Aborted                 (core dumped) MIOPEN_ENABLE_LOGGING=0 MIOPEN_LOG_LEVEL=0 ./build/test_miopen

check.sh:

[HIP]        50221153
[rocBLAS]    2.44.0.4a92c6f1
[rocFFT]     1.0.17.d3c798c
[rocPRIM]    201009
[rocRAND]    201009
[rocSPARSE]  200200
[rccl]       21212
[MIOpen]     2 17 0
[rocSOVLER]  3.18.0.d883d5f
[rocThrust]  101500
src/hello_rocalution.cpp:2:10: fatal error: 'rocalution/version.hpp' file not found
#include "rocalution/version.hpp"
         ^~~~~~~~~~~~~~~~~~~~~~~~
1 error generated when compiling for gfx1012.
check.sh: line 38: ./build/hello_rocalution: No such file or directory
[hipCUB]     201012
[hipBLAS]    0 51 0
[hipSPARSE]  200100
[hipRAND]    201009
[hipFFT]     10017

I've tried different versions of GCC (11.4, 10.5, and 9.4), all resulted in the same error. This is something I cannot fix, sadly. In the systemd journal logs, I see several messages saying "Could not parse number of program headers from core file: invalid `Elf' handle". Investigation shows that this was reported upstream and is somewhat specific to ROCm 5.2.3; it has been fixed in 5.3, including the illegal instruction messages in rocALUTION.

ROCm/MIOpen#1764
rocm-arch/rocm-arch#857

Nevertheless, I then proceeded to compile PyTorch 2.2.0 along with hipMAGMA support, torchaudio 2.2.0, and torchvision 0.17. It doesn't out of the box due to the use of missing constants. All of this appears in your target version, 5.4.

  • MIOPEN_CONVOLUTION_ATTRIB_DETERMINISTIC (doesn't exist in ROCm 5.2.3)
  • hipblasCgelsBatched (exists in rocm-5.2/hipBLAS/clients/include/hipblas_no_fortran.hpp)
  • hipblasDgelsBatched (exists in rocm-5.2/hipBLAS/clients/include/hipblas_no_fortran.hpp)
  • hipblasSgelsBatched (exists in rocm-5.2/hipBLAS/clients/include/hipblas_no_fortran.hpp)
  • hipblasZgelsBatched (exists in rocm-5.2/hipBLAS/clients/include/hipblas_no_fortran.hpp)

After making some modifications to the PyTorch code (see the build log), I was able to make it work. If you have any patches that backport these four hipBLAS and MIOpen constants, please provide them and let me know how to apply them. Thank you very much!

How to Reproduce

Create an Ubuntu 22.04 Docker container with these flags, and perform a repo init and repo sync on ROCm 5.2.x.

docker run -it --device /dev/dri --device /dev/kfd --volume /mnt/ubuntu22.04:/root/rocm-test ubuntu:22.04

You can change the volume mount point to whatever you have on your end. Then, implement those adjustments as indicated in the build log.

For reference, here's my env.sh file:

#!/bin/bash

export ROCM_INSTALL_DIR=/opt/rocm
export ROCM_MAJOR_VERSION=5
export ROCM_MINOR_VERSION=2
export ROCM_PATCH_VERSION=3
export ROCM_LIBPATCH_VERSION=50203
export CPACK_DEBIAN_PACKAGE_RELEASE=109
export ROCM_PKGTYPE=DEB
export ROCM_GIT_DIR=/root/rocm-test/rocm-5.2
export ROCM_BUILD_DIR=/root/rocm-test/rocm-build/build
export ROCM_PATCH_DIR=/root/rocm-test/rocm-build/patch
export AMDGPU_TARGETS="gfx1012"
# export CMAKE_DIR=/root/rocm-test/cmake-3.18.6
# export PATH=$ROCM_INSTALL_DIR/bin:$ROCM_INSTALL_DIR/llvm/bin:$ROCM_INSTALL_DIR/hip/bin:$CMAKE_DIR/bin:$PATH
export PATH=$ROCM_INSTALL_DIR/bin:$ROCM_INSTALL_DIR/llvm/bin:$ROCM_INSTALL_DIR/hip/bin:$PATH

Also, do an...

apt update && apt install sudo xxd kmod libtinfo5 graphviz libgmp-dev libcjson-dev

...beforehand, or your install-dependency.sh script and building specific toolchains like ROCR-Runtime, HIP, rocminfo, ROCgdb, and AMD MIGraphX won't run.

navi14 (gfx1012): git apply can not find file patch/22.rocblas-ninja-1.patch

Environment

Hardware description
GPU RX 5500 XT
CPU Ryzen 7 5800X
Software version
OS Linux Mint 20.3 (Ubuntu focal 20.04 base) under 5.8.0-48-generic x86_64 kernel
ROCm 4.3
Python 3.8.10

What is the expected behavior

I am trying to build ROCm for my navi14 GPU. Dependencies and environment are installed and set. I am following the instructions from navi14/REAME.md.

What actually happens

When executing bash navi14/22.rocblas.sh the script exits with error because git apply can not find file patch/22.rocblas-ninja-1.patch. I looked in the repo and the file is no longer there since commit 7759bdb. I am not sure how to proceed from here.

How to reproduce

It is not clear in the README: I ran bash navi14/22.rocblas.sh before running any other script because it is the first recommendation, before the list of 10 scripts to run. Do I need to run it in order as well?

11.llvm-amdgpu.sh fails with recent linux kernel, cyclades removed from kernel headers

Environment

Hardware description
GPU gfx803
CPU ryzen7
Software version
OS debian bookworm
ROCm 4.3
Python 3.9.9
Linux 5.15.0-1-amd64

What is the expected behavior

  • build scripts run

What actually happens

  • llvm build script fails

How to reproduce

  • run 11.llvm-amdgpu.sh with a recent linux kernel

There is a dependency on something called cyclades in llvm. Cyclades has been removed from the linux kernel headers. With a recent kernel, the llvm build will fail.

I applied this manually:
https://reviews.llvm.org/rG68d5235cb58f988c71b403334cd9482d663841ab

and it seems to be building now.

How to build patched tensorflow package

Environment

Hardware description
GPU RX 570
CPU Ryzen 5 2600
Software version
OS Ubuntu 20.04.5
ROCm 5.3.0 gfx803 (from this repo)
Python 3.8

Hi, for my application I need tensorflow 2.7, so I'd like to build it. From the available resources it is not clear to me how the provided tensorflow package is patched or if it is even patched at all to run on gfx803. Could you provide an insight on how you build the tensorflow package please?

bash 34.miopen.sh gives linker error

Environment

Hardware description
GPU - RX6800
CPU - AMD
Software version
OS - 20.04.2
ROCm - 4.2
Python -

What is the expected behavior

bash 34.miopen.sh

What actually happens

bash 34.miopen.sh

-- MIOpen_VERSION= 2.11.0
-- CMAKE_BUILD_TYPE= Release
-- Performing Test COMPILER_HAS_HIDDEN_VISIBILITY
-- Performing Test COMPILER_HAS_HIDDEN_VISIBILITY - Success
-- Performing Test COMPILER_HAS_HIDDEN_INLINE_VISIBILITY
-- Performing Test COMPILER_HAS_HIDDEN_INLINE_VISIBILITY - Success
-- Performing Test COMPILER_HAS_DEPRECATED_ATTR
-- Performing Test COMPILER_HAS_DEPRECATED_ATTR - Success
-- Librt: /usr/lib/x86_64-linux-gnu/librt.so
-- Performing Test MIOPEN_HAS_address
-- Performing Test MIOPEN_HAS_address - Failed
-- Performing Test MIOPEN_HAS_thread
-- Performing Test MIOPEN_HAS_thread - Failed
-- Configuring done
-- Generating done
-- Build files have been written to: /home/ian/Documents/rocm-build/build/miopen
[150/152] Linking CXX shared library lib/libMIOpen.so.1.0
FAILED: lib/libMIOpen.so.1.0
: && /opt/rocm/llvm/bin/clang++ -fPIC -O3 -DNDEBUG -s -shared -Wl,-soname,libMIOpen.so.1 -o lib/libMIOpen.so.1.0 src/sqlite/CMakeFiles/sqlite_memvfs.dir/memvfs.cpp.o src/CMakeFiles/MIOpen.dir/buffer_info.cpp.o src/CMakeFiles/MIOpen.dir/check_numerics.cpp.o src/CMakeFiles/MIOpen.dir/convolution.cpp.o src/CMakeFiles/MIOpen.dir/convolution_api.cpp.o src/CMakeFiles/MIOpen.dir/db.cpp.o src/CMakeFiles/MIOpen.dir/db_record.cpp.o src/CMakeFiles/MIOpen.dir/expanduser.cpp.o src/CMakeFiles/MIOpen.dir/find_controls.cpp.o src/CMakeFiles/MIOpen.dir/fusion.cpp.o src/CMakeFiles/MIOpen.dir/op_args.cpp.o src/CMakeFiles/MIOpen.dir/operator.cpp.o src/CMakeFiles/MIOpen.dir/fused_api.cpp.o src/CMakeFiles/MIOpen.dir/load_file.cpp.o src/CMakeFiles/MIOpen.dir/pooling_api.cpp.o src/CMakeFiles/MIOpen.dir/kernel_warnings.cpp.o src/CMakeFiles/MIOpen.dir/logger.cpp.o src/CMakeFiles/MIOpen.dir/lock_file.cpp.o src/CMakeFiles/MIOpen.dir/lrn_api.cpp.o src/CMakeFiles/MIOpen.dir/activ_api.cpp.o src/CMakeFiles/MIOpen.dir/target_properties.cpp.o src/CMakeFiles/MIOpen.dir/handle_api.cpp.o src/CMakeFiles/MIOpen.dir/softmax_api.cpp.o src/CMakeFiles/MIOpen.dir/batch_norm.cpp.o src/CMakeFiles/MIOpen.dir/batch_norm_api.cpp.o src/CMakeFiles/MIOpen.dir/rnn.cpp.o src/CMakeFiles/MIOpen.dir/rnn_api.cpp.o src/CMakeFiles/MIOpen.dir/ctc.cpp.o src/CMakeFiles/MIOpen.dir/ctc_api.cpp.o src/CMakeFiles/MIOpen.dir/temp_file.cpp.o src/CMakeFiles/MIOpen.dir/problem_description.cpp.o src/CMakeFiles/MIOpen.dir/kernel_build_params.cpp.o src/CMakeFiles/MIOpen.dir/find_db.cpp.o src/CMakeFiles/MIOpen.dir/conv_algo_name.cpp.o src/CMakeFiles/MIOpen.dir/conv/problem_description.cpp.o src/CMakeFiles/MIOpen.dir/dropout.cpp.o src/CMakeFiles/MIOpen.dir/dropout_api.cpp.o src/CMakeFiles/MIOpen.dir/readonlyramdb.cpp.o src/CMakeFiles/MIOpen.dir/execution_context.cpp.o src/CMakeFiles/MIOpen.dir/reducetensor.cpp.o src/CMakeFiles/MIOpen.dir/reducetensor_api.cpp.o src/CMakeFiles/MIOpen.dir/md_graph.cpp.o src/CMakeFiles/MIOpen.dir/mdg_expr.cpp.o src/CMakeFiles/MIOpen.dir/conv/invokers/gcn_asm_1x1u.cpp.o src/CMakeFiles/MIOpen.dir/conv/invokers/gcn_asm_1x1u_ss.cpp.o src/CMakeFiles/MIOpen.dir/conv/invokers/gcn_asm_1x1u_us.cpp.o src/CMakeFiles/MIOpen.dir/conv/invokers/gen_x_w_y_pad.cpp.o src/CMakeFiles/MIOpen.dir/conv/invokers/ocl_wrw_rdc.cpp.o src/CMakeFiles/MIOpen.dir/conv/invokers/impl_gemm.cpp.o src/CMakeFiles/MIOpen.dir/conv/invokers/impl_gemm_dynamic.cpp.o src/CMakeFiles/MIOpen.dir/invoker_cache.cpp.o src/CMakeFiles/MIOpen.dir/tensor.cpp.o src/CMakeFiles/MIOpen.dir/tensor_api.cpp.o src/CMakeFiles/MIOpen.dir/solver.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_3x3u.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_1x1u.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_1x1u_stride2.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_1x1u_bias_activ.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_5x10u2v2f1.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_5x10u2v2b1.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_7x7c3h224w224k64u2v2p3q3f1.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_dir_BwdWrW3x3.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_dir_BwdWrW1x1.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_multipass_wino3x3WrW.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_MP_bidirectional_winograd.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_bin_wino3x3U.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_bin_winoRxS.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_winoRxS_f3x2.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_winoRxS_f2x3.cpp.o src/CMakeFiles/MIOpen.dir/solver/fft.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_ocl_dir2D_bwdWrW_2.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_ocl_dir2D_bwdWrW_53.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_ocl_dir2D_bwdWrW_1x1.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_ocl_dir2Dfwdgen.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_ocl_dir2D11x11.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_ocl_dir2Dfwd_exhaustive_search.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_ocl_dir2Dfwd.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_ocl_dir2Dfwd1x1.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_fwd_v4r1.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_fwd_v4r4.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_fwd_v4r4_xdlops_padded_gemm.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_fwd_v4r5_xdlops.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_wrw_v4r4.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_fwd_v4r4_xdlops.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_xdlops_common.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_nonxdlops_common.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_bwd_v1r1.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_bwd_v4r1.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_bwd_v4r1_xdlops.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_implicit_gemm_v4r1_dynamic.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_implicit_gemm_wrw_v4r1_dynamic.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_implicit_gemm_wrw_gtc_dynamic_xdlops.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_bwd_v1r1_xdlops.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_implicit_gemm_bwd_v4r1_dynamic.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_wrw_v4r4_xdlops.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_hip_implicit_gemm_wrw_v4r4_xdlops_padded_gemm.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_implicit_gemm_gtc_fwd.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_asm_implicit_gemm_gtc_bwd.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_direct_naive_conv_fwd.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_direct_naive_conv_bwd.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_direct_naive_conv_wrw.cpp.o src/CMakeFiles/MIOpen.dir/solver/conv_direct_naive_conv.cpp.o src/CMakeFiles/MIOpen.dir/tmp_dir.cpp.o src/CMakeFiles/MIOpen.dir/binary_cache.cpp.o src/CMakeFiles/MIOpen.dir/md5.cpp.o src/CMakeFiles/MIOpen.dir/sqlite_db.cpp.o src/CMakeFiles/MIOpen.dir/kern_db.cpp.o src/CMakeFiles/MIOpen.dir/bz2.cpp.o src/CMakeFiles/MIOpen.dir/activ.cpp.o src/CMakeFiles/MIOpen.dir/kernel_cache.cpp.o src/CMakeFiles/MIOpen.dir/lrn.cpp.o src/CMakeFiles/MIOpen.dir/mlo_dir_conv.cpp.o src/CMakeFiles/MIOpen.dir/exec_utils.cpp.o src/CMakeFiles/MIOpen.dir/ocl/activ_ocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/batchnormocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/convolutionocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/lrn_ocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/mloNeuron.cpp.o src/CMakeFiles/MIOpen.dir/ocl/mloNorm.cpp.o src/CMakeFiles/MIOpen.dir/ocl/mloPooling.cpp.o src/CMakeFiles/MIOpen.dir/ocl/pooling_ocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/tensorocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/softmaxocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/rnnocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/utilocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/ctcocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/dropoutocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/gcn_asm_utils.cpp.o src/CMakeFiles/MIOpen.dir/ocl/rnn_util_ocl.cpp.o src/CMakeFiles/MIOpen.dir/hip/hip_build_utils.cpp.o src/CMakeFiles/MIOpen.dir/pooling.cpp.o src/CMakeFiles/MIOpen.dir/ocl/fusionopconvocl.cpp.o src/CMakeFiles/MIOpen.dir/ocl/fusionopbiasbnactivocl.cpp.o src/CMakeFiles/MIOpen.dir//db_path.cpp.o src/CMakeFiles/MIOpen.dir//kernel.cpp.o src/CMakeFiles/MIOpen.dir/__/kernel_includes.cpp.o src/CMakeFiles/MIOpen.dir/gemm_v2.cpp.o src/CMakeFiles/MIOpen.dir/miopengemm.cpp.o src/CMakeFiles/MIOpen.dir/hip/hiperrors.cpp.o src/CMakeFiles/MIOpen.dir/hip/handlehip.cpp.o src/CMakeFiles/MIOpen.dir/hipoc/hipoc_kernel.cpp.o src/CMakeFiles/MIOpen.dir/hipoc/hipoc_program.cpp.o -Wl,-rpath,/opt/rocm/rocblas/lib:/opt/rocm/hip/lib:/opt/rocm/lib: -lpthread /usr/local/lib/libbz2.a /opt/rocm/rocblas/lib/librocblas.so.0.1 /usr/local/lib/libboost_filesystem.a -Wl,--version-script=/home/ian/Documents/rocm-build/build/miopen/src/lib.def -Wl,--exclude-libs,ALL -L/usr/local/lib -lsqlite3 -ldl -lpthread /usr/lib/x86_64-linux-gnu/librt.so /usr/local/lib/libbz2.a -lsqlite3 -ldl /usr/lib/x86_64-linux-gnu/librt.so --hip-link --offload-arch=gfx1030 -L"/opt/rocm/llvm/lib/clang/12.0.0/include/../lib/linux" -lclang_rt.builtins-x86_64 /opt/rocm/hip/lib/libamdhip64.so.4.2.21155-37cb3a34 -Wl,-rpath-link,/opt/rocm/lib && :
/usr/bin/ld: /usr/local/lib/libboost_filesystem.a(exception.o): relocation R_X86_64_32S against hidden symbol _ZTVN5boost10filesystem16filesystem_errorE' can not be used when making a shared object /usr/bin/ld: /usr/local/lib/libboost_filesystem.a(operations.o): relocation R_X86_64_32 against .rodata.str1.1' can not be used when making a shared object; recompile with -fPIC
/usr/bin/ld: /usr/local/lib/libboost_filesystem.a(path.o): relocation R_X86_64_32 against .rodata.str1.1' can not be used when making a shared object; recompile with -fPIC /usr/bin/ld: /usr/local/lib/libboost_filesystem.a(unique_path.o): relocation R_X86_64_32S against .rodata.str1.16' can not be used when making a shared object; recompile with -fPIC
/usr/bin/ld: /usr/local/lib/libboost_filesystem.a(directory.o): relocation R_X86_64_32S against hidden symbol `_ZN5boost6system6detail10cat_holderIvE24system_category_instanceE' can not be used when making a shared object
clang-12: error: linker command failed with exit code 1 (use -v to see invocation)
ninja: build stopped: subcommand failed.

How to reproduce

bash 34.miopen.sh

navi10 pytorch build fails due to missing miopen

Environment

Hardware description
GPU Radeon RC5700XT
CPU Ryzen
Software version
OS Archlinux
ROCm 5.2.x
Python 3.10.6

What is the expected behavior

Following navi10 instructions for building pytorch should not require building miopen separately however, it fails due to missing miopen.

What actually happens

CMake Error at cmake/public/LoadHIP.cmake:147 (find_package):
By not providing "Findmiopen.cmake" in CMAKE_MODULE_PATH this project has
asked CMake to find a package configuration file provided by "miopen", but
CMake did not find one.

Could not find a package configuration file provided by "miopen" with any
of the following names:

miopenConfig.cmake
miopen-config.cmake

Add the installation prefix of "miopen" to CMAKE_PREFIX_PATH or set
"miopen_DIR" to a directory containing one of the above files. If "miopen"
provides a separate development package or SDK, be sure it has been
installed.
Call Stack (most recent call first):
cmake/public/LoadHIP.cmake:274 (find_package_and_print_version)
cmake/Dependencies.cmake:1264 (include)
CMakeLists.txt:696 (include)

How to reproduce

Follow navi10 build on archlinux.

Fix didnt fixed the problem?

Environment

Hardware description
GPU R3600
CPU RX 480
Software version
OS Ubuntu 20.04.2
ROCm 4.1.1
Python 3.7

What is the expected behavior

The given fix didnt fixed the problem:
ROCm/ROCm#1454
Maybe i didnt sth wrong?

What actually happens

How to reproduce

tensorflow-rocm needs hipErrorNoBinaryForGpu (does it need recompile?)

Environment

Hardware description
GPU - Navi21 RX6800
CPU - AMD
Software version
OS - 20.04.2
ROCm - 4.2
Python -

What is the expected behavior

tensorflow-rocm works after completing the install in this repo

What actually happens

ian@xxxx~/Documents/Src$ /home/ian/.envs/py3tf2/bin/python /home/ian/Documents/Src/test.py
2021-05-27 12:35:24.176982: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set
2021-05-27 12:35:24.177308: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libamdhip64.so
/home/ian/Documents/rocm-build/ROCm/HIP/rocclr/hip_code_object.cpp:486: "hipErrorNoBinaryForGpu: Unable to find code object for all current devices!"
Aborted (core dumped)

How to reproduce

pip install tensorflow-rocm
...
...
Successfully installed tensorflow-rocm-2.4.3

python

import tensorflow as tf
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, Conv2D, Flatten, Dropout

model = Sequential()

ERROR: rocBLAS headers can't be found under specified path.

Environment

Hardware description
GPU - RX6800
CPU - AMD Threadripper
Software version
OS - 20.4.2
ROCm - 4.2
Python -

What is the expected behavior

Compilation completes

What actually happens

Reading package lists... Done
Building dependency tree
Reading state information... Done
cmake is already the newest version (3.16.3-1ubuntu1).
libpciaccess-dev is already the newest version (0.16-0ubuntu1).
python-is-python3 is already the newest version (3.8.2-4).
unzip is already the newest version (6.0-25ubuntu1).
doxygen is already the newest version (1.8.17-0ubuntu2).
git is already the newest version (1:2.25.1-1ubuntu3.1).
libpci-dev is already the newest version (1:3.6.4-1ubuntu0.20.04.1).
libpci3 is already the newest version (1:3.6.4-1ubuntu0.20.04.1).
The following packages were automatically installed and are no longer required:
dctrl-tools dkms hsa-amd-aqlprofile libncurses5 libpython2-stdlib libpython2.7-minimal libpython2.7-stdlib libtinfo5 python2
python2-minimal python2.7 python2.7-minimal rock-dkms rock-dkms-firmware
Use 'sudo apt autoremove' to remove them.
0 upgraded, 0 newly installed, 0 to remove and 37 not upgraded.
~/Documents/rocm-build/build/rocmvalidationsuite ~/Documents/rocm-build/build/rocmvalidationsuite
ln: failed to create symbolic link '/usr/bin/python': File exists
-- RVS_OS_TYPE: ubuntu
Reading package lists...
Building dependency tree...
Reading state information...
rocblas is already the newest version (2.38.0.40200-21).
The following packages were automatically installed and are no longer required:
dctrl-tools dkms hsa-amd-aqlprofile libncurses5 libpython2-stdlib
libpython2.7-minimal libpython2.7-stdlib libtinfo5 python2 python2-minimal
python2.7 python2.7-minimal rock-dkms rock-dkms-firmware
Use 'sudo apt autoremove' to remove them.
0 upgraded, 0 newly installed, 0 to remove and 37 not upgraded.
-- RVS_OS_TYPE_NUM: 1
-- CMAKE_BUILD_TYPE = Release
-- RVS_COVERAGE = FALSE
-- RVS_BUILD_TESTS = TRUE
-- CMAKE_BINARY_DIR = /home/ian/Documents/rocm-build/build/rocmvalidationsuite
-- CMAKE_CURRENT_SOURCE_DIR = /home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite
-- RVS_DO_TRACE defined
-- CPACK_GENERATOR DEB
-- Configuring done
-- Generating done
-- Build files have been written to: /home/ian/Documents/rocm-build/build/rocmvalidationsuite/yaml-download
[1/6] Performing update step for 'yaml'
Current branch master is up to date.
[2/6] No configure step for 'yaml'
[3/6] No build step for 'yaml'
[4/6] No install step for 'yaml'
[5/6] No test step for 'yaml'
[6/6] Completed 'yaml'
-- Configuring done
-- Generating done
-- Build files have been written to: /home/ian/Documents/rocm-build/build/rocmvalidationsuite/yaml-build
-- Configuring done
-- Generating done
-- Build files have been written to: /home/ian/Documents/rocm-build/build/rocmvalidationsuite/googletest-download
[1/6] Performing update step for 'googletest'
[2/6] No configure step for 'googletest'
[3/6] No build step for 'googletest'
[4/6] No install step for 'googletest'
[5/6] No test step for 'googletest'
[6/6] Completed 'googletest'
-- Configuring done
-- Generating done
-- Build files have been written to: /home/ian/Documents/rocm-build/build/rocmvalidationsuite/googletest-build
UT_INC /home/ian/Documents/rocm-build/build/rocmvalidationsuite/googletest-src/googletest/include
UT_LIB /home/ian/Documents/rocm-build/build/rocmvalidationsuite/googletest-build/lib
-- MODULE: rvs
unit test: unit.rvs.1
unit test: unit.rvs.cli
unit test: unit.rvs.gpu_util
unit test: unit.rvs.lognode
unit test: unit.rvs.lognodeint
unit test: unit.rvs.lognoderec
unit test: unit.rvs.lognodestring
unit test: unit.rvs.pcie_caps
unit test: unit.rvs.thread
unit test: unit.rvs.timer
-- MODULE: rvslib
-- MODULE: gm
unit test: unit.gm.1
-- tcd.unit.gm.1 defined
-- value: UT_TCD_1
conf test: ttp.conf.log.gm.single
-- MODULE: gpup
unit test: unit.gpup.nonameaction
conf test: ttp.conf.log.gpup.single
-- MODULE: gst
ERROR: rocBLAS headers can't be found under specified path. Please set ROCBLAS_INC_DIR path. Current value is : /opt/rocm/include
-- MODULE: iet
ERROR: rocBLAS headers can't be found under specified path. Please set ROCBLAS_INC_DIR path. Current value is : /opt/rocm/include
-- MODULE: pebb
conf test: ttp.conf.log.pebb.single
-- MODULE: peqt
conf test: ttp.conf.log.peqt.single
-- MODULE: pesm
unit test: unit.pesm.actionbase
unit test: unit.pesm.sanity
conf test: ttp.conf.log.pesm.5
conf test: ttp.conf.log.pesm.2
conf test: ttp.conf.log.pesm.9
conf test: ttp.conf.log.pesm.3
conf test: ttp.conf.log.pesm.4
conf test: ttp.conf.log.pesm.8
conf test: ttp.conf.log.pesm.1
conf test: ttp.conf.log.pesm.7
conf test: ttp.conf.log.pesm.6
-- MODULE: pqt
-- MODULE: rcqt
-- MODULE: smqt
unit test: unit.smqt.1
-- MODULE: mem
ERROR: rocBLAS headers can't be found under specified path. Please set ROCBLAS_INC_DIR path. Current value is : /opt/rocm/include
-- MODULE: babel
ERROR: rocBLAS headers can't be found under specified path. Please set ROCBLAS_INC_DIR path. Current value is : /opt/rocm/include
-- MODULE: perf
ERROR: rocBLAS headers can't be found under specified path. Please set ROCBLAS_INC_DIR path. Current value is : /opt/rocm/include
-- MODULE: testif
-- Doxygen: /usr/bin/doxygen
-- Configuring done
-- Generating done
-- Build files have been written to: /home/ian/Documents/rocm-build/build/rocmvalidationsuite
[5/41] Generating RVS man.1 files
warning: Tag 'PERL_PATH' at line 2121 of file 'DoxyMan1' has become obsolete.
To avoid this warning please remove this line from your configuration file or upgrade it using "doxygen -u"
warning: Tag 'MSCGEN_PATH' at line 2143 of file 'DoxyMan1' has become obsolete.
To avoid this warning please remove this line from your configuration file or upgrade it using "doxygen -u"
[6/41] Generating RVS User Guide
warning: Tag 'PERL_PATH' at line 2163 of file 'DoxyGuide' has become obsolete.
To avoid this warning please remove this line from your configuration file or upgrade it using "doxygen -u"
warning: Tag 'MSCGEN_PATH' at line 2185 of file 'DoxyGuide' has become obsolete.
To avoid this warning please remove this line from your configuration file or upgrade it using "doxygen -u"
[7/41] Generating RVS Programmer's Reference Guide
warning: Tag 'PERL_PATH' at line 2165 of file 'Doxyfile' has become obsolete.
To avoid this warning please remove this line from your configuration file or upgrade it using "doxygen -u"
warning: Tag 'MSCGEN_PATH' at line 2187 of file 'Doxyfile' has become obsolete.
To avoid this warning please remove this line from your configuration file or upgrade it using "doxygen -u"
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/pebb.so/include/action.h:61: warning: Compound pebb_action::bandwidth is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/action.h:63: warning: Member gst_ops_type (variable) of class gst_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/action.h:83: warning: Member gst_alpha_val (variable) of class gst_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/action.h:84: warning: Member gst_beta_val (variable) of class gst_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/action.h:88: warning: Member gst_matrix_size_b (variable) of class gst_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/action.h:89: warning: Member gst_matrix_size_c (variable) of class gst_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/action.h:92: warning: Member gst_hot_calls (variable) of class gst_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/action.h:95: warning: Member gst_trans_a (variable) of class gst_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/action.h:96: warning: Member gst_trans_b (variable) of class gst_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/action.h:99: warning: Member gst_lda_offset (variable) of class gst_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/action.h:100: warning: Member gst_ldb_offset (variable) of class gst_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/action.h:101: warning: Member gst_ldc_offset (variable) of class gst_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:191: warning: Member set_gst_ops_type(std::string _ops_type) (function) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:213: warning: Member gst_trans_a (variable) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:215: warning: Member gst_trans_b (variable) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:219: warning: Member gst_alpha_val (variable) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:221: warning: Member gst_beta_val (variable) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:223: warning: Member gst_lda_offset (variable) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:224: warning: Member gst_ldb_offset (variable) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:225: warning: Member gst_ldc_offset (variable) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:245: warning: Member matrix_size_b (variable) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:246: warning: Member matrix_size_c (variable) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:248: warning: Member gst_hot_calls (variable) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/gst.so/include/gst_worker.h:260: warning: Member gst_ops_type (variable) of class GSTWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/action.h:80: warning: Member iet_ops_type (variable) of class iet_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/action.h:98: warning: Member iet_alpha_val (variable) of class iet_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/action.h:99: warning: Member iet_beta_val (variable) of class iet_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/action.h:103: warning: Member iet_matrix_size_b (variable) of class iet_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/action.h:104: warning: Member iet_matrix_size_c (variable) of class iet_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/action.h:107: warning: Member iet_hot_calls (variable) of class iet_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/action.h:110: warning: Member iet_trans_a (variable) of class iet_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/action.h:111: warning: Member iet_trans_b (variable) of class iet_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/action.h:114: warning: Member iet_lda_offset (variable) of class iet_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/action.h:115: warning: Member iet_ldb_offset (variable) of class iet_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/action.h:116: warning: Member iet_ldc_offset (variable) of class iet_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:204: warning: Member do_gpu_init_training(int gpuIdx, uint64_t matrix_size, std::string iet_ops_type) (function) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:205: warning: Member compute_gpu_stats(void) (function) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:206: warning: Member compute_new_sgemm_freq(float avg_power) (function) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:213: warning: Member gpu_blas (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:222: warning: Member blas_error (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:247: warning: Member sgemm_success (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:263: warning: Member matrix_size_b (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:264: warning: Member matrix_size_c (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:266: warning: Member iet_lda_offset (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:267: warning: Member iet_ldb_offset (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:268: warning: Member iet_ldc_offset (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:270: warning: Member iet_trans_a (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:272: warning: Member iet_trans_b (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:274: warning: Member iet_alpha_val (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:276: warning: Member iet_beta_val (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:278: warning: Member iet_tp_flag (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/iet.so/include/iet_worker.h:280: warning: Member mtx_blas_done (variable) of class IETWorker is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/pebb.so/include/action.h:65: warning: Member bandwidth (typedef) of class pebb_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/pebb.so/include/action.h:67: warning: Member resultBandwidth (variable) of class pebb_action is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/pebb.so/include/action.h:62: warning: Member finalBandwith (variable) of class pebb_action::bandwidth is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/pebb.so/include/action.h:63: warning: Member GPUId (variable) of class pebb_action::bandwidth is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/pebb.so/include/action.h:64: warning: Member CPUId (variable) of class pebb_action::bandwidth is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/rvs_blas.h:64: warning: Member gemm_gflop_count(void) (function) of class rvs_blas is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/rvs_blas.h:91: warning: Member size_d (variable) of class rvs_blas is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/rvs_blas.h:131: warning: Member blas_lda_offset (variable) of class rvs_blas is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/rvs_blas.h:132: warning: Member blas_ldb_offset (variable) of class rvs_blas is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/rvs_blas.h:133: warning: Member blas_ldc_offset (variable) of class rvs_blas is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/rvs_blas.h:142: warning: Member dhlfd (variable) of class rvs_blas is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/rvs_blas.h:150: warning: Member hostarrayA (variable) of class rvs_blas is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/rvs_blas.h:151: warning: Member hostarrayB (variable) of class rvs_blas is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/rvs_blas.h:152: warning: Member hostarrayC (variable) of class rvs_blas is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/rvs_blas.h:44: warning: The following parameters of rvs_blas::rvs_blas(int _gpu_device_index, int _m, int _n, int _k, int transa, int transb, float aplha, float beta, int lda, int ldb, int ldc, std::string ops_type) are not documented:
parameter 'transa'
parameter 'transb'
parameter 'aplha'
parameter 'beta'
parameter 'lda'
parameter 'ldb'
parameter 'ldc'
parameter 'ops_type'
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/gpu_util.h:83: warning: Member domain_loc_map (variable) of class rvs::gpulist is not documented.
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/src/gpu_util.cpp:425: warning: argument 'pNodeID' of command @param is not found in the argument list of rvs::gpulist::domlocation2gpu(const uint16_t domainID, const uint16_t LocationID, uint16_t *pGPUID)
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/gpu_util.h:70: warning: The following parameters of rvs::gpulist::domlocation2gpu(const uint16_t domainID, const uint16_t LocationID, uint16_t *pGPUID) are not documented:
parameter 'domainID'
parameter 'pGPUID'
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/gpu_util.h:68: warning: The following parameter of rvs::gpulist::domlocation2node(const uint16_t domainID, const uint16_t LocationID, uint16_t *pNodeID) is not documented:
parameter 'domainID'
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/include/rvshsa.h:180: warning: Member PrintTopology() (function) of class rvs::hsa is not documented.
[8/41] Building CXX object rvslib/CMakeFiles/rvslib.dir//src/rvs_blas.cpp.o
FAILED: rvslib/CMakeFiles/rvslib.dir/
/src/rvs_blas.cpp.o
/opt/rocm/hip/bin/hipcc -DRVS_DO_TRACE -I/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/rvslib/. -I/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/rvslib/.. -I/opt/rocm/rocm_smi/include -I/opt/rocm/include -I/opt/rocm/hip -fPIC -Xlinker --enable-new-dtags -fno-gpu-rdc --amdgpu-target=gfx803 --amdgpu-target=gfx900 --amdgpu-target=gfx906 --amdgpu-target=gfx908 -I/opt/rocm/hsa/include -O3 -DNDEBUG -std=c++11 -pthread -Wl,-no-as-needed -Wall -fPIC -DRVS_OS_TYPE_NUM=1 -DBUILD_VERSION_MAJOR=0 -DBUILD_VERSION_MINOR=0 -DBUILD_VERSION_PATCH=0 -DLIB_VERSION_STRING="0.0.0" -DBUILD_VERSION_STRING="0.0.0" -MD -MT rvslib/CMakeFiles/rvslib.dir//src/rvs_blas.cpp.o -MF rvslib/CMakeFiles/rvslib.dir//src/rvs_blas.cpp.o.d -o rvslib/CMakeFiles/rvslib.dir/__/src/rvs_blas.cpp.o -c /home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/src/rvs_blas.cpp
clang-12: warning: -Xlinker --enable-new-dtags: 'linker' input unused [-Wunused-command-line-argument]
clang-12: warning: -Wl,-no-as-needed: 'linker' input unused [-Wunused-command-line-argument]
In file included from /home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/src/rvs_blas.cpp:25:
/home/ian/Documents/rocm-build/ROCm/ROCmValidationSuite/rvslib/../include/rvs_blas.h:30:10: fatal error: 'rocblas.h' file not found
#include "rocblas.h"
^~~~~~~~~~~
1 error generated when compiling for gfx803.
ninja: build stopped: subcommand failed.

How to reproduce

bash navi21/rocblas.sh
bash 73.rocmvalidationsuite.sh

tensorflow error on gfx803

Environment

Hardware description
GPU - rx590
Software version
OS - debian sid
ROCm - 5.0.2 from ubuntu apt
Python - 3.9.10
tensorflow - git 2.9.0
pytorch - git 1.12

using pip tensorflow-rocm with error "hipErrorNoBinaryForGpu", so I build and install tensorflow with:

git clone https://github.com/ROCmSoftwarePlatform/tensorflow-upstream.git
./build_rocm_python3
>>> import tensorflow
>>> print(tensorflow.config.experimental.list_physical_devices('GPU'))
2022-03-10 21:21:13.850863: I tensorflow/stream_executor/rocm/rocm_gpu_executor.cc:838] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2022-03-10 21:21:13.911531: I tensorflow/stream_executor/rocm/rocm_gpu_executor.cc:838] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2022-03-10 21:21:13.911596: I tensorflow/stream_executor/rocm/rocm_gpu_executor.cc:838] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2022-03-10 21:21:13.911627: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1899] Ignoring visible gpu device (device: 0, name: AMD Radeon RX 590 Series, pci bus id: 0000:27:00.0) with AMDGPU version : gfx803. The supported AMDGPU versions are gfx1030, gfx900, gfx906, gfx908, gfx90a.
[]

then do the 22.tensile-gfx803-1.patch on:

git clone https://github.com/ROCmSoftwarePlatform/Tensile.git
git switch -d rocm-5.0.2

build and install rocblas:

git clone https://github.com/ROCmSoftwarePlatform/rocBLAS.git
    git switch -d rocm-5.0.2
    rm -rf library/src/blas3/Tensile/Logic/asm_full/r9nano*
    CXX=/opt/rocm-5.0.2/bin/hipcc cmake -DAMDGPU_TARGETS=gfx803 -DROCM_PATH=/opt/rocm-5.0.2 -DTensile_LOGIC=asm_full -DTensile_ARCHITECTURE=gfx803 -DTensile_CODE_OBJECT_VERSION=V3 -DCMAKE_BUILD_TYPE=Release -DTensile_TEST_LOCAL_PATH=../../Tensile -DBUILD_WITH_TENSILE_HOST=ON -DTensile_LIBRARY_FORMAT=yaml -DRUN_HEADER_TESTING=OFF -DTensile_COMPILER=hipcc -DHIP_CLANG_INCLUDE_PATH=/opt/rocm-5.0.2/llvm/include -DCPACK_SET_DESTDIR=OFF -DCMAKE_PREFIX_PATH=/opt/rocm-5.0.2 -DCMAKE_INSTALL_PREFIX=rocblas-install -DCPACK_PACKAGING_INSTALL_PREFIX=/opt/rocm-5.0.2 -DCPACK_GENERATOR=DEB -G Ninja ..
CLICK ME

-- The CXX compiler identification is Clang 14.0.0
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Check for working CXX compiler: /opt/rocm-5.0.2/bin/hipcc - skipped
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Looking for C++ include pthread.h
-- Looking for C++ include pthread.h - found
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success
-- Found Threads: TRUE  
-- Use hip-clang to build for amdgpu backend
-- OS detected is debian
-- Performing Test HAVE_gfx803
-- Performing Test HAVE_gfx803 - Success
-- Performing Test HAVE_gfx900
-- Performing Test HAVE_gfx900 - Success
-- Performing Test HAVE_gfx906:xnack-
-- Performing Test HAVE_gfx906:xnack- - Success
-- Performing Test HAVE_gfx908:xnack-
-- Performing Test HAVE_gfx908:xnack- - Success
-- Performing Test HAVE_gfx90a:xnack+
-- Performing Test HAVE_gfx90a:xnack+ - Success
-- Performing Test HAVE_gfx90a:xnack-
-- Performing Test HAVE_gfx90a:xnack- - Success
-- Performing Test HAVE_gfx1010
-- Performing Test HAVE_gfx1010 - Success
-- Performing Test HAVE_gfx1011
-- Performing Test HAVE_gfx1011 - Success
-- Performing Test HAVE_gfx1012
-- Performing Test HAVE_gfx1012 - Success
-- Performing Test HAVE_gfx1030
-- Performing Test HAVE_gfx1030 - Success
/bin/python3 -m venv ~/rocm/rocBLAS/build/virtualenv --system-site-packages --clear
~/rocm/rocBLAS/build/virtualenv/bin/python3 -m pip install ~/rocm/Tensile
Processing ~/rocm/Tensile
  Preparing metadata (setup.py): started
  Preparing metadata (setup.py): finished with status 'done'
Collecting msgpack
  Using cached msgpack-1.0.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (322 kB)
Requirement already satisfied: pyyaml in /usr/lib/python3/dist-packages (from Tensile==4.31.0) (5.4.1)
Building wheels for collected packages: Tensile
  Building wheel for Tensile (setup.py): started
  Building wheel for Tensile (setup.py): finished with status 'done'
  Created wheel for Tensile: filename=Tensile-4.31.0-py3-none-any.whl size=4544414 sha256=31c2449b59d5a6850f2ff31a393b82dfea64eca86f5e8404ee82772d75e578c3
  Stored in directory: /tmp/pip-ephem-wheel-cache-8mgcppg6/wheels/48/8d/c5/ef04e9532161b93e4192d8040bf0ac9ddcb15321e71963e0d3
Successfully built Tensile
Installing collected packages: msgpack, Tensile
Successfully installed Tensile-4.31.0 msgpack-1.0.3
-- using local Tensile from ~/rocm/Tensile, copied to 
-- Adding ~/rocm/rocBLAS/build/virtualenv to CMAKE_PREFIX_PATH
-- The C compiler identification is GNU 11.2.0
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working C compiler: /bin/cc - skipped
-- Detecting C compile features
-- Detecting C compile features - done
-- hip::amdhip64 is SHARED_LIBRARY
-- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS
-- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success
-- Found ZLIB: /usr/lib/x86_64-linux-gnu/libz.so (found version "1.2.11") 
-- Found LibXml2: /usr/lib/x86_64-linux-gnu/libxml2.so (found version "2.9.13") 
LLVMObjectYAML_LIBRARY: /opt/rocm-5.0.2/llvm/lib/libLLVMObjectYAML.a
-- hip::amdhip64 is SHARED_LIBRARY
-- Using AMDGPU_TARGETS: gfx803
-- Tensile script: ~/rocm/rocBLAS/build/virtualenv/lib/python3.9/site-packages/Tensile/bin/TensileCreateLibrary
-- Tensile_CREATE_COMMAND: ~/rocm/rocBLAS/build/virtualenv/lib/python3.9/site-packages/Tensile/bin/TensileCreateLibrary;--merge-files;--no-short-file-names;--no-library-print-debug;--code-object-version=V3;--cxx-compiler=hipcc;--library-format=yaml;--architecture=gfx803;~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full;~/rocm/rocBLAS/build/Tensile;HIP
-- Tensile_MANIFEST_FILE_PATH: ~/rocm/rocBLAS/build/Tensile/library/TensileManifest.txt
'~/rocm/rocBLAS/build/virtualenv/lib/python3.9/site-packages/Tensile/bin/TensileCreateLibrary' '--merge-files' '--no-short-file-names' '--no-library-print-debug' '--code-object-version=V3' '--cxx-compiler=hipcc' '--library-format=yaml' '--architecture=gfx803' '~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full' '~/rocm/rocBLAS/build/Tensile' 'HIP' '--generate-manifest-and-exit'

################################################################################
# Tensile Create Library
# Detected local GPU with ISA: gfx803
               cap gfx000 gfx803 
   HasMFMA_bf16_1k      0      0 
        HasAddLshl      0      0 
      HasAtomicAdd      0      0 
   HasCodeObjectV3      0      1 
    HasDirectToLds      0      1 
     HasExplicitCO      0      0 
     HasExplicitNC      0      0 
         HasLshlOr      0      0 
           HasMFMA      0      0 
         HasSMulHi      0      0 
        MaxLgkmcnt      1      1 
          MaxVmcnt      0      1 
      SupportedISA      0      1 
   SupportedSource      1      1 
    v_dot2_f32_f16      0      0 
   v_dot2c_f32_f16      0      0 
         v_fma_f16      0      0 
        v_fmac_f16      0      0 
         v_mac_f16      0      1 
      v_pk_fma_f16      0      0 
     v_pk_fmac_f16      0      0 
         v_fma_f32      0      1 
     v_fma_mix_f32      0      0 
        v_fmac_f32      0      0 
         v_mac_f32      0      1 
     v_mad_mix_f32      0      0 
       HasMFMA_f64      0      0 
     v_dot4_i32_i8      0      0 
    v_dot4c_i32_i8      0      0 
ArchAccUnifiedRegs      0      0 
    CMPXWritesSGPR      1      1 
          HasAccCD      0      0 
        HasEccHalf      0      0 
         HasWave32      0      0 
     SeparateVscnt      0      0 
  Waitcnt0Disabled      0      0 
# Found  hipcc version 5.0.13601-ded05588
# CodeObjectVersion from TensileCreateLibrary: V3
# CxxCompiler       from TensileCreateLibrary: hipcc
# Architecture      from TensileCreateLibrary: gfx803
# LibraryFormat     from TensileCreateLibrary: yaml
# LibraryLogicFiles:
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_BjlkC_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_BjlkC_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_BjlkC_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_BjlkC_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_4xi8BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_4xi8BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_BBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_BBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_BSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_BSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_DB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_DB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_I8II_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_I8II_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_SB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_SB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_4xi8BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_4xi8BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_BBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_BBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_BSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_BSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_DB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_DB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_I8II_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_I8II_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_SB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_SB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_BjlkC_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_BjlkC_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_BjlkC_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_BjlkC_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bjlk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bjlk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bjlk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bjlk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bljk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bljk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bljk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bljk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_BjlkC_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_BjlkC_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_BjlkC_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_BjlkC_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_4xi8BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_4xi8BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_BBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_BBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_BSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_BSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_DB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_DB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_I8II_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_I8II_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_SB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_SB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_4xi8BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_4xi8BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_BBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_BBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_BSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_BSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_DB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_DB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_I8II_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_I8II_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_SB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_SB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HBH_GB.yaml
Reading logic files: Launching 16 threads...
Reading logic files: Done.
Processing logic data: 100%|██████████| 108/108 [00:01<00:00, 105.98it/s]
-- Performing Test COMPILER_HAS_HIDDEN_VISIBILITY
-- Performing Test COMPILER_HAS_HIDDEN_VISIBILITY - Success
-- Performing Test COMPILER_HAS_HIDDEN_INLINE_VISIBILITY
-- Performing Test COMPILER_HAS_HIDDEN_INLINE_VISIBILITY - Success
-- Performing Test COMPILER_HAS_DEPRECATED_ATTR
-- Performing Test COMPILER_HAS_DEPRECATED_ATTR - Success
-- Configuring done
-- Generating done
CMake Warning:
  Manually-specified variables were not used by the project:
    BUILD_WITH_TENSILE_HOST
-- Build files have been written to: ~/rocm/rocBLAS/build

    ninja
CLICK ME

[0/2] Re-checking globbed directories...
[3/248] Generating Tensile Libraries

################################################################################
# Tensile Create Library
# Detected local GPU with ISA: gfx803
               cap gfx000 gfx803 
   HasMFMA_bf16_1k      0      0 
        HasAddLshl      0      0 
      HasAtomicAdd      0      0 
   HasCodeObjectV3      0      1 
    HasDirectToLds      0      1 
     HasExplicitCO      0      0 
     HasExplicitNC      0      0 
         HasLshlOr      0      0 
           HasMFMA      0      0 
         HasSMulHi      0      0 
        MaxLgkmcnt      1      1 
          MaxVmcnt      0      1 
      SupportedISA      0      1 
   SupportedSource      1      1 
    v_dot2_f32_f16      0      0 
   v_dot2c_f32_f16      0      0 
         v_fma_f16      0      0 
        v_fmac_f16      0      0 
         v_mac_f16      0      1 
      v_pk_fma_f16      0      0 
     v_pk_fmac_f16      0      0 
         v_fma_f32      0      1 
     v_fma_mix_f32      0      0 
        v_fmac_f32      0      0 
         v_mac_f32      0      1 
     v_mad_mix_f32      0      0 
       HasMFMA_f64      0      0 
     v_dot4_i32_i8      0      0 
    v_dot4c_i32_i8      0      0 
ArchAccUnifiedRegs      0      0 
    CMPXWritesSGPR      1      1 
          HasAccCD      0      0 
        HasEccHalf      0      0 
         HasWave32      0      0 
     SeparateVscnt      0      0 
  Waitcnt0Disabled      0      0 
# Found  hipcc version 5.0.13601-ded05588
# CodeObjectVersion from TensileCreateLibrary: V3
# CxxCompiler       from TensileCreateLibrary: hipcc
# Architecture      from TensileCreateLibrary: gfx803
# LibraryFormat     from TensileCreateLibrary: yaml
# LibraryLogicFiles:
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_BjlkC_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_BjlkC_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_BjlkC_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_BjlkC_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_4xi8BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_4xi8BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_BBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_BBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_BSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_BSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_DB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_DB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_I8II_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_I8II_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_SB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_SB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_4xi8BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_4xi8BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_BBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_BBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_BSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_BSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_DB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_DB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_HSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_I8II_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_I8II_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_SB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_SB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bljk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_BjlkC_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_BjlkC_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_BjlkC_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_BjlkC_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bjlk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bjlk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bjlk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bjlk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bljk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bljk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bljk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_AlikC_Bljk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_BjlkC_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_BjlkC_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_BjlkC_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_BjlkC_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_4xi8BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_4xi8BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_BBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_BBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_BSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_BSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_DB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_DB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_HSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_I8II_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_I8II_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_SB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_SB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bjlk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_4xi8BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_4xi8BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_BBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_BBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_BSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_BSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_CB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_CB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_DB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_DB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HBH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HSS_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_HSS_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_I8II_BH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_I8II_BH_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_SB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_SB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_ZB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Alik_Bljk_ZB_GB.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HBH.yaml
#   ~/rocm/rocBLAS/library/src/blas3/Tensile/Logic/asm_full/hip_Cijk_Ailk_Bjlk_HBH_GB.yaml
Reading logic files: Launching 16 threads...
Reading logic files: Done.
Processing logic data: 100%|██████████| 108/108 [00:01<00:00, 105.68it/s]
# Writing Custom CMake
# Writing Kernels...
Generating kernels: Launching 16 threads...
Generating kernels: Done.
569it [00:00, 2086152.95it/s]
Compiling source kernels: Launching 16 threads...
Compiling source kernels: Done.
# Kernel Building elapsed time = 414.6 secs
# Tensile Library Writer DONE
################################################################################

[248/248] Creating library symlink library/src/librocblas.so.0 library/src/librocblas.so

    ninja package
    sudo dpkg -i *.deb

turns out same error with or without patch rocblas.
I am confuse with the your readme of rocm 5.0 which I shound rebuild, is rocm, tensorflow, or just rocblas?
pytorch just build and work with rocm 5.0 apt.

21.rocfft.sh, CMAKE_BUILD_WITH_INSTALL_RPATH

Environment

Hardware description
GPU gfx803
CPU ryzen7
Software version
OS debian bookworm
ROCm 4.3
Python 3.9.9
Linux 5.15.0-1-amd64
cmake 3.22.1

What is the expected behavior

  • 21.rocfft.sh builds rocfft

What actually happens

  • build exits with error: CMake Error at library/src/CMakeLists.txt:62 (add_library): The install of the rocfft target requires changing an RPATH from the build tree, but this is not supported with the Ninja generator unless on an ELF-based or XCOFF-based platform. The CMAKE_BUILD_WITH_INSTALL_RPATH variable may be set to avoid this relinking step.

How to reproduce

  • run 21.rocfft.sh

Adding -DCMAKE_BUILD_WITH_INSTALL_RPATH=ON to the cmake cmdline seems to fix the issue.

Missing dependency miopen-hip

Environment

Hardware description
GPU - RX6800
CPU - AMD Threadripper
Software version
OS - 20.4.2
ROCm - 4.2
Python -

What is the expected behavior

shell script is supposed to install the dependency

What actually happens

scripts fails due to missing dependency

How to reproduce

bash 56.rocm-libs.sh

Build with 5.4.2

Environment

Hardware description
GPU rx580 8GB
CPU ryzen 2600x
Software version
OS Arch
ROCm 5.4.2
Python 3.10

I am not completely sure the problem is here, but when compiling i get a bunch of errors that prevents the tensile library to be created

image

image

i will not post the whole error message since it would be several images, but I think the beginning and the end of the message could be enough
Despite using a different OS the build itself should work, it has worked with old versions, sadly not all old versions are available currently in my OS, that is why I am compiling it instead of using 5.4.1

19.hip.sh needs CMAKE_HIP_ARCHITECTURES set

Environment

Hardware description
GPU gfx803
CPU ryzen7
Software version
OS debian bookworm
ROCm 4.3
Python 3.9.9
Linux 5.15.0-1-amd64
cmake 3.22.1

What is the expected behavior

  • 19.hip.sh builds HIP

What actually happens

  • HIP not built

How to reproduce

  • run 19.hip.sh

I get build errors for multiple targets like the following:

CMake Error in CMakeLists.txt:
  HIP_ARCHITECTURES is empty for target "directed_tests.hipEnvVar".

Found this:
https://cmake.org/cmake/help/latest/prop_tgt/HIP_ARCHITECTURES.html

Adding -DCMAKE_HIP_ARCHITECTURES=$AMDGPU_TARGETS to the cmake command line seems to fix the issue:

cmake \
    -DCMAKE_BUILD_TYPE=Release \
    -DHIP_COMPILER=clang \
    -DHIP_PLATFORM=amd \
    -DROCM_PATH=$ROCM_INSTALL_DIR \
    -DHSA_PATH=$ROCM_INSTALL_DIR/hsa \
    -DCPACK_INSTALL_PREFIX=$ROCM_INSTALL_DIR/hip \
    -DCMAKE_PREFIX_PATH="$ROCM_BUILD_DIR/rocclr;$ROCM_INSTALL_DIR" \
    -DCMAKE_INSTALL_PREFIX=$ROCM_INSTALL_DIR/hip/ \
    -DCPACK_PACKAGING_INSTALL_PREFIX=$ROCM_INSTALL_DIR/hip/ \
    -DCPACK_GENERATOR=DEB \
    -DCMAKE_HIP_ARCHITECTURES=$AMDGPU_TARGETS \
    -G Ninja \
    $ROCM_GIT_DIR/HIP

ROCCLR build failure

Environment

Hardware description
GPU - Rx6800
CPU - AMD ThreadRipper
Software version
OS - 20.04.2
ROCm - 4.2
Python - 2.7.18

What is the expected behavior

Successfully build

Scripts 11-17 work well, breaks on 18.

What actually happens

bash 18.rocclr.sh

ian@ian-TRX40-AORUS-PRO-WIFI:~/Documents/rocm-build$ bash 18.rocclr.sh
Reading package lists... Done
Building dependency tree       
Reading state information... Done
mesa-common-dev is already the newest version (20.2.6-0ubuntu0.20.04.1).
The following packages were automatically installed and are no longer required:
  libllvm11.0-amdgpu llvm-amdgpu-11.0 llvm-amdgpu-11.0-dev llvm-amdgpu-11.0-runtime llvm-amdgpu-runtime
Use 'sudo apt autoremove' to remove them.
0 upgraded, 0 newly installed, 0 to remove and 10 not upgraded.
~/Documents/rocm-build/build/rocclr ~/Documents/rocm-build/build/rocclr
-- Code Object Manager found at /opt/rocm/lib/cmake/amd_comgr.
-- HSA Runtime found at /opt/rocm/lib/cmake/hsa-runtime64.
-- Found: /usr/lib/x86_64-linux-gnu/libnuma.so
-- Configuring done
-- Generating done
-- Build files have been written to: /home/ian/Documents/rocm-build/build/rocclr
[1/11] Building CXX object device/rocm/CMakeFiles/oclrocm.dir/rocsignal.cpp.o
FAILED: device/rocm/CMakeFiles/oclrocm.dir/rocsignal.cpp.o 
/usr/bin/c++  -DATI_ARCH_X86 -DATI_BITS_64 -DATI_OS_LINUX -DCL_TARGET_OPENCL_VERSION=220 -DCL_USE_DEPRECATED_OPENCL_1_0_APIS -DCL_USE_DEPRECATED_OPENCL_1_1_APIS -DCL_USE_DEPRECATED_OPENCL_1_2_APIS -DCL_USE_DEPRECATED_OPENCL_2_0_APIS -DCOMGR_DYN_DLL -DLITTLEENDIAN_CPU -DOPENCL_C_MAJOR=2 -DOPENCL_C_MINOR=0 -DOPENCL_EXPORTS -DOPENCL_MAJOR=2 -DOPENCL_MINOR=1 -DROCCLR_SUPPORT_NUMA_POLICY -DUSE_COMGR_LIBRARY -DVEGA10_ONLY=false -DWITH_LIGHTNING_COMPILER -DWITH_TARGET_AMDGCN -D__x86_64__ -I/home/ian/Documents/rocm-build/ROCm/ROCclr -I/home/ian/Documents/rocm-build/ROCm/ROCclr/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers/opencl2.2 -I/opt/rocm/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/include -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/backends/common -I/home/ian/Documents/rocm-build/ROCm/ROCclr/elf -Idevice/rocm -I/opt/rocm/include/hsa -fPIC   -Wall -Wno-strict-aliasing -Wno-unused-variable -std=c++11 -MD -MT device/rocm/CMakeFiles/oclrocm.dir/rocsignal.cpp.o -MF device/rocm/CMakeFiles/oclrocm.dir/rocsignal.cpp.o.d -o device/rocm/CMakeFiles/oclrocm.dir/rocsignal.cpp.o -c /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocsignal.cpp
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocsignal.cpp: In member function ‘virtual uint64_t roc::Signal::Wait(uint64_t, device::Signal::Condition, uint64_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocsignal.cpp:41:10: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope; did you mean ‘hsa_signal_wait_acquire’?
   41 |   return hsa_signal_wait_scacquire(
      |          ^~~~~~~~~~~~~~~~~~~~~~~~~
      |          hsa_signal_wait_acquire
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocsignal.cpp: In member function ‘virtual void roc::Signal::Reset(uint64_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocsignal.cpp:50:3: error: ‘hsa_signal_store_screlease’ was not declared in this scope; did you mean ‘hsa_signal_store_release’?
   50 |   hsa_signal_store_screlease(signal_, value);
      |   ^~~~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_signal_store_release
[2/11] Building CXX object device/rocm/CMakeFiles/oclrocm.dir/rocmemory.cpp.o
FAILED: device/rocm/CMakeFiles/oclrocm.dir/rocmemory.cpp.o 
/usr/bin/c++  -DATI_ARCH_X86 -DATI_BITS_64 -DATI_OS_LINUX -DCL_TARGET_OPENCL_VERSION=220 -DCL_USE_DEPRECATED_OPENCL_1_0_APIS -DCL_USE_DEPRECATED_OPENCL_1_1_APIS -DCL_USE_DEPRECATED_OPENCL_1_2_APIS -DCL_USE_DEPRECATED_OPENCL_2_0_APIS -DCOMGR_DYN_DLL -DLITTLEENDIAN_CPU -DOPENCL_C_MAJOR=2 -DOPENCL_C_MINOR=0 -DOPENCL_EXPORTS -DOPENCL_MAJOR=2 -DOPENCL_MINOR=1 -DROCCLR_SUPPORT_NUMA_POLICY -DUSE_COMGR_LIBRARY -DVEGA10_ONLY=false -DWITH_LIGHTNING_COMPILER -DWITH_TARGET_AMDGCN -D__x86_64__ -I/home/ian/Documents/rocm-build/ROCm/ROCclr -I/home/ian/Documents/rocm-build/ROCm/ROCclr/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers/opencl2.2 -I/opt/rocm/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/include -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/backends/common -I/home/ian/Documents/rocm-build/ROCm/ROCclr/elf -Idevice/rocm -I/opt/rocm/include/hsa -fPIC   -Wall -Wno-strict-aliasing -Wno-unused-variable -std=c++11 -MD -MT device/rocm/CMakeFiles/oclrocm.dir/rocmemory.cpp.o -MF device/rocm/CMakeFiles/oclrocm.dir/rocmemory.cpp.o.d -o device/rocm/CMakeFiles/oclrocm.dir/rocmemory.cpp.o -c /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocmemory.cpp
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:38,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocmemory.hpp:28,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocmemory.cpp:31:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In function ‘bool roc::WaitForSignal(hsa_signal_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated)
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:11: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:46,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocmemory.hpp:28,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocmemory.cpp:31:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h: At global scope:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:277:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  277 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:442:3: error: ‘hsa_loaded_code_object_t’ was not declared in this scope; did you mean ‘hsa_code_object_t’?
  442 |   hsa_loaded_code_object_t loaded_code_object,
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:443:48: error: expected primary-expression before ‘attribute’
  443 |   hsa_ven_amd_loader_loaded_code_object_info_t attribute,
      |                                                ^~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:3: error: expected primary-expression before ‘void’
  444 |   void *value);
      |   ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:14: error: expression list treated as compound expression in initializer [-fpermissive]
  444 |   void *value);
      |              ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:489:5: error: ‘hsa_file_t’ was not declared in this scope; did you mean ‘hsa_isa_t’?
  489 |     hsa_file_t file,
      |     ^~~~~~~~~~
      |     hsa_isa_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:490:12: error: expected primary-expression before ‘offset’
  490 |     size_t offset,
      |            ^~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:491:12: error: expected primary-expression before ‘size’
  491 |     size_t size,
      |            ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:5: error: ‘hsa_code_object_reader_t’ was not declared in this scope; did you mean ‘hsa_code_object_type_t’?
  492 |     hsa_code_object_reader_t *code_object_reader);
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
      |     hsa_code_object_type_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:31: error: ‘code_object_reader’ was not declared in this scope
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                               ^~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:49: error: expression list treated as compound expression in initializer [-fpermissive]
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                                                 ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:538:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  538 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:543:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  543 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:568:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  568 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:573:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  573 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:579:7: error: ‘hsa_file_t’ has not been declared
  579 |       hsa_file_t file,
      |       ^~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:582:7: error: ‘hsa_code_object_reader_t’ has not been declared
  582 |       hsa_code_object_reader_t *code_object_reader);
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
[3/11] Building CXX object device/rocm/CMakeFiles/oclrocm.dir/rocprintf.cpp.o
FAILED: device/rocm/CMakeFiles/oclrocm.dir/rocprintf.cpp.o 
/usr/bin/c++  -DATI_ARCH_X86 -DATI_BITS_64 -DATI_OS_LINUX -DCL_TARGET_OPENCL_VERSION=220 -DCL_USE_DEPRECATED_OPENCL_1_0_APIS -DCL_USE_DEPRECATED_OPENCL_1_1_APIS -DCL_USE_DEPRECATED_OPENCL_1_2_APIS -DCL_USE_DEPRECATED_OPENCL_2_0_APIS -DCOMGR_DYN_DLL -DLITTLEENDIAN_CPU -DOPENCL_C_MAJOR=2 -DOPENCL_C_MINOR=0 -DOPENCL_EXPORTS -DOPENCL_MAJOR=2 -DOPENCL_MINOR=1 -DROCCLR_SUPPORT_NUMA_POLICY -DUSE_COMGR_LIBRARY -DVEGA10_ONLY=false -DWITH_LIGHTNING_COMPILER -DWITH_TARGET_AMDGCN -D__x86_64__ -I/home/ian/Documents/rocm-build/ROCm/ROCclr -I/home/ian/Documents/rocm-build/ROCm/ROCclr/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers/opencl2.2 -I/opt/rocm/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/include -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/backends/common -I/home/ian/Documents/rocm-build/ROCm/ROCclr/elf -Idevice/rocm -I/opt/rocm/include/hsa -fPIC   -Wall -Wno-strict-aliasing -Wno-unused-variable -std=c++11 -MD -MT device/rocm/CMakeFiles/oclrocm.dir/rocprintf.cpp.o -MF device/rocm/CMakeFiles/oclrocm.dir/rocprintf.cpp.o.d -o device/rocm/CMakeFiles/oclrocm.dir/rocprintf.cpp.o -c /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprintf.cpp
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:38,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocmemory.hpp:28,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprintf.cpp:25:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In function ‘bool roc::WaitForSignal(hsa_signal_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated)
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:11: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:46,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocmemory.hpp:28,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprintf.cpp:25:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h: At global scope:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:277:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  277 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:442:3: error: ‘hsa_loaded_code_object_t’ was not declared in this scope; did you mean ‘hsa_code_object_t’?
  442 |   hsa_loaded_code_object_t loaded_code_object,
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:443:48: error: expected primary-expression before ‘attribute’
  443 |   hsa_ven_amd_loader_loaded_code_object_info_t attribute,
      |                                                ^~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:3: error: expected primary-expression before ‘void’
  444 |   void *value);
      |   ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:14: error: expression list treated as compound expression in initializer [-fpermissive]
  444 |   void *value);
      |              ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:489:5: error: ‘hsa_file_t’ was not declared in this scope; did you mean ‘hsa_isa_t’?
  489 |     hsa_file_t file,
      |     ^~~~~~~~~~
      |     hsa_isa_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:490:12: error: expected primary-expression before ‘offset’
  490 |     size_t offset,
      |            ^~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:491:12: error: expected primary-expression before ‘size’
  491 |     size_t size,
      |            ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:5: error: ‘hsa_code_object_reader_t’ was not declared in this scope; did you mean ‘hsa_code_object_type_t’?
  492 |     hsa_code_object_reader_t *code_object_reader);
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
      |     hsa_code_object_type_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:31: error: ‘code_object_reader’ was not declared in this scope
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                               ^~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:49: error: expression list treated as compound expression in initializer [-fpermissive]
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                                                 ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:538:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  538 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:543:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  543 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:568:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  568 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:573:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  573 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:579:7: error: ‘hsa_file_t’ has not been declared
  579 |       hsa_file_t file,
      |       ^~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:582:7: error: ‘hsa_code_object_reader_t’ has not been declared
  582 |       hsa_code_object_reader_t *code_object_reader);
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.hpp:25,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprintf.cpp:26:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.hpp:89:3: error: ‘hsa_code_object_reader_t’ does not name a type; did you mean ‘hsa_code_object_type_t’?
   89 |   hsa_code_object_reader_t hsaCodeObjectReader_; //!< Handle to HSA code reader
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_type_t
[4/11] Building CXX object device/rocm/CMakeFiles/oclrocm.dir/rockernel.cpp.o
FAILED: device/rocm/CMakeFiles/oclrocm.dir/rockernel.cpp.o 
/usr/bin/c++  -DATI_ARCH_X86 -DATI_BITS_64 -DATI_OS_LINUX -DCL_TARGET_OPENCL_VERSION=220 -DCL_USE_DEPRECATED_OPENCL_1_0_APIS -DCL_USE_DEPRECATED_OPENCL_1_1_APIS -DCL_USE_DEPRECATED_OPENCL_1_2_APIS -DCL_USE_DEPRECATED_OPENCL_2_0_APIS -DCOMGR_DYN_DLL -DLITTLEENDIAN_CPU -DOPENCL_C_MAJOR=2 -DOPENCL_C_MINOR=0 -DOPENCL_EXPORTS -DOPENCL_MAJOR=2 -DOPENCL_MINOR=1 -DROCCLR_SUPPORT_NUMA_POLICY -DUSE_COMGR_LIBRARY -DVEGA10_ONLY=false -DWITH_LIGHTNING_COMPILER -DWITH_TARGET_AMDGCN -D__x86_64__ -I/home/ian/Documents/rocm-build/ROCm/ROCclr -I/home/ian/Documents/rocm-build/ROCm/ROCclr/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers/opencl2.2 -I/opt/rocm/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/include -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/backends/common -I/home/ian/Documents/rocm-build/ROCm/ROCclr/elf -Idevice/rocm -I/opt/rocm/include/hsa -fPIC   -Wall -Wno-strict-aliasing -Wno-unused-variable -std=c++11 -MD -MT device/rocm/CMakeFiles/oclrocm.dir/rockernel.cpp.o -MF device/rocm/CMakeFiles/oclrocm.dir/rockernel.cpp.o.d -o device/rocm/CMakeFiles/oclrocm.dir/rockernel.cpp.o -c /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.cpp
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:38,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.hpp:30,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.hpp:25,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.cpp:21:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In function ‘bool roc::WaitForSignal(hsa_signal_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated)
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:11: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:46,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.hpp:30,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.hpp:25,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.cpp:21:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h: At global scope:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:277:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  277 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:442:3: error: ‘hsa_loaded_code_object_t’ was not declared in this scope; did you mean ‘hsa_code_object_t’?
  442 |   hsa_loaded_code_object_t loaded_code_object,
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:443:48: error: expected primary-expression before ‘attribute’
  443 |   hsa_ven_amd_loader_loaded_code_object_info_t attribute,
      |                                                ^~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:3: error: expected primary-expression before ‘void’
  444 |   void *value);
      |   ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:14: error: expression list treated as compound expression in initializer [-fpermissive]
  444 |   void *value);
      |              ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:489:5: error: ‘hsa_file_t’ was not declared in this scope; did you mean ‘hsa_isa_t’?
  489 |     hsa_file_t file,
      |     ^~~~~~~~~~
      |     hsa_isa_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:490:12: error: expected primary-expression before ‘offset’
  490 |     size_t offset,
      |            ^~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:491:12: error: expected primary-expression before ‘size’
  491 |     size_t size,
      |            ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:5: error: ‘hsa_code_object_reader_t’ was not declared in this scope; did you mean ‘hsa_code_object_type_t’?
  492 |     hsa_code_object_reader_t *code_object_reader);
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
      |     hsa_code_object_type_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:31: error: ‘code_object_reader’ was not declared in this scope
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                               ^~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:49: error: expression list treated as compound expression in initializer [-fpermissive]
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                                                 ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:538:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  538 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:543:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  543 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:568:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  568 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:573:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  573 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:579:7: error: ‘hsa_file_t’ has not been declared
  579 |       hsa_file_t file,
      |       ^~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:582:7: error: ‘hsa_code_object_reader_t’ has not been declared
  582 |       hsa_code_object_reader_t *code_object_reader);
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.hpp:25,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.cpp:21:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.hpp:89:3: error: ‘hsa_code_object_reader_t’ does not name a type; did you mean ‘hsa_code_object_type_t’?
   89 |   hsa_code_object_reader_t hsaCodeObjectReader_; //!< Handle to HSA code reader
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_type_t
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.cpp: In member function ‘virtual bool roc::LightningKernel::init()’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.cpp:75:15: error: ‘hsa_executable_get_symbol_by_name’ was not declared in this scope; did you mean ‘hsa_executable_get_symbol’?
   75 |   hsaStatus = hsa_executable_get_symbol_by_name(program()->hsaExecutable(),
      |               ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |               hsa_executable_get_symbol
[5/11] Building CXX object device/rocm/CMakeFiles/oclrocm.dir/rocblit.cpp.o
FAILED: device/rocm/CMakeFiles/oclrocm.dir/rocblit.cpp.o 
/usr/bin/c++  -DATI_ARCH_X86 -DATI_BITS_64 -DATI_OS_LINUX -DCL_TARGET_OPENCL_VERSION=220 -DCL_USE_DEPRECATED_OPENCL_1_0_APIS -DCL_USE_DEPRECATED_OPENCL_1_1_APIS -DCL_USE_DEPRECATED_OPENCL_1_2_APIS -DCL_USE_DEPRECATED_OPENCL_2_0_APIS -DCOMGR_DYN_DLL -DLITTLEENDIAN_CPU -DOPENCL_C_MAJOR=2 -DOPENCL_C_MINOR=0 -DOPENCL_EXPORTS -DOPENCL_MAJOR=2 -DOPENCL_MINOR=1 -DROCCLR_SUPPORT_NUMA_POLICY -DUSE_COMGR_LIBRARY -DVEGA10_ONLY=false -DWITH_LIGHTNING_COMPILER -DWITH_TARGET_AMDGCN -D__x86_64__ -I/home/ian/Documents/rocm-build/ROCm/ROCclr -I/home/ian/Documents/rocm-build/ROCm/ROCclr/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers/opencl2.2 -I/opt/rocm/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/include -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/backends/common -I/home/ian/Documents/rocm-build/ROCm/ROCclr/elf -Idevice/rocm -I/opt/rocm/include/hsa -fPIC   -Wall -Wno-strict-aliasing -Wno-unused-variable -std=c++11 -MD -MT device/rocm/CMakeFiles/oclrocm.dir/rocblit.cpp.o -MF device/rocm/CMakeFiles/oclrocm.dir/rocblit.cpp.o.d -o device/rocm/CMakeFiles/oclrocm.dir/rocblit.cpp.o -c /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocblit.cpp
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:38,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocblit.cpp:22:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In function ‘bool roc::WaitForSignal(hsa_signal_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated)
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:11: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:46,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocblit.cpp:22:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h: At global scope:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:277:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  277 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:442:3: error: ‘hsa_loaded_code_object_t’ was not declared in this scope; did you mean ‘hsa_code_object_t’?
  442 |   hsa_loaded_code_object_t loaded_code_object,
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:443:48: error: expected primary-expression before ‘attribute’
  443 |   hsa_ven_amd_loader_loaded_code_object_info_t attribute,
      |                                                ^~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:3: error: expected primary-expression before ‘void’
  444 |   void *value);
      |   ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:14: error: expression list treated as compound expression in initializer [-fpermissive]
  444 |   void *value);
      |              ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:489:5: error: ‘hsa_file_t’ was not declared in this scope; did you mean ‘hsa_isa_t’?
  489 |     hsa_file_t file,
      |     ^~~~~~~~~~
      |     hsa_isa_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:490:12: error: expected primary-expression before ‘offset’
  490 |     size_t offset,
      |            ^~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:491:12: error: expected primary-expression before ‘size’
  491 |     size_t size,
      |            ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:5: error: ‘hsa_code_object_reader_t’ was not declared in this scope; did you mean ‘hsa_code_object_type_t’?
  492 |     hsa_code_object_reader_t *code_object_reader);
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
      |     hsa_code_object_type_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:31: error: ‘code_object_reader’ was not declared in this scope
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                               ^~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:49: error: expression list treated as compound expression in initializer [-fpermissive]
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                                                 ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:538:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  538 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:543:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  543 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:568:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  568 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:573:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  573 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:579:7: error: ‘hsa_file_t’ has not been declared
  579 |       hsa_file_t file,
      |       ^~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:582:7: error: ‘hsa_code_object_reader_t’ has not been declared
  582 |       hsa_code_object_reader_t *code_object_reader);
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.hpp:25,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocblit.cpp:25:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.hpp:89:3: error: ‘hsa_code_object_reader_t’ does not name a type; did you mean ‘hsa_code_object_type_t’?
   89 |   hsa_code_object_reader_t hsaCodeObjectReader_; //!< Handle to HSA code reader
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_type_t
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:38,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocblit.cpp:22:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In instantiation of ‘bool roc::WaitForSignal(hsa_signal_t) [with long unsigned int wait_time = 0; hsa_signal_t = hsa_signal_s]’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocblit.cpp:2435:37:   required from here
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:34: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope; did you mean ‘hsa_signal_wait_acquire’?
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   63 |                                   wait_time, HSA_WAIT_STATE_ACTIVE) != 0) {
      |                                   ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:34: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope; did you mean ‘hsa_signal_wait_acquire’?
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   71 |                                   timeout, HSA_WAIT_STATE_ACTIVE) != 0) {
      |                                   ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:36: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope, and no declarations were found by argument-dependent lookup at the point of instantiation [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   74 |                                     kUnlimitedWait, HSA_WAIT_STATE_BLOCKED) != 0) {
      |                                     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
[6/11] Building CXX object CMakeFiles/amdrocclr_static.dir/device/device.cpp.o
FAILED: CMakeFiles/amdrocclr_static.dir/device/device.cpp.o 
/usr/bin/c++  -DATI_ARCH_X86 -DATI_BITS_64 -DATI_OS_LINUX -DCL_TARGET_OPENCL_VERSION=220 -DCL_USE_DEPRECATED_OPENCL_1_0_APIS -DCL_USE_DEPRECATED_OPENCL_1_1_APIS -DCL_USE_DEPRECATED_OPENCL_1_2_APIS -DCL_USE_DEPRECATED_OPENCL_2_0_APIS -DCOMGR_DYN_DLL -DLITTLEENDIAN_CPU -DOPENCL_C_MAJOR=2 -DOPENCL_C_MINOR=0 -DOPENCL_EXPORTS -DOPENCL_MAJOR=2 -DOPENCL_MINOR=1 -DUSE_COMGR_LIBRARY -DVEGA10_ONLY=false -DWITH_HSA_DEVICE -DWITH_LIGHTNING_COMPILER -DWITH_TARGET_AMDGCN -D__x86_64__ -I/home/ian/Documents/rocm-build/ROCm/ROCclr -I/home/ian/Documents/rocm-build/ROCm/ROCclr/include -I/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/include -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/backends/common -I/home/ian/Documents/rocm-build/ROCm/ROCclr/elf -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers/opencl2.2 -isystem /opt/rocm/include -isystem /opt/rocm/include/hsa -fPIC   -Wall -Wno-strict-aliasing -Wno-unused-variable -pthread -std=c++11 -MD -MT CMakeFiles/amdrocclr_static.dir/device/device.cpp.o -MF CMakeFiles/amdrocclr_static.dir/device/device.cpp.o.d -o CMakeFiles/amdrocclr_static.dir/device/device.cpp.o -c /home/ian/Documents/rocm-build/ROCm/ROCclr/device/device.cpp
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:38,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/device.cpp:32:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In function ‘bool roc::WaitForSignal(hsa_signal_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated)
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:11: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:46,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/device.cpp:32:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h: At global scope:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:277:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  277 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:442:3: error: ‘hsa_loaded_code_object_t’ was not declared in this scope; did you mean ‘hsa_code_object_t’?
  442 |   hsa_loaded_code_object_t loaded_code_object,
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:443:48: error: expected primary-expression before ‘attribute’
  443 |   hsa_ven_amd_loader_loaded_code_object_info_t attribute,
      |                                                ^~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:3: error: expected primary-expression before ‘void’
  444 |   void *value);
      |   ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:14: error: expression list treated as compound expression in initializer [-fpermissive]
  444 |   void *value);
      |              ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:489:5: error: ‘hsa_file_t’ was not declared in this scope; did you mean ‘hsa_isa_t’?
  489 |     hsa_file_t file,
      |     ^~~~~~~~~~
      |     hsa_isa_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:490:12: error: expected primary-expression before ‘offset’
  490 |     size_t offset,
      |            ^~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:491:12: error: expected primary-expression before ‘size’
  491 |     size_t size,
      |            ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:5: error: ‘hsa_code_object_reader_t’ was not declared in this scope; did you mean ‘hsa_code_object_type_t’?
  492 |     hsa_code_object_reader_t *code_object_reader);
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
      |     hsa_code_object_type_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:31: error: ‘code_object_reader’ was not declared in this scope
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                               ^~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:49: error: expression list treated as compound expression in initializer [-fpermissive]
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                                                 ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:538:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  538 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:543:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  543 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:568:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  568 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:573:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  573 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:579:7: error: ‘hsa_file_t’ has not been declared
  579 |       hsa_file_t file,
      |       ^~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:582:7: error: ‘hsa_code_object_reader_t’ has not been declared
  582 |       hsa_code_object_reader_t *code_object_reader);
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/device.cpp: In constructor ‘amd::Device::Device()’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/device.cpp:445:37: warning: ‘void* memset(void*, int, size_t)’ clearing an object of type ‘struct device::Info’ with no trivial copy-assignment; use assignment or value-initialization instead [-Wclass-memaccess]
  445 |   memset(&info_, '\0', sizeof(info_));
      |                                     ^
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/device.cpp:21:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/device.hpp:222:8: note: ‘struct device::Info’ declared here
  222 | struct Info : public amd::EmbeddedObject {
      |        ^~~~
[7/11] Building CXX object device/rocm/CMakeFiles/oclrocm.dir/roccounters.cpp.o
FAILED: device/rocm/CMakeFiles/oclrocm.dir/roccounters.cpp.o 
/usr/bin/c++  -DATI_ARCH_X86 -DATI_BITS_64 -DATI_OS_LINUX -DCL_TARGET_OPENCL_VERSION=220 -DCL_USE_DEPRECATED_OPENCL_1_0_APIS -DCL_USE_DEPRECATED_OPENCL_1_1_APIS -DCL_USE_DEPRECATED_OPENCL_1_2_APIS -DCL_USE_DEPRECATED_OPENCL_2_0_APIS -DCOMGR_DYN_DLL -DLITTLEENDIAN_CPU -DOPENCL_C_MAJOR=2 -DOPENCL_C_MINOR=0 -DOPENCL_EXPORTS -DOPENCL_MAJOR=2 -DOPENCL_MINOR=1 -DROCCLR_SUPPORT_NUMA_POLICY -DUSE_COMGR_LIBRARY -DVEGA10_ONLY=false -DWITH_LIGHTNING_COMPILER -DWITH_TARGET_AMDGCN -D__x86_64__ -I/home/ian/Documents/rocm-build/ROCm/ROCclr -I/home/ian/Documents/rocm-build/ROCm/ROCclr/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers/opencl2.2 -I/opt/rocm/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/include -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/backends/common -I/home/ian/Documents/rocm-build/ROCm/ROCclr/elf -Idevice/rocm -I/opt/rocm/include/hsa -fPIC   -Wall -Wno-strict-aliasing -Wno-unused-variable -std=c++11 -MD -MT device/rocm/CMakeFiles/oclrocm.dir/roccounters.cpp.o -MF device/rocm/CMakeFiles/oclrocm.dir/roccounters.cpp.o.d -o device/rocm/CMakeFiles/oclrocm.dir/roccounters.cpp.o -c /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.cpp
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:38,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.hpp:26,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.cpp:21:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In function ‘bool roc::WaitForSignal(hsa_signal_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated)
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:11: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:46,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.hpp:26,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.cpp:21:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h: At global scope:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:277:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  277 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:442:3: error: ‘hsa_loaded_code_object_t’ was not declared in this scope; did you mean ‘hsa_code_object_t’?
  442 |   hsa_loaded_code_object_t loaded_code_object,
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:443:48: error: expected primary-expression before ‘attribute’
  443 |   hsa_ven_amd_loader_loaded_code_object_info_t attribute,
      |                                                ^~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:3: error: expected primary-expression before ‘void’
  444 |   void *value);
      |   ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:14: error: expression list treated as compound expression in initializer [-fpermissive]
  444 |   void *value);
      |              ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:489:5: error: ‘hsa_file_t’ was not declared in this scope; did you mean ‘hsa_isa_t’?
  489 |     hsa_file_t file,
      |     ^~~~~~~~~~
      |     hsa_isa_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:490:12: error: expected primary-expression before ‘offset’
  490 |     size_t offset,
      |            ^~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:491:12: error: expected primary-expression before ‘size’
  491 |     size_t size,
      |            ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:5: error: ‘hsa_code_object_reader_t’ was not declared in this scope; did you mean ‘hsa_code_object_type_t’?
  492 |     hsa_code_object_reader_t *code_object_reader);
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
      |     hsa_code_object_type_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:31: error: ‘code_object_reader’ was not declared in this scope
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                               ^~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:49: error: expression list treated as compound expression in initializer [-fpermissive]
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                                                 ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:538:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  538 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:543:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  543 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:568:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  568 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:573:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  573 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:579:7: error: ‘hsa_file_t’ has not been declared
  579 |       hsa_file_t file,
      |       ^~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:582:7: error: ‘hsa_code_object_reader_t’ has not been declared
  582 |       hsa_code_object_reader_t *code_object_reader);
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.cpp:21:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.hpp: In member function ‘bool roc::PerfCounterProfile::Create()’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.hpp:121:36: error: ‘HSA_EXTENSION_AMD_AQLPROFILE’ was not declared in this scope; did you mean ‘HSA_EXTENSION_AMD_PROFILER’?
  121 |     hsa_system_extension_supported(HSA_EXTENSION_AMD_AQLPROFILE, 1, 0, &system_support);
      |                                    ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |                                    HSA_EXTENSION_AMD_PROFILER
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.hpp:127:9: error: ‘hsa_system_get_major_extension_table’ was not declared in this scope; did you mean ‘hsa_system_get_extension_table’?
  127 |     if (hsa_system_get_major_extension_table(HSA_EXTENSION_AMD_AQLPROFILE,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |         hsa_system_get_extension_table
[8/11] Building CXX object device/rocm/CMakeFiles/oclrocm.dir/rocprogram.cpp.o
FAILED: device/rocm/CMakeFiles/oclrocm.dir/rocprogram.cpp.o 
/usr/bin/c++  -DATI_ARCH_X86 -DATI_BITS_64 -DATI_OS_LINUX -DCL_TARGET_OPENCL_VERSION=220 -DCL_USE_DEPRECATED_OPENCL_1_0_APIS -DCL_USE_DEPRECATED_OPENCL_1_1_APIS -DCL_USE_DEPRECATED_OPENCL_1_2_APIS -DCL_USE_DEPRECATED_OPENCL_2_0_APIS -DCOMGR_DYN_DLL -DLITTLEENDIAN_CPU -DOPENCL_C_MAJOR=2 -DOPENCL_C_MINOR=0 -DOPENCL_EXPORTS -DOPENCL_MAJOR=2 -DOPENCL_MINOR=1 -DROCCLR_SUPPORT_NUMA_POLICY -DUSE_COMGR_LIBRARY -DVEGA10_ONLY=false -DWITH_LIGHTNING_COMPILER -DWITH_TARGET_AMDGCN -D__x86_64__ -I/home/ian/Documents/rocm-build/ROCm/ROCclr -I/home/ian/Documents/rocm-build/ROCm/ROCclr/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers/opencl2.2 -I/opt/rocm/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/include -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/backends/common -I/home/ian/Documents/rocm-build/ROCm/ROCclr/elf -Idevice/rocm -I/opt/rocm/include/hsa -fPIC   -Wall -Wno-strict-aliasing -Wno-unused-variable -std=c++11 -MD -MT device/rocm/CMakeFiles/oclrocm.dir/rocprogram.cpp.o -MF device/rocm/CMakeFiles/oclrocm.dir/rocprogram.cpp.o.d -o device/rocm/CMakeFiles/oclrocm.dir/rocprogram.cpp.o -c /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:38,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.hpp:30,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp:23:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In function ‘bool roc::WaitForSignal(hsa_signal_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated)
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:11: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:46,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.hpp:30,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp:23:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h: At global scope:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:277:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  277 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:442:3: error: ‘hsa_loaded_code_object_t’ was not declared in this scope; did you mean ‘hsa_code_object_t’?
  442 |   hsa_loaded_code_object_t loaded_code_object,
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:443:48: error: expected primary-expression before ‘attribute’
  443 |   hsa_ven_amd_loader_loaded_code_object_info_t attribute,
      |                                                ^~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:3: error: expected primary-expression before ‘void’
  444 |   void *value);
      |   ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:14: error: expression list treated as compound expression in initializer [-fpermissive]
  444 |   void *value);
      |              ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:489:5: error: ‘hsa_file_t’ was not declared in this scope; did you mean ‘hsa_isa_t’?
  489 |     hsa_file_t file,
      |     ^~~~~~~~~~
      |     hsa_isa_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:490:12: error: expected primary-expression before ‘offset’
  490 |     size_t offset,
      |            ^~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:491:12: error: expected primary-expression before ‘size’
  491 |     size_t size,
      |            ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:5: error: ‘hsa_code_object_reader_t’ was not declared in this scope; did you mean ‘hsa_code_object_type_t’?
  492 |     hsa_code_object_reader_t *code_object_reader);
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
      |     hsa_code_object_type_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:31: error: ‘code_object_reader’ was not declared in this scope
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                               ^~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:49: error: expression list treated as compound expression in initializer [-fpermissive]
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                                                 ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:538:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  538 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:543:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  543 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:568:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  568 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:573:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  573 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:579:7: error: ‘hsa_file_t’ has not been declared
  579 |       hsa_file_t file,
      |       ^~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:582:7: error: ‘hsa_code_object_reader_t’ has not been declared
  582 |       hsa_code_object_reader_t *code_object_reader);
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp:23:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.hpp:89:3: error: ‘hsa_code_object_reader_t’ does not name a type; did you mean ‘hsa_code_object_type_t’?
   89 |   hsa_code_object_reader_t hsaCodeObjectReader_; //!< Handle to HSA code reader
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_type_t
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp: In destructor ‘virtual roc::Program::~Program()’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp:78:7: error: ‘hsaCodeObjectReader_’ was not declared in this scope
   78 |   if (hsaCodeObjectReader_.handle != 0) {
      |       ^~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp:79:5: error: ‘hsa_code_object_reader_destroy’ was not declared in this scope; did you mean ‘hsa_code_object_destroy’?
   79 |     hsa_code_object_reader_destroy(hsaCodeObjectReader_);
      |     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |     hsa_code_object_destroy
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp: In constructor ‘roc::Program::Program(roc::NullDevice&, amd::Program&)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp:86:3: error: ‘hsaCodeObjectReader_’ was not declared in this scope
   86 |   hsaCodeObjectReader_.handle = 0;
      |   ^~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp: In member function ‘virtual bool roc::Program::createGlobalVarObj(amd::Memory**, void**, size_t*, const char*) const’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp:167:12: error: ‘hsa_executable_get_symbol_by_name’ was not declared in this scope; did you mean ‘hsa_executable_get_symbol’?
  167 |   status = hsa_executable_get_symbol_by_name(hsaExecutable_, global_name, &hsa_device,
      |            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |            hsa_executable_get_symbol
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp: In member function ‘virtual bool roc::LightningProgram::setKernels(amd::option::Options*, void*, size_t, amd::Os::FileDesc, size_t, std::string)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp:495:12: error: ‘hsa_executable_create_alt’ was not declared in this scope; did you mean ‘hsa_executable_create’?
  495 |   status = hsa_executable_create_alt(HSA_PROFILE_FULL, HSA_DEFAULT_FLOAT_ROUNDING_MODE_DEFAULT,
      |            ^~~~~~~~~~~~~~~~~~~~~~~~~
      |            hsa_executable_create
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp:507:72: error: ‘hsaCodeObjectReader_’ was not declared in this scope
  507 |   status = hsa_code_object_reader_create_from_memory(binary, binSize, &hsaCodeObjectReader_);
      |                                                                        ^~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp:507:12: error: ‘hsa_code_object_reader_create_from_memory’ was not declared in this scope
  507 |   status = hsa_code_object_reader_create_from_memory(binary, binSize, &hsaCodeObjectReader_);
      |            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.cpp:515:12: error: ‘hsa_executable_load_agent_code_object’ was not declared in this scope; did you mean ‘hsa_executable_load_code_object’?
  515 |   status = hsa_executable_load_agent_code_object(hsaExecutable_, agent, hsaCodeObjectReader_, nullptr,
      |            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |            hsa_executable_load_code_object
[9/11] Building CXX object device/rocm/CMakeFiles/oclrocm.dir/rocdevice.cpp.o
FAILED: device/rocm/CMakeFiles/oclrocm.dir/rocdevice.cpp.o 
/usr/bin/c++  -DATI_ARCH_X86 -DATI_BITS_64 -DATI_OS_LINUX -DCL_TARGET_OPENCL_VERSION=220 -DCL_USE_DEPRECATED_OPENCL_1_0_APIS -DCL_USE_DEPRECATED_OPENCL_1_1_APIS -DCL_USE_DEPRECATED_OPENCL_1_2_APIS -DCL_USE_DEPRECATED_OPENCL_2_0_APIS -DCOMGR_DYN_DLL -DLITTLEENDIAN_CPU -DOPENCL_C_MAJOR=2 -DOPENCL_C_MINOR=0 -DOPENCL_EXPORTS -DOPENCL_MAJOR=2 -DOPENCL_MINOR=1 -DROCCLR_SUPPORT_NUMA_POLICY -DUSE_COMGR_LIBRARY -DVEGA10_ONLY=false -DWITH_LIGHTNING_COMPILER -DWITH_TARGET_AMDGCN -D__x86_64__ -I/home/ian/Documents/rocm-build/ROCm/ROCclr -I/home/ian/Documents/rocm-build/ROCm/ROCclr/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers/opencl2.2 -I/opt/rocm/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/include -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/backends/common -I/home/ian/Documents/rocm-build/ROCm/ROCclr/elf -Idevice/rocm -I/opt/rocm/include/hsa -fPIC   -Wall -Wno-strict-aliasing -Wno-unused-variable -std=c++11 -MD -MT device/rocm/CMakeFiles/oclrocm.dir/rocdevice.cpp.o -MF device/rocm/CMakeFiles/oclrocm.dir/rocdevice.cpp.o.d -o device/rocm/CMakeFiles/oclrocm.dir/rocdevice.cpp.o -c /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:38,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp:36:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In function ‘bool roc::WaitForSignal(hsa_signal_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated)
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:11: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:46,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp:36:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h: At global scope:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:277:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  277 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:442:3: error: ‘hsa_loaded_code_object_t’ was not declared in this scope; did you mean ‘hsa_code_object_t’?
  442 |   hsa_loaded_code_object_t loaded_code_object,
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:443:48: error: expected primary-expression before ‘attribute’
  443 |   hsa_ven_amd_loader_loaded_code_object_info_t attribute,
      |                                                ^~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:3: error: expected primary-expression before ‘void’
  444 |   void *value);
      |   ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:14: error: expression list treated as compound expression in initializer [-fpermissive]
  444 |   void *value);
      |              ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:489:5: error: ‘hsa_file_t’ was not declared in this scope; did you mean ‘hsa_isa_t’?
  489 |     hsa_file_t file,
      |     ^~~~~~~~~~
      |     hsa_isa_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:490:12: error: expected primary-expression before ‘offset’
  490 |     size_t offset,
      |            ^~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:491:12: error: expected primary-expression before ‘size’
  491 |     size_t size,
      |            ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:5: error: ‘hsa_code_object_reader_t’ was not declared in this scope; did you mean ‘hsa_code_object_type_t’?
  492 |     hsa_code_object_reader_t *code_object_reader);
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
      |     hsa_code_object_type_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:31: error: ‘code_object_reader’ was not declared in this scope
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                               ^~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:49: error: expression list treated as compound expression in initializer [-fpermissive]
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                                                 ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:538:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  538 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:543:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  543 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:568:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  568 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:573:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  573 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:579:7: error: ‘hsa_file_t’ has not been declared
  579 |       hsa_file_t file,
      |       ^~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:582:7: error: ‘hsa_code_object_reader_t’ has not been declared
  582 |       hsa_code_object_reader_t *code_object_reader);
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp:39:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.hpp:89:3: error: ‘hsa_code_object_reader_t’ does not name a type; did you mean ‘hsa_code_object_type_t’?
   89 |   hsa_code_object_reader_t hsaCodeObjectReader_; //!< Handle to HSA code reader
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_type_t
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp: In static member function ‘static bool roc::Device::init()’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp:469:40: error: ‘HSA_EXTENSION_AMD_LOADER’ was not declared in this scope; did you mean ‘HSA_EXTENSION_AMD_PROFILER’?
  469 |   hsa_system_get_major_extension_table(HSA_EXTENSION_AMD_LOADER, 1, sizeof(amd_loader_ext_table),
      |                                        ^~~~~~~~~~~~~~~~~~~~~~~~
      |                                        HSA_EXTENSION_AMD_PROFILER
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp:469:3: error: ‘hsa_system_get_major_extension_table’ was not declared in this scope; did you mean ‘hsa_system_get_extension_table’?
  469 |   hsa_system_get_major_extension_table(HSA_EXTENSION_AMD_LOADER, 1, sizeof(amd_loader_ext_table),
      |   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_system_get_extension_table
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp: In member function ‘bool roc::Device::create()’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp:590:7: error: ‘hsa_agent_iterate_isas’ was not declared in this scope; did you mean ‘hsa_agent_iterate_regions’?
  590 |       hsa_agent_iterate_isas(_bkendDevice,
      |       ^~~~~~~~~~~~~~~~~~~~~~
      |       hsa_agent_iterate_regions
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp:611:7: error: ‘hsa_isa_get_info_alt’ was not declared in this scope; did you mean ‘hsa_isa_get_info’?
  611 |       hsa_isa_get_info_alt(agent_isas.first_isa, (hsa_isa_info_t)HSA_ISA_INFO_NAME_LENGTH,
      |       ^~~~~~~~~~~~~~~~~~~~
      |       hsa_isa_get_info
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp:620:7: error: ‘hsa_isa_get_info_alt’ was not declared in this scope; did you mean ‘hsa_isa_get_info’?
  620 |       hsa_isa_get_info_alt(agent_isas.first_isa, (hsa_isa_info_t)HSA_ISA_INFO_NAME,
      |       ^~~~~~~~~~~~~~~~~~~~
      |       hsa_isa_get_info
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp: In member function ‘hsa_queue_t* roc::Device::acquireQueue(uint32_t, bool, const std::vector<unsigned int>&, amd::CommandQueue::Priority)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.cpp:2609:18: error: ‘HSA_QUEUE_TYPE_COOPERATIVE’ was not declared in this scope; did you mean ‘HSA_QUEUE_TYPE_MULTI’?
 2609 |     queue_type = HSA_QUEUE_TYPE_COOPERATIVE;
      |                  ^~~~~~~~~~~~~~~~~~~~~~~~~~
      |                  HSA_QUEUE_TYPE_MULTI
[10/11] Building CXX object device/rocm/CMakeFiles/oclrocm.dir/rocvirtual.cpp.o
FAILED: device/rocm/CMakeFiles/oclrocm.dir/rocvirtual.cpp.o 
/usr/bin/c++  -DATI_ARCH_X86 -DATI_BITS_64 -DATI_OS_LINUX -DCL_TARGET_OPENCL_VERSION=220 -DCL_USE_DEPRECATED_OPENCL_1_0_APIS -DCL_USE_DEPRECATED_OPENCL_1_1_APIS -DCL_USE_DEPRECATED_OPENCL_1_2_APIS -DCL_USE_DEPRECATED_OPENCL_2_0_APIS -DCOMGR_DYN_DLL -DLITTLEENDIAN_CPU -DOPENCL_C_MAJOR=2 -DOPENCL_C_MINOR=0 -DOPENCL_EXPORTS -DOPENCL_MAJOR=2 -DOPENCL_MINOR=1 -DROCCLR_SUPPORT_NUMA_POLICY -DUSE_COMGR_LIBRARY -DVEGA10_ONLY=false -DWITH_LIGHTNING_COMPILER -DWITH_TARGET_AMDGCN -D__x86_64__ -I/home/ian/Documents/rocm-build/ROCm/ROCclr -I/home/ian/Documents/rocm-build/ROCm/ROCclr/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime/khronos/headers/opencl2.2 -I/opt/rocm/include -I/home/ian/Documents/rocm-build/ROCm/ROCm-OpenCL-Runtime -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/include -I/home/ian/Documents/rocm-build/ROCm/ROCclr/compiler/lib/backends/common -I/home/ian/Documents/rocm-build/ROCm/ROCclr/elf -Idevice/rocm -I/opt/rocm/include/hsa -fPIC   -Wall -Wno-strict-aliasing -Wno-unused-variable -std=c++11 -MD -MT device/rocm/CMakeFiles/oclrocm.dir/rocvirtual.cpp.o -MF device/rocm/CMakeFiles/oclrocm.dir/rocvirtual.cpp.o.d -o device/rocm/CMakeFiles/oclrocm.dir/rocvirtual.cpp.o -c /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:38,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:22:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In function ‘bool roc::WaitForSignal(hsa_signal_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:9: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated)
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:9: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:11: error: there are no arguments to ‘hsa_signal_wait_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_signal_wait_scacquire’ must be available [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:46,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:22:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h: At global scope:
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:277:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  277 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:442:3: error: ‘hsa_loaded_code_object_t’ was not declared in this scope; did you mean ‘hsa_code_object_t’?
  442 |   hsa_loaded_code_object_t loaded_code_object,
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:443:48: error: expected primary-expression before ‘attribute’
  443 |   hsa_ven_amd_loader_loaded_code_object_info_t attribute,
      |                                                ^~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:3: error: expected primary-expression before ‘void’
  444 |   void *value);
      |   ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:444:14: error: expression list treated as compound expression in initializer [-fpermissive]
  444 |   void *value);
      |              ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:489:5: error: ‘hsa_file_t’ was not declared in this scope; did you mean ‘hsa_isa_t’?
  489 |     hsa_file_t file,
      |     ^~~~~~~~~~
      |     hsa_isa_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:490:12: error: expected primary-expression before ‘offset’
  490 |     size_t offset,
      |            ^~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:491:12: error: expected primary-expression before ‘size’
  491 |     size_t size,
      |            ^~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:5: error: ‘hsa_code_object_reader_t’ was not declared in this scope; did you mean ‘hsa_code_object_type_t’?
  492 |     hsa_code_object_reader_t *code_object_reader);
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
      |     hsa_code_object_type_t
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:31: error: ‘code_object_reader’ was not declared in this scope
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                               ^~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:492:49: error: expression list treated as compound expression in initializer [-fpermissive]
  492 |     hsa_code_object_reader_t *code_object_reader);
      |                                                 ^
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:538:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  538 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:543:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  543 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:568:7: error: ‘hsa_loaded_code_object_t’ has not been declared
  568 |       hsa_loaded_code_object_t loaded_code_object,
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:573:5: error: ‘hsa_loaded_code_object_t’ has not been declared
  573 |     hsa_loaded_code_object_t loaded_code_object,
      |     ^~~~~~~~~~~~~~~~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:579:7: error: ‘hsa_file_t’ has not been declared
  579 |       hsa_file_t file,
      |       ^~~~~~~~~~
/opt/rocm/include/hsa/hsa_ven_amd_loader.h:582:7: error: ‘hsa_code_object_reader_t’ has not been declared
  582 |       hsa_code_object_reader_t *code_object_reader);
      |       ^~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rockernel.hpp:25,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:24:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocprogram.hpp:89:3: error: ‘hsa_code_object_reader_t’ does not name a type; did you mean ‘hsa_code_object_type_t’?
   89 |   hsa_code_object_reader_t hsaCodeObjectReader_; //!< Handle to HSA code reader
      |   ^~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_code_object_type_t
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:27:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.hpp: In member function ‘bool roc::PerfCounterProfile::Create()’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.hpp:121:36: error: ‘HSA_EXTENSION_AMD_AQLPROFILE’ was not declared in this scope; did you mean ‘HSA_EXTENSION_AMD_PROFILER’?
  121 |     hsa_system_extension_supported(HSA_EXTENSION_AMD_AQLPROFILE, 1, 0, &system_support);
      |                                    ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |                                    HSA_EXTENSION_AMD_PROFILER
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/roccounters.hpp:127:9: error: ‘hsa_system_get_major_extension_table’ was not declared in this scope; did you mean ‘hsa_system_get_extension_table’?
  127 |     if (hsa_system_get_major_extension_table(HSA_EXTENSION_AMD_AQLPROFILE,
      |         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |         hsa_system_get_extension_table
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp: In member function ‘hsa_signal_t roc::VirtualGPU::HwQueueTracker::ActiveSignal(hsa_signal_value_t, roc::Timestamp*, uint32_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:328:3: error: ‘hsa_signal_silent_store_relaxed’ was not declared in this scope; did you mean ‘hsa_signal_store_relaxed’?
  328 |   hsa_signal_silent_store_relaxed(prof_signal->signal_, init_val);
      |   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_signal_store_relaxed
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp: In member function ‘void roc::VirtualGPU::HwQueueTracker::ResetCurrentSignal()’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:429:3: error: ‘hsa_signal_silent_store_relaxed’ was not declared in this scope; did you mean ‘hsa_signal_store_relaxed’?
  429 |   hsa_signal_silent_store_relaxed(signal_list_[current_id_]->signal_, 0);
      |   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_signal_store_relaxed
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp: In member function ‘bool roc::VirtualGPU::dispatchGenericAqlPacket(AqlPacket*, uint16_t, uint16_t, bool, size_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:685:20: error: there are no arguments to ‘hsa_queue_add_write_index_screlease’ that depend on a template parameter, so a declaration of ‘hsa_queue_add_write_index_screlease’ must be available [-fpermissive]
  685 |   uint64_t index = hsa_queue_add_write_index_screlease(gpu_queue_, size);
      |                    ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:696:19: error: there are no arguments to ‘hsa_queue_load_read_index_scacquire’ that depend on a template parameter, so a declaration of ‘hsa_queue_load_read_index_scacquire’ must be available [-fpermissive]
  696 |   while ((index - hsa_queue_load_read_index_scacquire(gpu_queue_)) >= queueMask) {
      |                   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/include/top.hpp:101,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/devhostcall.hpp:23,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:21:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:726:36: error: ‘HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE’?
  726 |             extractAqlBits(header, HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE,
      |                                    ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:727:28: error: ‘HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_ACQUIRE_FENCE_SCOPE’?
  727 |                            HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE),
      |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:728:36: error: ‘HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE’?
  728 |             extractAqlBits(header, HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE,
      |                                    ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:729:28: error: ‘HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_RELEASE_FENCE_SCOPE’?
  729 |                            HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE),
      |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:726:36: error: ‘HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE’?
  726 |             extractAqlBits(header, HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE,
      |                                    ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:727:28: error: ‘HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_ACQUIRE_FENCE_SCOPE’?
  727 |                            HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE),
      |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:728:36: error: ‘HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE’?
  728 |             extractAqlBits(header, HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE,
      |                                    ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:729:28: error: ‘HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_RELEASE_FENCE_SCOPE’?
  729 |                            HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE),
      |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:744:3: error: there are no arguments to ‘hsa_signal_store_screlease’ that depend on a template parameter, so a declaration of ‘hsa_signal_store_screlease’ must be available [-fpermissive]
  744 |   hsa_signal_store_screlease(gpu_queue_->doorbell_signal, index - 1);
      |   ^~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp: In member function ‘void roc::VirtualGPU::dispatchBarrierPacket(hsa_barrier_and_packet_t*, uint16_t, bool)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:813:20: error: ‘hsa_queue_add_write_index_screlease’ was not declared in this scope; did you mean ‘hsa_queue_add_write_index_release’?
  813 |   uint64_t index = hsa_queue_add_write_index_screlease(gpu_queue_, 1);
      |                    ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |                    hsa_queue_add_write_index_release
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:828:19: error: ‘hsa_queue_load_read_index_scacquire’ was not declared in this scope; did you mean ‘hsa_queue_load_read_index_acquire’?
  828 |   while ((index - hsa_queue_load_read_index_scacquire(gpu_queue_)) >= queueMask);
      |                   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |                   hsa_queue_load_read_index_acquire
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:834:3: error: ‘hsa_signal_store_screlease’ was not declared in this scope; did you mean ‘hsa_signal_store_release’?
  834 |   hsa_signal_store_screlease(gpu_queue_->doorbell_signal, index);
      |   ^~~~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_signal_store_release
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/include/top.hpp:101,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/devhostcall.hpp:23,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:21:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:844:40: error: ‘HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE’?
  844 |           extractAqlBits(packetHeader, HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE,
      |                                        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:845:27: error: ‘HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_ACQUIRE_FENCE_SCOPE’?
  845 |                           HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE),
      |                           ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:846:40: error: ‘HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE’?
  846 |           extractAqlBits(packetHeader, HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE,
      |                                        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:847:27: error: ‘HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_RELEASE_FENCE_SCOPE’?
  847 |                           HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE),
      |                           ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:844:40: error: ‘HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE’?
  844 |           extractAqlBits(packetHeader, HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE,
      |                                        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:845:27: error: ‘HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_ACQUIRE_FENCE_SCOPE’?
  845 |                           HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE),
      |                           ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:846:40: error: ‘HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE’?
  846 |           extractAqlBits(packetHeader, HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE,
      |                                        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:847:27: error: ‘HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_RELEASE_FENCE_SCOPE’?
  847 |                           HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE),
      |                           ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp: In member function ‘void roc::VirtualGPU::dispatchBarrierValuePacket(const hsa_amd_barrier_value_packet_t*, hsa_amd_vendor_packet_header_t)’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:2088:20: error: ‘hsa_queue_add_write_index_screlease’ was not declared in this scope; did you mean ‘hsa_queue_add_write_index_release’?
 2088 |   uint64_t index = hsa_queue_add_write_index_screlease(gpu_queue_, 1);
      |                    ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |                    hsa_queue_add_write_index_release
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:2089:19: error: ‘hsa_queue_load_read_index_scacquire’ was not declared in this scope; did you mean ‘hsa_queue_load_read_index_acquire’?
 2089 |   while ((index - hsa_queue_load_read_index_scacquire(gpu_queue_)) >= queueMask) {
      |                   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |                   hsa_queue_load_read_index_acquire
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:2098:3: error: ‘hsa_signal_store_screlease’ was not declared in this scope; did you mean ‘hsa_signal_store_release’?
 2098 |   hsa_signal_store_screlease(gpu_queue_->doorbell_signal, index);
      |   ^~~~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_signal_store_release
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/include/top.hpp:101,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/devhostcall.hpp:23,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:21:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:2106:41: error: ‘HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE’?
 2106 |           extractAqlBits(header.header, HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE,
      |                                         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:2107:26: error: ‘HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_ACQUIRE_FENCE_SCOPE’?
 2107 |                          HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE),
      |                          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:2108:41: error: ‘HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE’?
 2108 |           extractAqlBits(header.header, HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE,
      |                                         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:2109:26: error: ‘HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_RELEASE_FENCE_SCOPE’?
 2109 |                          HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE),
      |                          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:200:68: note: in definition of macro ‘ClPrint’
  200 |           amd::log_printf(level, __FILENAME__, __LINE__, format, ##__VA_ARGS__);                   \
      |                                                                    ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:2106:41: error: ‘HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE’?
 2106 |           extractAqlBits(header.header, HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE,
      |                                         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:2107:26: error: ‘HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_ACQUIRE_FENCE_SCOPE’?
 2107 |                          HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE),
      |                          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:2108:41: error: ‘HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE’?
 2108 |           extractAqlBits(header.header, HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE,
      |                                         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:2109:26: error: ‘HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE’ was not declared in this scope; did you mean ‘HSA_PACKET_HEADER_WIDTH_RELEASE_FENCE_SCOPE’?
 2109 |                          HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE),
      |                          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/utils/debug.hpp:202:51: note: in definition of macro ‘ClPrint’
  202 |           amd::log_printf(level, "", 0, format, ##__VA_ARGS__);                                    \
      |                                                   ^~~~~~~~~~~
In file included from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocdevice.hpp:38,
                 from /home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:22:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In instantiation of ‘bool roc::WaitForSignal(hsa_signal_t) [with long unsigned int wait_time = 0; hsa_signal_t = hsa_signal_s]’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:119:34:   required from here
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:34: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope; did you mean ‘hsa_signal_wait_acquire’?
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   63 |                                   wait_time, HSA_WAIT_STATE_ACTIVE) != 0) {
      |                                   ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:34: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope; did you mean ‘hsa_signal_wait_acquire’?
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   71 |                                   timeout, HSA_WAIT_STATE_ACTIVE) != 0) {
      |                                   ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:36: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope, and no declarations were found by argument-dependent lookup at the point of instantiation [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   74 |                                     kUnlimitedWait, HSA_WAIT_STATE_BLOCKED) != 0) {
      |                                     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp: In instantiation of ‘bool roc::WaitForSignal(hsa_signal_t) [with long unsigned int wait_time = 50000; hsa_signal_t = hsa_signal_s]’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:392:60:   required from here
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:62:34: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope; did you mean ‘hsa_signal_wait_acquire’?
   62 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   63 |                                   wait_time, HSA_WAIT_STATE_ACTIVE) != 0) {
      |                                   ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:70:34: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope; did you mean ‘hsa_signal_wait_acquire’?
   70 |     if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |         ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   71 |                                   timeout, HSA_WAIT_STATE_ACTIVE) != 0) {
      |                                   ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.hpp:73:36: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope, and no declarations were found by argument-dependent lookup at the point of instantiation [-fpermissive]
   73 |       if (hsa_signal_wait_scacquire(signal, HSA_SIGNAL_CONDITION_LT, kInitSignalValueOne,
      |           ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   74 |                                     kUnlimitedWait, HSA_WAIT_STATE_BLOCKED) != 0) {
      |                                     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp: In instantiation of ‘bool roc::VirtualGPU::dispatchGenericAqlPacket(AqlPacket*, uint16_t, uint16_t, bool, size_t) [with AqlPacket = hsa_kernel_dispatch_packet_s; uint16_t = short unsigned int; size_t = long unsigned int]’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:770:65:   required from here
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:685:55: error: ‘hsa_queue_add_write_index_screlease’ was not declared in this scope; did you mean ‘hsa_queue_add_write_index_release’?
  685 |   uint64_t index = hsa_queue_add_write_index_screlease(gpu_queue_, size);
      |                    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~
      |                    hsa_queue_add_write_index_release
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:696:54: error: ‘hsa_queue_load_read_index_scacquire’ was not declared in this scope; did you mean ‘hsa_queue_load_read_index_acquire’?
  696 |   while ((index - hsa_queue_load_read_index_scacquire(gpu_queue_)) >= queueMask) {
      |                   ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
      |                   hsa_queue_load_read_index_acquire
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:744:29: error: ‘hsa_signal_store_screlease’ was not declared in this scope; did you mean ‘hsa_signal_store_release’?
  744 |   hsa_signal_store_screlease(gpu_queue_->doorbell_signal, index - 1);
      |   ~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_signal_store_release
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp: In instantiation of ‘bool roc::VirtualGPU::dispatchGenericAqlPacket(AqlPacket*, uint16_t, uint16_t, bool, size_t) [with AqlPacket = hsa_barrier_and_packet_s; uint16_t = short unsigned int; size_t = long unsigned int]’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:776:65:   required from here
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:685:55: error: ‘hsa_queue_add_write_index_screlease’ was not declared in this scope; did you mean ‘hsa_queue_add_write_index_release’?
  685 |   uint64_t index = hsa_queue_add_write_index_screlease(gpu_queue_, size);
      |                    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~
      |                    hsa_queue_add_write_index_release
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:696:54: error: ‘hsa_queue_load_read_index_scacquire’ was not declared in this scope; did you mean ‘hsa_queue_load_read_index_acquire’?
  696 |   while ((index - hsa_queue_load_read_index_scacquire(gpu_queue_)) >= queueMask) {
      |                   ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
      |                   hsa_queue_load_read_index_acquire
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:744:29: error: ‘hsa_signal_store_screlease’ was not declared in this scope; did you mean ‘hsa_signal_store_release’?
  744 |   hsa_signal_store_screlease(gpu_queue_->doorbell_signal, index - 1);
      |   ~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_signal_store_release
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp: In instantiation of ‘bool roc::VirtualGPU::dispatchGenericAqlPacket(AqlPacket*, uint16_t, uint16_t, bool, size_t) [with AqlPacket = hsa_ext_amd_aql_pm4_packet_t; uint16_t = short unsigned int; size_t = long unsigned int]’:
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:793:90:   required from here
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:685:55: error: ‘hsa_queue_add_write_index_screlease’ was not declared in this scope; did you mean ‘hsa_queue_add_write_index_release’?
  685 |   uint64_t index = hsa_queue_add_write_index_screlease(gpu_queue_, size);
      |                    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~
      |                    hsa_queue_add_write_index_release
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:696:54: error: ‘hsa_queue_load_read_index_scacquire’ was not declared in this scope; did you mean ‘hsa_queue_load_read_index_acquire’?
  696 |   while ((index - hsa_queue_load_read_index_scacquire(gpu_queue_)) >= queueMask) {
      |                   ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
      |                   hsa_queue_load_read_index_acquire
/home/ian/Documents/rocm-build/ROCm/ROCclr/device/rocm/rocvirtual.cpp:744:29: error: ‘hsa_signal_store_screlease’ was not declared in this scope; did you mean ‘hsa_signal_store_release’?
  744 |   hsa_signal_store_screlease(gpu_queue_->doorbell_signal, index - 1);
      |   ~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |   hsa_signal_store_release
ninja: build stopped: subcommand failed.

How to reproduce

bash 18.rocclr.sh

Navi 14 patches fail in 5.3

Environment

Hardware description
GPU RX 5500 XT 8GB
CPU Ryzen 9 5950X
Software version
OS Ubuntu 22.04.1
ROCm 5.3.3
Python 3.10.6

What is the expected behavior

Compiles fine

What actually happens

|====|
|SLOW|
|====|
~/rocm-build/build/rocblas ~/rocm-build/build/rocblas
HEAD is now at 006a5d65 hotfix: Assembly syntax and coverage for DirectToLds has changed (#1598) (#1610)
error: patch failed: Tensile/Common.py:198
error: Tensile/Common.py: patch does not apply

How to reproduce

apply the steps of navi 14 tutorial

expanded `__noinline__` cannot compile anything past script 21, including script 22 in navi10

Environment

Hardware description
GPU -gfx1010-rx5600xt
CPU -12700k-AVX512 pcores only
Software version
OS - popOS 22,04
ROCm - 5.2.1
Python - 5.10
Compiler gcc/g++ 12.1.x

What is the expected behavior

expect to be able to build properly

What actually happens

error during macro expansion

How to reproduce

building on 22.04, after having built: 00 11 12 13 14 15 16 17 18
Attempt to build 21, `navi1010/22'

log.txt

Error for 28.hipfft.sh

What actually happens

Terminal exits when runing 28.hipfft.sh

I'm very new to Git, Ubuntu and everything. What I guess has happened is, when pulling from ROCM git, it doesn't come with the hipFFT. So I manually downloaded it and put it in the right folder. Ran the 28.sh and it worked.

I'm sure if it's ok to do so.

Issue downloading from Bintray

Environment

Hardware description
GPU RX6800
CPU AMD
Software version
OS 20.4.2
ROCm 4.2
Python -

What is the expected behavior

bash 34.miopen.sh

completes

What actually happens

Downloading https://dl.bintray.com/boostorg/release/1.72.0/source/boost_1_72_0.tar.bz2
CMake Error at /usr/local/share/cmake/cmakeget/CMakeGet.cmake:151 (message):
Download failed: "HTTP response code said error":

How to reproduce

https://dl.bintray.com/boostorg/release/1.72.0/source/boost_1_72_0.tar.bz2 gives Forbidden. I cannot seem to find where this is referenced.

See issue from Libboost
boostorg/boost#299

question about rock-dkms script

Running the script 62.rock-dkms.sh generated a .deb file. Does a user need to type dpkg -i rock-dkms_5.1-36_all.deb ?

The following error is shown for the above command. Thanks for your instruction.

dpkg: regarding rock-dkms_5.1-36_all.deb containing rock-dkms, pre-dependency problem:
rock-dkms pre-depends on rock-dkms-firmware (= 1:5.1-36)
rock-dkms-firmware is not installed.

dpkg: error processing archive rock-dkms_5.1-36_all.deb (--install):
pre-dependency problem - not installing rock-dkms
Errors were encountered while processing:
rock-dkms_5.1-36_all.deb

Minor build issues on Debian bookworm

Environment

Hardware description
GPU gfx803
CPU ryzen7
Software version
OS debian bookworm
ROCm 4.3
Python 3.9.9
Linux 5.15.0-1-amd64
cmake 3.22.1

I'm currently trying to build rocm 4.5 for gfx803 using your feature/build branch. If I run into any more minor issues I'll report them here, instead of opening a new issue for every little thing.

BTW, thanks for your awesome work with the build scripts and especially support for the gfx803.

Build error on rocSPARSE fork following gfx803 instructions

I have installed 3.9.1 rocm-dev, rocm-libs. cmake and libc6-dev on my Ubuntu 20.10, using a Radeon RX680.

But when I bash install.sh -di on xuhuisheng/rocSPARSE, I get:

Creating project build directory in: ./build
Get:1 file:/var/opt/amdgpu-pro-local ./ InRelease
Ign:1 file:/var/opt/amdgpu-pro-local ./ InRelease
Get:2 file:/var/opt/amdgpu-pro-local ./ Release [816 B]
Get:2 file:/var/opt/amdgpu-pro-local ./ Release [816 B]
Get:3 file:/var/opt/amdgpu-pro-local ./ Release.gpg
Ign:3 file:/var/opt/amdgpu-pro-local ./ Release.gpg
Hit:4 http://nl.archive.ubuntu.com/ubuntu groovy InRelease
Hit:5 http://dl.google.com/linux/chrome/deb stable InRelease                                                                            
Hit:6 https://download.docker.com/linux/ubuntu groovy InRelease                                                                         
Hit:7 http://nl.archive.ubuntu.com/ubuntu groovy-updates InRelease                                                                      
Hit:8 http://nl.archive.ubuntu.com/ubuntu groovy-backports InRelease                                                                    
Hit:9 https://packages.microsoft.com/repos/ms-teams stable InRelease                                                                    
Hit:10 https://updates.signal.org/desktop/apt xenial InRelease                                                                          
Hit:11 https://packages.microsoft.com/repos/vscode stable InRelease                                               
Hit:12 https://repo.nordvpn.com/deb/nordvpn/debian stable InRelease                                               
Hit:13 http://security.ubuntu.com/ubuntu groovy-security InRelease                          
Hit:14 https://repo.radeon.com/rocm/apt/3.9.1 xenial InRelease        
Reading package lists... Done
Building dependency tree       
Reading state information... Done
All packages are up to date.
~/workspace/rocSPARSE ~/workspace/rocSPARSE
Building googletest from source; installing into /usr/local
-- Configuring gtest external dependency
-- ExternalGmock using ( 13 ) cores to build with
CMake Deprecation Warning at CMakeLists.txt:63 (cmake_policy):
  The OLD behavior for policy CMP0037 will be removed from a future version
  of CMake.

  The cmake-policies(7) manual explains that the OLD behaviors of all
  policies are deprecated and that a policy should be set to OLD only under
  specific short-term circumstances.  Projects should be ported to the NEW
  behavior and not rely on setting a policy to OLD.


-- Configuring done
-- Generating done
-- Build files have been written to: /home/user/rocSPARSE/build/deps
make[1]: Entering directory '/home/user/rocSPARSE/build/deps'
make[2]: Entering directory '/home/user/rocSPARSE/build/deps'
make[2]: Leaving directory '/home/user/rocSPARSE/build/deps'
make[2]: Entering directory '/home/user/rocSPARSE/build/deps'
[ 12%] Performing update step for 'googletest'
[ 25%] Performing configure step for 'googletest'
-- Configuring done
-- Generating done
-- Build files have been written to: /home/user/rocSPARSE/build/deps/gtest/src/googletest-build
[ 37%] Performing build step for 'googletest'
-- googletest build command succeeded.  See also /home/user/rocSPARSE/build/deps/gtest/src/googletest-stamp/googletest-build-*.log
[ 50%] No install step for 'googletest'
[ 62%] Completed 'googletest'
make[2]: Leaving directory '/home/user/rocSPARSE/build/deps'
[100%] Built target googletest
make[1]: Leaving directory '/home/user/rocSPARSE/build/deps'
make[1]: Entering directory '/home/user/rocSPARSE/build/deps'
make[2]: Entering directory '/home/user/rocSPARSE/build/deps'
make[3]: Entering directory '/home/user/rocSPARSE/build/deps'
make[3]: Leaving directory '/home/user/rocSPARSE/build/deps'
make[3]: Entering directory '/home/user/rocSPARSE/build/deps'
[ 12%] Performing update step for 'googletest'
[ 25%] Performing configure step for 'googletest'
-- Configuring done
-- Generating done
-- Build files have been written to: /home/user/rocSPARSE/build/deps/gtest/src/googletest-build
[ 37%] Performing build step for 'googletest'
-- googletest build command succeeded.  See also /home/user/rocSPARSE/build/deps/gtest/src/googletest-stamp/googletest-build-*.log
[ 50%] No install step for 'googletest'
[ 62%] Completed 'googletest'
make[3]: Leaving directory '/home/user/rocSPARSE/build/deps'
[100%] Built target googletest
make[3]: Entering directory '/home/user/rocSPARSE/build/deps'
make[3]: Leaving directory '/home/user/rocSPARSE/build/deps'
make[3]: Entering directory '/home/user/rocSPARSE/build/deps'
gmake[4]: Entering directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
gmake[5]: Entering directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
gmake[6]: Entering directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
gmake[6]: Leaving directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
[ 25%] Built target gtest
gmake[6]: Entering directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
gmake[6]: Leaving directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
[ 50%] Built target gmock
gmake[6]: Entering directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
gmake[6]: Leaving directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
[ 75%] Built target gmock_main
gmake[6]: Entering directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
gmake[6]: Leaving directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
[100%] Built target gtest_main
gmake[5]: Leaving directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
Install the project...
-- Install configuration: ""
-- Up-to-date: /usr/local/include
-- Up-to-date: /usr/local/include/gmock
-- Up-to-date: /usr/local/include/gmock/gmock-more-actions.h
-- Up-to-date: /usr/local/include/gmock/gmock-actions.h
-- Up-to-date: /usr/local/include/gmock/gmock-generated-actions.h.pump
-- Up-to-date: /usr/local/include/gmock/gmock-more-matchers.h
-- Up-to-date: /usr/local/include/gmock/gmock.h
-- Up-to-date: /usr/local/include/gmock/gmock-generated-function-mockers.h
-- Up-to-date: /usr/local/include/gmock/gmock-generated-matchers.h.pump
-- Up-to-date: /usr/local/include/gmock/gmock-generated-actions.h
-- Up-to-date: /usr/local/include/gmock/gmock-generated-matchers.h
-- Up-to-date: /usr/local/include/gmock/gmock-generated-nice-strict.h
-- Up-to-date: /usr/local/include/gmock/gmock-matchers.h
-- Up-to-date: /usr/local/include/gmock/internal
-- Up-to-date: /usr/local/include/gmock/internal/gmock-generated-internal-utils.h.pump
-- Up-to-date: /usr/local/include/gmock/internal/gmock-generated-internal-utils.h
-- Up-to-date: /usr/local/include/gmock/internal/gmock-port.h
-- Up-to-date: /usr/local/include/gmock/internal/gmock-internal-utils.h
-- Up-to-date: /usr/local/include/gmock/internal/custom
-- Up-to-date: /usr/local/include/gmock/internal/custom/gmock-generated-actions.h.pump
-- Up-to-date: /usr/local/include/gmock/internal/custom/README.md
-- Up-to-date: /usr/local/include/gmock/internal/custom/gmock-generated-actions.h
-- Up-to-date: /usr/local/include/gmock/internal/custom/gmock-matchers.h
-- Up-to-date: /usr/local/include/gmock/internal/custom/gmock-port.h
-- Up-to-date: /usr/local/include/gmock/gmock-generated-function-mockers.h.pump
-- Up-to-date: /usr/local/include/gmock/gmock-cardinalities.h
-- Up-to-date: /usr/local/include/gmock/gmock-generated-nice-strict.h.pump
-- Up-to-date: /usr/local/include/gmock/gmock-spec-builders.h
-- Up-to-date: /usr/local/lib/libgmock.a
-- Up-to-date: /usr/local/lib/libgmock_main.a
-- Up-to-date: /usr/local/lib/pkgconfig/gmock.pc
-- Up-to-date: /usr/local/lib/pkgconfig/gmock_main.pc
-- Up-to-date: /usr/local/lib/cmake/GTest/GTestTargets.cmake
-- Installing: /usr/local/lib/cmake/GTest/GTestTargets-noconfig.cmake
-- Up-to-date: /usr/local/lib/cmake/GTest/GTestConfigVersion.cmake
-- Up-to-date: /usr/local/lib/cmake/GTest/GTestConfig.cmake
-- Up-to-date: /usr/local/include
-- Up-to-date: /usr/local/include/gtest
-- Up-to-date: /usr/local/include/gtest/gtest.h
-- Up-to-date: /usr/local/include/gtest/gtest-death-test.h
-- Up-to-date: /usr/local/include/gtest/gtest-spi.h
-- Up-to-date: /usr/local/include/gtest/gtest-param-test.h
-- Up-to-date: /usr/local/include/gtest/gtest_pred_impl.h
-- Up-to-date: /usr/local/include/gtest/gtest-message.h
-- Up-to-date: /usr/local/include/gtest/internal
-- Up-to-date: /usr/local/include/gtest/internal/gtest-string.h
-- Up-to-date: /usr/local/include/gtest/internal/gtest-param-util.h
-- Up-to-date: /usr/local/include/gtest/internal/gtest-filepath.h
-- Up-to-date: /usr/local/include/gtest/internal/gtest-param-util-generated.h
-- Up-to-date: /usr/local/include/gtest/internal/gtest-tuple.h.pump
-- Up-to-date: /usr/local/include/gtest/internal/gtest-port-arch.h
-- Up-to-date: /usr/local/include/gtest/internal/gtest-port.h
-- Up-to-date: /usr/local/include/gtest/internal/gtest-type-util.h.pump
-- Up-to-date: /usr/local/include/gtest/internal/gtest-internal.h
-- Up-to-date: /usr/local/include/gtest/internal/gtest-tuple.h
-- Up-to-date: /usr/local/include/gtest/internal/gtest-type-util.h
-- Up-to-date: /usr/local/include/gtest/internal/custom
-- Up-to-date: /usr/local/include/gtest/internal/custom/gtest.h
-- Up-to-date: /usr/local/include/gtest/internal/custom/README.md
-- Up-to-date: /usr/local/include/gtest/internal/custom/gtest-port.h
-- Up-to-date: /usr/local/include/gtest/internal/custom/gtest-printers.h
-- Up-to-date: /usr/local/include/gtest/internal/gtest-linked_ptr.h
-- Up-to-date: /usr/local/include/gtest/internal/gtest-death-test-internal.h
-- Up-to-date: /usr/local/include/gtest/internal/gtest-param-util-generated.h.pump
-- Up-to-date: /usr/local/include/gtest/gtest-test-part.h
-- Up-to-date: /usr/local/include/gtest/gtest-printers.h
-- Up-to-date: /usr/local/include/gtest/gtest_prod.h
-- Up-to-date: /usr/local/include/gtest/gtest-typed-test.h
-- Up-to-date: /usr/local/include/gtest/gtest-param-test.h.pump
-- Up-to-date: /usr/local/lib/libgtest.a
-- Up-to-date: /usr/local/lib/libgtest_main.a
-- Up-to-date: /usr/local/lib/pkgconfig/gtest.pc
-- Up-to-date: /usr/local/lib/pkgconfig/gtest_main.pc
gmake[4]: Leaving directory '/home/user/rocSPARSE/build/deps/gtest/src/googletest-build'
make[3]: Leaving directory '/home/user/rocSPARSE/build/deps'
[100%] Built target install
make[2]: Leaving directory '/home/user/rocSPARSE/build/deps'
make[1]: Leaving directory '/home/user/rocSPARSE/build/deps'
~/workspace/rocSPARSE
~/workspace/rocSPARSE ~/workspace/rocSPARSE
-- The CXX compiler identification is unknown
-- The Fortran compiler identification is GNU 10.2.0
-- Check for working CXX compiler: /opt/rocm/bin/hipcc
-- Check for working CXX compiler: /opt/rocm/bin/hipcc -- broken
CMake Error at /usr/share/cmake-3.16/Modules/CMakeTestCXXCompiler.cmake:53 (message):
  The C++ compiler

    "/opt/rocm/bin/hipcc"

  is not able to compile a simple test program.

  It fails with the following output:

    Change Dir: /home/user/rocSPARSE/build/release/CMakeFiles/CMakeTmp
    
    Run Build Command(s):/usr/bin/gmake cmTC_9346b/fast && /usr/bin/gmake -f CMakeFiles/cmTC_9346b.dir/build.make CMakeFiles/cmTC_9346b.dir/build
    gmake[1]: Entering directory '/home/user/rocSPARSE/build/release/CMakeFiles/CMakeTmp'
    Building CXX object CMakeFiles/cmTC_9346b.dir/testCXXCompiler.cxx.o
    /opt/rocm/bin/hipcc     -o CMakeFiles/cmTC_9346b.dir/testCXXCompiler.cxx.o -c /home/user/rocSPARSE/build/release/CMakeFiles/CMakeTmp/testCXXCompiler.cxx
    Can't exec "/opt/rocm/llvm/bin/clang++": No such file or directory at /opt/rocm-3.9.1/hip/bin/hipconfig line 141.
    Use of uninitialized value $HIP_CLANG_VERSION in pattern match (m//) at /opt/rocm-3.9.1/hip/bin/hipconfig line 142.
    Use of uninitialized value $HIP_CLANG_VERSION in concatenation (.) or string at /opt/rocm-3.9.1/hip/bin/hipconfig line 145.
    Can't exec "/opt/rocm/llvm/bin/clang++": No such file or directory at /opt/rocm-3.9.1/hip/bin/hipconfig line 141.
    Use of uninitialized value $HIP_CLANG_VERSION in pattern match (m//) at /opt/rocm-3.9.1/hip/bin/hipconfig line 142.
    Use of uninitialized value $HIP_CLANG_VERSION in concatenation (.) or string at /opt/rocm-3.9.1/hip/bin/hipconfig line 145.
    Can't exec "/opt/rocm/llvm/bin/clang++": No such file or directory at /opt/rocm-3.9.1/hip/bin/hipconfig line 141.
    Use of uninitialized value $HIP_CLANG_VERSION in pattern match (m//) at /opt/rocm-3.9.1/hip/bin/hipconfig line 142.
    Use of uninitialized value $HIP_CLANG_VERSION in concatenation (.) or string at /opt/rocm-3.9.1/hip/bin/hipconfig line 145.
    Can't exec "/opt/rocm/llvm/bin/clang++": No such file or directory at /opt/rocm-3.9.1/hip/bin/hipconfig line 141.
    Use of uninitialized value $HIP_CLANG_VERSION in pattern match (m//) at /opt/rocm-3.9.1/hip/bin/hipconfig line 142.
    Use of uninitialized value $HIP_CLANG_VERSION in concatenation (.) or string at /opt/rocm-3.9.1/hip/bin/hipconfig line 145.
    Can't exec "/opt/rocm-3.9.1/llvm/bin/clang": No such file or directory at /opt/rocm/bin/hipcc line 203.
    Use of uninitialized value $HIP_CLANG_VERSION in pattern match (m//) at /opt/rocm/bin/hipcc line 204.
    Use of uninitialized value $HIP_CLANG_VERSION in concatenation (.) or string at /opt/rocm/bin/hipcc line 208.
    Can't exec "/opt/rocm-3.9.1/llvm/bin/clang": No such file or directory at /opt/rocm/bin/hipcc line 897.
    failed to execute: No such file or directory
    gmake[1]: *** [CMakeFiles/cmTC_9346b.dir/build.make:66: CMakeFiles/cmTC_9346b.dir/testCXXCompiler.cxx.o] Error 255
    gmake[1]: Leaving directory '/home/user/rocSPARSE/build/release/CMakeFiles/CMakeTmp'
    gmake: *** [Makefile:121: cmTC_9346b/fast] Error 2
    
    

  

  CMake will not be able to correctly generate this project.
Call Stack (most recent call first):
  CMakeLists.txt:54 (project)


-- Configuring incomplete, errors occurred!
See also "/home/user/rocSPARSE/build/release/CMakeFiles/CMakeOutput.log".
See also "/home/user/rocSPARSE/build/release/CMakeFiles/CMakeError.log".

Also my CMakeError.log might be useful.

Any idea?

navi14 installing tips

finally compiled all supposed scripts for NAVI14 sucessfully! Thank you

for MIopen I needed some additional packages
libslite-dev
libboost-all-dev
texlive-latex-recommended
libbz2-dev
half
libghc-half-dev

So i am wondering if rocminfo and clinfo should show my 5500xt as agent. Now i cant see my gpu there. So maybe i should install drivers first or smth?

Thank you for your work and please explain what should i see in rocminfo and clinfo

ROCm 5.4.3, gfx1012, can not build 28.rccl.sh, error: instruction not supported on this GPU

Environment

Hardware description
GPU AMD RX5500M
CPU AMD R7 4800H
Software version
OS Ubuntu 20.04 (Docker) and Debian 12 (host)
ROCm 5.4.3
Python 3.8.10

Hello I try to build Docker with you solution and I can not build 28.rccl.sh with my RX5500M and I got this error instruction not supported on this GPU (please look on logs and Dockerfile that I attached)

I also had patch conflict in 25.rocsparse.sh but I fixed it manually (I also attached it to archive with Dockerfile)
I not sure but looks like we can remove this patch and use CXXFLAGS=-DROCSPARSE_USE_MOVE_DPP=0 as described here ROCm/rocSPARSE@f9446b8
Also I not sure that I correctly resolved this patch - but it built without problems

PS: I used Ubuntu 20.04 as you described in instruction (you can see all steps in Dockerfile)

logs.txt
Dockerfiles_and_patch.zip

22.rocblas.sh fails, 'cmake -lpthread'

Environment

Hardware description
GPU gfx803
CPU ryzen7
Software version
OS debian bookworm
ROCm 4.3
Python 3.9.9
Linux 5.15.0-1-amd64
cmake 3.22.1

What is the expected behavior

  • rocBLAS built

What actually happens

  • CMake Error: Unknown argument -lpthread
    CMake Error: Run 'cmake --help' for all supported options.

How to reproduce

  • run 22.rocblas.sh

I think the -lpthread can just be removed from 22.rocblas.sh. It's already somewhere in the generated Makefiles as an argument to gcc.

libMLIRMIOpen not found problem while running script #35

Environment

Hardware description
GPU RX580
CPU i7
Software version
OS Ubuntu 20.04
ROCm 5.1.3
Python 3.8

What is the expected behavior

No error

What actually happens

Error while running script # 35

How to reproduce

Run script #35

Is this an expected hiccup in the build process? Should we be going through each of the ROCm projects and running the install_deps script?

请问如何在rocm6.0上添加gfx803的支持呢

Environment

Hardware description
GPU gfx803
CPU E5-2667v2*2
Software version
OS -deepin
ROCm 6.0
Python 3.10

What is the expected behavior

让gfx803架构的专业卡-AMD_Radeon_Pro_Duo_Polaris能运行rocm6.0.

What actually happens

还未操作

How to reproduce

请问怎么设置编译参数以及添加哪些补丁

3.9 crashes during building on gfx803 with me but 3.10 does not crash.

hey man, firstly, thanks for the work.

it has been two days for me trying to build rocm for tensorflow.

I got to the point of despair and raising this issue.

my setup:
GPU: Sapphire Radeon RX570 4GB
CPU: Intel Celeron
RAM: 8GB

my quesion is, do you have the 3.10 rocSPARSE version which would work on gfx803?

I tried building your version but it was for 3.9 right?

I still get the hipErrorNoBinaryForGpu issue even after rebuilding your version of the rocSPARSE

anything would be helpful. Thanks

invalid output with gfx803 and BUILD_WITH_TENSILE_HOST ON

Finally, I can reproduce gfx803 with BUILD_WITH_TENSILE_HOST errors using a small sample.

OS Ubuntu-20.04.1
linux Linux 5.4.0-64-generic
ROCm 4.0.1
GPU RX 580 8G
pytorch 1.7.1

test codes : https://github.com/xuhuisheng/rocm-build/blob/feature/check/check/test-pytorch-rocblas.py
In this code, there is one full connection layer Y = w0 * x0 + w1 * x1 + b

var value
input num 2
output num 1
weight0 1
weight1 1
bias 10
features 800
batch 32

For simple case, I init weight with [1,1], bias with 10, so it should not change anything, loss always should be 0.
But on the 12th steps, loss changed to 50. It is reproducable on my computer.

I print the X, Y, output, loss, and find out that the first 16 of output is 12 which is correct. the last 12 of output is 2 which is wrong. And seems the last 16 of output miss the bias, I guess.

     X tensor([[1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.]], device='cuda:0')
     Y tensor([1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01,
        1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01,
        1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01,
        1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01,
        1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01, 1.2000e+01,
        1.2000e+01, 1.2000e+01], device='cuda:0')
weight tensor([[1., 1.]], device='cuda:0')
  bias tensor([10.], device='cuda:0')
output tensor([[1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [1.2000e+01],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00],
        [2.0000e+00]], device='cuda:0', grad_fn=<AddmmBackward>)
     l tensor(50., device='cuda:0', grad_fn=<MseLossBackward>)

If I used BUILD_WITH_TENSILE_HOST=OFF and re-compile rocBLAS, the error will gone.

And It is weired if I comment out l.backward(), the error will change to happen on 21th step, and output change to all 3.
Why it is unstable? Maybe gfx803 need PCI3 Atomics operation? or gfx803 just miss some waitcnt() instructions?

     X tensor([[1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.],
        [1., 1.]], device='cuda:0')
     Y tensor([12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000,
        12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000,
        12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000,
        12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000, 12.0000],
       device='cuda:0')
weight tensor([[1., 1.]], device='cuda:0')
  bias tensor([10.], device='cuda:0')
output tensor([[3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.],
        [3.]], device='cuda:0', grad_fn=<AddmmBackward>)
     l tensor(81., device='cuda:0', grad_fn=<MseLossBackward>)

So I believe it caused by the new Tensile client which used assembly codes to optimized. I am not familiar with GCN assemly, So I create an issue to tensile wish someone could help.
ROCm/Tensile#1287

HIP failing to build

Environment

Hardware description
GPU AMD RX 5500XT
CPU AMD 5950X
Software version
OS Ubuntu 22.04.2
ROCm 5.5
Python 3.10.6

What is the expected behavior

Compilation proceeds normally

What actually happens

at 18.hip.sh:

In file included from /tmp/hip_pch.17620/hip_pch.h:1:
/media/erkinalp/SteamLibrary/ROCm/HIP/include/hip/hip_runtime.h:54:10: fatal error: 'thread' file not found
#include <thread>
         ^~~~~~~~
1 error generated when compiling for gfx1030.
CMake Error at src/CMakeLists.txt:187 (message):
  Failed to embed PCH


-- Configuring incomplete, errors occurred!

How to reproduce

Run the scripts in order.

Any experience with a gfx902 APU -> Ryzen 5850U

Hi,

i have a gfx902 APU -> Ryzen 5850U. I'm just reaching into the wild if someone has any success getting this card to run with rocm.

Thanks

❯ rocminfo 
ROCk module is loaded
=====================    
HSA System Attributes    
=====================    
Runtime Version:         1.1
System Timestamp Freq.:  1000.000000MHz
Sig. Max Wait Duration:  18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count)
Machine Model:           LARGE                              
System Endianness:       LITTLE                             

==========               
HSA Agents               
==========               
*******                  
Agent 1                  
*******                  
  Name:                    AMD Ryzen 7 PRO 5850U with Radeon Graphics
  Uuid:                    CPU-XX                             
  Marketing Name:          AMD Ryzen 7 PRO 5850U with Radeon Graphics
  Vendor Name:             CPU                                
  Feature:                 None specified                     
  Profile:                 FULL_PROFILE                       
  Float Round Mode:        NEAR                               
  Max Queue Number:        0(0x0)                             
  Queue Min Size:          0(0x0)                             
  Queue Max Size:          0(0x0)                             
  Queue Type:              MULTI                              
  Node:                    0                                  
  Device Type:             CPU                                
  Cache Info:              
    L1:                      32768(0x8000) KB                   
  Chip ID:                 0(0x0)                             
  Cacheline Size:          64(0x40)                           
  Max Clock Freq. (MHz):   1900                               
  BDFID:                   0                                  
  Internal Node ID:        0                                  
  Compute Unit:            16                                 
  SIMDs per CU:            0                                  
  Shader Engines:          0                                  
  Shader Arrs. per Eng.:   0                                  
  WatchPts on Addr. Ranges:1                                  
  Features:                None
  Pool Info:               
    Pool 1                   
      Segment:                 GLOBAL; FLAGS: FINE GRAINED        
      Size:                    28567620(0x1b3e844) KB             
      Allocatable:             TRUE                               
      Alloc Granule:           4KB                                
      Alloc Alignment:         4KB                                
      Accessible by all:       TRUE                               
    Pool 2                   
      Segment:                 GLOBAL; FLAGS: KERNARG, FINE GRAINED
      Size:                    28567620(0x1b3e844) KB             
      Allocatable:             TRUE                               
      Alloc Granule:           4KB                                
      Alloc Alignment:         4KB                                
      Accessible by all:       TRUE                               
    Pool 3                   
      Segment:                 GLOBAL; FLAGS: COARSE GRAINED      
      Size:                    28567620(0x1b3e844) KB             
      Allocatable:             TRUE                               
      Alloc Granule:           4KB                                
      Alloc Alignment:         4KB                                
      Accessible by all:       TRUE                               
  ISA Info:                
*******                  
Agent 2                  
*******                  
  Name:                    gfx902                             
  Uuid:                    GPU-XX                             
  Marketing Name:          Cezanne                            
  Vendor Name:             AMD                                
  Feature:                 KERNEL_DISPATCH                    
  Profile:                 BASE_PROFILE                       
  Float Round Mode:        NEAR                               
  Max Queue Number:        128(0x80)                          
  Queue Min Size:          4096(0x1000)                       
  Queue Max Size:          131072(0x20000)                    
  Queue Type:              MULTI                              
  Node:                    1                                  
  Device Type:             GPU                                
  Cache Info:              
    L1:                      16(0x10) KB                        
  Chip ID:                 5688(0x1638)                       
  Cacheline Size:          64(0x40)                           
  Max Clock Freq. (MHz):   2000                               
  BDFID:                   1792                               
  Internal Node ID:        1                                  
  Compute Unit:            28                                 
  SIMDs per CU:            4                                  
  Shader Engines:          2                                  
  Shader Arrs. per Eng.:   2                                  
  WatchPts on Addr. Ranges:4                                  
  Features:                KERNEL_DISPATCH 
  Fast F16 Operation:      FALSE                              
  Wavefront Size:          64(0x40)                           
  Workgroup Max Size:      1024(0x400)                        
  Workgroup Max Size per Dimension:
    x                        1024(0x400)                        
    y                        1024(0x400)                        
    z                        1024(0x400)                        
  Max Waves Per CU:        40(0x28)                           
  Max Work-item Per CU:    2560(0xa00)                        
  Grid Max Size:           4294967295(0xffffffff)             
  Grid Max Size per Dimension:
    x                        4294967295(0xffffffff)             
    y                        4294967295(0xffffffff)             
    z                        4294967295(0xffffffff)             
  Max fbarriers/Workgrp:   32                                 
  Pool Info:               
    Pool 1                   
      Segment:                 GLOBAL; FLAGS: COARSE GRAINED      
      Size:                    4194304(0x400000) KB               
      Allocatable:             TRUE                               
      Alloc Granule:           4KB                                
      Alloc Alignment:         4KB                                
      Accessible by all:       FALSE                              
    Pool 2                   
      Segment:                 GROUP                              
      Size:                    64(0x40) KB                        
      Allocatable:             FALSE                              
      Alloc Granule:           0KB                                
      Alloc Alignment:         0KB                                
      Accessible by all:       FALSE                              
  ISA Info:                
    ISA 1                    
      Name:                    amdgcn-amd-amdhsa--gfx902:xnack-   
      Machine Models:          HSA_MACHINE_MODEL_LARGE            
      Profiles:                HSA_PROFILE_BASE                   
      Default Rounding Mode:   NEAR                               
      Default Rounding Mode:   NEAR                               
      Fast f16:                TRUE                               
      Workgroup Max Size:      1024(0x400)                        
      Workgroup Max Size per Dimension:
        x                        1024(0x400)                        
        y                        1024(0x400)                        
        z                        1024(0x400)                        
      Grid Max Size:           4294967295(0xffffffff)             
      Grid Max Size per Dimension:
        x                        4294967295(0xffffffff)             
        y                        4294967295(0xffffffff)             
        z                        4294967295(0xffffffff)             
      FBarrier Max Size:       32                                 
*** Done ***             

How to Get This Working On An 18.04 Based System?

I'm trying it on 18.04 now.

One thing I've run into is that the scripts don't gracefully handle paths with spaces in the name. The variables need to be enclosed in double quotes when they are used in the scripts.

For instance lines 5 & 6 of 00.rocm-core.sh need to be like this:

mkdir -p "$ROCM_BUILD_DIR/rocm-core"
cd "$ROCM_BUILD_DIR/rocm-core"

There's probably more places besides those first few lines in the other scripts. I'm going through them now.

Also, I've created a convenience script to go through the scripts and execute them in order so I don't have to do it manually.

gfx803 still NaN loss even with rocblas patch

Environment

Hardware description
GPU gfx803 (rx580)
CPU ryzen 3700x
Software version
OS 5.14.2
ROCm 4.3.1
Python 3.9.7
TensorFlow 2.6.0

What is the expected behavior

tensorflow should work as expected

What actually happens

NaN loss when use tensorflow

How to reproduce

  1. Install rocm, with rocblas patch (remove library/src/blas3/Tensile/Logic/asm_full/r9nano_*.yaml)
  2. run any model

28.rccl.sh fails to build for navi10

Environment

Hardware description
GPU RX 5700
CPU Ryzen 5 3600
Software version
OS Ubuntu 20.04.6 LTS
ROCm 5.4.x
Python 3.8.10

What is the expected behavior

Build rccl for navi10

What actually happens


|====|
|SLOW|
|====|
/home/tyra/rocm/rocm-build/build/rccl /home/tyra/rocm/rocm-build/build/rccl
-- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11")
-- hip::amdhip64 is SHARED_LIBRARY
-- HIP compiler: clang
-- HIP runtime: rocclr
-- Found rocm_smi at /opt/rocm/include
RPM version 4.14.2.1
-- rocm-cmake: Set license file to /home/tyra/rocm/ROCm/rccl/LICENSE.txt.
-- Configuring done
-- Generating done
-- Build files have been written to: /home/tyra/rocm/rocm-build/build/rccl
[1/4] Updating git_version.cpp if necessary
-- Updating git_version.cpp
[2/4] Building CXX object CMakeFiles/rccl.dir/git_version.cpp.o
Warning: The --hipcc-func-supp option has been deprecated and will be removed in the future.
[3/4] Linking CXX shared library librccl.so.1.0.50400
FAILED: librccl.so.1.0.50400 
: && /opt/rocm/bin/hipcc -fPIC -O3 -DNDEBUG   -shared -Wl,-soname,librccl.so.1 -o librccl.so.1.0.50400 CMakeFiles/rccl.dir/src/collectives/device/all_reduce.cpp.o CMakeFiles/rccl.dir/src/collectives/device/all_gather.cpp.o CMakeFiles/rccl.dir/src/collectives/device/alltoall_pivot.cpp.o CMakeFiles/rccl.dir/src/collectives/device/reduce.cpp.o CMakeFiles/rccl.dir/src/collectives/device/broadcast.cpp.o CMakeFiles/rccl.dir/src/collectives/device/reduce_scatter.cpp.o CMakeFiles/rccl.dir/src/collectives/device/sendrecv.cpp.o CMakeFiles/rccl.dir/src/collectives/device/onerank_reduce.cpp.o CMakeFiles/rccl.dir/src/collectives/device/functions.cpp.o CMakeFiles/rccl.dir/src/init.cc.o CMakeFiles/rccl.dir/src/graph/trees.cc.o CMakeFiles/rccl.dir/src/graph/rings.cc.o CMakeFiles/rccl.dir/src/graph/paths.cc.o CMakeFiles/rccl.dir/src/graph/search.cc.o CMakeFiles/rccl.dir/src/graph/connect.cc.o CMakeFiles/rccl.dir/src/graph/tuning.cc.o CMakeFiles/rccl.dir/src/graph/topo.cc.o CMakeFiles/rccl.dir/src/graph/xml.cc.o CMakeFiles/rccl.dir/src/graph/rome_models.cc.o CMakeFiles/rccl.dir/src/collectives/all_reduce_api.cc.o CMakeFiles/rccl.dir/src/collectives/all_gather_api.cc.o CMakeFiles/rccl.dir/src/collectives/reduce_api.cc.o CMakeFiles/rccl.dir/src/collectives/broadcast_api.cc.o CMakeFiles/rccl.dir/src/collectives/reduce_scatter_api.cc.o CMakeFiles/rccl.dir/src/collectives/sendrecv_api.cc.o CMakeFiles/rccl.dir/src/collectives/gather_api.cc.o CMakeFiles/rccl.dir/src/collectives/scatter_api.cc.o CMakeFiles/rccl.dir/src/collectives/all_to_all_api.cc.o CMakeFiles/rccl.dir/src/collectives/all_to_allv_api.cc.o CMakeFiles/rccl.dir/src/channel.cc.o CMakeFiles/rccl.dir/src/misc/argcheck.cc.o CMakeFiles/rccl.dir/src/misc/nvmlwrap_stub.cc.o CMakeFiles/rccl.dir/src/misc/utils.cc.o CMakeFiles/rccl.dir/src/misc/ibvwrap.cc.o CMakeFiles/rccl.dir/src/misc/rocm_smi_wrap.cc.o CMakeFiles/rccl.dir/src/misc/profiler.cc.o CMakeFiles/rccl.dir/src/misc/npkit.cc.o CMakeFiles/rccl.dir/src/misc/shmutils.cc.o CMakeFiles/rccl.dir/src/misc/signals.cc.o CMakeFiles/rccl.dir/src/misc/socket.cc.o CMakeFiles/rccl.dir/src/misc/param.cc.o CMakeFiles/rccl.dir/src/misc/rocmwrap.cc.o CMakeFiles/rccl.dir/src/misc/strongstream.cc.o CMakeFiles/rccl.dir/src/transport/coll_net.cc.o CMakeFiles/rccl.dir/src/transport/net.cc.o CMakeFiles/rccl.dir/src/transport/net_ib.cc.o CMakeFiles/rccl.dir/src/transport/net_socket.cc.o CMakeFiles/rccl.dir/src/transport/p2p.cc.o CMakeFiles/rccl.dir/src/transport/shm.cc.o CMakeFiles/rccl.dir/src/transport.cc.o CMakeFiles/rccl.dir/src/debug.cc.o CMakeFiles/rccl.dir/src/group.cc.o CMakeFiles/rccl.dir/src/bootstrap.cc.o CMakeFiles/rccl.dir/src/proxy.cc.o CMakeFiles/rccl.dir/src/net.cc.o CMakeFiles/rccl.dir/src/enqueue.cc.o CMakeFiles/rccl.dir/git_version.cpp.o  --amdgpu-target=gfx1010  -fgpu-rdc  -parallel-jobs=8  -ldl  -lrocm_smi64  -L/opt/rocm/lib  /opt/rocm/lib/libamdhip64.so.5.4.50100  --hip-link  --offload-arch=gfx1010  /opt/rocm/llvm/lib/clang/15.0.0/lib/linux/libclang_rt.builtins-x86_64.a && :
Warning: The --amdgpu-target option has been deprecated and will be removed in the future.  Use --offload-arch instead.
lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: ld-temp.o <inline asm>:1:2: instruction not supported on this GPU
        buffer_wbinvl1_vol
        ^


lld: error: too many errors emitted, stopping now (use --error-limit=0 to see all errors)
clang-15: error: amdgcn-link command failed with exit code 1 (use -v to see invocation)
ninja: build stopped: subcommand failed.

How to reproduce

====== CONFIG ======

export ROCM_INSTALL_DIR=/opt/rocm
export ROCM_MAJOR_VERSION=5
export ROCM_MINOR_VERSION=4
export ROCM_PATCH_VERSION=0
export ROCM_LIBPATCH_VERSION=50400
export CPACK_DEBIAN_PACKAGE_RELEASE=72~20.04
export ROCM_PKGTYPE=DEB
export ROCM_GIT_DIR=/home/tyra/rocm/ROCm
export ROCM_BUILD_DIR=/home/tyra/rocm/rocm-build/build
export ROCM_PATCH_DIR=/home/tyra/rocm/rocm-build/patch
export AMDGPU_TARGETS="gfx1010"
export CMAKE_DIR=/home/tyra/rocm/cmake
export PATH=$ROCM_INSTALL_DIR/bin:$ROCM_INSTALL_DIR/llvm/bin:$ROCM_INSTALL_DIR/hip/bin:$CMAKE_DIR/bin:$PATH

====================

Build script I use

#!/bin/bash

if [ "$EUID" -ne 0 ]; then sudo bash "$0" "$@"; exit; fi
[ "$1" = clean ] && sudo rm -rf rocm-build/ venv/ ROCm/ cmake/ repo && exit

for package in "git" "git-lfs" "python3" "python3-venv" "python-is-python3" "wget"; do
  if ! dpkg -s "$package" &> /dev/null; then
    echo "Installing ..."
    apt install -y "$package"
  fi
done

[ -d rocm-build ] || git clone "https://github.com/xuhuisheng/rocm-build"
[ -d venv ] || python3 -m venv venv --system-site-packages
[ -x repo ] || wget --progress=bar:force "https://storage.googleapis.com/git-repo-downloads/repo" && chmod +x repo

if [ ! -d cmake ]; then
  wget "https://cmake.org/files/v3.18/cmake-3.18.6-Linux-x86_64.tar.gz"
  tar -xf cmake-3.18.6-Linux-x86_64.tar.gz
  mv cmake-3.18.6-Linux-x86_64 cmake
  rm -rf cmake-3.18.6-Linux-x86_64.tar.gz
fi

if [ ! -d ROCm ]; then
  mkdir -p ROCm
  cd ROCm
  git config --global user.email "[email protected]"
  git config --global user.name "$USER"
  git config --global color.ui false
  ../repo init -u "https://github.com/RadeonOpenCompute/ROCm.git" -b roc-5.4.x
  ../repo sync
  cd ..
fi

cd rocm-build
config=$(cat "env.sh" | sed "s:/home/work/local/cmake-3.18.6-Linux-x86_64:$(readlink -f ../cmake):g" | sed "s:/home/work:$(readlink -f ..):g" | sed "s:gfx803:gfx1010:g")
echo -e "\n====== CONFIG ======\n\n\e[34m$(tail -n+3 <<< $config)\e[0m\n\n===================="
echo "$config" > .config; source .config
source ../venv/bin/activate
progress_file=".progress"

if [ -f "$progress_file" ]; then
  if [ "$1" = "startover" ]; then
    rm "$progress_file"
    checkpoint_index=0
  else
    checkpoint_index=$(<"$progress_file")
  fi
else
  checkpoint_index=0
fi

readarray -t scripts <<< $(ls -1 | sort -n | grep .sh | tail -n+3)
scripts=(00.rocm-core.sh "${scripts[@]}")

for i in "${!scripts[@]}"; do
  [ $i -lt $checkpoint_index ] && continue
  line="${scripts[$i]}"
  script_name="${line##*/}"
  navi_script="navi10/$script_name"
  [ -f "$navi_script" ] && scripts[$i]="$navi_script"
done

for i in "${!scripts[@]}"; do
  [ $i -lt $checkpoint_index ] && continue
  line="${scripts[$i]##*/}"
  cd $(dirname "$line")
  read -p $'\e[32m\n\n'"Press ENTER to execute $line"$'\e[0m '
  echo; echo; bash "$line"
  echo "$i" > "$progress_file"
  cd - >/dev/null
done

echo -e "\n\n\e[34m====== Finished ======\e[0m\n\n"

Any ideas of flags I could use/modify?
I couldn't find any relevant google results regarding these errors.

Tensorflow, gfx803, `hipErrorNoBinaryForGpu: Unable to find code object for all current devices`

Environment

Hardware description
GPU RX580
CPU 3700X
Software version
OS Linux 5.13.10
ROCm 4.3.0
Python 3.9.6
Tensorflow-rocm 2.5.0

What is the expected behavior

-Tensorflow should run correctly.

What actually happens

root@darkbox ~# python -m deeporn.fit
WARNING:root:Limited tf.compat.v2.summary API due to missing TensorBoard installation.
WARNING:root:Limited tf.compat.v2.summary API due to missing TensorBoard installation.
WARNING:root:Limited tf.compat.v2.summary API due to missing TensorBoard installation.
WARNING:root:Limited tf.summary API due to missing TensorBoard installation.
WARNING:deeporn.model:test_run=True
2021-08-21 08:57:36.488884: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libamdhip64.so
/home/yay/.cache/yay/hip-rocclr/src/HIP-rocm-4.3.0/rocclr/hip_code_object.cpp:486: "hipErrorNoBinaryForGpu: Unable to find code object for all current devices!"
fish: Job 1, 'python -m deeporn.fit' terminated by signal SIGABRT (Abort)

How to reproduce

  • run any tensorflow program

Do I have to recompile tensorflow?

boost.cmake needs to be added in repo; as [email protected] is invalid

Environment

Hardware description
GPU - Navi23
CPU - V2000
Software version
OS - Ubuntu 22.04
ROCm - 5.2
Python - 3.10.4

What is the expected behavior

  • Cant build boost pkg as you mentioned. due which we cant create CMakeLists.txt

What actually happens

How to reproduce

scp [email protected]:/media/meicai/hgst1/backup-1000/ai/boost.cmake /home/work/local

cp /home/amd/local/boost.cmake /home/work/local/boost_1_72_0/CMakeLists.txt

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.