dmitrylyakh / hiptt Goto Github PK
View Code? Open in Web Editor NEWHIP port of the fast GPU tensor transpose library cuTT
HIP port of the fast GPU tensor transpose library cuTT
I was just curious to build and run your library. I built the library with the following changes:
g++ -o bin/cutt_test -L/opt/rocm-3.9.1/lib/ -lamdhip64 build/cutt_test.o build/TensorTester.o build/CudaMem.o build/CudaUtils.o build/cuttTimer.o -Llib -lcutt -fPIC
/usr/bin/ld: build/TensorTester.o: relocation R_X86_64_32 against symbol `_Z42__device_stub__setTensorCheckPatternKernelPjj' can not be used when making a PIE object; recompile with -fPIE
/usr/bin/ld: build/CudaUtils.o: relocation R_X86_64_32 against `.rodata.str1.1' can not be used when making a PIE object; recompile with -fPIE
/usr/bin/ld: lib/libcutt.a(cuttkernel.o): relocation R_X86_64_32S against `.rodata' can not be used when making a PIE object; recompile with -fPIE
/usr/bin/ld: lib/libcutt.a(cuttGpuModelKernel.o): relocation R_X86_64_32 against symbol `_Z32__device_stub__runCountersKernelPKiiiiPiS1_S1_' can not be used when making a PIE object; recompile with -fPIE
Running the test shows the following message
TensorTester::checkTranspose FAIL at 98 ref 130 data 40
Test 1 failed
For the benchmark, please suggest the options for meaningful results. Thanks.
./bin/cutt_bench
Using Vega 20 [Radeon VII] SM version 9.0
L2 0.00MB
CPU using vector type AVX2 of length 8
0.00 GB/s
0.00 GB/s
0.00 GB/s
0.00 GB/s
scalarCopy 0.000000 GB/s
0.00 GB/s
0.00 GB/s
0.00 GB/s
0.00 GB/s
vectorCopy 0.000000 GB/s
0.00 GB/s
0.00 GB/s
0.00 GB/s
0.00 GB/s
memcpyFloat 0.000000 GB/s
bench OK
seed 1606747073
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.