as this code <div class="snippet-clipboard-content notranslate position-relative o

Thanks I have read the auto_padding. now I change my code as <div class="snipp

Hello <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-ur

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

How to use NESoftmaxLayer about computelibrary HOT 6 CLOSED

SCUTE-ZZ commented on May 5, 2024

How to use NESoftmaxLayer

from computelibrary.

Comments (6)

AnthonyBarbier commented on May 5, 2024

NESoftMaxLayer is in the list of functions which haven't been ported to use the new accurate padding yet, therefore it falls back onto the overly conservative auto_padding which is why the size ends up being so big.
We're in the process of finishing porting the functions to use the new accurate padding method.
We'll release an update as soon as it's ready.

from computelibrary.

SCUTE-ZZ commented on May 5, 2024

Thanks I have read the auto_padding.
now I change my code as

Tensor in,out;
in.allocator()->init(TensorInfo(10,1,Format::F32));
out.allocator()->init(*in.info());

NESoftmaxLayer softmax;
softmax.configure(&in,&out);


in.allocator()->allocate();
out.allocator()->allocate();

float in_data[10];
for(int i=0;i<10;i++) in_data[i]=i;

std::copy_n(in_data, 10, 
in.buffer()+in.info()->offset_element_in_bytes(Coordinates(0, 0)));

softmax.run();

float out_data[10];      
std::copy_n(out.buffer()+out.info()->offset_element_in_bytes(Coordinates(0, 0)), 
4*10, out_data);
for(int i=0;i<10;i++) printf("%f ",out_data[i]);
puts("");

but the output was
0.000000 0.000000 0.000000 128.000000 0.000000 0.000000 0.000000 128.000000 0.000000 0.000000

what's wrong for me?

from computelibrary.

GeorgeARM commented on May 5, 2024

Hello @SCUTE-ZZ,

Can you cast your in pointer to float * when you copy the data to the input tensor (std::copy works differently than mempy)?

std::copy_n(in_data, 10, reinterpret_cast<float*>(in.buffer() + in.info()->offset_element_in_bytes(arm_compute::Coordinates(0, 0))));

from computelibrary.

SCUTE-ZZ commented on May 5, 2024

Hello @GeorgeARM，
now I have use mempy instead of std::copy

Tensor in,out;
in.allocator()->init(TensorInfo(10,1,Format::F32));
out.allocator()->init(*in.info());

NESoftmaxLayer softmax;
softmax.configure(&in,&out);

in.allocator()->allocate();
out.allocator()->allocate();

float in_data[10];
for(int i=0;i<10;i++) in_data[i]=i;
memcpy(
in.buffer()+in.info()->offset_element_in_bytes(Coordinates(0, 0)),
in_data,
10*sizeof(float));

softmax.run();

float out_data[10];
memcpy(
out_data,
out.buffer()+out.info()->offset_element_in_bytes(Coordinates(0, 0)),
10*sizeof(float));

for(int i=0;i<10;i++)
       printf("%f ",out_data[i]);
puts("");

but the output was
-0.000000 -0.000000 -0.000000 -0.000000 -0.000000 -0.000000 -0.000000 -0.000000 -0.000000 -0.000000

from computelibrary.

GeorgeARM commented on May 5, 2024

@SCUTE-ZZ so it seems they were two problems:
a) you were copying wrong with copy_n which you addressed in the patch above
b) it seems that softmax might fail if your output is not a multiple of 4, we did notice this recently (and is fixed internally) as we added more boards in our testing farm. It seems that an exponent is raised to a really high negative value causing the sequence of calculating this exponent to return -inf breaking the sum part of the softmax equation [exp(x - max(x)) / sum(exp(x - max(x)))], that's why you get -0. We didn't notice this as the boards we were initially testing on didn't have such behavior.

In other words, this will be fixed in the next maintenance release in the following days.

Thanks

from computelibrary.

SCUTE-ZZ commented on May 5, 2024

@GeorgeARM
I change size 10 to 16 the output was true.
Thanks

from computelibrary.

How to use NESoftmaxLayer about computelibrary HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent