Hi, Great work. But there are few small problems that puzzle me. 1) The image shape changes from 240240155 to 240240160,what are the considerations here? 2) I can't understand the intmd_encoder_outputs
of the encoder which is the output intmd_x
of transformer
, what's the difference between x
and intmd_x
? Besides, the encoder_output
seems to be not used in Decoder.
def forward(self, x, auxillary_output_layers=[1, 2, 3, 4]):
x1_1, x2_1, x3_1, encoder_output, intmd_encoder_outputs = self.encode(x)
decoder_output = self.decode(
x1_1, x2_1, x3_1, encoder_output, intmd_encoder_outputs, auxillary_output_layers
)
if auxillary_output_layers is not None:
auxillary_outputs = {}
for i in auxillary_output_layers:
val = str(2 * i - 1)
_key = 'Z' + str(i)
auxillary_outputs[_key] = intmd_encoder_outputs[val]
return decoder_output
return decoder_output