- Encoder uses one set of transition parameters.
- Decoder uses another set of transition parameters.
- Only the decoder is connected to a layer of fully connected and softmax loss layer
jianbotang / seq2seq Goto Github PK
View Code? Open in Web Editor NEWThis project forked from freddycct/seq2seq
sequence to sequence using mxnet