understand rnn from motivation

just scratch the surface

N vs N

The length of input and output sequence must be equal

char rnn:
http://karpathy.github.io/2015/05/21/rnn-effectiveness/

N vs 1

Sequence to the Category

1 vs N

https://static.leiphone.com/uploads/new/article/740_740/201709/59a920bc1964b.jpg?imageMogr2/format/jpg/quality/90

the Category to the Sequence

N vs M

Seq2Seq

encoder:

Decoder:

Attention

the length of ‘c’ limits the model performance. Attention mechanism input different c in each different time to solve this problem.

Decoder with attention:

example:

(aij is learn from the model.)

tf