The output of the attention mechanism(也就是整体的输出) is a softmax distribution with dictionary size equal to the length of the input.

实际数据的情况:(图中是2对数据的第1对)

Pointer Networks 理解_数据

Pointer Networks 理解_数据_02