Wanted to check if the Attention layer operation implemented as a layer in the dnn module? Link to the "Attention is all you Need" paper https://papers.nips.cc/paper/7181-attention-is-all-you-need.pdf
1 | initial version |
Wanted to check if the Attention layer operation implemented as a layer in the dnn module? Link to the "Attention is all you Need" paper https://papers.nips.cc/paper/7181-attention-is-all-you-need.pdf