Understanding Transformer in Ten Minutes
Transformer is a model that utilizes the attention mechanism to improve the training speed of models. For more information about the attention mechanism, you can refer to this article (https://zhuanlan.zhihu.com/p/52119092). The transformer can be said to be a deep learning model that is entirely based on the self-attention mechanism, as it is suitable for parallel … Read more