Can Attention Mechanism Be Interpreted?
Source: Harbin Institute of Technology SCIR This article is approximately 9300 words long and is recommended for a reading time of 10+ minutes. This article will discuss the interpretability of the attention mechanism. Introduction Since Bahdanau introduced Attention as soft alignment in neural machine translation in 2014, a large number of natural language processing works … Read more