Rethinking the Attention Mechanism in Deep Learning

Rethinking the Attention Mechanism in Deep Learning

↑ ClickBlue Text Follow the Jishi Platform Author丨Cool Andy @ Zhihu Source丨https://zhuanlan.zhihu.com/p/125145283 Editor丨Jishi Platform Jishi Guide This article discusses the Attention mechanism in deep learning. It is not intended to review the various frameworks and applications of the Attention mechanism, but rather to introduce four representative and interesting works related to Attention and provide further … Read more

Understanding How to Build Multi-Task Visual Models

Understanding How to Build Multi-Task Visual Models

In the past two years, multi-task learning has been gradually replacing traditional single-task learning and becoming a mainstream research direction in the field of artificial intelligence. The reason is that multi-task learning allows us to gain as much AI capability as possible with minimal human input. For example, ChatGPT is a natural language generation model … Read more