From Word2Vec to GPT: Understanding the Family Tree of NLP Models

From Word2Vec to GPT: Understanding the Family Tree of NLP Models

Analyst Network of Machine Heart Author: Wang Zijia Editor: H4O This article starts from the ancestor level word2vec and systematically sorts out the “genealogy” of GPT and the large NLP “family group” led by word2vec. GPT did not emerge out of nowhere; it is the result of the efforts of many people and a long … Read more

It’s Time to Abandon RNN and LSTM for Sequence Modeling

It's Time to Abandon RNN and LSTM for Sequence Modeling

Selected from Medium Author: Eugenio Culurciello Translation by Machine Heart Contributors: Liu Xiaokun, Siyuan The author states: We have been trapped in the pit of RNNs, LSTMs, and their variants for many years; it is time to abandon them! In 2014, RNNs and LSTMs were revived. We all read Colah’s blog “Understanding LSTM Networks” and … Read more

Summary and Code Implementation of Attention Mechanism in Deep Learning (2017-2021)

Summary and Code Implementation of Attention Mechanism in Deep Learning (2017-2021)

Follow the official account "ML-CVer" Set as "Star", DLCV messages will be delivered! Author丨mayiwei1998 Source丨GiantPandaCV Editor丨极市平台 Abstract Due to the network structures in many papers being embedded into code frameworks, the code tends to be redundant. The author of this article has organized and reproduced the core code based on Attention networks from recent years. … Read more

Attention Mechanism in Recommendation Systems

Attention Mechanism in Recommendation Systems

Produced by NewBeeNLP Author: @Uesugi Shoji Leisure Meeting · Information Retrieval When the attention mechanism has become a rather tasteless technique, using or modifying the attention mechanism must focus on telling a good story: that is, ‘Why use Attention, and why modify Attention.’ Currently, from traditional CF, FM methods to NFM, DeepFM, although deep learning … Read more

In-Depth Understanding of Attention Mechanism in CV

In-Depth Understanding of Attention Mechanism in CV

All Fans Say: “Why Didn’t You Follow Me Sooner!” Hello everyone, I am Canshi. In the field of deep learning, there are many technical terms that can be confusing when first encountered. As you read more, you gradually get the hang of it, but it still feels somewhat lacking. Today, we will discuss a technical … Read more

Understanding Attention Mechanisms in Depth

Understanding Attention Mechanisms in Depth

Recently, I plan to organize the application of Attention in deep recommendation systems, so I wrote this introductory article about Attention. Since it was proposed in the 2015 ICLR paper “Neural machine translation by jointly learning to align and translate”, Attention has flourished in the fields of NLP and computer vision. What is so special … Read more

Understanding the Attention Mechanism in Deep Learning – Part 2

Understanding the Attention Mechanism in Deep Learning - Part 2

[GiantPandaCV Guide] In recent years, Attention-based methods have gained popularity in both academia and industry due to their interpretability and effectiveness. However, the network structures proposed in papers are often embedded within code frameworks for classification, detection, segmentation, etc., leading to redundancy in code. For beginners like me, it can be challenging to find the … Read more

Do We Still Need Attention in Transformers?

Do We Still Need Attention in Transformers?

Selected from interconnects Author: Nathan Lambert Translated by Machine Heart Machine Heart Editorial Team State-space models are on the rise; has attention reached its end? In recent weeks, there has been a hot topic in the AI community: implementing language modeling with attention-free architectures. In short, this refers to a long-standing research direction in the … Read more

An In-Depth Introduction to Attention Mechanism in CV

An In-Depth Introduction to Attention Mechanism in CV

In the field of deep learning, there are many specialized terms that can be quite overwhelming at first glance. However, as we delve deeper, we gradually start to understand them, albeit feeling like something is still missing. Today, we will discuss a specialized term called Attention mechanism! 1. Intuitive Understanding of Attention Imagine a scenario … Read more

Detailed Explanation of Attention Mechanism and Transformer in NLP

Detailed Explanation of Attention Mechanism and Transformer in NLP

Source | Zhihu Author | JayLou Link | https://zhuanlan.zhihu.com/p/53682800 Editor | Deep Learning Matters WeChat Public Account This article is for academic sharing only. If there is any infringement, please contact us to delete. This article summarizes the attention mechanism (Attention) in natural language processing in a Q&A format and provides an in-depth analysis of … Read more