From Word2Vec to BERT: The Evolution of NLP Pre-trained Models

From Word2Vec to BERT: The Evolution of NLP Pre-trained Models

Natural Language Processing Author: Zhang Junlin Source: Deep Learning Frontier Notes Zhihu Column Original Link: https://zhuanlan.zhihu.com/p/49271699 The theme of this article is the pre-training process in natural language processing (NLP). It will roughly explain how pre-training techniques in NLP have gradually developed into the BERT model, naturally illustrating how the ideas behind BERT were formed, … Read more

From Word2Vec to BERT: The Evolution of Word Vectors

From Word2Vec to BERT: The Evolution of Word Vectors

Machine Learning Algorithms and Natural Language Processing Recommendations Source: https://zhuanlan.zhihu.com/p/58425003 Author: Xiao Chuan Ryan [Introduction to Machine Learning Algorithms and Natural Language Processing]BERT did not come out of nowhere; this article introduces some thoughts on how to derive it from Word2Vec! Recently, my work has been closely related to pre-trained models, but I found that … Read more

Comparison of Word Vectors in NLP: Word2Vec, GloVe, FastText, ELMo, GPT, BERT

Comparison of Word Vectors in NLP: Word2Vec, GloVe, FastText, ELMo, GPT, BERT

Author: JayLou, NLP Algorithm Engineer Zhihu Column: High Energy NLP Journey This article is authorized, click “Read the original” at the end: https://zhuanlan.zhihu.com/p/56382372 This article summarizes word vectors in natural language processing in a Q&A format: including Word2Vec, GloVe, FastText, ELMo, and BERT. Table of Contents 1. Text Representation and Comparison of Word Vectors 1. … Read more

Pre-training Methods for Language Models in NLP

Pre-training Methods for Language Models in NLP

Recently, in the field of Natural Language Processing (NLP), the use of pre-training methods for language models has achieved significant improvements across various NLP tasks, attracting widespread attention from various sectors. In this article, I will summarize some relevant papers I have recently read, selecting a few representative models (including ELMo [1], OpenAI GPT [2], … Read more

In-Depth Analysis of GPT’s Rise: 20 Years of NLP Turmoil

In-Depth Analysis of GPT's Rise: 20 Years of NLP Turmoil

In the era of AI, semantic processing will become the core of operating systems, and all software technologies will establish their upper limits based on advancements in semantic technology. By | Gong Zhouzhou Editor | Yan Zi Typesetting Editor | Xiao Shuang Once again, GPT-4 has overshadowed all the established tech companies. In the third … Read more

ACL 2019 Review: Trends in Natural Language Processing

ACL 2019 Review: Trends in Natural Language Processing

Reported by New Intelligence Source: Mihail Eric Editors: Yuanzi, Daming [New Intelligence Guide] Alexa AI Machine Learning Scientist reviews and summarizes the content of the ACL 2019 conference, summarizing the current trends in the NLP field from multiple aspects including reducing bias, practical applications, and model integration capabilities. This week, at ACL 2019 held in … Read more

Natural Language Processing (NLP) Algorithm Learning Path!

Natural Language Processing (NLP) Algorithm Learning Path!

Yunzhong from Aofeisi Quantum Bit Editor | Public Account QbitAI NLP has been very popular in recent years and is developing particularly fast. Technologies such as BERT, GPT-3, Graph Neural Networks, and Knowledge Graphs have emerged. We are in an era of information explosion, facing a flood of online resources and papers every day. Often, … Read more

Must-See! Complete Collection of NLP Interview Questions (38)

Must-See! Complete Collection of NLP Interview Questions (38)

Hello everyone! I am very glad to have the opportunity to share with you common interview questions in the field of Natural Language Processing (NLP). As an important branch of artificial intelligence, NLP has developed rapidly in recent years and has a wide range of applications in various industries. Familiarity with these interview questions can … Read more

Understanding the Nine Layers of Attention Mechanism

Understanding the Nine Layers of Attention Mechanism

This article is written by: Electric Light Phantom Alchemist Graduate topic Top 1, Shanghai Jiao Tong University Computer Science first place, first prize in high school physics competition, meme master, national award in computer science from Shanghai Jiao Tong University, currently a PhD student at CUHK https://zhuanlan.zhihu.com/p/362366192 Attention has become a hot topic in the … Read more

Nine Layers of Understanding Attention Mechanism

Nine Layers of Understanding Attention Mechanism

↑ ClickBlue Text Follow the Extreme City Platform Author丨Electric Light Phantom Alchemy@Zhihu (Authorized) Source丨https://zhuanlan.zhihu.com/p/362366192 Editor丨Extreme City Platform Extreme City Guide Attention has become popular in the entire AI field, whether in machine vision or natural language processing, it is inseparable from Attention, transformer, or BERT. The author of this article follows the EM nine-layer tower … Read more