A Must-Read for Educators: The Double-Edged Sword of Generative AI

A Must-Read for Educators: The Double-Edged Sword of Generative AI

The wave of artificial intelligence is gradually rising, flowing into the field of education, and becoming a remedy for the “impossible triangle of education“. However, more and more studies indicate that immediate convenience may come at the cost of long-term capability. Who Is the Alarm Bell Ringing For? In 2024, Wharton School published a paper … Read more

Understanding AI: Overview of Five Deep Learning Models

Understanding AI: Overview of Five Deep Learning Models

Deep learning is an important branch of artificial intelligence that has made significant progress in recent years. Among them, RNN, CNN, Transformer, BERT, and GPT are five commonly used deep learning models that have achieved important breakthroughs in fields such as computer vision and natural language processing. This article will briefly introduce these five models … Read more

Full Speech by Shen Xiangyang at the Youth Scientist 50² Forum: 10 Thoughts on Large Models

Full Speech by Shen Xiangyang at the Youth Scientist 50² Forum: 10 Thoughts on Large Models

Click the blue text Follow us Shen Xiangyang Chairman of the Board of Hong Kong University of Science and Technology, Foreign Member of the National Academy of Engineering, USA On September 28, the 4th “Youth Scientist 50² Forum” was held at Southern University of Science and Technology,Foreign Member of the National Academy of Engineering, USAShen … Read more

What Does ‘GPT’ Mean in ChatGPT?

What Does 'GPT' Mean in ChatGPT?

Writing scripts, creating novels, coding, answering questions… the almost omnipotent ChatGPT has become a frequent topic on hot search lists in recent months. At the end of November last year, ChatGPT was launched and quickly went viral on social media. In just five days, the number of registered users exceeded 1 million; within two months, … Read more

Detailed Explanation of ChatGPT and InstructGPT

Detailed Explanation of ChatGPT and InstructGPT

Source: JD Cloud Dolphin Data Science Lab This article is approximately 7000 words long, suggested reading time is 15 minutes. To understand ChatGPT, we must first understand InstructGPT. Introduction The GPT series is a series of pre-trained models from OpenAI, where GPT stands for Generative Pre-Trained Transformer. As the name suggests, the purpose of GPT … Read more

The Evolution of Large Models: From Transformer to DeepSeek-R1

📖 Reading Time: 19 minutes 🕙 Release Date: February 14, 2025 ❝ Recent Hot Articles: The Most Comprehensive Mathematical Principles of Neural Networks (Code and Formulas) Intuitive Explanation Welcome to follow the Zhihu and WeChat public account columns LLM Architecture Column Zhihu LLM Column Zhihu【Boqi】 WeChat Public Account【Boqi Technology Talk】【Boqi Reading】 At the beginning of … Read more

BERT and GPT Outperform Transformers Without Attention or MLPs

BERT and GPT Outperform Transformers Without Attention or MLPs

Machine Heart reported Editors: Du Wei, Ze Nan This article explores the Monarch Mixer (M2), a new architecture that is sub-quadratic in both sequence length and model dimension, demonstrating high hardware efficiency on modern accelerators. From language models like BERT, GPT, and Flan-T5 to image models like SAM and Stable Diffusion, Transformers are sweeping the … Read more

Understanding the Working Principle of GPT’s Transformer Technology

Understanding the Working Principle of GPT's Transformer Technology

Introduction The Transformer was proposed in the paper“Attention is All You Need”, and is now the recommended reference model for Google Cloud TPU. By introducing self-attention mechanisms and positional encoding layers, it effectively captures long-distance dependencies in input sequences and performs excellently when handling long sequences. Additionally, the parallel computing capabilities of the Transformer model … Read more

What Is the Transformer Model?

What Is the Transformer Model?

Welcome to the special winter vacation column “High-Tech Lessons for Kids” brought to you by Science Popularization China! Artificial intelligence, as one of the most cutting-edge technologies today, is changing our lives at an astonishing speed. From smart voice assistants to self-driving cars, from AI painting to machine learning, it opens up a future full … Read more

How to Use GPT to Write Long Articles

How to Use GPT to Write Long Articles

///Providing art education, high school entrance exam, college entrance exam, and art graduate school consulting for thousands of students/// Why is GPT not able to write long articles well? Despite using GPT for a long time, many people still struggle to use it for writing long texts. Vague questioning instructions and the tendency to specify … Read more