Deep Learning: Structured Machine Learning Projects

Deep Learning: Structured Machine Learning Projects

Notes from Andrew Ng’s DeepLearning.ai Course 【Andrew Ng’s DeepLearning.ai Notes 1】Intuitive Explanation of Logistic Regression 【Andrew Ng’s DeepLearning.ai Notes 2】Simple Explanation of Neural Networks (Part 1) 【Andrew Ng’s DeepLearning.ai Notes 2】Simple Explanation of Neural Networks (Part 2) Having trouble with deep networks? Andrew Ng helps you optimize neural networks (1) 【DeepLearning.ai】Deep Learning: Optimizing Neural Networks … Read more

In-Depth Look at Cerebras: Architecture of the World’s Largest AI Chip

In-Depth Look at Cerebras: Architecture of the World's Largest AI Chip

Author|Sean Lie Translator|Hu Yanjun, Cheng Haoyuan In recent years, the scale of neural network models has grown exponentially, from BERT with over 100 million parameters in 2018 to GPT-3 with 175 billion parameters in 2020, an increase of three orders of magnitude in just two years, and this growth shows no signs of stopping. People … Read more

Exploring Pre-Trained Neural Networks for Feature Extraction

Exploring Pre-Trained Neural Networks for Feature Extraction

Introduction In this article, I will explore a common practice in representation learning—using the frozen states of pre-trained neural networks as feature extractors. Specifically, I am interested in comparing the performance of simple models trained using these extracted neural network features with that of fine-tuned neural networks initialized with transfer learning. The intended audience is … Read more

Understanding Deep Learning: From Neurons to BERT

Understanding Deep Learning: From Neurons to BERT

Ali Sister’s Guide: BERT, a landmark in the field of natural language processing, did not appear out of nowhere; it has its development principles behind it. Today, the Ant Financial Wealth Dialogue Algorithm Team has organized and compared the development history of deep learning models in the field of natural language processing. From simple neurons … Read more

Demystifying Large Language Models: Time to Implement Intelligent Cognitive Paradigms in Industry

Demystifying Large Language Models: Time to Implement Intelligent Cognitive Paradigms in Industry

Click Follow us for updates in blue above Cover image: A recent cognitive class on intelligence by the author, demystifying large language models from a comparative perspective “ 𝕀²·ℙarad𝕚g𝕞 Intelligent Square Paradigm Research: Writing to Deconstruct Intelligence。 After all, deep learning LLMs are not the entirety of AI, and the path to AGI is not … Read more

Overview of Transformer Compression

Overview of Transformer Compression

Large models based on the Transformer architecture are playing an increasingly important role in artificial intelligence, especially in the fields of natural language processing (NLP) and computer vision (CV). Model compression methods reduce their memory and computational costs, which is a necessary step for implementing Transformer models on practical devices. Given the unique architecture of … Read more

Real-Time Detection Transformer (RT-DETR) Combined with EBC for Superior Image Representation

Real-Time Detection Transformer (RT-DETR) Combined with EBC for Superior Image Representation

Click the card below to follow「AI Vision Engine」public account ( Note when adding: direction + school/company + nickname/name ) Event-based cameras (EBCs) are a biologically inspired alternative to traditional cameras, emerging due to their advantages in energy efficiency, temporal resolution, and high dynamic range. However, developing corresponding image analysis methods is quite challenging due to … Read more

Hands-On Coding to Learn Transformer Principles

Hands-On Coding to Learn Transformer Principles

AliMei Guide Learn about Transformer, and come write one with the author. As an engineering student, when learning about Transformer, it always feels like understanding is not solid enough unless I write one myself. Knowledge gained from books is often superficial; true understanding requires practice, so take time to debug a few times! Note: No … Read more

Current Research Status of Object Detection Algorithms Based on Transformer

Current Research Status of Object Detection Algorithms Based on Transformer

Object detection is a fundamental task in computer vision that requires us to locate and classify objects. The groundbreaking R-CNN family[1]-[3] and ATSS[4], RetinaNet[5], FCOS[6], PAA[7], and a series of variants[8][10] have made significant breakthroughs in the object detection task. One-to-many label assignment is the core solution, which assigns each ground truth box as a … Read more

Running Stable Diffusion on iPhone: An App That Generates Images in One Minute

Running Stable Diffusion on iPhone: An App That Generates Images in One Minute

Selected from liuliu.me Author: liuliu Translated by Machine Heart Machine Heart Editorial Team Stable Diffusion may soon become popular on mobile devices. Is it difficult to run Stable Diffusion on an iPhone? The author of this article provides the answer: it’s not difficult, and the iPhone still has 50% of its performance available. As we … Read more