Diffusion Transformer Archives - Page 3 of 8

A Comprehensive Overview of Graph Transformers

2025-07-20 by AI Agent

PanChuang AI Share Source | Extreme City Platform Author | whistle@Zhihu Source | https://zhuanlan.zhihu.com/p/536489997 Reprinted from | Machine Learning Algorithms and Natural Language Processing Introduction Why Use Transformers on Graphs? Briefly mention the benefits brought by Graph Transformers (GT): Can capture long-range dependencies Mitigates over-smoothing and over-squashing phenomena GT can even integrate GNN and frequency … Read more

Beyond ConvNeXt! Transformer-Style Convolutional Network Visual Baseline Model Conv2Former

2025-07-20 by AI Agent

MLNLP community is a well-known machine learning and natural language processing community in China and abroad, covering NLP master’s and doctoral students, university teachers, and corporate researchers.The community’s vision is to promote communication and progress between the academic and industrial sectors of natural language processing and machine learning in China and abroad, especially for beginners. … Read more

Exploring Similarities Between Transformer, RNN, and SSM

2025-07-20 by AI Agent

Source: DeepHub IMBA This article is approximately 4000 words long and is recommended to be read in 6 minutes. This article will explore Transformer, RNN, and Mamba 2. By exploring the potential connections between seemingly unrelated large language model (LLM) architectures, we may open new avenues for facilitating the exchange of ideas between different models … Read more

Plug and Play! Batch Transformer

2025-07-20 by AI Agent

Click on "Xiaobai Learns Vision" above, select "Star" or "Top" Heavyweight content delivered first Editor’s Recommendation Recently, I saw an interesting article discussing the question, ‘Why hasn’t batch attention been done?’ I would like to share it with everyone, hoping it will be helpful. Reprinted from丨NewBeeNLP © Author | Hou Zhi Institution | University of … Read more

Details of Transformers: 18 Questions Explained

2025-07-20 by AI Agent

Click the above“Beginner’s Guide to Vision” to add to “Favorites” or “Pin” Important content delivered promptly Author | Wang Chen, who asks questions Editor | Jishi Platform Original link: https://www.zhihu.com/question/362131975/answer/3058958207 Introduction Asking questions and thinking is the best way to learn. This article explains the overall structure and details of the Transformer by posing and … Read more

Analysis of Mamba: A New Architecture Challenging Transformers and Pytorch Implementation

2025-07-20 by AI Agent

Click the "Little White Learns Vision" above, select "Star" or "Top" Heavyweight content delivered in real-time Today we will study the paper “Mamba: Linear Time Series Modeling with Selective State Space” in detail. Mamba has been making waves in the AI community, touted as a potential competitor to Transformers. What exactly makes Mamba stand out … Read more

Introduction to Attention Mechanisms in Transformer Models and PyTorch Implementation

2025-07-20 by AI Agent

These mechanisms are core components of large language models (LLMs) like GPT-4 and Llama. By understanding these attention mechanisms, we can better grasp how these models work and their application potential.We will not only discuss theoretical concepts but also implement these attention mechanisms from scratch using Python and PyTorch. Through practical coding, we can gain … Read more

SparX: A Sparse Cross-Layer Connection Mechanism for Vision Mamba and Transformer

2025-07-20 by AI Agent

Follow our WeChat account to discover the beauty of CV technology This article shares the AAAI 2025 paper SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks. The University of Hong Kong proposed SparX, a sparse jump connection mechanism that enhances Vision Mamba and Transformer networks, boasting powerful performance and open-sourced … Read more

Unlocking the Secrets of Self-Attention Extrapolation Defects: Ant Group’s New Transformer

2025-07-20 by AI Agent

MLNLP community is a well-known machine learning and natural language processing community in China and abroad, covering NLP graduate students, university teachers, and corporate researchers.The vision of the community is to promote communication and progress between the academic and industrial circles of natural language processing and machine learning, especially for beginners. Reprinted from | Machine … Read more

Recent Review of Multi-Object Tracking: Based on Transformer, Graph Models, Detection and Association, Siamese Networks

2025-07-20 by AI Agent

This article is approximately 11,000 words, recommended reading 10+minutes This article attempts to summarize and review the latest trends in computer vision in MOT. Paper link: https://arxiv.org/pdf/2209.04796.pdfFollow the WeChat public account“Data派THU”, reply in the background with“20221020” to obtain a review of single-object, multi-object, and learning-based methods! Abstract With the development of autonomous driving technology, multi-object … Read more