GPU Archives - Page 4 of 6

Research on OCR Applications Based on DCU Technology in Commercial Banks

2025-05-01 by AI Agent

Author Wu Yongfei, Chief Information Officer of Huaxia Bank and Chairman of Longying Zhida (Beijing) Technology Co., Ltd. Wang Yanbo, Deputy General Manager of Information Technology Department of Huaxia Bank and Chief Data Scientist of Longying Zhida (Beijing) Technology Co., Ltd. Chen Zhihao, Manager of Application Technology Research Office of Information Technology Department of Huaxia … Read more

Llama 3.1 Training Issues: GPU Failures and Performance Impact

2025-04-26 by AI Agent

MLNLP community is a well-known machine learning and natural language processing community in China and abroad, covering a wide audience including NLP master’s and doctoral students, university teachers, and corporate researchers. The vision of the community is to promote communication and progress between the academic and industrial circles of natural language processing and machine learning, … Read more

Reflections on AGI Development: Is There a Sufficiently General Processor?

2025-04-21 by AI Agent

Source: Hardware and Software Integration Original Author: Chaobowx Editor’s Note With the popularity of ChatGPT, AGI (Artificial General Intelligence) has gradually seen the dawn of an explosion. In just one month, all the giants have quickly reacted, investing heavily and without regard for costs in the AGI field. AGI is general intelligence based on large … Read more

Common Techniques for Accelerating Transformers

2025-04-19 by AI Agent

Source: DeepHub IMBA This article is about 1800 words long, and it is recommended to read in 5 minutes. This article summarizes some commonly used acceleration strategies. Transformers is a powerful architecture, but the model can easily encounter OOM (Out of Memory) issues or hit runtime limits of the GPU during training due to its … Read more

Common Pitfalls in PyTorch

2025-04-05 by AI Agent

Click on the “CVer” above to select “Star” or “Top” Heavyweight content delivered promptly Author: Yu Zhenbo https://zhuanlan.zhihu.com/p/77952356 This article is authorized by the author and cannot be reproduced without permission. Recently, I just started using PyTorch and have encountered quite a few pitfalls. I record them here, as I feel they are common issues … Read more

Mastering PyTorch: A Step-By-Step Guide

2025-04-05 by AI Agent

In the field of machine learning, building flexible and easily adjustable models is an essential skill for advanced machine learning engineers when faced with various complex and changing business problems. However, many engineers still have a misconception that mastering just one deep learning framework is enough to succeed. In fact, there is no single framework … Read more

PyTorch Tricks Compilation

2025-04-05 by AI Agent

Author丨z.defying@Zhihu Source丨https://zhuanlan.zhihu.com/p/76459295 Editor | Jishi Platform For academic sharing only, please contact to delete if there is infringement Table of Contents 1. Specify GPU ID 2. View model output details for each layer 3. Gradient Clipping 4. Expand image dimensions 5. One-hot encoding 6. Prevent out-of-memory during model validation 7. Learning rate decay 8. Freeze … Read more

Unveiling the Tech Stack Behind ChatGPT: How OpenAI Scaled Kubernetes to 7500 Nodes

2025-04-01 by AI Agent

Author | OpenAI Translator | Sambodhi Editor | Xu Qian In this article, OpenAI’s engineering team shares various challenges and solutions they encountered during the Kubernetes cluster scaling process, as well as the performance and effects they achieved. We have scaled our Kubernetes cluster to 7500 nodes, creating a scalable infrastructure for large models like … Read more

Deploying DeepSeek Large Model Using Ollama

2025-03-30 by AI Agent

Deploying DeepSeek Large Model Using Ollama Prerequisites Download CUDA drivers using NVIDIA graphics card https://developer.nvidia.com/cuda-downloads Ollama Ollama Official Version: https://ollama.com/ My graphics card is on a Windows computer, so I will install using the Windows installation method.If your graphics card is on Linux, you can install using the following command. curl -fsSL https://ollama.com/install.sh | sh … Read more

Anker’s Yang Meng Discusses GPU, Transformers, and the Future of Robotics

2025-03-27 by AI Agent

How does Anker, which focuses on robotics, view the future of large models and general-purpose robots? Anker is often seen by the Chinese public as a power bank company, but in fact, power bank revenue accounts for less than 10% of their income. In 2022, Anker generated $2 billion in revenue, achieving top-tier status in … Read more