CUDA Archives - Page 3 of 3

Settings for Reproducible Experiments in PyTorch

2025-05-26 by AI Agent

Click on the above “Beginner’s Guide to Vision” to choose to add “Star” or “Pin“ Important content delivered promptly Author: Alxander@Zhihu (authorized) Source: https://zhuanlan.zhihu.com/p/448284000 Editor: Jishi Platform Jishi Guide During the training process in deep learning, due to random initialization and the randomness of sample reading, repeated experimental results may differ, with some variations being … Read more

Understanding PyTorch Memory Management Mechanism

2025-05-25 by AI Agent

Author丨Mialo@Zhihu Source丨https://zhuanlan.zhihu.com/p/486360176 1. Background Introduction Analyzing the PyTorch memory management mechanism primarily aims to reduce the impact of “memory fragmentation”. A simple example is as follows: As shown in the figure above, suppose we want to allocate 800MB of memory. Although the total free memory is 1000MB, the free memory shown in the upper figure … Read more

Common Pitfalls in PyTorch

2025-04-05 by AI Agent

Click the “CVer” above to select “Star” or “Pin”. Heavyweight content delivered at the first time. Author: Bi Ji Ji https://zhuanlan.zhihu.com/p/59271905 This article is authorized, and no secondary reproduction is allowed without permission. 1. The Differences Between nn.Module.cuda() and Tensor.cuda() Both the cuda() function can achieve memory migration from CPU to GPU for models and … Read more

Running Deekseek-R1 Distillation Model with Llama Edge

2025-03-30 by AI Agent

DeepSeek-R1 uses reinforcement learning to significantly enhance the model’s inference capabilities. In tasks such as mathematics, coding, and natural language reasoning, its performance rivals that of OpenAI’s official version o1.The small model distilled from DeepSeek-R1 effectively inherits the reasoning patterns learned by the large model.This article primarily tests DeepSeek-R1-Distill-Llama-8B-GGUF using Llama Edge. Welcome to experiment … Read more

DeepSeek: Unraveling the AGI Black Box

2025-03-27 by AI Agent

As tech giants erect parameter monuments in the desert of computing power, a squad of engineers adorned with dynamic routing badges is cutting open the metal abdomen of large models with algorithm welding guns. The latest leaked battle map from the DeepSeek laboratory shows that their open-source model is rewriting the underlying game theory of … Read more

Faster R-CNN Model and Deep Learning Environment Setup

2025-02-06 by AI Agent

1. Faster R-CNN Model The R-CNN series networks are the most classic networks in the field of object detection, and their model update ideas are easy to understand. The object detection process is divided into three stages: candidate box generation, feature extraction, classification, and regression. R-CNN is a detection network assembled from many modules, where … Read more