Essential Pytorch Knowledge Explained

Introduction When constructing models, especially when dealing with the input-output data formats between layers, some commonly used data processing functions such as tensor calculations and broadcasting mechanisms are very important. They remain indispensable when later using the Transformers library with pre-trained models. This article aims to explain the most commonly used Pytorch processing functions for … Read more

Common Pitfalls in PyTorch

Common Pitfalls in PyTorch

Click the “CVer” above to select “Star” or “Pin”. Heavyweight content delivered at the first time. Author: Bi Ji Ji https://zhuanlan.zhihu.com/p/59271905 This article is authorized, and no secondary reproduction is allowed without permission. 1. The Differences Between nn.Module.cuda() and Tensor.cuda() Both the cuda() function can achieve memory migration from CPU to GPU for models and … Read more

Common Pitfalls in PyTorch

Common Pitfalls in PyTorch

Click on the “CVer” above to select “Star” or “Top” Heavyweight content delivered promptly Author: Yu Zhenbo https://zhuanlan.zhihu.com/p/77952356 This article is authorized by the author and cannot be reproduced without permission. Recently, I just started using PyTorch and have encountered quite a few pitfalls. I record them here, as I feel they are common issues … Read more

Pytorch-Lightning: A Simplified Framework for Python Deep Learning

Pytorch-Lightning: A Simplified Framework for Python Deep Learning

Pytorch-Lightning: A Simplified Framework for Python Deep Learning! Pytorch-Lightning is a lightweight framework based on PyTorch, specifically designed to simplify the training process of deep learning models. It abstracts common coding patterns, allowing you to focus on building models without getting bogged down in cumbersome details. Today, we’ll discuss the basic usage of this framework … Read more