Common Pitfalls in PyTorch

Common Pitfalls in PyTorch

Click on the “CVer” above to select “Star” or “Top” Heavyweight content delivered promptly Author: Yu Zhenbo https://zhuanlan.zhihu.com/p/77952356 This article is authorized by the author and cannot be reproduced without permission. Recently, I just started using PyTorch and have encountered quite a few pitfalls. I record them here, as I feel they are common issues … Read more

Mastering PyTorch: A Step-By-Step Guide

Mastering PyTorch: A Step-By-Step Guide

In the field of machine learning, building flexible and easily adjustable models is an essential skill for advanced machine learning engineers when faced with various complex and changing business problems. However, many engineers still have a misconception that mastering just one deep learning framework is enough to succeed. In fact, there is no single framework … Read more

PyTorch Tricks Compilation

PyTorch Tricks Compilation

Author丨z.defying@Zhihu Source丨https://zhuanlan.zhihu.com/p/76459295 Editor | Jishi Platform For academic sharing only, please contact to delete if there is infringement Table of Contents 1. Specify GPU ID 2. View model output details for each layer 3. Gradient Clipping 4. Expand image dimensions 5. One-hot encoding 6. Prevent out-of-memory during model validation 7. Learning rate decay 8. Freeze … Read more

Unveiling the Tech Stack Behind ChatGPT: How OpenAI Scaled Kubernetes to 7500 Nodes

Unveiling the Tech Stack Behind ChatGPT: How OpenAI Scaled Kubernetes to 7500 Nodes

Author | OpenAI Translator | Sambodhi Editor | Xu Qian In this article, OpenAI’s engineering team shares various challenges and solutions they encountered during the Kubernetes cluster scaling process, as well as the performance and effects they achieved. We have scaled our Kubernetes cluster to 7500 nodes, creating a scalable infrastructure for large models like … Read more

Deploying DeepSeek Large Model Using Ollama

Deploying DeepSeek Large Model Using Ollama

Deploying DeepSeek Large Model Using Ollama Prerequisites Download CUDA drivers using NVIDIA graphics card https://developer.nvidia.com/cuda-downloads Ollama Ollama Official Version: https://ollama.com/ My graphics card is on a Windows computer, so I will install using the Windows installation method.If your graphics card is on Linux, you can install using the following command. curl -fsSL https://ollama.com/install.sh | sh … Read more

Anker’s Yang Meng Discusses GPU, Transformers, and the Future of Robotics

Anker's Yang Meng Discusses GPU, Transformers, and the Future of Robotics

How does Anker, which focuses on robotics, view the future of large models and general-purpose robots? Anker is often seen by the Chinese public as a power bank company, but in fact, power bank revenue accounts for less than 10% of their income. In 2022, Anker generated $2 billion in revenue, achieving top-tier status in … Read more

Midjourney V6 Style Reference Command Updates

Midjourney V6 Style Reference Command Updates

MJ Announcement: –sref has updated the latest style reference algorithm command this morning –sv, which works best when combined with –sw parameter (based on testing). Official Statement The official announcement states that two new style reference algorithms have been released, which is a new “version control” feature, and the image scoring has reopened free GPU … Read more

Agentic AI Era: NVIDIA’s Technological Revolution and Ambitions

Agentic AI Era: NVIDIA's Technological Revolution and Ambitions

Agentic AI Era: NVIDIA’s Technological Revolution and Ambitions In the field of artificial intelligence, technological advancements often reshape our world at an astonishing pace. Today, we stand on the cusp of a new technological revolution – the dawn of the Agentic AI Era. In this revolution, NVIDIA is not just a participant but a leader. … Read more

TensorFlow Installation Guide – Using Pip to Install TensorFlow

TensorFlow Installation Guide - Using Pip to Install TensorFlow

There are many ways to install TensorFlow. This article will provide a detailed guide on how to install TensorFlow using pip. Available Installation Packages tensorflow — Current version for CPU only (recommended for beginners) tensorflow-gpu — Current version with GPU support (Ubuntu and Windows) tf-nightly — Nightly build for CPU only (unstable) tf-nightly-gpu — Nightly … Read more

How Competitive Is Large Model Training? Unraveling the Mystery of Large Model Computing Power

How Competitive Is Large Model Training? Unraveling the Mystery of Large Model Computing Power

Image Source @GPT Generated ▎The premise of competing for computing power is that computing power is becoming a new business model. The craze of “alchemy” for large models will pass, and computing power service providers need to prepare for the future and pivot in a timely manner. Author|Qin Conghui Editor|Gai Hongda This article was first … Read more