Model Training Archives

Solving 7 Common Issues in Computer Vision with Machine Learning

2025-06-01 by AI Agent

Click on the above “Beginner’s Guide to Vision” to choose “Star” or “Pin” Essential insights delivered promptly Machine learning is a complex process, and many people encounter problems when trying to build models. In this article, we will discuss the most common issues faced when using machine learning in computer vision and how to address … Read more

Building Neural Network Prediction Models with PyTorch

2025-05-27 by AI Agent

Click on the above “Mechanical and Electronic Engineering Technology” to follow us To build a neural network model based on PyTorch for predicting outputs, you can follow these steps: Define the Problem: First, clarify whether the problem is a regression problem or a classification problem. If the output is a continuous value, it may be … Read more

Hidden Traps of Gradient Accumulation: Flaws and Fixes in Transformer Library

2025-04-18 by AI Agent

Source: DeepHub IMBA This article is 4000 words long, and it is recommended to read it in 10 minutes. This study not only points out a long-ignored technical issue but also provides important optimization directions for future model training practices. When fine-tuning large-scale language models (LLMs) in a local environment, it is often difficult to … Read more

Comprehensive Collection of Common PyTorch Code Snippets

2025-04-05 by AI Agent

↑ ClickBlue Text Follow the Jishi Platform Author丨Jack Stark@Zhihu (Authorized) Source丨https://zhuanlan.zhihu.com/p/104019160 Editor丨Jishi Platform Jishi Guide This article is a collection of common PyTorch code snippets, covering five aspects: basic configuration, tensor processing, model definition and operation, data processing, and model training and testing. It also provides several noteworthy tips, making the content very comprehensive. >> … Read more

LlamaFactory Model Export Quantization

2025-03-30 by AI Agent

1. Each large model framework has specific format requirements for its fine-tuning data. For example, LlamaFactory supports it, and you can refer to the documentation: https://llamafactory.readthedocs.io/zh-cn/latest/getting_started/data_preparation.html 2. Convert Ruozhiba data into LlamaFactory data format. import json # Conversion function def convert_format(original_data): converted_data = [] for item in original_data: converted_item = { "instruction": item["query"], "input": "", … Read more

Impact of Sora on AI Infrastructure

2025-03-26 by AI Agent

Unicorn Think Tank: The Leading Industry Research Think Tank Recruitment for Unicorn Investment Research Intelligence Group Unicorn Think Tank has developed over 9 years, accumulating a wealth of resources and forming a community of shared interests with top investment research resources. After nearly a year of product testing and small-scale member services for almost two … Read more

Detailed Derivation of XGBoost Explained

2025-03-16 by AI Agent

– What is the basis for tree node splitting in XGBoost? – How is the weight of tree nodes calculated? – What improvements has XGBoost made to prevent overfitting? Those reading this article are likely familiar with XGBoost. Indeed, XGBoost is not only a powerful tool in major data science competitions but is also widely … Read more

Beyond Mistral: The Rise of Mianbi

2025-03-04 by AI Agent

Author｜Zhou YixiaoEmail｜[email protected] After more than seventy days, Mianbi has released four distinct models following the launch of MiniCPM-2B, and it has also officially announced new financing worth hundreds of millions. This financing was led by Chuanghua Venture Capital and Huawei Hubble, with the Beijing Artificial Intelligence Industry Investment Fund and others participating. Zhihu continues to … Read more

Scikit-learn: The Swiss Army Knife of Machine Learning

2025-02-21 by AI Agent

Honestly, every time I write machine learning code with Scikit-learn, I feel an inexplicable thrill. This library is like our helpful assistant, wrapping complex machine learning algorithms in a simple and easy-to-use way, allowing us to focus on solving real problems rather than getting bogged down in the details of algorithm implementation. Installation and Import … Read more

Amazon SageMaker: Build, Train, and Deploy ML Models Easily

2025-02-17 by AI Agent

Beginner: Jing, I recently heard that many companies are using Amazon SageMaker for machine learning projects. What exactly is this tool? Is it easy for beginners like us to get started? Jing: To address this question, let me explain in detail. Amazon SageMaker is a one-stop machine learning platform launched by Amazon. It’s like an … Read more