XGBoost Tutorial: A Comprehensive Guide

XGBoost Tutorial: A Comprehensive Guide

Source: Machine Learning Algorithms This article is about 8400 words long and is recommended for a 10-minute read. This article provides a detailed explanation of the engineering application methods of XGBoost. The illustrated machine learning practical application demonstrates the application process and chain of machine learning algorithms in a case-driven and code-driven manner, mastering the … Read more

Summary of Reasons for Neural Network Training Not Converging or Failing

Click on 'Xiaobai Learns Vision' above, select to add 'star' or 'top' Important content delivered first Introduction This article analyzes the reasons for model training not converging or failing from both data and model perspectives. Four possible reasons from the data aspect and nine possible issues from the model aspect are summarized. In addition, the … Read more

Solving 7 Common Issues in Computer Vision with Machine Learning

Solving 7 Common Issues in Computer Vision with Machine Learning

Click on the above “Beginner’s Guide to Vision” to choose “Star” or “Pin” Essential insights delivered promptly Machine learning is a complex process, and many people encounter problems when trying to build models. In this article, we will discuss the most common issues faced when using machine learning in computer vision and how to address … Read more

Building Neural Network Prediction Models with PyTorch

Click on the above “Mechanical and Electronic Engineering Technology” to follow us To build a neural network model based on PyTorch for predicting outputs, you can follow these steps: Define the Problem: First, clarify whether the problem is a regression problem or a classification problem. If the output is a continuous value, it may be … Read more

Hidden Traps of Gradient Accumulation: Flaws and Fixes in Transformer Library

Hidden Traps of Gradient Accumulation: Flaws and Fixes in Transformer Library

Source: DeepHub IMBA This article is 4000 words long, and it is recommended to read it in 10 minutes. This study not only points out a long-ignored technical issue but also provides important optimization directions for future model training practices. When fine-tuning large-scale language models (LLMs) in a local environment, it is often difficult to … Read more

Comprehensive Collection of Common PyTorch Code Snippets

Comprehensive Collection of Common PyTorch Code Snippets

↑ ClickBlue Text Follow the Jishi Platform Author丨Jack Stark@Zhihu (Authorized) Source丨https://zhuanlan.zhihu.com/p/104019160 Editor丨Jishi Platform Jishi Guide This article is a collection of common PyTorch code snippets, covering five aspects: basic configuration, tensor processing, model definition and operation, data processing, and model training and testing. It also provides several noteworthy tips, making the content very comprehensive. >> … Read more

LlamaFactory Model Export Quantization

LlamaFactory Model Export Quantization

1. Each large model framework has specific format requirements for its fine-tuning data. For example, LlamaFactory supports it, and you can refer to the documentation: https://llamafactory.readthedocs.io/zh-cn/latest/getting_started/data_preparation.html 2. Convert Ruozhiba data into LlamaFactory data format. import json # Conversion function def convert_format(original_data): converted_data = [] for item in original_data: converted_item = { "instruction": item["query"], "input": "", … Read more

Impact of Sora on AI Infrastructure

Impact of Sora on AI Infrastructure

Unicorn Think Tank: The Leading Industry Research Think Tank Recruitment for Unicorn Investment Research Intelligence Group Unicorn Think Tank has developed over 9 years, accumulating a wealth of resources and forming a community of shared interests with top investment research resources. After nearly a year of product testing and small-scale member services for almost two … Read more

Detailed Derivation of XGBoost Explained

Detailed Derivation of XGBoost Explained

– What is the basis for tree node splitting in XGBoost? – How is the weight of tree nodes calculated? – What improvements has XGBoost made to prevent overfitting? Those reading this article are likely familiar with XGBoost. Indeed, XGBoost is not only a powerful tool in major data science competitions but is also widely … Read more

Beyond Mistral: The Rise of Mianbi

Beyond Mistral: The Rise of Mianbi

Author|Zhou YixiaoEmail|[email protected] After more than seventy days, Mianbi has released four distinct models following the launch of MiniCPM-2B, and it has also officially announced new financing worth hundreds of millions. This financing was led by Chuanghua Venture Capital and Huawei Hubble, with the Beijing Artificial Intelligence Industry Investment Fund and others participating. Zhihu continues to … Read more