Understanding CatBoost Alongside XGBoost and LightGBM

Understanding CatBoost Alongside XGBoost and LightGBM

This article provides an overview of the content: 1. Introduction to CatBoost CatBoost is a machine learning library open-sourced by the Russian search giant Yandex in 2017 and is a type of Boosting algorithm. CatBoost, along with XGBoost and LightGBM, is known as one of the three mainstream tools for GBDT, all of which are … Read more

Introduction to XGBoost Principles

Introduction to XGBoost Principles

1 Algorithm Overview XGBoost (eXtreme Gradient Boosting) is also a member of the Boosting family. To understand its working principle, we first need to briefly introduce the related concepts of AdaBoost and GBDT. AdaBoost focuses on misclassified samples, increasing the weight of misclassified samples each time to train new classifiers. XGBoost is essentially a GBDT, … Read more

Understanding the Decision Process of XGBoost Machine Learning Model

Understanding the Decision Process of XGBoost Machine Learning Model

Using the XGBoost algorithm often yields good results in Kaggle and other data science competitions, making it popular among practitioners. This article analyzes the prediction process of the XGBoost machine learning model using a specific dataset and demonstrates the results through visualization, allowing us to better understand the model’s prediction process. As the industrial application … Read more

Summary of XGBoost Parameter Tuning

Summary of XGBoost Parameter Tuning

XGBoost has shone in Kaggle competitions. In previous articles, the principles of the XGBoost algorithm and the XGBoost splitting algorithm were introduced. Most explanations of XGBoost parameters found online only scratch the surface, making it extremely unfriendly for those new to machine learning algorithms. This article will explain some important parameters while referencing mathematical formulas … Read more

Understanding XGBoost: A Comprehensive Guide

Understanding XGBoost: A Comprehensive Guide

SIGAI Recommendation SIGAI Resource Summary 60% off on courses taught by Teacher Lei Ming XGBoost is a hot algorithm suitable for analyzing abstract data problems, achieving great results in competitions like Kaggle. Although there are many articles introducing the principles and use of XGBoost, few can clearly and thoroughly explain its principles. The goal of … Read more

An Explanation and Derivation of the XGBoost Algorithm

An Explanation and Derivation of the XGBoost Algorithm

This article is excerpted from “Introduction to Machine Learning Basics (Micro Course Version)” 10.5 XGBoost Algorithm XGBoost is a machine learning algorithm based on the gradient boosting algorithm (GBDT) invented by PhD student Tianqi Chen from the University of Washington in February 2014. This algorithm not only has excellent learning performance but also trains efficiently, … Read more

How to Use XGBoost for Time Series Forecasting

How to Use XGBoost for Time Series Forecasting

↑↑↑ Follow and Star “Datawhale” Daily insights & monthly study groups, don’t miss out Datawhale Insights Source: Jason Brownlee, Organized by Data Science THU This article is approximately 3300 words, and is recommended to read in 10minutes This article introduces how to use XGBoost for time series forecasting, including transforming time series into a supervised … Read more

Basics of Machine Learning: Machine Learning and Materials/Chemistry

Basics of Machine Learning: Machine Learning and Materials/Chemistry

How to Obtain 1. Follow the public account below, and click 【Like】 and 【View】 in this article 2. Click 【Get Course】 in the public account to obtain this material There is a course on Basics of Machine Learning: Machine Learning and Materials/Chemistry Basics of Machine Learning: Machine Learning and Materials/Chemistry 1. Introduction to Machine Learning … Read more

Predicting POD24 in Follicular Lymphoma Using ML Models

Predicting POD24 in Follicular Lymphoma Using ML Models

POD24 significantly affects the overall survival (OS) of patients with follicular lymphoma (FL) within 24 months of disease progression. Existing FL risk scoring models have been developed using statistical methods such as logistic regression or Cox regression. The rapid development of machine learning has facilitated the advancement of prognostic models. In China, Professor Song Yuqin, … Read more