Unveiling GBDT: The Superhero of Machine Learning

The Gradient Boosting Decision Tree (GBDT), as a leader in ensemble learning, has won the favor of many data scientists due to its excellent performance in classification and regression tasks. This article will take you deep into the mysterious veil of GBDT and demonstrate how to efficiently implement GBDT using the sklearn library. 1. What … Read more

Implementing Machine Learning Algorithms: CatBoost

Implementing Machine Learning Algorithms: CatBoost

Click the above “Beginner’s Guide to Vision” to select and add a Bookmark or “Pin” Heavyweight content delivered first-hand This article introduces the last powerful engineering implementation model in the GBDT series—CatBoost. CatBoost is one of the three mainstream models under the GBDT framework, alongside XGBoost and LightGBM. CatBoost, open-sourced by the Russian search giant … Read more

LightGBM: A Gradient Boosting Tree Algorithm for Large-Scale Data

LightGBM: A Gradient Boosting Tree Algorithm for Large-Scale Data

1 Algorithm Introduction LightGBM (Light Gradient Boosting Machine, hereinafter referred to as LGBM) is an efficient and scalable machine learning algorithm based on Gradient Boosted Decision Trees (GBDT). As a member of the GBDT framework algorithms and a successor to the XGB algorithm, LGBM effectively integrates a series of advantages from previous GBDT algorithms, including … Read more

Summary of Decision Trees, Random Forests, Bagging, Boosting, Adaboost, GBDT, and XGBoost

Summary of Decision Trees, Random Forests, Bagging, Boosting, Adaboost, GBDT, and XGBoost

Official WeChat account of Tsinghua Big Data Software Team Source: Zhihu This article is about 5000 words long, and it is recommended to read for 5 minutes. This article systematically summarizes the related content about decision trees, random forests, etc. 1、Decision Tree A decision tree is a supervised classification model that essentially selects a feature … Read more

Mathematical Derivation and Pure Python Implementation of Machine Learning Algorithm: XGBoost

Mathematical Derivation and Pure Python Implementation of Machine Learning Algorithm: XGBoost

Click the above “Beginner Learning Vision” to choose to add Star or “Pin” Important content delivered in real-time Since Chen Tianqi proposed XGBoost in 2015, this model has been frequently used as a powerful tool in major data competitions. Its greatest advantages are speed and effectiveness. XGBoost is of the same lineage as GBDT and … Read more

XGBoost: The Next Wave in Machine Learning

XGBoost: The Next Wave in Machine Learning

1 Algorithm Introduction XGBoost (eXtreme Gradient Boosting) is an algorithm based on GBDT, which is an ensemble machine learning algorithm based on decision trees, using Gradient Boosting as its framework. In 2016, Tianqi Chen formally proposed the XGBoost algorithm in his paper “XGBoost: A Scalable Tree Boosting System.” This algorithm efficiently implements GBDT and has … Read more

Common Interview Questions About XGBoost

Common Interview Questions About XGBoost

XGBoost is well-known as a powerful tool in data science competitions and is widely used in the industry. This article shares a collection of frequently asked interview questions about XGBoost that I have compiled over the years, hoping to deepen everyone’s understanding of XGBoost and, more importantly, to provide some assistance when seeking opportunities. 1. … Read more

Comprehensive Explanation of XGBoost Algorithm

Comprehensive Explanation of XGBoost Algorithm

This article is a part of Chapter 10 of the book “Introduction to Machine Learning Basics” (by Huang Haiguang). XGBoost Algorithm XGBoost is a machine learning algorithm based on the Gradient Boosting Decision Tree (GBDT) invented in February 2014 by PhD student Chen Tianqi from the University of Washington. This algorithm not only has excellent … Read more

Understanding CatBoost Alongside XGBoost and LightGBM

Understanding CatBoost Alongside XGBoost and LightGBM

This article provides an overview of the content: 1. Introduction to CatBoost CatBoost is a machine learning library open-sourced by the Russian search giant Yandex in 2017 and is a type of Boosting algorithm. CatBoost, along with XGBoost and LightGBM, is known as one of the three mainstream tools for GBDT, all of which are … Read more

An Explanation and Derivation of the XGBoost Algorithm

An Explanation and Derivation of the XGBoost Algorithm

This article is excerpted from “Introduction to Machine Learning Basics (Micro Course Version)” 10.5 XGBoost Algorithm XGBoost is a machine learning algorithm based on the gradient boosting algorithm (GBDT) invented by PhD student Tianqi Chen from the University of Washington in February 2014. This algorithm not only has excellent learning performance but also trains efficiently, … Read more