A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

data-mining decision-trees distributed gbdt gbm gbrt gradient-boosting kaggle lightgbm machine-learning microsoft parallel python r

Перейти к файлу

James Lamb ed651e8672 [R-package] enable use of trees with linear models at leaves (fixes #3319 ) (#3699 ) * [R-package] enable use of trees with linear models at leaves (fixes #3319) * remove problematic pragmas * fix tests * try to fix build scripts * try fixing pragma check * more pragma checks * ok fix pragma stuff for real * empty commit * regenerate documentation * try skipping test * uncomment CI * add note on missing value types for R * add tests on saving and re-loading booster		2021-01-18 15:44:38 +03:00
.ci	[ci][docs] update linkchecker (#3773 )	2021-01-17 00:28:01 +03:00
.github	[ci] Slightly optimize optional workflows checks (#3762 )	2021-01-15 09:14:14 -06:00
.nuget	[CI] fix nuget version	2020-08-07 08:46:13 +08:00
R-package	[R-package] enable use of trees with linear models at leaves (fixes #3319 ) (#3699 )	2021-01-18 15:44:38 +03:00
compute@36c89134d4	[ci] update CI stuff (#2079 )	2019-04-09 10:23:32 +08:00
docker	[python] Drop Python 2 support (#3581 )	2020-12-09 13:32:00 +03:00
docs	[R-package] enable use of trees with linear models at leaves (fixes #3319 ) (#3699 )	2021-01-18 15:44:38 +03:00
eigen@8ba1b0f41a	Trees with linear models at leaves (#3299 )	2020-12-24 14:01:23 +08:00
examples	Trees with linear models at leaves (#3299 )	2020-12-24 14:01:23 +08:00
external_libs	Fix model locale issue and improve model R/W performance. (#3405 )	2020-12-08 21:36:24 +08:00
helpers	[python] save all param values into model file (#2589 )	2020-03-06 15:42:49 +03:00
include/LightGBM	[R-package] enable use of trees with linear models at leaves (fixes #3319 ) (#3699 )	2021-01-18 15:44:38 +03:00
pmml	[docs][python] made OS detection more reliable and little docs improvements (#1414 )	2018-06-03 12:46:59 +03:00
python-package	[dask] [python-package] Search for available ports when setting up network (fixes #3753 ) (#3766 )	2021-01-15 10:59:45 -06:00
src	[R-package] enable use of trees with linear models at leaves (fixes #3319 ) (#3699 )	2021-01-18 15:44:38 +03:00
swig	[refactor] SWIG - Split pointer manipulation to individual .i file (#3538 )	2020-11-24 12:49:49 +08:00
tests	completely remove tempfile from test_basic (#3767 )	2021-01-15 11:16:12 -06:00
windows	[python-package] remove unused Eigen files, compile with EIGEN_MPL2_ONLY (fixes #3684 ) (#3685 )	2020-12-30 02:16:34 +03:00
.appveyor.yml	[docs][ci] added docs about GPU support out of the box for Windows wheels and small refactoring for dual test (#3660 )	2020-12-22 14:20:01 +03:00
.editorconfig	added editorconfig (#2403 )	2019-09-16 14:38:26 +03:00
.gitignore	[python][tests] small Python tests cleanup (#3715 )	2021-01-04 21:11:24 +03:00
.gitmodules	Trees with linear models at leaves (#3299 )	2020-12-24 14:01:23 +08:00
.readthedocs.yml	[docs] fix R documentation builds (fixes #3655 ) (#3656 )	2020-12-18 13:37:58 -06:00
.vsts-ci.yml	[ci] remove Travis (fixes #3519 ) (#3672 )	2021-01-13 09:55:58 -06:00
CMakeIntegratedOpenCL.cmake	Add option to build with integrated OpenCL (#3144 )	2020-09-21 20:48:53 +03:00
CMakeLists.txt	[python-package] remove unused Eigen files, compile with EIGEN_MPL2_ONLY (fixes #3684 ) (#3685 )	2020-12-30 02:16:34 +03:00
CODE_OF_CONDUCT.md	Create CODE_OF_CONDUCT.md (#803 )	2017-08-18 19:01:47 +08:00
CONTRIBUTING.md	Better documentation for Contributing (#2781 )	2020-02-22 10:56:14 +08:00
LICENSE	added editorconfig (#2403 )	2019-09-16 14:38:26 +03:00
README.md	[ci] remove Travis (fixes #3519 ) (#3672 )	2021-01-13 09:55:58 -06:00
VERSION.txt	[ci] Bump version for development (#3633 )	2020-12-09 13:27:37 +03:00
build-cran-package.sh	[R-package] enable use of trees with linear models at leaves (fixes #3319 ) (#3699 )	2021-01-18 15:44:38 +03:00
build_r.R	[R-package] enable use of trees with linear models at leaves (fixes #3319 ) (#3699 )	2021-01-18 15:44:38 +03:00

README.md

Light Gradient Boosting Machine

LightGBM is a gradient boosting framework that uses tree based learning algorithms. It is designed to be distributed and efficient with the following advantages:

Faster training speed and higher efficiency.
Lower memory usage.
Better accuracy.
Support of parallel and GPU learning.
Capable of handling large-scale data.

For further details, please refer to Features.

Benefitting from these advantages, LightGBM is being widely-used in many winning solutions of machine learning competitions.

Comparison experiments on public datasets show that LightGBM can outperform existing boosting frameworks on both efficiency and accuracy, with significantly lower memory consumption. What's more, parallel experiments show that LightGBM can achieve a linear speed-up by using multiple machines for training in specific settings.

Get Started and Documentation

Our primary documentation is at https://lightgbm.readthedocs.io/ and is generated from this repository. If you are new to LightGBM, follow the installation instructions on that site.

Next you may want to read:

Examples showing command line usage of common tasks.
Features and algorithms supported by LightGBM.
Parameters is an exhaustive list of customization you can make.
Parallel Learning and GPU Learning can speed up computation.
Laurae++ interactive documentation is a detailed guide for hyperparameters.
Optuna Hyperparameter Tuner provides automated tuning for LightGBM hyperparameters (code examples).

Documentation for contributors:

How we update readthedocs.io.
Check out the Development Guide.

News

Please refer to changelogs at GitHub releases page.

Some old update logs are available at Key Events page.

External (Unofficial) Repositories

Optuna (hyperparameter optimization framework): https://github.com/optuna/optuna

Julia-package: https://github.com/IQVIA-ML/LightGBM.jl

JPMML (Java PMML converter): https://github.com/jpmml/jpmml-lightgbm

Treelite (model compiler for efficient deployment): https://github.com/dmlc/treelite

cuML Forest Inference Library (GPU-accelerated inference): https://github.com/rapidsai/cuml

daal4py (Intel CPU-accelerated inference): https://github.com/IntelPython/daal4py

m2cgen (model appliers for various languages): https://github.com/BayesWitnesses/m2cgen

leaves (Go model applier): https://github.com/dmitryikh/leaves

ONNXMLTools (ONNX converter): https://github.com/onnx/onnxmltools

SHAP (model output explainer): https://github.com/slundberg/shap

MMLSpark (LightGBM on Spark): https://github.com/Azure/mmlspark

Kubeflow Fairing (LightGBM on Kubernetes): https://github.com/kubeflow/fairing

Kubeflow Operator (LightGBM on Kubernetes): https://github.com/kubeflow/xgboost-operator

ML.NET (.NET/C#-package): https://github.com/dotnet/machinelearning

LightGBM.NET (.NET/C#-package): https://github.com/rca22/LightGBM.Net

Dask-LightGBM (distributed and parallel Python-package): https://github.com/dask/dask-lightgbm

Ruby gem: https://github.com/ankane/lightgbm

LightGBM4j (Java high-level binding): https://github.com/metarank/lightgbm4j

MLflow (experiment tracking, model monitoring framework): https://github.com/mlflow/mlflow

{treesnip} (R {parsnip}-compliant interface): https://github.com/curso-r/treesnip

{mlr3learners.lightgbm} (R {mlr3}-compliant interface): https://github.com/mlr3learners/mlr3learners.lightgbm

Support

Ask a question on Stack Overflow with the lightgbm tag, we monitor this for new questions.
Open bug reports and feature requests (not questions) on GitHub issues.

How to Contribute

Check CONTRIBUTING page.

Microsoft Open Source Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Reference Papers

Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, Tie-Yan Liu. "LightGBM: A Highly Efficient Gradient Boosting Decision Tree". Advances in Neural Information Processing Systems 30 (NIPS 2017), pp. 3149-3157.

Qi Meng, Guolin Ke, Taifeng Wang, Wei Chen, Qiwei Ye, Zhi-Ming Ma, Tie-Yan Liu. "A Communication-Efficient Parallel Algorithm for Decision Tree". Advances in Neural Information Processing Systems 29 (NIPS 2016), pp. 1279-1287.

Huan Zhang, Si Si and Cho-Jui Hsieh. "GPU Acceleration for Large-scale Tree Boosting". SysML Conference, 2018.

Note: If you use LightGBM in your GitHub projects, please add lightgbm in the requirements.txt.

License

This project is licensed under the terms of the MIT license. See LICENSE for additional details.