A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Перейти к файлу
James Lamb 6d825cd3a1
clarify DEBUG-level log about tree depth (#4126)
* clarify DEBUG-level log about tree depth

* more places
2021-04-05 08:28:01 -05:00
.ci [ci] apply cpplint to cpp tests (#4092) 2021-03-28 15:52:42 +03:00
.github [ci] use GitHub Actions to re-generate R configure (#4140) 2021-03-31 14:37:40 +03:00
.nuget [ci][python] run isort in CI linting job (#3990) 2021-02-16 20:09:13 +03:00
R-package [ci] use GitHub Actions to re-generate R configure (#4140) 2021-03-31 14:37:40 +03:00
cmake store all CMake files in one place (#4087) 2021-03-21 16:36:03 +03:00
docker [ci] Added curl library to the installed packages list inside dockerfile-python installation (#4129) 2021-03-28 16:36:06 +03:00
docs clarify DEBUG-level log about tree depth (#4126) 2021-04-05 08:28:01 -05:00
examples [ci] run Dask examples on CI (#4064) 2021-03-14 21:58:15 -05:00
external_libs Move compute and eigen libraries to external_libs folder (#3809) 2021-01-22 17:45:43 +03:00
helpers [python] save all param values into model file (#2589) 2020-03-06 15:42:49 +03:00
include/LightGBM [docs] add missed CUDA device type in docs (#4130) 2021-03-28 22:22:44 +03:00
pmml [docs][python] made OS detection more reliable and little docs improvements (#1414) 2018-06-03 12:46:59 +03:00
python-package [tests][dask] Add voting_parallel algorithm in tests (fixes #3834) (#4088) 2021-04-01 08:51:24 -05:00
src clarify DEBUG-level log about tree depth (#4126) 2021-04-05 08:28:01 -05:00
swig [ci] apply cpplint to cpp tests (#4092) 2021-03-28 15:52:42 +03:00
tests [tests][dask] Add voting_parallel algorithm in tests (fixes #3834) (#4088) 2021-04-01 08:51:24 -05:00
windows Move compute and eigen libraries to external_libs folder (#3809) 2021-01-22 17:45:43 +03:00
.appveyor.yml [ci] Bump version for development (#4094) 2021-03-22 21:11:26 +03:00
.editorconfig [ci][python] run isort in CI linting job (#3990) 2021-02-16 20:09:13 +03:00
.gitignore Add CMake option to enable sanitizers and build gtest (#3555) 2021-03-13 00:53:08 +03:00
.gitmodules Move compute and eigen libraries to external_libs folder (#3809) 2021-01-22 17:45:43 +03:00
.readthedocs.yml [docs][ci] drop special dependency requirements for RTD site (#3884) 2021-01-31 14:15:26 -06:00
.vsts-ci.yml [ci] build CRAN R-package on Azure with every commit and attach it to releases (#4117) 2021-03-27 19:47:33 +03:00
CMakeLists.txt store all CMake files in one place (#4087) 2021-03-21 16:36:03 +03:00
CODE_OF_CONDUCT.md Create CODE_OF_CONDUCT.md (#803) 2017-08-18 19:01:47 +08:00
CONTRIBUTING.md Better documentation for Contributing (#2781) 2020-02-22 10:56:14 +08:00
LICENSE added editorconfig (#2403) 2019-09-16 14:38:26 +03:00
README.md [docs] add dtreeviz to the list of external projects (#4098) 2021-03-23 21:49:50 +03:00
VERSION.txt [ci] Bump version for development (#4094) 2021-03-22 21:11:26 +03:00
build-cran-package.sh store all CMake files in one place (#4087) 2021-03-21 16:36:03 +03:00
build_r.R store all CMake files in one place (#4087) 2021-03-21 16:36:03 +03:00

README.md

Light Gradient Boosting Machine

Python-package GitHub Actions Build Status R-package GitHub Actions Build Status CUDA Version GitHub Actions Build Status Static Analysis GitHub Actions Build Status Azure Pipelines Build Status Appveyor Build Status Documentation Status Link checks License Python Versions PyPI Version CRAN Version

LightGBM is a gradient boosting framework that uses tree based learning algorithms. It is designed to be distributed and efficient with the following advantages:

  • Faster training speed and higher efficiency.
  • Lower memory usage.
  • Better accuracy.
  • Support of parallel, distributed, and GPU learning.
  • Capable of handling large-scale data.

For further details, please refer to Features.

Benefitting from these advantages, LightGBM is being widely-used in many winning solutions of machine learning competitions.

Comparison experiments on public datasets show that LightGBM can outperform existing boosting frameworks on both efficiency and accuracy, with significantly lower memory consumption. What's more, distributed learning experiments show that LightGBM can achieve a linear speed-up by using multiple machines for training in specific settings.

Get Started and Documentation

Our primary documentation is at https://lightgbm.readthedocs.io/ and is generated from this repository. If you are new to LightGBM, follow the installation instructions on that site.

Next you may want to read:

Documentation for contributors:

News

Please refer to changelogs at GitHub releases page.

Some old update logs are available at Key Events page.

External (Unofficial) Repositories

FLAML (AutoML library for hyperparameter optimization): https://github.com/microsoft/FLAML

Optuna (hyperparameter optimization framework): https://github.com/optuna/optuna

Julia-package: https://github.com/IQVIA-ML/LightGBM.jl

JPMML (Java PMML converter): https://github.com/jpmml/jpmml-lightgbm

Treelite (model compiler for efficient deployment): https://github.com/dmlc/treelite

Hummingbird (model compiler into tensor computations): https://github.com/microsoft/hummingbird

cuML Forest Inference Library (GPU-accelerated inference): https://github.com/rapidsai/cuml

daal4py (Intel CPU-accelerated inference): https://github.com/IntelPython/daal4py

m2cgen (model appliers for various languages): https://github.com/BayesWitnesses/m2cgen

leaves (Go model applier): https://github.com/dmitryikh/leaves

ONNXMLTools (ONNX converter): https://github.com/onnx/onnxmltools

SHAP (model output explainer): https://github.com/slundberg/shap

dtreeviz (decision tree visualization and model interpretation): https://github.com/parrt/dtreeviz

MMLSpark (LightGBM on Spark): https://github.com/Azure/mmlspark

Kubeflow Fairing (LightGBM on Kubernetes): https://github.com/kubeflow/fairing

Kubeflow Operator (LightGBM on Kubernetes): https://github.com/kubeflow/xgboost-operator

ML.NET (.NET/C#-package): https://github.com/dotnet/machinelearning

LightGBM.NET (.NET/C#-package): https://github.com/rca22/LightGBM.Net

Ruby gem: https://github.com/ankane/lightgbm

LightGBM4j (Java high-level binding): https://github.com/metarank/lightgbm4j

lightgbm-rs (Rust binding): https://github.com/vaaaaanquish/lightgbm-rs

MLflow (experiment tracking, model monitoring framework): https://github.com/mlflow/mlflow

{treesnip} (R {parsnip}-compliant interface): https://github.com/curso-r/treesnip

{mlr3learners.lightgbm} (R {mlr3}-compliant interface): https://github.com/mlr3learners/mlr3learners.lightgbm

Support

How to Contribute

Check CONTRIBUTING page.

Microsoft Open Source Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Reference Papers

Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, Tie-Yan Liu. "LightGBM: A Highly Efficient Gradient Boosting Decision Tree". Advances in Neural Information Processing Systems 30 (NIPS 2017), pp. 3149-3157.

Qi Meng, Guolin Ke, Taifeng Wang, Wei Chen, Qiwei Ye, Zhi-Ming Ma, Tie-Yan Liu. "A Communication-Efficient Parallel Algorithm for Decision Tree". Advances in Neural Information Processing Systems 29 (NIPS 2016), pp. 1279-1287.

Huan Zhang, Si Si and Cho-Jui Hsieh. "GPU Acceleration for Large-scale Tree Boosting". SysML Conference, 2018.

Note: If you use LightGBM in your GitHub projects, please add lightgbm in the requirements.txt.

License

This project is licensed under the terms of the MIT license. See LICENSE for additional details.