Граф коммитов

320 Коммитов

Автор SHA1 Сообщение Дата
Nikita Titov fe9f292cb3 added new early_stopping param alias (#2431) 2019-09-26 12:07:59 +08:00
Nikita Titov 2e9f255e91 [docs] refer to RTD site directly (#2415) 2019-09-24 18:38:20 -05:00
Nikita Titov 023749239e
added editorconfig (#2403)
* style minor fixes

* added editorconfig

* updated editorconfig
2019-09-16 14:38:26 +03:00
Guolin Ke ad8e8ccc9c update feature_fraction_bynode (#2381)
* update

* fix a bug

* Update config.h

* Update Parameters.rst
2019-09-12 15:52:52 +03:00
CharlesAuguste f52be9be50 [python] Improved python tree plots (#2304)
* Some basic changes to the plot of the trees to make them readable.

* Squeezed the information in the nodes.

* Added colouring when a dictionnary mapping the features to the constraints is passed.

* Fix spaces.

* Added data percentage as an option in the nodes.

* Squeezed the information in the leaves.

* Important information is now in bold.

* Added a legend for the color of monotone splits.

* Changed "split_gain" to "gain" and "internal_value" to "value".

* Sqeezed leaves a bit more.

* Changed description in the legend.

* Revert "Sqeezed leaves a bit more."

This reverts commit dd8bf14a3ba604b0dfae3b7bb1c64b6784d15e03.

* Increased the readability for the gain.

* Tidied up the legend.

* Added the data percentage in the leaves.

* Added the monotone constraints to the dumped model.

* Monotone constraints are now specified automatically when plotting trees.

* Raise an exception instead of the bug that was here before.

* Removed operators on the branches for a clearer design.

* Small cleaning of the code.

* Setting a monotone constraint on a categorical feature now returns an exception instead of doing nothing.

* Fix bug when monotone constraints are empty.

* Fix another bug when monotone constraints are empty.

* Variable name change.

* Added is / isn't on every edge of the trees.

* Fix test "tree_create_digraph".

* Add new test for plotting trees with monotone constraints.

* Typo.

* Update documentation of categorical features.

* Typo.

* Information in nodes more explicit.

* Used regular strings instead of raw strings.

* Small refactoring.

* Some cleaning.

* Added future statement.

* Changed output for consistency.

* Updated documentation.

* Added comments for colors.

* Changed text on edges for more clarity.

* Small refactoring.

* Modified text in leaves for consistency with nodes.

* Updated default values and documentaton for consistency.

* Replaced CHECK with Log::Fatal for user-friendliness.

* Updated tests.

* Typo.

* Simplify imports.

* Swapped count and weight to improve readibility of the leaves in the plotted trees.

* Thresholds in bold.

* Made information in nodes written in a specific order.

* Added information to clarify legend.

* Code cleaning.
2019-09-08 19:26:55 +03:00
Guolin Ke bbbad73db3
sub-features for node level (#2330)
* add parameter

* implement

* fix bug

* fix bug

* fix according comment

* add test

* Update test_engine.py

* Update test_engine.py

* Update test_engine.py
2019-09-03 11:43:51 +08:00
Nikita Titov faba6cafb7
[docs][R] added R-package docs generation routines (#2176)
* added R-package docs generation routines

* change theme to be more consistent with sphinx_rtd_theme on main site in terms of color scheme

* placed man folder with old Rd files back

* specify full path to conda and make script more readable by one line - one pkg

* removed commented lines from build_r_site script

* made one line - one argument in build_reference() call

* pin R package versions

* fixed conflict
2019-09-01 19:47:04 +03:00
Guolin Ke 0dfda82607
normalize the lambdas in lambdamart objective (#2331)
* norm the lambda scores

* change default to false

* update doc

* typo

* Update Parameters.rst

* Update config.h

* Update test_sklearn.py

* Update test_sklearn.py

* Update test_sklearn.py

* Update test_sklearn.py

* Update test_sklearn.py

* Update rank_objective.hpp

* Update Parameters.rst

* Update config.h

* Update test_sklearn.py

* Update test_sklearn.py

* Update test_sklearn.py
2019-08-25 07:30:10 +08:00
Nikita Titov 686b0cc2f9
[docs][python] allow usage and compilation of 32-bit library (#2188)
* allow usage and compilation of 32-bit library

* added docs
2019-08-17 23:01:35 +03:00
Nikita Titov a8d8b07d3a
[docs] remove aliases from options in params and remove separators in FAQ (#2296)
* removed aliases from options in params

* minor fixes in FAQ
2019-08-07 14:30:55 +03:00
Nikita Titov 6b94c8e0f2 [docs] generate contents in Installation guide automatically (#2298)
* Update Installation-Guide.rst

* Update Installation-Guide.rst
2019-08-05 21:02:33 -05:00
Nikita Titov 28dc333d3f
[docs] fixed params description (#2306) 2019-08-05 00:23:50 +03:00
Nikita Titov 0197dba061
[ci][docs] update the list of ignoring sites for linkchecker (#2307) 2019-08-04 22:14:17 +03:00
Nikita Titov f68e1b50fb
[docs] limit number of files to scan for autosummary generation and reorganize conf.py (#2297)
* limit number of files to scan for autosummary generation

* reorganize conf.py
2019-08-02 14:51:07 +03:00
Guolin Ke c17a5f5b84 Refine the description of ``is_unbalance `` 2019-08-01 13:47:15 +08:00
Alexander L. Hayes 04a56010ec [docs] 📝 FAQ overhaul for linking to individual questions (#2293)
* 📝 FAQ overhaul for Issue #2268

Reformat "Contents" to use the `.. contents::` directive
Reword "Critical" into "Critical Issues"
Reformat "Critical" section to define "critical issues"
Reformat FAQ sections to follow a new format
Reformat FAQ sections so individual questions have links

All sections now follow a new format (below).
A "frequently asked question" may also include a possible
cause and a solution (if the two are not obvious from
the context):

```rst
Section Title
=============

.. contents::
  :local:
  :backlinks: none

1. Question 1
-------------

**Possible Cause**: This is likely due to...

**Solution**: Fix with...
```

* ✏️ Correcting typos and links

Add period to `2. Error messages: ....`
Fix links to FAQ in Installation-Guide.rst

* ✏️ Removing FAQ link and correcting `python-package` README

Drop general FAQ link in `Installation-Guide.rst`
Add FAQ question links to `python-package/README.rst`
2019-07-31 03:10:28 +03:00
James Lamb 11c24cf835
[docs] added FAQ on early stopping (fixes #2270) (#2272)
* added FAQ on early stopping (fixes #2270)

* fixed indentation in FAQ

* Update docs/FAQ.rst

Co-Authored-By: Nikita Titov <nekit94-08@mail.ru>
2019-07-29 16:39:57 -05:00
Alexander L. Hayes 207bb3ef32 [docs] 🎨 Sphinx Autosummary for generating Python-API documentation (#2286)
* 🎨 `sphinx.ext.autosummary` for generating Python-API summaries

Add `docs/.gitignore` to not track autosummary stubs
Add `sphinx.ext.autosummary` in `docs/conf.py`
  Add 'members' and 'inherited-members' as default parameters
  Add 'autosummary = True' for setting output with `:toctree:`
Add `.. autosummary::` tags to replace `.. autoclass::`

Previously the `Python-API.rst` dumped all of the Python API onto
a single page.

This replaces the Python-API documentation with an index listing
all modules, and paginates all functions and classes onto
separate pages.

* ✏️ Corrections following feedback

Drop `docs/.gitignore` to use the general `.gitignore`
Add `show-inheritance` to `autodoc_default_flags` in `docs/conf.py`
Fix `both` to `class` in `autoclass_content` in `docs/conf.py`

* ✏️ Replacing deprecated Sphinx parameter

Fix deprecated `autodoc_default_flags` to `autodoc_default_options`

* ✏️ Adding `autodoc_default_flags` in to support early Sphinx versions

Add `autodoc_default_flags` with parameters from
  `autodoc_default_options`
2019-07-27 14:59:51 +03:00
Nikita Titov 5dc1507e1c
[docs] updated js file attaching mechanism (#2287) 2019-07-27 14:34:21 +03:00
Guolin Ke 5d3a3ea47e fix metric alias (#2273)
* fix metric alias

* fix format

* updated docs

* simplify alias in objective function

* move the alias parsing to config.cpp

* updated docs

* fix multi-class aliases

* updated regression aliases in docs

* fixed trailing space
2019-07-25 17:31:58 +03:00
Belinda Trotta 291752de3e Max bin by feature (#2190)
* Add parameter max_bin_by_feature.

* Fix minor bug.

* Fix minor bug.

* Fix calculation of header size for writing binary file.

* Fix style issues.

* Fix python style issue.

* Fix test and python style issue.
2019-07-08 09:45:36 +08:00
Nikita Titov bf78008b47 switch name and alias of rmse metric (#2257) 2019-07-07 08:42:54 +08:00
Guolin Ke cdba71475f balanced bagging (#2214)
* add balanced bagging

* refine code

* fix format

* clarify usage only for binary application
2019-06-18 10:32:38 +08:00
Nikita Titov 8ecae56258 [ci] add OpenMPI link to ignore list (#2227) 2019-06-13 10:08:25 +08:00
Belinda Trotta b3db9e924a Top k multi error (#2178)
* Implement top-k multiclass error metric. Add new parameter top_k_threshold.

* Add test for multiclass metrics

* Make test less sensitive to avoid floating-point issues.

* Change tabs to spaces.

* Fix problem with test in Python 2. Refactor to use np.testing. Decrease number of training rounds so loss is larger and easier to compare.

* Move multiclass tests into test_engine.py

* Change parameter name from top_k_threshold to multi_error_top_k.

* Fix top-k error metric to handle case where scores are equal. Update tests and docs.

* Change name of top-k metric to multi_error@k.

* Change tabs to spaces.

* Fix formatting.

* Fix minor issues in docs.
2019-05-26 14:08:45 +03:00
leasunhy 6a1a538f45 [docs] remove duplicated param in Python-Intro.rst (#2181)
`num_round` is redundant here because it will be overrideen by `num_trees` in the `param` dictionary.
2019-05-18 20:51:55 +03:00
Nikita Titov a7885b60dd
[ci] removed temp hotfix for Yahoo URL (#2162) 2019-05-16 15:35:40 +03:00
Guolin Ke f01b2aca13
first metric only in earlystopping for cli (#2172)
* first metric only in earlystopping for cli

* code clean

* added note about CLI only usage

* removed note about CLI only usage
2019-05-16 09:47:49 +08:00
Ilya Matiach 6f3fae51fe [doc] minor doc fix for gamma param (#2180) 2019-05-15 18:26:46 +03:00
Laurae 3d8770aff7 [docs] fixing max_depth param description (#2155)
* PR #1879

* Update docs with parameter_generator.py

* Update wrapper doc for sklearn
2019-05-15 17:04:50 +03:00
Nikita Titov f91e5644a3
[python] added ability to pass first_metric_only in params (#2175)
* added ability to pass first_metric_only in params

* simplified tests

* fixed test

* fixed punctuation
2019-05-15 15:44:37 +03:00
Nikita Titov 76170788c4
[ci][docs] updated fix for OpenMP conflicts (#2166)
* test: AppleClang on Travis

* updated fix for OpenMP multiple instances

* updated docs

* switch compiler back
2019-05-12 14:19:53 +03:00
Guolin Ke 94fbe5bb9f
[docs] updated Microsoft GitHub URL (#2152)
* fix travis badge

* updated GitHub Microsoft URL
2019-05-08 13:51:28 +08:00
Nikita Titov cfcc020e58
[ci][docs] generate docs for C API (#2059)
* use file to install deps for docs

* added C_API docs

* use breathe without exhale

* added missed params descriptions and make Doxygen fail for warnings

* escape char hotfix

* ignore unknown directive for rstcheck

* better handle env variable

* hotfix for 'Unknown directive type' error with C_API=NO

* Update .gitignore

* fixed pylint

* use already defined constants in conf.py

* do not suppress Doxygen's output

* addressed review comments

* removed unneeded import
2019-05-05 14:54:37 +03:00
Nikita Titov 93a899ac7c [ci] hotfix for check the docs(#2145) 2019-05-04 23:01:01 +08:00
Nikita Titov 611cf5d414 [python] added plot_split_value_histogram function (#2043)
* added plot_split_value_histogram function

* updated init module

* added plot split value histogram example

* added plot_split_value_histogram to notebook

* added test

* fixed pylint

* updated API docs

* fixed grammar

* set y ticks to int value in more sufficient way
2019-05-01 23:05:16 +09:00
Nikita Titov 5ece53b5e0 [docs] updated the part about Boost in installation guide (#2082)
* updated installation guide

* updated Python installation guide

* added note about opencl path to Windows section

* added space before path in message

* minor correction for option description in Python installation guide
2019-04-30 10:53:49 -07:00
Nikita Titov 8d2ec69f4f
[python][R][docs][ci] better compatibility with Visual Studio 2019 (#2083)
* set platform via A option

* style hotfix

* updated R installation script

* updated Python installation script

* updated CI test script

* provide VS version-ingependent link for redistributables download

* added link to VS 2019 redistributables

* added VS 2019 match for Boost binaries
2019-04-29 13:34:44 +03:00
Nikita Titov 24ad35f748 fixed minor typos (#2119) 2019-04-28 15:35:11 -05:00
Nikita Titov 044a39d0ee [ci] hotfixes (#2132)
* Update Features.rst

* Update .appveyor.yml

* Update test_windows.ps1

* Update test_windows.ps1

* Update .appveyor.yml

* Update test_windows.ps1

* Update .appveyor.yml
2019-04-26 22:06:49 +08:00
Scott Lundberg 89f2021a4d [docs] Update doc string for pred_contrib (#2116)
* Update doc string for pred_contrib

See comments at the end of #1969

* Update basic.py

* Update basic.py

* update doc strings

* update equals sign in doc string

* strip whitespace and gen rst

* strip whitespace
2019-04-19 04:22:17 +03:00
Nikita Titov beb35d567d
[docs] added note about the spoiled probabilities (#2113) 2019-04-18 18:49:21 +03:00
Nikita Titov c56412a859
[docs] updated HDFS guide (#1890)
* updated HDFS guide

* updated guide

* no info about Clang

* pass paths in quotes

* Update README.rst
2019-04-11 23:09:06 +03:00
Nikita Titov b3c31c4015
[docs] Python wrapper doesn't support params in form of list of pairs (#2078)
* fixed Python intro

* fixed typos

* scikit-learn added support of https
2019-04-10 13:26:12 +03:00
remcob-gr 76102284d1 Add Cost Effective Gradient Boosting (#2014)
* Add configuration parameters for CEGB.

* Add skeleton CEGB tree learner

Like the original CEGB version, this inherits from SerialTreeLearner.
Currently, it changes nothing from the original.

* Track features used in CEGB tree learner.

* Pull CEGB tradeoff and coupled feature penalty from config.

* Implement finding best splits for CEGB

This is heavily based on the serial version, but just adds using the coupled penalties.

* Set proper defaults for cegb parameters.

* Ensure sanity checks don't switch off CEGB.

* Implement per-data-point feature penalties in CEGB.

* Implement split penalty and remove unused parameters.

* Merge changes from CEGB tree learner into serial tree learner

* Represent features_used_in_data by a bitset, to reduce the memory overhead of CEGB, and add sanity checks for the lengths of the penalty vectors.

* Fix bug where CEGB would incorrectly penalise a previously used feature

The tree learner did not update the gains of previously computed leaf splits when splitting a leaf elsewhere in the tree.
This caused it to prefer new features due to incorrectly penalising splitting on previously used features.

* Document CEGB parameters and add them to the appropriate section.

* Remove leftover reference to cegb tree learner.

* Remove outdated diff.

* Fix warnings

* Fix minor issues identified by @StrikerRUS.

* Add docs section on CEGB, including citation.

* Fix link.

* Fix CI failure.

* Add some unit tests

* Fix pylint issues.

* Fix remaining pylint issue
2019-04-04 10:35:11 +08:00
sheikheddy fe115bbb72 [docs] Fix typo in Python-Intro.rst (#2074) 2019-04-03 00:21:40 +03:00
James Lamb 572ae40038 [docs] Small aesthetic improvements to RTD docs (#2060)
* Small aesthetic improvements to RTD docs

* fixed markdown table in Development-Guide

* removed unnecessary blank line in conf.py
2019-03-26 22:55:17 +03:00
kenmatsu4 011cc90a77 [python] Use first_metric_only flag for early_stopping function. (#2049)
* Use first_metric_only flag for early_stopping function.

In order to apply early stopping with only first metric, applying first_metric_only flag for early_stopping function.

* upcate comment

* Revert "upcate comment"

This reverts commit 1e75a1a415.

* added test

* fixed docstring

* cut comment and save one line

* document new feature
2019-03-25 13:18:22 +08:00
Nikita Titov e5e9fbea5e fixed broken link (#2065) 2019-03-22 10:46:41 -07:00
Nikita Titov f9ab5f588d [docs] added notes about params usage when data is provided via path and removed unused param (#2024)
* added notes about params usage when data is provided via path

* fixed init score and valid init score params note

* fixed binary params description
2019-02-24 10:59:33 +08:00